Professional Documents
Culture Documents
Wonderful World
of
Speech Enhancement
Femi Odelowo
Definition
Speech enhancement is concerned with improving
some perceptual aspect of speech that has been
degraded by additive noise
Speech
Enhancement Theory and Practice, P. C. Loizou
Signal Model
The additive noise model is the most commonly
considered model
STFT
STFT
Phase
Paramet
er
Estimati
on
Gain
Calculatio
n
Spectral
Modificati
on
Inverse
STFT
Algorithms
Spectral subtraction
Conceptually the simplest to design/implement
Based on the assumed additive nature of the noise
Subspace algorithms
Based on a linear algebra framework
Typically use eigenvalue/eigenvector decomposition or SVD
-20
-25
-10
-30
-20
-30
-40
PS (dB)
PN (dB)
-35
-45
-50
True Noise
Est. Noise,
Est. Noise,
Est. Noise,
Est. Noise,
Est. Noise,
-55
-60
-65
-70
0.5
1.5
Time (sec)
2.5
-40
-50
-60
= 0.7
= 0.9
=1
=2
=5
-70
-80
-90
0.5
1.5
Time (sec)
True Signal
Est. Signal, = 0.7
Est. Signal, = 1
Est. Signal, = 5
2.5
Clean Signal PSD vs. Signal PSD Estimates With imcra Noise Estimation for f = 500
0
-30
-10
-35
-20
-40
-30
-45
-40
PS (dB)
PN (dB)
-25
-50
-55
True Noise
Est. Noise,
Est. Noise,
Est. Noise,
Est. Noise,
Est. Noise,
-60
-65
-70
0.5
1.5
Time (sec)
2.5
-50
-60
= 0.7
= 0.9
=1
=2
=5
-70
True Signal
Est. Signal, = 0.7
Est. Signal, = 1
Est. Signal, = 5
-80
-90
0.5
1.5
Time (sec)
2.5
Noisy Signal
Enhanced Signal, oracle PSDs
Enhanced Signal, = 0.7
Enhanced Signal, = 1
Enhanced Signal, = 1
Enhanced Signal, = 5
Enhanced Signal, = 5
Noisy Signal
Enhanced Signal, oracle PSDs
Enhanced Signal, = 0.7
Enhanced Signal, = 1
Enhanced Signal, = 1
Enhanced Signal, = 5
Enhanced Signal, = 5
True SNR
DD SNR Estimate
Anderson DD Estimate
-10
SNR (dB)
20
0
-20
-40
-60
0
-20
-30
-40
-50
0.5
1.5
Time (sec)
2.5
-60
0
True
DD Gain
Anderson DD Gain
0.5
1.5
Time (sec)
2.5
0.9
0.8
Wiener Gains
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
-50
Ideal
Realized
-40
-30
-20
-10
SNR (dB)
10
20
30
40
50
5
Mean COMP SIG Score
3.5
3
2.5
Learned Response
True Signal/Noise WF
DD Wiener Filter
2
1.5
dB
10
3.5
3
Learned Response
True Signal/Noise WF
DD Wiener Filter
2.5
0
dB
10
15
4.5
3.5
15
3
2.5
Learned Response
True Signal/Noise WF
DD Wiener Filter
2
1.5
4.5
dB
10
15
4
3.5
3
2.5
Learned Response
True Signal/Noise WF
DD Wiener Filter
2
1.5
dB
10
15
3
Learned Response
True Signal/Noise WF
DD Wiener Filter
2.5
1.5
airport
babble
car
exhibition restaurant
Noise Types
station
street
2.5
babble
car
exhibition restaurant
Noise Types
station
street
train
15dB Signals
3.8
3.4
Learned Response
True Signal/Noise WF
DD Wiener Filter
3.2
3
2.8
Learned Response
True Signal/Noise WF
DD Wiener Filter
3.6
3.6
Learned Response
True Signal/Noise WF
DD Wiener Filter
3.4
3.2
3
2.6
airport
2
airport
train
10dB Signals
3.8
5dB Signals
3.5
3.5
babble
car
exhibition restaurant
Noise Types
station
street
train
2.8
airport
babble
car
exhibition restaurant
Noise Types
station
street
train
Questions/Discussion