Noise Supression Techniques For Speech Enhancement Using Adaptive Filtering
Noise Supression Techniques For Speech Enhancement Using Adaptive Filtering
Noise Supression Techniques For Speech Enhancement Using Adaptive Filtering
Overview
Objective/Problem Description
Applications
Overview of Noise Reduction Methods
System Description
Filter analysis
Linear methods
Wiener approximation
KLT preprocessing
Signal subspace embedding
Current results
Future work
Implementation/ practical considerations
Conclusions
Objective/Problem
Description
The goal of my project was to
research noise reduction techniques
specifically for automatic speech
recognition system front-end
processing on a single microphone
without an independent noise
recording or clean reference signal.
Applications
(1)
(2)
(3)
(1)
Speaker identification
(2)
(3)
Overview of Speech
Enhancement
System Descriptions
BSS/ICA
ANC
Filter Analysis
(1)
yk sk nk
MMSE cost
function
Min
E s k g ( y k )
g
error sk sk
sk g ( yk )
Min
E yk g ( y k )
g
2 E nk g ( y k ) 2 E yk nk E nk 2
1
Min
g
N
k 1
yk g ( y k )
2 E nk g ( y )
Filter Analysis
(2)
sk w T y k
Minimizing the MMSE cost function with respect to w the result is as follows:
w R -1y (ry rn )
This is an approximation to the Wiener solution where we are estimating
the crosscorrelation vector p with (ry rn) (similar to spectral subtraction)
Filter Analysis
(3)
U eig ( R y )
~y Uy
k
k
The resulting closed form solution for the weight vector is:
1 T
w U R y U U ry rn
T
Filter Analysis
(4)
~ T
~
Min
E sk W ( y k ) Q sk W ( y k )
g
U i 1 U i
Filter Analysis
(5)
computationally efficient
Standard linear state-space model for Kalman filter
x (n 1) F (n 1, n) x (n) v1 (n)
y ( n) C ( n) x ( n) v 2 ( n)
Filter Analysis
(6)
Nonlinear filtering
Many nonlinear filtering methods exist to suppress noise in
noisy speech. Examples include filters based on neural networks
or phase space reconstruction. In general, they are very complex
to analyze, but do not require estimation of noise or speech
spectra and are not characterized by musical tone artifacts.
(1) http://research.yale.edu/ysm/images/78.2/articles-neural-network.jpg
Typical Results
Segmental SNR results
(left) and SNR results
(below) for various
linear and nonlinear
noise reduction
methods [8]
Noisy Speech Signal
(white noise)
Wiener Filtered
Ephraim Filtered
Future Work
Experiments
Implementation/
Practical Considerations
Real-time processing
Conclusions
References
1.
2.
3.
4.
5.
6.
7.
8.
9.
Eric A. Wan and Rudolph van der Merwe, Noise-Regularized Adaptive Filtering for Speech
Enhancement, Proc. Eurospeech, pp. 2643-2646, 1999.
Ki Yong Lee., Byung-Gook Lee, Iickho Song, and Souguil Ann, Robust Estimation of AR Parameters
and its Application for Speech Enhancement, Proc. IEEE ICASSP, pp. 309 - 312, 1992.
Phil S. Whitehead, David V. Anderson, and Mark A. Clements, Adaptive, Acoustic Noise Suppression
for Speech Enhancement. Proc. IEEE ICME, pp. 565 568, 2003.
A. V. Oppenheim, E. Weinstein, K. C. Zangi, M. Feder, and D. Gauger, Single Sensor Active Noise
Cancellation Based on the EM Algorithm, Proc. IEEE ICASSP, pp. 277 280, 1992.
T. Rutkowski, A. Cichocki, and A. K. Barros, Speech Enhancement Using Adaptive Filters and
Independent Component Analysis Approach, Proc. AISAT, 2000.
H. Saruwatari, K. Sawai, A. Lee, K. Shikano, A. Kaminuma, and M. Sakata, Speech Enhancement and
Recognition in Car Environment Using Blind Source Separation and Subband Elimination Processing,
Proc. ICA, pp. 367 372, 2003.
Simon Haykin, Adaptive Filter Theory, Prentice-Hall Inc., Upper Saddle River, NJ, pp 466 501,
2002.
M. T. Johnson, A. C. Lindgren, R. J. Povinelli, and X. Yuan, Performance of Nonlinear Speech
Enhancement using Phase Space Reconstruction, Proc IEEE ICASSP, pp. 872 875, 2003.
Andrew C. Lindgren, Speech Recognition Using Features Extracted from Phase Space
Reconstructions, Thesis, Marquette University, Milwaukee WI, May 2003.
END