Abstract
Background noise added to speech can decrease the performance of speech segmentation and enhancement. To solve this problem, new methods have been developed in this thesis. First, a new speech segmentation method (ATF-based SONFIN algorithm) is proposed in fixed noise-level environment. This method contains the multiband analysis and a neural fuzzy network, and it achieves higher recognition rate than the TF-based robust algorithm by 5%. In addition, a new speech segmentation method called RTF-based RSONFIN algorithm is proposed for variable noise-level environment. The RTF-based RSONFIN algorithm contains a recurrent neural fuzzy network. This method contains the multiband analysis and achieve higher recognition rate than the TFbased robust algorithm by 12%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
L.F. Lamel, L.R. Rabiner, A.E. Rosenberg, and J.G. Wilson, “An improved endpoint detector for isolated word recognition,” IEEE ASSP Mag., vol.29, pp.777–785, August, 1981.
Y. Qi and B.R. Hunt, “Voiced-unvoiced-silence classification of speech using hybrid features and a network classifier,” IEEE Tran. Speech Audio Processing, vol.1, pp. 250–255, April, 1993.
B. Reaves, “Comments on an improved endpoint detector for isolated word recognition,” IEEE Trans. Signal Processing, vol.39, pp.526–527, February, 1991.
J.C. Junqua, B. Mak, and B. Reaves, “A robust algorithm for word boundary detection in the presence of noise,” IEEE Trans. Speech Audio Processing, vol.2, pp.406–412, July, 1994.
T. Ghiselli-Crippa and A. El-Jaroudi, “A fast neural net training algorithm and its application to voiced-unvoiced-silence classification of speech,” ICASSP91, vol.1, pp.441–444, 1991.
C.F. Juang and C.T. Lin, “An on-line self-constructing neural fuzzy inference network and its application,” IEEE Trans. Fuzzy System, vol. 6, pp. 12–32, February 1998.
C.F. Juang and C.T. Lin, “A Recurrent Self-Organizing Neural Fuzzy Inference Network”, IEEE Trans. Neural Networks, vol. 10, no. 4, pp. 828–845, July, 1999.
C.T. Lin, Neural Fuzzy Control Systems with Structure and Parameter Learning, World Scientific, 1994.
C.T. Lin and C.S.G.J Lee, Neural Fuzzy Systems: A Neural-Fuzzy Synergism to Intelligent Systems, Englewood Cliffs, NJ: Prentice-Hall, May, 1996.
C.T. Lin, H.W. Nein, and J.Y. Hwu, “GA-based Noisy Speech Recognition using Two Dimensional Cepstrum,” accepted to appear in IEEE Trans. Speech and Audio Processing.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lin, CT., Liu, DJ., Wu, RC., Wu, GD. (2002). Noisy Speech Segmentation/Enhancement with Multiband Analysis and Neural Fuzzy Networks. In: Pal, N.R., Sugeno, M. (eds) Advances in Soft Computing — AFSS 2002. AFSS 2002. Lecture Notes in Computer Science(), vol 2275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45631-7_40
Download citation
DOI: https://doi.org/10.1007/3-540-45631-7_40
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43150-3
Online ISBN: 978-3-540-45631-5
eBook Packages: Springer Book Archive