Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Noisy Speech Segmentation/Enhancement with Multiband Analysis and Neural Fuzzy Networks

  • Conference paper
  • First Online:
Advances in Soft Computing — AFSS 2002 (AFSS 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2275))

Included in the following conference series:

Abstract

Background noise added to speech can decrease the performance of speech segmentation and enhancement. To solve this problem, new methods have been developed in this thesis. First, a new speech segmentation method (ATF-based SONFIN algorithm) is proposed in fixed noise-level environment. This method contains the multiband analysis and a neural fuzzy network, and it achieves higher recognition rate than the TF-based robust algorithm by 5%. In addition, a new speech segmentation method called RTF-based RSONFIN algorithm is proposed for variable noise-level environment. The RTF-based RSONFIN algorithm contains a recurrent neural fuzzy network. This method contains the multiband analysis and achieve higher recognition rate than the TFbased robust algorithm by 12%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. L.F. Lamel, L.R. Rabiner, A.E. Rosenberg, and J.G. Wilson, “An improved endpoint detector for isolated word recognition,” IEEE ASSP Mag., vol.29, pp.777–785, August, 1981.

    Google Scholar 

  2. Y. Qi and B.R. Hunt, “Voiced-unvoiced-silence classification of speech using hybrid features and a network classifier,” IEEE Tran. Speech Audio Processing, vol.1, pp. 250–255, April, 1993.

    Google Scholar 

  3. B. Reaves, “Comments on an improved endpoint detector for isolated word recognition,” IEEE Trans. Signal Processing, vol.39, pp.526–527, February, 1991.

    Google Scholar 

  4. J.C. Junqua, B. Mak, and B. Reaves, “A robust algorithm for word boundary detection in the presence of noise,” IEEE Trans. Speech Audio Processing, vol.2, pp.406–412, July, 1994.

    Google Scholar 

  5. T. Ghiselli-Crippa and A. El-Jaroudi, “A fast neural net training algorithm and its application to voiced-unvoiced-silence classification of speech,” ICASSP91, vol.1, pp.441–444, 1991.

    Google Scholar 

  6. C.F. Juang and C.T. Lin, “An on-line self-constructing neural fuzzy inference network and its application,” IEEE Trans. Fuzzy System, vol. 6, pp. 12–32, February 1998.

    Google Scholar 

  7. C.F. Juang and C.T. Lin, “A Recurrent Self-Organizing Neural Fuzzy Inference Network”, IEEE Trans. Neural Networks, vol. 10, no. 4, pp. 828–845, July, 1999.

    Article  Google Scholar 

  8. C.T. Lin, Neural Fuzzy Control Systems with Structure and Parameter Learning, World Scientific, 1994.

    Google Scholar 

  9. C.T. Lin and C.S.G.J Lee, Neural Fuzzy Systems: A Neural-Fuzzy Synergism to Intelligent Systems, Englewood Cliffs, NJ: Prentice-Hall, May, 1996.

    Google Scholar 

  10. C.T. Lin, H.W. Nein, and J.Y. Hwu, “GA-based Noisy Speech Recognition using Two Dimensional Cepstrum,” accepted to appear in IEEE Trans. Speech and Audio Processing.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lin, CT., Liu, DJ., Wu, RC., Wu, GD. (2002). Noisy Speech Segmentation/Enhancement with Multiband Analysis and Neural Fuzzy Networks. In: Pal, N.R., Sugeno, M. (eds) Advances in Soft Computing — AFSS 2002. AFSS 2002. Lecture Notes in Computer Science(), vol 2275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45631-7_40

Download citation

  • DOI: https://doi.org/10.1007/3-540-45631-7_40

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43150-3

  • Online ISBN: 978-3-540-45631-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics