Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Digits Speech Recognition Based on Geometrical Learning

  • Conference paper
Advanced Data Mining and Applications (ADMA 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3584))

Included in the following conference series:

Abstract

We investigate the use of independent component analysis (ICA) for speech feature extraction in digits speech recognition systems.We observe that this may be true for a recognition tasks based on geometrical learning with little training data. In contrast to image processing, phase information is not essential for digits speech recognition. We therefore propose a new scheme that shows how the phase sensitivity can be removed by using an analytical description of the ICA-adapted basis functions via the Hilbert transform. Furthermore, since the basis functions are not shift invariant, we extend the method to include a frequency-based ICA stage that removes redundant time shift information. The digits speech recognition results show promising accuracy, Experiments show method based on ICA and geometrical learning outperforms HMM in different number of train samples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Bell, A.J., Sejnowski, T.J.: Learning the higher-order structure of a natural sound. Network Comput. Neural Syst. 7, 261–266 (1996)

    Article  MATH  Google Scholar 

  2. ShouJue, W.: A new development on ANN in China - Biomimetic pattern recognition and multi weight vector neurons. LNCS(LNAI), vol. 2639, pp. 35–43. Springer, Heidelberg (2003)

    Google Scholar 

  3. Shoujue, W., et al.: Multi Camera Human Face Personal Identification System Based on Biomimetic pattern recognition. Acta Electronica Sinica 31(1), 1–3 (2003)

    Google Scholar 

  4. Shoujue, W., et al.: Discussion on the basic mathematical models of Neurons in General purpose Neurocomputer. Acta Electronica Sinica 29(5), 577–580 (2001)

    Google Scholar 

  5. Wang, X., Wang, S.: The Application of Feedforward Neural Networks in VLSI Fabrication Process Optimization. International Journal of Computational Intelligence and Applications 1(1), 83–90 (2001)

    Article  Google Scholar 

  6. Cao, W., Hao, F., Wang, S.: The application of DBF neural networks for object recognition. Inf. Sci. 160(1-4), 153–160 (2004)

    Article  Google Scholar 

  7. Hyvärinen, A., Karhunen, J., Oja, E.: Independent Component Analysis. Wiley, New York (2001)

    Book  Google Scholar 

  8. Csiszar, I., Tusnady, G.: Information geometry and alternating minimization procedures, Statistics and Decisions (suppl. 1), 205–237 (1984)

    Google Scholar 

  9. Amari, S., Nagaoka, H.: Methods of Information Geometry. AMS and Oxford University Press (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cao, W., Pan, X., Wang, S., Hu, J. (2005). Digits Speech Recognition Based on Geometrical Learning. In: Li, X., Wang, S., Dong, Z.Y. (eds) Advanced Data Mining and Applications. ADMA 2005. Lecture Notes in Computer Science(), vol 3584. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11527503_50

Download citation

  • DOI: https://doi.org/10.1007/11527503_50

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27894-8

  • Online ISBN: 978-3-540-31877-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics