Using PCA to Improve the Generation of Speech Keys

Nolazco-Flores, Juan A.; Mex-Perera, J. Carlos; Garcia-Perera, L. Paola; Sanchez-Torres, Brenda

doi:10.1007/11925231_104

Juan A. Nolazco-Flores²⁰,
J. Carlos Mex-Perera²⁰,
L. Paola Garcia-Perera²⁰ &
…
Brenda Sanchez-Torres²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4293))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

995 Accesses

Abstract

This research shows the improvement obtained by including the principal component analysis as part of the feature production in the generation of a speech key. The main architecture includes an automatic segmentation of speech and a classifier. The first one, by using a forced alignment configuration, computes a set of primary features, obtains a phonetic acoustic model, and finds the beginnings and ends of the phones in each utterance. The primary features are then transformed according to both the phone model parameters and the phones segments per utterance. Before feeding these processed features to the classifier, the principal component analysis algorithm is applied to the data and a new set of secondary features is built. Then a support vector machine classifier generates an hyperplane that is capable to produce a phone key. Finally, by performing a phone spotting technique, the key is hardened. In this research the results for 10, 20 and 30 users are given using the YOHO database. 90% accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 239.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Robust phoneme classification for automatic speech recognition using hybrid features and an amalgamated learning model

Article 11 October 2016

A Review of Feature Extraction and Classification Techniques in Speech Recognition

Article 12 October 2023

Source and system features for phone recognition

Article 09 December 2014

References

Campbell Jr., J.P.: Features and Measures for Speaker Recognition. Ph.D. Dissertation, Oklahoma State University (1992)
Google Scholar
Beringer, N., Schiel, F.: Independent Automatic Segmentation of Speech by Pronunciation Modeling. In: Proc. of the ICPhS 1999, San Francisco, August 1999, pp. 1653–1656 (1999)
Google Scholar
Binnenpoorte, D., Goddijn, S., Cucchiarini, C.: How to Improve Human and Machine Transcriptions of Spontaneous Speech. In: Proceedings ISCA and IEEE Workshop on Spontaneous Speech Processing and Recognition (SSPR), Tokyo, Japan, April 2003, pp. 147–150 (2003)
Google Scholar
Cortes, C., Vapnik, V.: Support-vector network. Machine Learning 20, 273–297 (1995)
MATH Google Scholar
Furui, S.: Digital Speech Processing, Synthesis, and Recognition. Marcel Dekker, New York (2001)
Google Scholar
Huang, X., Acero, A., Hon, H.: Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice-Hall, Upper Saddle River (2001)
Google Scholar
Higgins, A., Porter, J.J., Bahler, L.: YOHO Speaker Authentication Final Report. ITT Defense Communications Division (1989)
Google Scholar
Young, S., Woodland, P.: HTK Hidden Markov Model Toolkit home page, http://htk.eng.cam.ac.uk/
Monrose, F., Reiter, M.K., Li, Q., Wetzel, S.: Cryptographic Key Generation From Voice. In: Proceedings of the IEEE Conference on Security and Privacy, Oakland, CA (2001)
Google Scholar
Osuna, E., Freund R., Girosi F.: Support vector machines: Training and applications. Technical Report AIM-1602, MIT A.I. Lab. (1996)
Google Scholar
Osuna, E., Freund, R., Girosi, F.: Training Support Vector Machines: An Application to Face Recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 130–136 (1997)
Google Scholar
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Article Google Scholar
Rabiner, L.R., Juang, B.-H.: Fundamentals of speech recognition. Prentice-Hall, New-Jersey (1993)
Google Scholar
Uludag, U., Pankanti, S., Prabhakar, S., Jain, A.K.: Biometric cryptosystems: issues and challenges. Proceedings of the IEEE 92(6) (2004)
Google Scholar
Joachims, T.: SVMLight: Support Vector Machine, SVM-Light Support Vector Machine, University of Dortmund (1999), http://svmlight.joachims.org/
Wan, V., Renals, S.: Speaker Verification Using Sequence Discriminant Support Vector Machines. IEEE Transactions on speech and audio processing 13(2) (March 2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, ITESM, Campus Monterrey., Av. Eugenio Garza Sada 2501 Sur, Col. Tecnológico, Monterrey, N.L., México, C.P. 6484
Juan A. Nolazco-Flores, J. Carlos Mex-Perera, L. Paola Garcia-Perera & Brenda Sanchez-Torres

Authors

Juan A. Nolazco-Flores
View author publications
You can also search for this author in PubMed Google Scholar
J. Carlos Mex-Perera
View author publications
You can also search for this author in PubMed Google Scholar
L. Paola Garcia-Perera
View author publications
You can also search for this author in PubMed Google Scholar
Brenda Sanchez-Torres
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computing Research, National Polytechnic Institute, 07738, Mexico City, México
Alexander Gelbukh
Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE), Luis Enrique Erro No. 1, Sta. Ma. Tonanzintla, 72840, Puebla, México
Carlos Alberto Reyes-Garcia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nolazco-Flores, J.A., Mex-Perera, J.C., Garcia-Perera, L.P., Sanchez-Torres, B. (2006). Using PCA to Improve the Generation of Speech Keys. In: Gelbukh, A., Reyes-Garcia, C.A. (eds) MICAI 2006: Advances in Artificial Intelligence. MICAI 2006. Lecture Notes in Computer Science(), vol 4293. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11925231_104

Download citation

DOI: https://doi.org/10.1007/11925231_104
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49026-5
Online ISBN: 978-3-540-49058-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Using PCA to Improve the Generation of Speech Keys

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Robust phoneme classification for automatic speech recognition using hybrid features and an amalgamated learning model

A Review of Feature Extraction and Classification Techniques in Speech Recognition

Source and system features for phone recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Using PCA to Improve the Generation of Speech Keys

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Robust phoneme classification for automatic speech recognition using hybrid features and an amalgamated learning model

A Review of Feature Extraction and Classification Techniques in Speech Recognition

Source and system features for phone recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation