Abstract
This work presents an infant cry automatic recognizer development, with the objective of classifying three kinds of infant cries, normal, deaf and asphyxia from recently born babies. We use extraction of acoustic features such as LPC (Linear Predictive Coefficients) and MFCC (Mel Frequency Cepstral Coefficients) for the cry’s sound waves, and a genetic feature selection system combined with a feed forward input delay neural network, trained by adaptive learning rate back-propagation. We show a comparison between Principal Component Analysis and the proposed genetic feature selection system, to reduce the feature vectors. In this paper we describe the whole process; in which we include the acoustic features extraction, the hybrid system design, implementation, training and testing. We also show the results from some experiments, in which we improve the infant cry recognition up to 96.79% using our genetic system. We also show different features extractions that result on vectors that go from 145 up to 928 features, from cry segments of 1 and 3 seconds respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Orozco Garcia, J., Reyes Garcia, C.A.: Mel-Frequency Cepstrum coefficients Extraction from Infant Cry for Classification of Normal and Pathological Cry with Feed-forward Neural Networks. In: ESANN 2003, Bruges, Belgium (2003)
Petroni, M., Malowany, A.S., Johnston, C.C., Stevens, B.J.: Identification of pain from infant cry vocalizations using artificial neural networks (ANNs). In: The International Society for Optical Engineering, vol. 2492, Part two of two. Paper #, pp. 2492–79 (1995)
Ekkel, T.: Neural Network-Based Classification of Cries from Infants Suffering from Hypoxia-Related CNS Damage, Master’s Thesis. University of Twente, The Netherlands (2002)
Cano, S.D., Coello, D.I.E.y.E.: El Uso de los Mapas Auto-Organizados de Kohonen en la Clasificación de Unidades de Llanto Infantil. In: Voice Processing Group, 1st Workshop AIRENE, Universidad Católica del Norte, Chile, pp. 24–29 (1999)
Gold, B., Morgan, N.: Speech and Audio Signal Processing. Processing and perception of speech and music. John Wiley & Sons, Chichester (2000)
Santo Orcero, D.: Estrategias Evolutivas (2004), http://www.orcero.org/irbis/disertacion/node217.html
Hussain, T.S.: An Introduction to Evolutionary Computation, Department of Computing and Information Science Queens University, Kingston, Ont. K7L 3N6 (1998)
DARPA Neural Network Study, p. 60. AFCEA International Press (1988)
Fu, L.: Neural Networks in Computer Intelligence. McGraw-Hill International Editions, Computer Science Series (1994)
Neural Network Toolbox Guide, Matlab V.6.0.8, Developed by MathWoks, Inc.
Boersma, P., Weenink, D.: Praat v. 4.0.8. A system for doing phonetics by computer. Institute of Phonetic Sciences of the University of Amsterdam (February 2002)
Weibel, A., Hanazawa, T., Hinton, G., Shikano, K., Lang, K.J.: Phoneme Recognition Using Time Delay Neural Networks. IEEE Trans. Acoustics, Speech, Signal Proc. ASSP-37, 332–339 (1989)
Reyes Galaviz, O.F.: Clasificación de Llanto de Bebés para Identificación de Hipoacúsia y Asfixia por medio de un Sistema Híbrido (Genético – Neuronal). In: Master’s Thesis on Computer Science, at the Apizaco Institute of Technology, ITA (March 2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Galaviz, O.F.R., García, C.A.R. (2005). Infant Cry Classification to Identify Hypo Acoustics and Asphyxia Comparing an Evolutionary-Neural System with a Neural Network System. In: Gelbukh, A., de Albornoz, Á., Terashima-Marín, H. (eds) MICAI 2005: Advances in Artificial Intelligence. MICAI 2005. Lecture Notes in Computer Science(), vol 3789. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11579427_97
Download citation
DOI: https://doi.org/10.1007/11579427_97
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29896-0
Online ISBN: 978-3-540-31653-4
eBook Packages: Computer ScienceComputer Science (R0)