Abstract
In this work we propose an algorithm for continuous speech segmentation with text independency. In our approach we do not use feature vectors in order to detect phoneme boundaries, instead we only make use of the intensity measure. Obtaining with this a remarkable reduction in the amount of information needed and simplified rules on the processing. In the process only a pre-emphasis filter, and one strategy based on a distance measure with normalized fuzzy memberships over the signal patterns are used. In the preliminary results the method reaches up to 77.54% of correct segmentation with a 20 msec. accuracy and an over segmentation rate near to 0%. The algorithm implementation, the experiments, as well as some results are shown.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Hu, Z., Schalwyk, J., Bernard, E., Cole, R.: Speech recognition using syllabe-like units. In: ICSLP 1996, vol. 2, pp. 1117–1120 (1996)
Suh, Y., Lee, Y.: Phoneme Segmentation of Continuous speech using multi-layer perceptron. IEEE Trans. Speech and Audio Proc. 7(6), 697–708 (1999)
Ratsameewichai, S., Theera, N., Vilasdechanon, J., Uatrongjit, S., Likit-Anurucks, K.: Thai phoneme segmentation using dual-band energy contour., ITC- CSCC-2002
Pellom, B., Hansen, J.: Automatic segmentation of speech recorded in unknown noisy channel characteristics. Speech Communication 25, 97–116 (1998)
Schwartz, R., Makhoul, J.: Where the Phonemes Are: Dealing with Ambiguity in Acoustic Phonetic Recognition. IEEE Trans. ASSP 23, 50–53 (1975)
Zue, V.: The Use of Speech Knowledge in Automatic Speech Recognition. Proceeding of the IEEE 73, 1602–1615 (1985)
Weinstein, C., McCandless, S., Mondstein, L., Zue, V.: A system for Acoustic-Phonetic Analysis of Continuous Speech. IEEE Trans. ASSP 23, 54–67 (1975)
Grayden, D., Scordilis, M.: Phonemic Segmentation of Fluent Speech. In: Proc. ICASSP 1994, pp. 73–76 (1994)
Aversano, G., Esposito, A.: A new text-independent method for phoneme segmentation. In: Proc. the 44th IEEE Midwest Symposium on Circuits and Systems, vol. 2, pp. 516–519 (2001)
Aversano, G., Esposito, A.: Automátic Parameter Estimation for a Context-Independent Speech Segmentation Algorithm. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2002. LNCS (LNAI), vol. 2448, pp. 293–300. Springer, Heidelberg (2002)
Saraswhati, S., Geetha, T.V., Saravanan, K.: Integrating Language Independent Segmentation and Language Dependent Phoneme Based Modeling for Tamil Speech Recognition System. Asian Journal of Information Technology 5(1), 38–43 (2006)
Gold, B., Morgan, N.: Speech and audio signal processing. John Wiley, Chichester (2000)
Rodman, R.: Computer Speech Technology. Artech House Inc. (1999)
Boersma, P.: Praat, a system for doing phonetics by computer. Glot International 5(9/10), 341–345
Wang, D., Le, L., Zhang, H.: Speech segmentation without speech recognition. In: Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2003), Hong Kong, vol. I, pp. 468–471 (2003)
Petek, B., Andersen, O., Dalsgaard, P.: On the robust automatic segmentation of spontaneous speech. In: Proceedings of ICSLP 1996, pp. 913–916 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huerta-Hernández, L.D., Reyes-García, C.A. (2006). On the Processing of Fuzzy Patterns for Text Independent Phonetic Speech Segmentation. In: Martínez-Trinidad, J.F., Carrasco Ochoa, J.A., Kittler, J. (eds) Progress in Pattern Recognition, Image Analysis and Applications. CIARP 2006. Lecture Notes in Computer Science, vol 4225. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11892755_45
Download citation
DOI: https://doi.org/10.1007/11892755_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46556-0
Online ISBN: 978-3-540-46557-7
eBook Packages: Computer ScienceComputer Science (R0)