Abstract
In this paper we propose different approaches to the LM integrated in a Continuous Speech Recognition system. All of them are based on classes that are made up of phrases or segments of words. The proposed models were evaluated in terms of Word Error Rate over a spontaneous dialogue corpus in Spanish. The experiments carried out show that better performance of the CSR system can be achieved introducing segments of words into a class-based LM.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Brown, P.F., Pietra, V.J.D., Souza, P.V.d., Lai, J.C., Mercer, R.L.: Class-based n-gram Models of Natural Language. Computational Linguistics 18(4) (1992) 467–480
Niesler, T., Whittaker, E., Woodland, P.: Comparison of part-of-speech and automatically derived category-based language models for speech recognition. In: ICASSP’98, Seattle. (1998) 177–180
Zitouni, I.: Backoff hierarchical class n-gram language models: effectiveness to model unseen events in speech recognition. Computer Speech and Language 21(1) (2007) 99–104
Deligne, S., Bimbot, F.: Language modeling by variable length sequences: Theoretical formulation and evaluation of multigrams. In: Proc. ICASSP’ 95, Detroit, MI (1995) 169–172
Marcu, D., Wong, W.: A phrase-based, joint probability model for statistical machine translation. (EMNLP), Philadelphia, PA, July 6–7 (2002)
Ries, K., Buo, F.D., Waibel, A.: Class phrase models for language modelling. In: Proc. ICSLP’ 96. Volume 1., Philadelphia, PA (oct 1996) 398–401
Garcia, P., Vidal, E.: Inference of k-testable languages in the strict sense and application to syntactic pattern recognition. IEEE Trans. Pattern Anal. Mach. Intell. 12(9) (1990) 920–925
Torres, I., Varona, A.: k-tss language models in speech recognition systems. Computer Speech and Language 15(2) (2001) 127–149
Caseiro, D., Trancoso, L: Transducer composition for on-the-fly lexicon and language model integration. In: Proceedings ASRU’2001, Madonna di Campiglio, Italy (December 2001)
Kuo, H.K.J., Reichl, W.: Phrase-based language models for speech recognition. In: Proceedings of EUROSPEECH 99. Volume 4. (September 1999) 1595–1598 Budapest, Hungary.
Och, F. J.: An efficient method for determining bilingual word classes. In: EACL99, Bergen (1999) 71–76
Justo, R., Torres, M.I., Benedi, J.M.: Category-based language model in a Spanish spoken dialogue system. Procesamiento del Lenguaje Natural 37(1) (2006) 19–24
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Justo, R., Inés Torres, M. (2007). Different Approaches to Class-Based Language Models Using Word Segments. In: Kurzynski, M., Puchala, E., Wozniak, M., Zolnierek, A. (eds) Computer Recognition Systems 2. Advances in Soft Computing, vol 45. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75175-5_53
Download citation
DOI: https://doi.org/10.1007/978-3-540-75175-5_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75174-8
Online ISBN: 978-3-540-75175-5
eBook Packages: EngineeringEngineering (R0)