Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Different Approaches to Class-Based Language Models Using Word Segments

  • Conference paper
Computer Recognition Systems 2

Part of the book series: Advances in Soft Computing ((AINSC,volume 45))

  • 805 Accesses

Abstract

In this paper we propose different approaches to the LM integrated in a Continuous Speech Recognition system. All of them are based on classes that are made up of phrases or segments of words. The proposed models were evaluated in terms of Word Error Rate over a spontaneous dialogue corpus in Spanish. The experiments carried out show that better performance of the CSR system can be achieved introducing segments of words into a class-based LM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Brown, P.F., Pietra, V.J.D., Souza, P.V.d., Lai, J.C., Mercer, R.L.: Class-based n-gram Models of Natural Language. Computational Linguistics 18(4) (1992) 467–480

    Google Scholar 

  2. Niesler, T., Whittaker, E., Woodland, P.: Comparison of part-of-speech and automatically derived category-based language models for speech recognition. In: ICASSP’98, Seattle. (1998) 177–180

    Google Scholar 

  3. Zitouni, I.: Backoff hierarchical class n-gram language models: effectiveness to model unseen events in speech recognition. Computer Speech and Language 21(1) (2007) 99–104

    Article  Google Scholar 

  4. Deligne, S., Bimbot, F.: Language modeling by variable length sequences: Theoretical formulation and evaluation of multigrams. In: Proc. ICASSP’ 95, Detroit, MI (1995) 169–172

    Google Scholar 

  5. Marcu, D., Wong, W.: A phrase-based, joint probability model for statistical machine translation. (EMNLP), Philadelphia, PA, July 6–7 (2002)

    Google Scholar 

  6. Ries, K., Buo, F.D., Waibel, A.: Class phrase models for language modelling. In: Proc. ICSLP’ 96. Volume 1., Philadelphia, PA (oct 1996) 398–401

    Google Scholar 

  7. Garcia, P., Vidal, E.: Inference of k-testable languages in the strict sense and application to syntactic pattern recognition. IEEE Trans. Pattern Anal. Mach. Intell. 12(9) (1990) 920–925

    Article  Google Scholar 

  8. Torres, I., Varona, A.: k-tss language models in speech recognition systems. Computer Speech and Language 15(2) (2001) 127–149

    Article  Google Scholar 

  9. Caseiro, D., Trancoso, L: Transducer composition for on-the-fly lexicon and language model integration. In: Proceedings ASRU’2001, Madonna di Campiglio, Italy (December 2001)

    Google Scholar 

  10. Kuo, H.K.J., Reichl, W.: Phrase-based language models for speech recognition. In: Proceedings of EUROSPEECH 99. Volume 4. (September 1999) 1595–1598 Budapest, Hungary.

    Google Scholar 

  11. Och, F. J.: An efficient method for determining bilingual word classes. In: EACL99, Bergen (1999) 71–76

    Google Scholar 

  12. Justo, R., Torres, M.I., Benedi, J.M.: Category-based language model in a Spanish spoken dialogue system. Procesamiento del Lenguaje Natural 37(1) (2006) 19–24

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Justo, R., Inés Torres, M. (2007). Different Approaches to Class-Based Language Models Using Word Segments. In: Kurzynski, M., Puchala, E., Wozniak, M., Zolnierek, A. (eds) Computer Recognition Systems 2. Advances in Soft Computing, vol 45. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75175-5_53

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-75175-5_53

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-75174-8

  • Online ISBN: 978-3-540-75175-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics