Morpheme-Based Automatic Speech Recognition of Basque

Guijarrubia, Víctor G.; Torres, M. Inés; Justo, Raquel

doi:10.1007/978-3-642-02172-5_50

Víctor G. Guijarrubia²⁰,
M. Inés Torres²⁰ &
Raquel Justo²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5524))

Included in the following conference series:

Iberian Conference on Pattern Recognition and Image Analysis

1812 Accesses

Abstract

In this work, we focus on studying a morpheme-based speech recognition system for Basque, an highly inflected language that is official language in the Basque Country (northern Spain). Two different techniques are presented to decompose the words into their morphological units. The morphological units are then integrated into an Automatic Speech Recognition System, and those systems are then compared to a word-based approach in terms of accuracy and processing speed. Results show that whereas the morpheme-based approaches perform similarly from an accuracy point of view, they can be significantly faster than the word-based system when applied to a weather-forecast task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Turkish Speech Recognition

Lexical modeling for the development of Amharic automatic speech recognition systems

Article 03 May 2023

Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems

References

Kirchhoff, K., Vergyri, D., Bilmes, J., Duh, K., Stolcke, A.: Morphology-based language modeling for conversational Arabic speech recognition. Computer Speech and Language 20, 589–608 (2006)
Article Google Scholar
Rotovnik, T., Maučec, M.S., Kačič, Z.: Large vocabulary continuous speech recognition of an inflected language using stems and endings. Speech Communication 49(6), 437–452 (2007)
Article Google Scholar
Kirsimäki, T., Creutz, M., Siivola, V., Kurimo, M., Virpioja, S., Pylkkönen, J.: Unlimited vocabulary speech recognition with morph language models applied to Finnish. Computer Speech and Language 20, 515–541 (2006)
Article Google Scholar
Arisoy, E., Dutag̈aci, H., Arslan, L.M.: A unified language model for large vocabulary continuous spech recognition of Turkish. Signal Processing 86, 2844–2862 (2006)
Article MATH Google Scholar
Kwon, O.W., Park, J.: Korean large vocabulary continuous speech recognition with morpheme-based recognition units. Speech Communication 39, 287–300 (2003)
Article MATH Google Scholar
Creutz, M., Lagus, K.: Inducing the morphological lexicon of a natural language from unannotated text. In: Proceedings of the International and Interdisciplinary Conference on Aadaptive Knowledge Representation and Reasoning (AKRR), Espoo, Finland (June 2005)
Google Scholar
Pérez, A., Torres, M.I., Casacuberta, F., Guijarrubia, V.: A Spanish-Basque weather forecast corpus for probabilistic speech translation. In: 5th SALTMIL Workshop on Minority Languages, Genoa, pp. 99–101 (May 2006)
Google Scholar
Kneissler, J., Klakow, D.: Speech recognition for huge vocabularies by using optimized sub-word units. In: Proc. Eurospeech 2001, Aalborg, pp. 69–72 (2001)
Google Scholar
Guijarrubia, V., Torres, M.I., Rodríguez, L.J.: Evaluation of a spoken phonetic database in Basque language. In: Proceedings of LREC, Lisbon, vol. 6, pp. 2127–2130 (2004)
Google Scholar
Bisani, M., Ney, H.: Bootstrap estimates for confidence intervals in ASR performance evaluation. In: Proc. IEEE ICASSP, vol. 1, pp. 409–412 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Electricidad y Electrónica, Universidad del País Vasco, Apartado 644, 48080, Bilbao, Spain
Víctor G. Guijarrubia, M. Inés Torres & Raquel Justo

Authors

Víctor G. Guijarrubia
View author publications
You can also search for this author in PubMed Google Scholar
M. Inés Torres
View author publications
You can also search for this author in PubMed Google Scholar
Raquel Justo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Systems and Robotics, Dept. of Electrical and Computer Eng.-Polo II, University of Coimbra, 3030-290, Coimbra, Portugal
Helder Araujo
Institute of Biomedical Engineering, Faculty of Engineering, University of Porto, Rua Dr. Roberto Frias, 4200-465, Porto, Portugal
Ana Maria Mendonça
Dept. de Electrónica e Telecomunicações / IEETA, Universidade de Aveiro, Signal Processing Lab, DETI/IEETA, University of Aveiro, 3810–193, Aveiro, Portugal
Armando J. Pinho
Departamento de Electricidad y Electrónica, Fac. Ciencia y Tecnología - UPV/EHU, Universidad del País Vasco, Apartado 644, 48080, Bilbao, Spain
María Inés Torres

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guijarrubia, V.G., Torres, M.I., Justo, R. (2009). Morpheme-Based Automatic Speech Recognition of Basque. In: Araujo, H., Mendonça, A.M., Pinho, A.J., Torres, M.I. (eds) Pattern Recognition and Image Analysis. IbPRIA 2009. Lecture Notes in Computer Science, vol 5524. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02172-5_50

Download citation

DOI: https://doi.org/10.1007/978-3-642-02172-5_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02171-8
Online ISBN: 978-3-642-02172-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Morpheme-Based Automatic Speech Recognition of Basque

Abstract

Access this chapter

Preview

Similar content being viewed by others

Turkish Speech Recognition

Lexical modeling for the development of Amharic automatic speech recognition systems

Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Morpheme-Based Automatic Speech Recognition of Basque

Abstract

Access this chapter

Preview

Similar content being viewed by others

Turkish Speech Recognition

Lexical modeling for the development of Amharic automatic speech recognition systems

Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation