Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1499224.1499269acmconferencesArticle/Chapter ViewAbstractPublication PagesicicConference Proceedingsconference-collections
research-article

Phonetic decomposition for speech recognition of lesser-studied languages

Published: 20 February 2009 Publication History

Abstract

This paper deals with voice transcription systems for lesser-studied languages. In particular, it deals with creating phonetic decompositions for words in these languages, an important step in creating a voice transcription system. The two languages cited here as examples of lesser-studied languages are the San Juan Quiahije variety of Chatino and Vlax Romani. For these languages, an English-based phonemic decomposition is inadequate, because lesser-studied languages are not written with the orthographic rules used for English. The phonemic decomposition proposed here is composed of two stages: separation of words into sounds and expansion of sounds into phonemes.

References

[1]
A. Varela, H. Cuayáhuitl and J.A. Nolazco-Flores, "Creating a Mexican Spanish Version of the CMU Sphinx-III Speech Recognition System", Progress in Pattern Recognition, Speech and Image Analysis, 251--258, Springer 2003
[2]
Hsiao-Wuen Hon; Baosheng Yuan; Yen-Lu Chow; Narayan, S.; Kai-Fu Lee, "Towards large vocabulary Mandarin Chinese speech recognition," Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., vol.1, 19-22 Apr 1994
[3]
Juan A. Nolazco-Flores, Luis R. Salgado-Garza  and Marco Peña-Díaz, "Speaker Dependent ASRs for Huastec and Western-Huastec Náhuatl Languages," Lecture Notes in Computer Science:  Pattern Recognition and Image Analysis, 595--602, Springer, 2005
[4]
Peter Ladefoged, A Course in Phonetics, 4th Edition, Heinle & Heinle, 2001.
[5]
Carnegie Mellon Speech Group, Sphinx Knowledge Base Tool, http://www.speech.cs.cmu.edu/tools/lmtool.html and Simple LM from http://sourceforge.net/project/showfiles.php?group_id=1904
[6]
Emiliana Cruz and Tony Woodbury, "El sandhi de los tonos en el Chatino de Quiahije," Las memorias del Congreso de Idiomas Indígenas de Latinoamérica-II. Archive of the Indigenous Languages of Latin America, 2006. http://www.ailla.utexas.org/site/cilla2/ECruzWoodbury_CILLA2_sandhi.pdf.
[7]
Ian Hancock, A Handbook of Vlax Romani, Slavica, 1995.
[8]
Vijay John. A Method for Enhancing Search Using Transliteration of Mandarin Chinese. Texas Linguistics Society 10, 2006 University of Texas. http://uts.cc.utexas.edu/~tls/2006tls/papers/john_tlsx.pdf.

Cited By

View all
  • (2015)Multilingual Voice Control for Endoscopic ProceduresInternet of Things. User-Centric IoT10.1007/978-3-319-19656-5_33(229-235)Online publication date: 26-Jun-2015

Index Terms

  1. Phonetic decomposition for speech recognition of lesser-studied languages

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      IWIC '09: Proceedings of the 2009 international workshop on Intercultural collaboration
      February 2009
      342 pages
      ISBN:9781605585024
      DOI:10.1145/1499224
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 20 February 2009

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. chatino
      2. phonetics
      3. romani
      4. speech
      5. sphinx

      Qualifiers

      • Research-article

      Conference

      IWIC 09
      Sponsor:
      IWIC 09: International Workshop on Intercultural Collaboration 2009
      February 20 - 21, 2009
      California, Palo Alto, USA

      Acceptance Rates

      Overall Acceptance Rate 47 of 77 submissions, 61%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)3
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 24 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2015)Multilingual Voice Control for Endoscopic ProceduresInternet of Things. User-Centric IoT10.1007/978-3-319-19656-5_33(229-235)Online publication date: 26-Jun-2015

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media