research-article

Phonetic decomposition for speech recognition of lesser-studied languages

Author:

Vijay JohnAuthors Info & Claims

IWIC '09: Proceedings of the 2009 international workshop on Intercultural collaboration

Pages 253 - 256

https://doi.org/10.1145/1499224.1499269

Published: 20 February 2009 Publication History

Get Access

Abstract

This paper deals with voice transcription systems for lesser-studied languages. In particular, it deals with creating phonetic decompositions for words in these languages, an important step in creating a voice transcription system. The two languages cited here as examples of lesser-studied languages are the San Juan Quiahije variety of Chatino and Vlax Romani. For these languages, an English-based phonemic decomposition is inadequate, because lesser-studied languages are not written with the orthographic rules used for English. The phonemic decomposition proposed here is composed of two stages: separation of words into sounds and expansion of sounds into phonemes.

References

[1]

A. Varela, H. Cuayáhuitl and J.A. Nolazco-Flores, "Creating a Mexican Spanish Version of the CMU Sphinx-III Speech Recognition System", Progress in Pattern Recognition, Speech and Image Analysis, 251--258, Springer 2003

Google Scholar

[2]

Hsiao-Wuen Hon; Baosheng Yuan; Yen-Lu Chow; Narayan, S.; Kai-Fu Lee, "Towards large vocabulary Mandarin Chinese speech recognition," Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., vol.1, 19-22 Apr 1994

Google Scholar

[3]

Juan A. Nolazco-Flores, Luis R. Salgado-Garza and Marco Peña-Díaz, "Speaker Dependent ASRs for Huastec and Western-Huastec Náhuatl Languages," Lecture Notes in Computer Science: Pattern Recognition and Image Analysis, 595--602, Springer, 2005

Digital Library

Google Scholar

[4]

Peter Ladefoged, A Course in Phonetics, 4th Edition, Heinle & Heinle, 2001.

Google Scholar

[5]

Carnegie Mellon Speech Group, Sphinx Knowledge Base Tool, http://www.speech.cs.cmu.edu/tools/lmtool.html and Simple LM from http://sourceforge.net/project/showfiles.php?group_id=1904

Google Scholar

[6]

Emiliana Cruz and Tony Woodbury, "El sandhi de los tonos en el Chatino de Quiahije," Las memorias del Congreso de Idiomas Indígenas de Latinoamérica-II. Archive of the Indigenous Languages of Latin America, 2006. http://www.ailla.utexas.org/site/cilla2/ECruzWoodbury_CILLA2_sandhi.pdf.

Google Scholar

[7]

Ian Hancock, A Handbook of Vlax Romani, Slavica, 1995.

Google Scholar

[8]

Vijay John. A Method for Enhancing Search Using Transliteration of Mandarin Chinese. Texas Linguistics Society 10, 2006 University of Texas. http://uts.cc.utexas.edu/~tls/2006tls/papers/john_tlsx.pdf.

Google Scholar

Cited By

View all

Afonso SLaranjo IBraga JAlves VNeves J(2015)Multilingual Voice Control for Endoscopic ProceduresInternet of Things. User-Centric IoT10.1007/978-3-319-19656-5_33(229-235)Online publication date: 26-Jun-2015
https://doi.org/10.1007/978-3-319-19656-5_33

Index Terms

Phonetic decomposition for speech recognition of lesser-studied languages
1. Applied computing
  1. Arts and humanities
    1. Language translation
2. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing

Recommendations

Large vocabulary continuous speech recognition for Urdu
FIT '10: Proceedings of the 8th International Conference on Frontiers of Information Technology

This paper presents the development of acoustic and language models for robust Urdu speech recognition using the CMU Sphinx Open Source Toolkit for speech recognition. Three models have been developed incrementally, with the addition of speech data of ...
Psycho-acoustics inspired automatic speech recognition
Abstract
Understanding the human spoken language recognition process is still a far scientific goal. Nowadays, commercial automatic speech recognisers (ASRs) achieve high performance at recognising clean speech, but their approaches are poorly ...
Highlights
- We propose a novel Automatic Speech Recognizer inspired by psycho-acoustic studies.
Automatic phonetic transcription by phonological derivation
PROPOR'12: Proceedings of the 10th international conference on Computational Processing of the Portuguese Language

Automatic phonetic transcription tools usually perform phonetic transcriptions directly from orthographic representations. Although these approaches often achieve good results, theoretical studies suggest that including morphophonological knowledge ...

Comments

Information & Contributors

Information

Published In

IWIC '09: Proceedings of the 2009 international workshop on Intercultural collaboration

February 2009

342 pages

ISBN:9781605585024

DOI:10.1145/1499224

General Chairs:
Susan Fussell
Carnegie Mellon University, USA
,
Pamela Hinds
Stanford University, USA
,
Toru Ishida
Kyoto University, Japan

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 February 2009

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

IWIC 09

Sponsor:

IWIC 09: International Workshop on Intercultural Collaboration 2009

February 20 - 21, 2009

California, Palo Alto, USA

Acceptance Rates

Overall Acceptance Rate 47 of 77 submissions, 61%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
371
Total Downloads

Downloads (Last 12 months)3
Downloads (Last 6 weeks)1

Reflects downloads up to 24 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Afonso SLaranjo IBraga JAlves VNeves J(2015)Multilingual Voice Control for Endoscopic ProceduresInternet of Things. User-Centric IoT10.1007/978-3-319-19656-5_33(229-235)Online publication date: 26-Jun-2015
https://doi.org/10.1007/978-3-319-19656-5_33

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Large vocabulary continuous speech recognition for Urdu

Psycho-acoustics inspired automatic speech recognition

Automatic phonetic transcription by phonological derivation

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations