Abstract
In this paper the Speech-to-Speech Translation (SST) system, which is mainly focused on translation from English to Dravidian languages (Tamil and Malayalam) has been proposed. Three major techniques involved in SST system are Automatic continuous speech recognition, machine translation, and text-to-speech synthesis system. In this paper automatic Continuous Speech Recognition (CSR) has been developed based on the Auto Associative Neural Network (AANN), Support Vector Machine (SVM) and Hidden Markov Model (HMM). The HMM yields better results compared with SVM and AANN. Hence the HMM based Speech recognizer for English language has been taken. We propose a hybrid Machine Translation (MT) system (combination of Rule based and Statistical) for converting English to Dravidian languages text. A syllable based concatenative Text To Speech Synthesis (TTS) for Tamil and Malayalam has been proposed. AANN based prosody prediction has been done for the Tamil language which is used to improve the naturalness and intelligibility. The domain is restricted to sentences that cover the announcements in the railway station, bus stop and airport. This work is framed a novel translation method for English to Dravidian languages. The improved performance of each module HMM based CSR, Hybrid MT and concatenative TTS increases the overall speech translation performance. This proposed speech translation system can be applied to English to any Indian languages if we train and create a parallel corpus for those languages.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Arora K, Arora S, Roy MK (2013) Speech to speech translation: a communication boon. CSI Trans ICT 1(3):207–213
Condon S, Arehart M, Parvaz D, Sanders G, Doran C, Aberdeen J (2012) Evaluation of 2-way iraqi arabic - english speech translation systems using automated metrics. Mach Transl Springer 26(1):159–176
Cui J, Deng Y, Zhou B (2009) Reinforcing language model for speech translation with auxiliary data. ASRU IEEE502–506
Dutoit T (1997) An introduction to text-to-speech synthesis. Kulwer Academic Publishers
Dyer C, Muresan S, Resnik P (2008) Generalizing word lattice translation. In: Proceedings of ACL 08: HLT, pp 1012–1020
Ettelaie E, Georgiou PG, Narayanan SS (2013) Unsupervised data processing for classifier-based speech translator. Comput Speech Lang 27(2):438–454
Gao Y, Gu L, Zhou B, Sarikaya R, Afity M, Juo HK, Zhong W (2006) IBMmaster system: Multilingual automatic speech to speech translator. In: Proceedings of the workshop on medical speech translation at HLT-NAACL, pp 57–60
Guan Y, Zheng L, Tian J (2010) Real-time speaker adapted speech to speech translation system in mobile environment. In: 10th international conference on signal processing (ICSP). IEEE, pp 577–580
Juang BH, Rabiner LR (2005) Automatic speech recognition – a brief history of the technology development
Kalyanasundaram K A brief History and Features of the Tamil Language. http://www.tamilelibrary.org/teli/tamil7.html, Accessed April 5, 2015
Kao CL, Saleem S, Prasad R, Choi F, Natrajan P, Stallard D, Krstovski K, Kamali M (2008) Rapid development of an english/farsi speech to speech translation system. In: Proceedings of IWSLT, pp 166–173
Lewis MP, Simons GF, Fennig CD Ethnologue: languages of the World. http://www.ethnologue.com/country/IN, Accessed April 5, 2015
Matusov E, Ney H (2011) Lattice-based ASR-MT interface for speech translation. IEEE Trans Audio Speech Lang Process 19(4):721–732
Nakamura S (2009) Overcoming the language barriers with speech translation technology. Sci Technol Q Rev 31:35–48
Nakamura S, Markov K, Nakaiwa H, Kikui G, Kawai H, Jitsuhiro T, Zhang JS (2006) The ATR multilingual speech to speech translation system. IEEE Trans Acoust Speech Signal Process 14(2):365–376
Papineni K, Roukos S, Ward T, Zhu WJ (2002) BLEU: A method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting on association for computational linguistics, ACL ’02. Association for Computational Linguistics, Stroudsburg, pp 311–318
Rabiner L, Juang BH (1993) Fundamentals of speech recognition. Prentice-Hall, Inc., Upper Saddle River
Rajaram BSR, Ramakrishnan AG, Kumar HRS (2013) An accessible translation system between simple kannada and tamil sentences. In: Proceedings of 6th language and technology conference
Sakti PS, Maia M, Sakai R, Kimura S, Ashikari N, Sumita Y, Nakamura E (2009) Toward translating indonesian spoken utterances to/from other languages. In: International conference on oriental COCOSDA, pp 137–142
Saraswathi S, Geetha TV (2010) Design of language models at various phases of tamil speech recognition system. Int J Eng Sci Technol 2(5):244–257
Szekely E, Steiner I, Ahmed Z, Carson-Berndsen J (2013) Facial expression based affective speech translation. J Multimodal User Interfaces 8(1):87–96
Weaver W (1955) Machine translation of languages, chap. translation. The MIT Press, pp 15–23
Yegnanarayana B (1999) Artificial neural networks. Prentice-Hall, New Delhi
Zhou B, Cui X, Huang S, Cmejrek M, Zhang W (2013) The IBM speech-to-speech translation system for smartphone: Improvements for resource-constrained tasks. Comput Speech Lang 27(2):397–418
Zhou B, Gao Y, Sorensen J, Diao Z, Picheny M (2002) Statistical natural language generation for speech to speech machine translation systems. In: International conference on spoken language processing, vol 3. Denver, pp 1897–1900
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sangeetha, J., Jothilakshmi, S. Speech translation system for english to dravidian languages. Appl Intell 46, 534–550 (2017). https://doi.org/10.1007/s10489-016-0846-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-016-0846-3