Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Text-Text Neural Machine Translation: A Survey

  • Published:
Optical Memory and Neural Networks Aims and scope Submit manuscript

Abstract

We present a review of Neural Machine Translation (NMT), which has got much popularity in recent decades. Machine translation eased the way we do massive language translation in the new digital era. Otherwise, language translation would have been manually done by human experts. However, manual translation is very costly, time-consuming, and prominently inefficient. So far, three main Machine Translation (MT) techniques have been developed over the past few decades. Viz rule-based, statistical, and neural machine translations. We have presented the merits and demerits of each of these methods and discussed a more detailed review of articles under each category. In the present survey, we conducted an in-depth review of existing approaches, basic architecture, and models for MT systems. Our effort is to shed light on the existing MT systems and assist potential researchers, in revealing related works in the literature. In the process, critical research gaps have been identified. This review intrinsically helps researchers who are interested in the study of MT.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1.
Fig. 2.

Similar content being viewed by others

REFERENCES

  1. Arivai, H. and Joorg, T., Rule-based machine translation from english to finnish, Proc. of the Conference on Machine Translation, vol. 2: Shared Papers, Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 323–329.

  2. Phan-Vu, H.-H., Nguyen, V.-N., Tran, V.-T., and Do, P.-T., Towards state-of-the-art English-Vietnamese neural nachine translation, Proc. of the 8th International Symposium on Information and Communication Technology-SoICT, 2017, pp. 120–126; https://doi.org/10.1145/3155133.3155205

  3. Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., and Dyer, C., Moses: Open-source toolkit for statistical machine translation, in Proc. of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions, 2007, pp. 177–180.

  4. Koehn, P., Och, F.J., and Marcu, D., Statistical phrase-based translation, in Proc. of the Human Language Technology Conference of the North American Chapter, Association for Computational Linguistics, 2003, pp. 127–133.

  5. Zhang, B., Xiong, D., Su, J., and Duan, H., A context-aware recurrent encoder for neural machine translation, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2017, vol. 25, no. 12, pp. 2424–2432, DOI.org/10.1109/TASLP. 2017.2751420

  6. Huang, Jin-Xia, Lee, Kyung-Soon, Kim, and Young-Kil, Hybrid Translation with classification: Revisiting rule-based and Neural Machine Translation, Electronics, 2020, vol. 9, no. 2, p. 201. https://doi.org/10.3390/electronics9020201

    Article  Google Scholar 

  7. Vathsala, M.K. and Holi, G., RNN based machine translation and transliteration for Twitter data, Int. J. Speech Technol., 2020. https://doi.org/10.1007/s10772-020-09724-9

  8. Yonghui Wu, Mike, S., Zhifeng, C., Quoc V. Le, and Norouzi, Mohammad, Google Neural Machine Translation System: Bridging the Gap between Human and Machine Translation, 2016. arXiv: 1609.08144v2.

  9. Mohamed, M., Bidirectional internal memory gate recurrent neural networks for spoken language understanding, Int. J. Speech Technol., 2020. https://doi.org/10.1007/s10772-020-09708-9

  10. Chen, K., Zhao, T., Yang, M., Liu, L., Tamura, A., Wang, R., Utiyama, M., and Sumita, E., A neural approach to source dependence based context model for statistical machine translation, IEEE/ACM Trans. Audio, Speech, Language Process., 2018, vol. 26, no. 2, pp. 266–280. https://doi.org/10.1109/TASLP.2017.2772846

    Article  Google Scholar 

  11. Minh-Thang, L., Hieu, P., and Christopher, D., Effective approaches to attention-based Neural Machine Translation, 2015. arXiv: 1508.04025v5.

  12. Shah and Bakrola, V., Neural Machine Translation System of Indic Languages – An attention-based approach, 2nd Int. Conf. on Advanced Computational and Communication Paradigms (ICACCP), Gangtok, India, 2019.

  13. Daniel, T., Nivranshu, P., Bharathi, R.C., Mariam, M., Michael, A., Juan, A., and Noe, C., Leveraging rule-based machine translation knowledge for under-resourced Neural Machine Translation Models, Proc. of MT Summit XVII, Dublin, Ireland, 2019, vol. 2, pp. 125–133.

  14. Alawneh, M.F., Sembok, T.M., and Mohd, M., Grammar-based and example-based techniques in machine translation from English to Arabic, IEEE 2013 5th Int. Conf. on Information and Communication Technology for the Muslim World 2013 (ICT4M) – Rabat (2013.3.26–2013.3.27), 2013, pp. 1–6. https://doi.org/10.1109/ICT4M.2013. 6518910

  15. Brown, P.F., Cocke, J., Della Pietra, S.A., Della Pietra, V.J., Jelinek, F., Lafferty, J.D., and Roossin, P.S., A statistical approach to machine translation, Comput. Linguist., 1990, vol. 16, no. 2, pp. 79–85.

    Google Scholar 

  16. Million, M. and Yitayew, S., English-Afaan Oromo Machine Translation, Int. J. Comput. Linguist. (IJCL), 2018, vol. 9, Issue 1.

  17. Sisay, A., English-Afaan Oromo Machine Translation: An experiment using a statistical approach, Thesis, Addis Ababa, AAU, 2009.

  18. Solomon, T. Michael, M., Martha, Y., Million, M., Solomon, A., Wondwossen, M., Yaregal, A., Hafte, A., Biniyam, E., Tewodros, A., Wondimagegnhue, T., Tsegaye, A., and Seifedin, Sh., Parallel corpora for bi-directional statistical machine translation for seven Ethiopian language pairs, Proc. of the First Workshop on Linguistic Resources for Natural Language Processing, Santa Fe, New Mexico, USA, 2018, pp. 83–90.

  19. Yitayew, S., Million, M., and Wendewesen, E., Optimal alignment for bi-directional Afaan Oromo-English statistical machine translation, Int. J. Adv. Res. Publ., 2019, vol. 3, Iss. 7, ISSN: 2456-9992.

  20. Bahdanau, D, Cho, K.H., and Bengio, Y., Neural Machine Translation by jointly learning to align and translate, 3rd Int. Conf. on Learning Representations, ICLR 2015, San Diego, USA, 2015.

  21. Zhang, B., Xiong, D., and Su, J., Neural Machine Translation with deep attention, IEEE Trans. Pattern Anal. Mach. Intell., 2018, vol. 1-1. https://doi.org/10.1109/tpami.2018.2876404

  22. Han, D., Li, J., Li, Y., Zhang, M., and Zhou, G., Explicitly modelling word translations in Neural Machine Translation, ACM Trans. Asian Low-Res. Lang. Inform. Process., 2019, vol. 19, no. 1, pp. 1–17, https://doi.org/10.1145/3342353

    Article  Google Scholar 

  23. ShweSin, Y.M., Soe, K.M., and Htwe, K.Y., Large scale Myanmar to English Neural Machine Translation system, IEEE 7th Global Conf. on Consumer Electronics (GCCE), Nara, Japan, 2018, ISSN: 2378-8143.https://doi.org/10.1109/gcce.2018.8574614

  24. Arfaso, B., Bi-directional english-Afan Oromo Machine Translation using convolutional Neural Network, Thesis, AAU, Addis Ababa, 2019.

  25. S. Kumar, N.K. and Malarvizhi, N., Bi-directional LSTM-CNN combined method for sentiment analysis in part of speech tagging (PoS), Int. J. Speech Technol., 2020, vol. 23, pp. 373–380. https://doi.org/10.1007/s10772-020-09716-9

    Article  Google Scholar 

  26. Sennrich, R., Barry, H., and Alexandra, B., Neural Machine Translation of rare words with sub-word units, Proc. of the 54th Meeting of the Association for Computational Linguistics, Vol. 1: Long Papers, Association for Computational Linguistics, 2016, pp. 1715–1725.

  27. Schuster, M. and Keisuke, N., Japanese and Korean voice search, in Proc. of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'12), IEEE, 2012, pp. 5149–5152.

  28. Taku, K. and John, R., SentencePiece: A simple and language-independent sub-word tokenizer and de-tokenizer for neural text processing, in Proceedings of the Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Association for Computational Linguistics, 2018, pp. 66–71. v1/D18-2012, DOI:https://doi.org/10.18653/

  29. Chen, K., Wang, R., Utiyama, M., Sumita, E., Zhao, T., Yang, Muyun, and Zhao, Hai, Towards More Diverse Input Representation for Neural Machine Translation. IEEE/ACM Trans. Audio, Speech, Lang. Process., 2020, vol. 1-1. https://doi.org/10.1109/TASLP.2020.2996077

  30. Surafel, M.L., Mauro, C., and Marcello, F., A comparison of transformer and recurrent neural networks on multilingual Neural Machine Translation, in Proc. of the 27th International Conference on Computational Linguistics, Association for Computational Linguistics, 2018, pp. 641–652.

  31. Singh, M., Kumar, R., and Chana, I., Neural-based Machine Translation System outperforming statistical phrase-based machine translation for low-resource languages, 12fth Int. Conf. on Contemporary Computing (IC3), Noida, India, 2019, pp. 1–7, https://doi.org/10.1109/IC3.2019.8844915

  32. Yang, Z., Wang, Y., Zhang, J., and Zong, C., Phrase table as recommendation memory for Neural Machine Translation, Proc. of the 27th International Joint Conference on Artificial Intelligence (IJCAI-18), 2018, pp. 4609–4615.

  33. Wang, X., Tu, Z., and Zhang, M., Incorporating statistical machine translation Word knowledge into Neural Machine Translation, IEEE/ACM Trans. on Audio, Speech, Lang. Process., 2018, vol. 26, no. 12, pp. 2255–2266. https://doi.org/10.1109/taslp.2018.2860287

    Article  Google Scholar 

  34. Ram, R., V.S. and Devi, S.L., Overview of verb phrase translation in machine translation: English to Tamil and Hindi to Tamil, Proc. of the 10th Annual Meeting of the Forum for Information Retrieval Evaluation on—FIRE'18, Gandhinagar, India, 2018, pp. 6–10. https://doi.org/10.1145/3293339.3293341

  35. Jabesa, D., Bidirectional English-Afaan Oromo Machine Translation using a hybrid approach, Thesis, AAU, Addis Ababa, 2013.

  36. Mulu, G., Besacier, L., Taye, G., and Teferi, D., Phoneme-based english-amharic statistical machine translation, AFRICON, 2015. https://doi.org/10.1109/afrcon.2015.7331921

    Book  Google Scholar 

  37. Tiwari, G., Sharma, A., Sahotra A., and Kapoor, R., English-Hindi Neural Machine Translation-LSTM Seq2Seq and ConvS2S, Int. Conf. on Communication and Signal Processing (ICCSP), Chennai, India, 2020, pp. 871–875, https://doi.org/10.1109/ICCSP48568.2020.9182117

  38. Cho, K., Bahdanau, D., Fethi, B., Holger, S., and Yoshua, B., Learning phrase representations using RNN encoder-decoder for statistical machine translation, 2014. arXiv:1406.1078v3.

  39. Chen, K., Wang, R., Utiyama, M., Sumita, E., and Zhao, T., Neural Machine Translation with sentence-level topic context, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2019, vol. 1-1. doi: . 2937190https://doi.org/10.1109/TASLP.2019

  40. Wu, S., Zhang, D., Zhang, Z., Yang, N., Li, M., and Zhou, M., Dependency-to-dependency Neural Machine Translation, IEEE/ACM Trans. on Audio, Speech, Language Process., 2018, vol. 26, no. 11, pp. 2132–2141. https://doi.org/10.1109/taslp.2018.2855968

    Article  Google Scholar 

  41. Su, J., Zeng, J., Xiong, D., Liu, Y., Wang, M., and Xie, J., A hierarchy-to-sequence attentional Neural Machine Translation model, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2018, vol. 26, no. 3, pp. 623–632.

    Article  Google Scholar 

  42. Ibrahim, G. and Shashirekha, H.L., Amharic-Arabic Neural Machine Translation, 2019. https://doi.org/10.5121/csit.2019.91606

  43. Matiss, R., Marcis, P., and Richards, K., Training and adapting multilingual NMT for less-resourced and morphologically rich languages, in Proc. of the 11th International Conference on Language Resources and Evaluation (LREC'18), European Language Resources Association (ELRA), 2018, pp. 3766–3773.

Download references

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sector.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Ebisa Gemechu or G. R. Kanagachidambaresan.

Ethics declarations

The authors declare that they have no conflicts of interest.

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ebisa Gemechu, Kanagachidambaresan, G.R. Text-Text Neural Machine Translation: A Survey. Opt. Mem. Neural Networks 32, 59–72 (2023). https://doi.org/10.3103/S1060992X23020042

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.3103/S1060992X23020042

Keywords