Abstract
We present a review of Neural Machine Translation (NMT), which has got much popularity in recent decades. Machine translation eased the way we do massive language translation in the new digital era. Otherwise, language translation would have been manually done by human experts. However, manual translation is very costly, time-consuming, and prominently inefficient. So far, three main Machine Translation (MT) techniques have been developed over the past few decades. Viz rule-based, statistical, and neural machine translations. We have presented the merits and demerits of each of these methods and discussed a more detailed review of articles under each category. In the present survey, we conducted an in-depth review of existing approaches, basic architecture, and models for MT systems. Our effort is to shed light on the existing MT systems and assist potential researchers, in revealing related works in the literature. In the process, critical research gaps have been identified. This review intrinsically helps researchers who are interested in the study of MT.
Similar content being viewed by others
REFERENCES
Arivai, H. and Joorg, T., Rule-based machine translation from english to finnish, Proc. of the Conference on Machine Translation, vol. 2: Shared Papers, Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 323–329.
Phan-Vu, H.-H., Nguyen, V.-N., Tran, V.-T., and Do, P.-T., Towards state-of-the-art English-Vietnamese neural nachine translation, Proc. of the 8th International Symposium on Information and Communication Technology-SoICT, 2017, pp. 120–126; https://doi.org/10.1145/3155133.3155205
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., and Dyer, C., Moses: Open-source toolkit for statistical machine translation, in Proc. of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions, 2007, pp. 177–180.
Koehn, P., Och, F.J., and Marcu, D., Statistical phrase-based translation, in Proc. of the Human Language Technology Conference of the North American Chapter, Association for Computational Linguistics, 2003, pp. 127–133.
Zhang, B., Xiong, D., Su, J., and Duan, H., A context-aware recurrent encoder for neural machine translation, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2017, vol. 25, no. 12, pp. 2424–2432, DOI.org/10.1109/TASLP. 2017.2751420
Huang, Jin-Xia, Lee, Kyung-Soon, Kim, and Young-Kil, Hybrid Translation with classification: Revisiting rule-based and Neural Machine Translation, Electronics, 2020, vol. 9, no. 2, p. 201. https://doi.org/10.3390/electronics9020201
Vathsala, M.K. and Holi, G., RNN based machine translation and transliteration for Twitter data, Int. J. Speech Technol., 2020. https://doi.org/10.1007/s10772-020-09724-9
Yonghui Wu, Mike, S., Zhifeng, C., Quoc V. Le, and Norouzi, Mohammad, Google Neural Machine Translation System: Bridging the Gap between Human and Machine Translation, 2016. arXiv: 1609.08144v2.
Mohamed, M., Bidirectional internal memory gate recurrent neural networks for spoken language understanding, Int. J. Speech Technol., 2020. https://doi.org/10.1007/s10772-020-09708-9
Chen, K., Zhao, T., Yang, M., Liu, L., Tamura, A., Wang, R., Utiyama, M., and Sumita, E., A neural approach to source dependence based context model for statistical machine translation, IEEE/ACM Trans. Audio, Speech, Language Process., 2018, vol. 26, no. 2, pp. 266–280. https://doi.org/10.1109/TASLP.2017.2772846
Minh-Thang, L., Hieu, P., and Christopher, D., Effective approaches to attention-based Neural Machine Translation, 2015. arXiv: 1508.04025v5.
Shah and Bakrola, V., Neural Machine Translation System of Indic Languages – An attention-based approach, 2nd Int. Conf. on Advanced Computational and Communication Paradigms (ICACCP), Gangtok, India, 2019.
Daniel, T., Nivranshu, P., Bharathi, R.C., Mariam, M., Michael, A., Juan, A., and Noe, C., Leveraging rule-based machine translation knowledge for under-resourced Neural Machine Translation Models, Proc. of MT Summit XVII, Dublin, Ireland, 2019, vol. 2, pp. 125–133.
Alawneh, M.F., Sembok, T.M., and Mohd, M., Grammar-based and example-based techniques in machine translation from English to Arabic, IEEE 2013 5th Int. Conf. on Information and Communication Technology for the Muslim World 2013 (ICT4M) – Rabat (2013.3.26–2013.3.27), 2013, pp. 1–6. https://doi.org/10.1109/ICT4M.2013. 6518910
Brown, P.F., Cocke, J., Della Pietra, S.A., Della Pietra, V.J., Jelinek, F., Lafferty, J.D., and Roossin, P.S., A statistical approach to machine translation, Comput. Linguist., 1990, vol. 16, no. 2, pp. 79–85.
Million, M. and Yitayew, S., English-Afaan Oromo Machine Translation, Int. J. Comput. Linguist. (IJCL), 2018, vol. 9, Issue 1.
Sisay, A., English-Afaan Oromo Machine Translation: An experiment using a statistical approach, Thesis, Addis Ababa, AAU, 2009.
Solomon, T. Michael, M., Martha, Y., Million, M., Solomon, A., Wondwossen, M., Yaregal, A., Hafte, A., Biniyam, E., Tewodros, A., Wondimagegnhue, T., Tsegaye, A., and Seifedin, Sh., Parallel corpora for bi-directional statistical machine translation for seven Ethiopian language pairs, Proc. of the First Workshop on Linguistic Resources for Natural Language Processing, Santa Fe, New Mexico, USA, 2018, pp. 83–90.
Yitayew, S., Million, M., and Wendewesen, E., Optimal alignment for bi-directional Afaan Oromo-English statistical machine translation, Int. J. Adv. Res. Publ., 2019, vol. 3, Iss. 7, ISSN: 2456-9992.
Bahdanau, D, Cho, K.H., and Bengio, Y., Neural Machine Translation by jointly learning to align and translate, 3rd Int. Conf. on Learning Representations, ICLR 2015, San Diego, USA, 2015.
Zhang, B., Xiong, D., and Su, J., Neural Machine Translation with deep attention, IEEE Trans. Pattern Anal. Mach. Intell., 2018, vol. 1-1. https://doi.org/10.1109/tpami.2018.2876404
Han, D., Li, J., Li, Y., Zhang, M., and Zhou, G., Explicitly modelling word translations in Neural Machine Translation, ACM Trans. Asian Low-Res. Lang. Inform. Process., 2019, vol. 19, no. 1, pp. 1–17, https://doi.org/10.1145/3342353
ShweSin, Y.M., Soe, K.M., and Htwe, K.Y., Large scale Myanmar to English Neural Machine Translation system, IEEE 7th Global Conf. on Consumer Electronics (GCCE), Nara, Japan, 2018, ISSN: 2378-8143.https://doi.org/10.1109/gcce.2018.8574614
Arfaso, B., Bi-directional english-Afan Oromo Machine Translation using convolutional Neural Network, Thesis, AAU, Addis Ababa, 2019.
S. Kumar, N.K. and Malarvizhi, N., Bi-directional LSTM-CNN combined method for sentiment analysis in part of speech tagging (PoS), Int. J. Speech Technol., 2020, vol. 23, pp. 373–380. https://doi.org/10.1007/s10772-020-09716-9
Sennrich, R., Barry, H., and Alexandra, B., Neural Machine Translation of rare words with sub-word units, Proc. of the 54th Meeting of the Association for Computational Linguistics, Vol. 1: Long Papers, Association for Computational Linguistics, 2016, pp. 1715–1725.
Schuster, M. and Keisuke, N., Japanese and Korean voice search, in Proc. of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'12), IEEE, 2012, pp. 5149–5152.
Taku, K. and John, R., SentencePiece: A simple and language-independent sub-word tokenizer and de-tokenizer for neural text processing, in Proceedings of the Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Association for Computational Linguistics, 2018, pp. 66–71. v1/D18-2012, DOI:https://doi.org/10.18653/
Chen, K., Wang, R., Utiyama, M., Sumita, E., Zhao, T., Yang, Muyun, and Zhao, Hai, Towards More Diverse Input Representation for Neural Machine Translation. IEEE/ACM Trans. Audio, Speech, Lang. Process., 2020, vol. 1-1. https://doi.org/10.1109/TASLP.2020.2996077
Surafel, M.L., Mauro, C., and Marcello, F., A comparison of transformer and recurrent neural networks on multilingual Neural Machine Translation, in Proc. of the 27th International Conference on Computational Linguistics, Association for Computational Linguistics, 2018, pp. 641–652.
Singh, M., Kumar, R., and Chana, I., Neural-based Machine Translation System outperforming statistical phrase-based machine translation for low-resource languages, 12fth Int. Conf. on Contemporary Computing (IC3), Noida, India, 2019, pp. 1–7, https://doi.org/10.1109/IC3.2019.8844915
Yang, Z., Wang, Y., Zhang, J., and Zong, C., Phrase table as recommendation memory for Neural Machine Translation, Proc. of the 27th International Joint Conference on Artificial Intelligence (IJCAI-18), 2018, pp. 4609–4615.
Wang, X., Tu, Z., and Zhang, M., Incorporating statistical machine translation Word knowledge into Neural Machine Translation, IEEE/ACM Trans. on Audio, Speech, Lang. Process., 2018, vol. 26, no. 12, pp. 2255–2266. https://doi.org/10.1109/taslp.2018.2860287
Ram, R., V.S. and Devi, S.L., Overview of verb phrase translation in machine translation: English to Tamil and Hindi to Tamil, Proc. of the 10th Annual Meeting of the Forum for Information Retrieval Evaluation on—FIRE'18, Gandhinagar, India, 2018, pp. 6–10. https://doi.org/10.1145/3293339.3293341
Jabesa, D., Bidirectional English-Afaan Oromo Machine Translation using a hybrid approach, Thesis, AAU, Addis Ababa, 2013.
Mulu, G., Besacier, L., Taye, G., and Teferi, D., Phoneme-based english-amharic statistical machine translation, AFRICON, 2015. https://doi.org/10.1109/afrcon.2015.7331921
Tiwari, G., Sharma, A., Sahotra A., and Kapoor, R., English-Hindi Neural Machine Translation-LSTM Seq2Seq and ConvS2S, Int. Conf. on Communication and Signal Processing (ICCSP), Chennai, India, 2020, pp. 871–875, https://doi.org/10.1109/ICCSP48568.2020.9182117
Cho, K., Bahdanau, D., Fethi, B., Holger, S., and Yoshua, B., Learning phrase representations using RNN encoder-decoder for statistical machine translation, 2014. arXiv:1406.1078v3.
Chen, K., Wang, R., Utiyama, M., Sumita, E., and Zhao, T., Neural Machine Translation with sentence-level topic context, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2019, vol. 1-1. doi: . 2937190https://doi.org/10.1109/TASLP.2019
Wu, S., Zhang, D., Zhang, Z., Yang, N., Li, M., and Zhou, M., Dependency-to-dependency Neural Machine Translation, IEEE/ACM Trans. on Audio, Speech, Language Process., 2018, vol. 26, no. 11, pp. 2132–2141. https://doi.org/10.1109/taslp.2018.2855968
Su, J., Zeng, J., Xiong, D., Liu, Y., Wang, M., and Xie, J., A hierarchy-to-sequence attentional Neural Machine Translation model, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2018, vol. 26, no. 3, pp. 623–632.
Ibrahim, G. and Shashirekha, H.L., Amharic-Arabic Neural Machine Translation, 2019. https://doi.org/10.5121/csit.2019.91606
Matiss, R., Marcis, P., and Richards, K., Training and adapting multilingual NMT for less-resourced and morphologically rich languages, in Proc. of the 11th International Conference on Language Resources and Evaluation (LREC'18), European Language Resources Association (ELRA), 2018, pp. 3766–3773.
Funding
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sector.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
The authors declare that they have no conflicts of interest.
About this article
Cite this article
Ebisa Gemechu, Kanagachidambaresan, G.R. Text-Text Neural Machine Translation: A Survey. Opt. Mem. Neural Networks 32, 59–72 (2023). https://doi.org/10.3103/S1060992X23020042
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.3103/S1060992X23020042