Low-resource Neural Machine Translation: Methods and Trends

Published: 15 November 2022 Publication History


Neural Machine Translation (NMT) brings promising improvements in translation quality, but until recently, these models rely on large-scale parallel corpora. As such corpora only exist on a handful of language pairs, the translation performance is far from the desired effect in the majority of low-resource languages. Thus, developing low-resource language translation techniques is crucial and it has become a popular research field in neural machine translation. In this article, we make an overall review of existing deep learning techniques in low-resource NMT. We first show the research status as well as some widely used low-resource datasets. Then, we categorize the existing methods and show some representative works detailedly. Finally, we summarize the common characters among them and outline the future directions in this field.


  Neural Machine Translation for Low-Resource Languages from a Chinese-centric Perspective: A Survey
  More Than Syntaxes: Investigating Semantics to Zero-shot Cross-lingual Relation Extraction and Event Argument Role Labelling
  Emerging resources, enduring challenges: a comprehensive study of Kashmiri parallel corpus
Index Terms

  1. Low-resource Neural Machine Translation: Methods and Trends



    Information & Contributors


    Published In

    cover image ACM Transactions on Asian and Low-Resource Language Information Processing
    ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 21, Issue 5
    September 2022
    486 pages
    Issue’s Table of Contents


    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 November 2022
    Online AM: 15 March 2022
    Accepted: 27 January 2022
    Revised: 15 December 2021
    Received: 10 June 2021
    Published in TALLIP Volume 21, Issue 5


    Author Tags

    1. Low-resource
    2. neural machine translation
    3. semi-supervised
    4. unsupervised
    5. transfer learning
    6. pivot-based methods
    7. data augmentation


    • Research-article
    • Refereed

    Funding Sources

    • National Natural Science Foundation of China


    Neural Machine Translation for Low-Resource Languages from a Chinese-centric Perspective: A Survey
    More Than Syntaxes: Investigating Semantics to Zero-shot Cross-lingual Relation Extraction and Event Argument Role Labelling
    Emerging resources, enduring challenges: a comprehensive study of Kashmiri parallel corpus
    A Chinese–Kazakh Translation Method That Combines Data Augmentation and R-Drop Regularization
    Human-machine Translation Model Evaluation Based on Artificial Intelligence Translation
    Rule Based Fuzzy Computing Approach on Self-Supervised Sentiment Polarity Classification with Word Sense Disambiguation in Machine Translation for Hindi Language
    A Study of Small Corpus-based NMT for Image-based Text Recognition
    Improving Access to Medical Information for Multilingual Patients using Pipelined Ensemble Average based Machine Translation

