Synonym-Based Reordering Model for Statistical Machine Translation

Yang, Zhenxin; Li, Miao; Chen, Lei; Sun, Kai

doi:10.1007/978-3-319-42297-8_35

Zhenxin Yang^16,17,
Miao Li¹⁶,
Lei Chen¹⁶ &
…
Kai Sun¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9773))

Included in the following conference series:

International Conference on Intelligent Computing

Abstract

Reordering model is the crucial component in statistical machine translation (SMT), since it plays an important role in the generation of fluent translation results. However, the data sparseness is the key factor that greatly affects the performance of reordering model in SMT. In this paper, we exploit synonymous information to alleviate the data sparseness and take Chinese-Mongolian SMT as example. First, a synonym-based reordering model with Chinese synonym is proposed for Chinese-Mongolian SMT. Then, we flexibly integrate synonym-based reordering model into baseline SMT as additional feature functions. Besides, we present source-side reordering as the pre-processing module to verify the extensibility of our synonym-based reordering model. Experiments on the Chinese-Mongolian dataset show that our synonym-based reordering model achieves significant improvement over baseline SMT system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

An Unknown Word Processing Method in NMT by Integrating Syntactic Structure and Semantic Concept

A Study on Turkish Meronym Extraction Using a Variety of Lexico-Syntactic Patterns

Notes

References

Tu, M., Zhou, Y., Zong, C.: Exploring diverse features for statistical machine translation model pruning. IEEE/ACM Trans. Audio Speech Lang Process. 23(11), 1847–1857 (2015)
Article Google Scholar
Farzi, S., Faili, H., Khadivi, S.: A syntactically informed reordering model for statistical machine translation. J. Exp. Theor. Artif. Intell. 27(4), 449–469 (2015)
Article Google Scholar
Tillmann, C.: A unigram orientation model for statistical machine translation. In: HLT-NAACL 2004: Short Papers, pp. 101–104. Association for Computational Linguistics (2004)
Google Scholar
Koehn, P., Hoang, H., Birch, A., et al: Moses: open source toolkit for statistical machine translation. In: ACL, pp. 177–180. Association for Computational Linguistics (2007)
Google Scholar
Galley, M., Manning, C.D.: A simple and effective hierarchical phrase reordering model. In: EMNLP, pp. 848–856. Association for Computational Linguistics (2008)
Google Scholar
Ling, W., Luis, T., Graa, J., Coheur, L., Trancoso, I.: Reordering modeling using weighted alignment matrices. In: ACL-HLT, pp. 450–454. Association for Computational Linguistics (2011)
Google Scholar
Yeon-Soo, L.E.E.: Utilizing global syntactic tree features for phrase reordering. IEICE Trans. Inf. Syst. 97(6), 1694–1698 (2014)
Google Scholar
Chen, L., Li, M., He, M., Liu, H.: Dependency parsing on source language with reordering information in SMT. In: IALP, pp. 133–136 (2012)
Google Scholar
Liang, F., Chen, L., Li, M.: Nasun-urtu: a rule-based source-side reordering on phrase structure subtrees. In: IALP, pp. 173–176 (2011)
Google Scholar
Cai, J., Utiyama, M., Sumita, E., et al.: Dependency-based pre-ordering for Chinese-English machine translation. In: ACL, pp. 155–160. Association for Computational Linguistics (2014)
Google Scholar
Koehn, P., Och, F.J., Marcu, D.: Statistical phrase-based translation. In: NAACL-HLT, pp. 48–54. Association for Computational Linguistics (2003)
Google Scholar
Och, F.J.: Minimum error rate training in statistical machine translation. In: ACL, pp. 160–167. Association for Computational Linguistics (2003)
Google Scholar
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: a method for automatic evaluation of machine translation. In: ACL, pp. 311–318. Association for Computational Linguistics (2002)
Google Scholar
Zhang, J., Zhai, F., Zong, C.: A substitution-translation-restoration framework for handling unknown words in statistical machine translation. J. Comput. Sci. Technol. 28(5), 907–918 (2013)
Article Google Scholar
Hoang, H., Koehn, P.: Improving mid-range reordering using templates of factors. In: EMNLP, pp. 372–379. Association for Computational Linguistics (2009)
Google Scholar
Chen, S.F., Goodman, J.: An empirical study of smoothing techniques for language modeling. In: ACL, pp. 310–318. Association for Computational Linguistics (1996)
Google Scholar
Levy, R., Manning, C.: Is it harder to parse Chinese, or the Chinese treebank? In: ACL, pp. 439–446. Association for Computational Linguistics (2003)
Google Scholar
Xiong, D., Liu, Q., Lin, S.: Maximum entropy based phrase reordering model for statistical machine translation. In: ACL, pp. 521–528. Association for Computational Linguistics (2006)
Google Scholar
He, Z., Meng, Y., Yu, H.: Maximum entropy based phrase reordering for hierarchical phrase-based translation. In: EMNLP, pp. 555–563. Association for Computational Linguistics (2010)
Google Scholar
Ling, W., Graça, J., de Matos, D.M., Trancoso, I., Black, A.W.: Discriminative phrase-based lexicalized reordering models using weighted reordering graphs. In: IJCNLP, pp. 47–55. Association for Computational Linguistics (2011)
Google Scholar
Yang, Z., Li, M., Zhu, Z., et al.: A maximum entropy based reordering model for Mongolian-Chinese SMT with morphological information. In: IALP, pp.175–178 (2014)
Google Scholar
Yang, N., Li, M., Zhang, D., Yu, N.: A ranking-based approach to word reordering for statistical machine translation. In: ACL, pp. 912–920. Association for Computational Linguistics (2012)
Google Scholar
Visweswariah, K., Navratil, J., Sorensen, J., et al.: Syntax based reordering with automatically derived rules for improved statistical machine translation. In: ICCL, pp. 1119–1127 (2010)
Google Scholar

Download references

Acknowledgement

This work is supported by the National Natural Science Foundation of China under No. 61572462, No. 61502445, the Informationization Special Projects of Chinese Academy of Science under No. XXH12504-1-10.

Author information

Authors and Affiliations

Institute of Intelligent Machines, Chinese Academy of Sciences, Hefei, 230031, China
Zhenxin Yang, Miao Li & Lei Chen
University of Science and Technology of China, Hefei, 230026, China
Zhenxin Yang & Kai Sun

Authors

Zhenxin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Miao Li
View author publications
You can also search for this author in PubMed Google Scholar
Lei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Kai Sun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhenxin Yang .

Editor information

Editors and Affiliations

Tongji University , Shanghai, China
De-Shuang Huang
Inha University , Incheon, Korea (Republic of)
Kyungsook Han
Liverpool John Moores University , Liverpool, United Kingdom
Abir Hussain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, Z., Li, M., Chen, L., Sun, K. (2016). Synonym-Based Reordering Model for Statistical Machine Translation. In: Huang, DS., Han, K., Hussain, A. (eds) Intelligent Computing Methodologies. ICIC 2016. Lecture Notes in Computer Science(), vol 9773. Springer, Cham. https://doi.org/10.1007/978-3-319-42297-8_35

Download citation

DOI: https://doi.org/10.1007/978-3-319-42297-8_35
Published: 12 July 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42296-1
Online ISBN: 978-3-319-42297-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Synonym-Based Reordering Model for Statistical Machine Translation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

An Unknown Word Processing Method in NMT by Integrating Syntactic Structure and Semantic Concept

A Study on Turkish Meronym Extraction Using a Variety of Lexico-Syntactic Patterns

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Synonym-Based Reordering Model for Statistical Machine Translation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

An Unknown Word Processing Method in NMT by Integrating Syntactic Structure and Semantic Concept

A Study on Turkish Meronym Extraction Using a Variety of Lexico-Syntactic Patterns

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation