Translation from a source language into a target language has become a very important activity in recent years, both in official institutions (such as the United Nations and the EU, or in the parliaments of multilingual countries like Canada and Spain), as well as in the private sector (for example, to translate user's manuals or newspapers articles). Prestigious clients such as these cannot make do with approximate translations; for all kinds of reasons, ranging from the legal obligations to good marketing practice, they require target-language texts of the highest quality. The task of producing such high-quality translations is a demanding and time-consuming one that is generally conferred to expert human translators. The problem is that, with growing globalization, the demand for high-quality translation has been steadily increasing, to the point where there are just not enough qualified translators available today to satisfy it. This has dramatically raised the need for improved machine translation (MT) technologies.

The field of MT has undergone something of a revolution over the last 15 years, with the adoption of empirical, data-driven techniques originally inspired by the success of automatic speech recognition. Given the requisite corpora, it is now possible to develop new MT systems in a fraction of the time and with much less effort than was previously required under the formerly dominant rule-based paradigm. As for the quality of the translations produced by this new generation of MT systems, there has also been considerable progress; generally speaking, however, it remains well below that of human translation. No one would seriously consider directly using the output of even the best of these systems to translate a CV or a corporate Web site, for example, without submitting the machine translation to a careful human revision. As a result, those who require publication-quality translation are forced to make a diffcult choice between systems that are fully automatic but whose output must be attentively post-edited, and computer-assisted translation systems (or CAT tools for short) that allow for high quality but to the detriment of full automation.

Currently, the best known CAT tools are translation memory (TM) systems. These systems recycle sentences that have previously been translated, either within the current document or earlier in other documents. This is very useful for highly repetitive texts, but not of much help for the vast majority of texts composed of original materials.

Since TM systems were first introduced, very few other types of CAT tools have been forthcoming. Notable exceptions are the TransType system and its successor TransType2 (TT2). These systems represent a novel rework-ing of the old idea of interactive machine translation (IMT). Initial efforts on TransType are described in detail in Foster; suffice it to say here the system's principal novelty lies in the fact the human-machine interaction focuses on the drafting of the target text, rather than on the disambiguation of the source text, as in all former IMT systems.

In the TT2 project, this idea was further developed. A full-fledged MT engine was embedded in an interactive editing environment and used to generate suggested completions of each target sentence being translated. These completions may be accepted or amended by the translator; but once validated, they are exploited by the MT engine to produce further, hopefully improved suggestions. This is in marked contrast with traditional MT, where typically the system is first used to produce a complete draft translation of a source text, which is then post-edited (corrected) offline by a human translator. TT2's interactive approach offers a significant advantage over traditional post-editing. In the latter paradigm, there is no way for the system, which is off-line, to benefit from the user's corrections; in TransType, just the opposite is true. As soon as the user begins to revise an incorrect segment, the system immediately responds to that new information by proposing an alternative completion to the target segment, which is compatible with the prefix that the user has input.

Another notable feature of the work described in this article is the importance accorded to a formal treatment of human-machine interaction, something that is seldom considered in the now-prevalent framework of statistical pattern recognition.

References

[1]

Barrachina, S., Bender, et al. Statistical approaches to computer-assisted translation. Computational Linquistics 35, 8 (2009), 3--28.

Digital Library

Google Scholar

[2]

Brown, P. F., Della Pietra, S. A., Della Pietra, V. J., and Mercer, R. L. The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics 19, 2 (1993), 263--310.

Digital Library

Google Scholar

[3]

Casacuberta, F., and Vidal, E. Machine translation with inferred stochastic finite-state transducers. Computational Linguistics 30, 2 (2004), 205--225.

Digital Library

Google Scholar

[4]

Esteban, J., Lorenzo, J., Valderrabanos, A. S., and Lapalme, G. TransType2 - -An innovative computer-assisted translation system. In The Companion Volume to the Proceedings of 42st Annual Meeting of the Association for Computational Linguistics, (Barcelona, Spain, July 2004), 94--97.

Digital Library

Google Scholar

[5]

Foster, G. Text Prediction for Translators. PhD thesis, Université de Montreal, May 2002.

Digital Library

Google Scholar

[6]

Foster, G., Isabelle, P., and Plamondon, P. Target-text mediated interactive machine translation. Machine Translation 12, 1--2, (1997), 175--194.

Digital Library

Google Scholar

[7]

Isabelle, P. and Church, K. Special issue on new tools for human translators. Machine Translation 12, 1--2, 1997.

Digital Library

Google Scholar

[8]

King, M., Popescu-Belis, A., and Hovy, E. FEMTI: creating and using a framework for MT evaluation. In Proceedings of the Machine Translation Summit IX, 224--231, (Sept 2003), New Orleans.

Google Scholar

[9]

Macklovitch, E. TransType2: The last word. In Proceedings of the 5th International Conference on Languages Resources and Evaluation, (May 2006, Genoa, Italy), 167--172.

Google Scholar

[10]

Ney, H. One decade of statistical machine translation: 1996--2005. In Proceedings of the MT Summit X, (Sept. 2005, Phuket, Thailand), 12--17.

Crossref

Google Scholar

[11]

Vidal, E., Casacuberta, F., Rodriguez, L, Civera, J., and Martinez, C. Computer-assisted translation using speech recognition. IEEE Transactions on Speech and Audio Processing, 14, 3, (2006), 941--951.

Digital Library

Google Scholar

Cited By

View all

Gupta AVerbert KParra DHammond TKnijnenburg BO'Donovan JTaele P(2021)User-Controlled Content Translation in Social MediaCompanion Proceedings of the 26th International Conference on Intelligent User Interfaces10.1145/3397482.3450714(96-98)Online publication date: 14-Apr-2021
https://dl.acm.org/doi/10.1145/3397482.3450714
Peris ÁCasacuberta F(2019)Online Learning for Effort Reduction in Interactive Neural Machine TranslationComputer Speech & Language10.1016/j.csl.2019.04.001Online publication date: Apr-2019
https://doi.org/10.1016/j.csl.2019.04.001
González-Rubio JCasacuberta F(2017)Minimum description length inference of phrase-based translation modelsNeural Computing and Applications10.1007/s00521-016-2257-028:9(2403-2413)Online publication date: 1-Sep-2017
https://dl.acm.org/doi/10.1007/s00521-016-2257-0
Show More Cited By

Index Terms

Human interaction for high-quality machine translation
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing

Recommendations

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination
EACL '09: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics

In this paper we compare and contrast two approaches to Machine Translation (MT): the CMU-UKA Syntax Augmented Machine Translation system (SAMT) and UPC-TALP N-gram-based Statistical Machine Translation (SMT). SAMT is a hierarchical syntax-driven ...
Preventing translation quality deterioration caused by beam search decoding in neural machine translation using statistical machine translation
Graphical abstract

Display Omitted

Abstract
Decoding is an important part of machine translation systems, and the most popular inference algorithm used here is beam search. Beam search algorithm improves translation by allowing a larger search space to be traversed than greedy ...
Evaluation of machine translation
ICWET '11: Proceedings of the International Conference & Workshop on Emerging Trends in Technology

Machine Translation (MT) refers to the use of a machine for performing translation task which converts text or speech from one Natural Language (NL) into another Natural Language. Machine Translation is an important technology for localization, and is ...

Comments

Information & Contributors

Information

Published In

Communications of the ACM Volume 52, Issue 10

A View of Parallel Computing

October 2009

134 pages

ISSN:0001-0782

EISSN:1557-7317

DOI:10.1145/1562764

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2009

Published in CACM Volume 52, Issue 10

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Popular
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

21
Total Citations
View Citations
1,818
Total Downloads

Downloads (Last 12 months)275
Downloads (Last 6 weeks)36

Reflects downloads up to 18 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Gupta AVerbert KParra DHammond TKnijnenburg BO'Donovan JTaele P(2021)User-Controlled Content Translation in Social MediaCompanion Proceedings of the 26th International Conference on Intelligent User Interfaces10.1145/3397482.3450714(96-98)Online publication date: 14-Apr-2021
https://dl.acm.org/doi/10.1145/3397482.3450714
Peris ÁCasacuberta F(2019)Online Learning for Effort Reduction in Interactive Neural Machine TranslationComputer Speech & Language10.1016/j.csl.2019.04.001Online publication date: Apr-2019
https://doi.org/10.1016/j.csl.2019.04.001
González-Rubio JCasacuberta F(2017)Minimum description length inference of phrase-based translation modelsNeural Computing and Applications10.1007/s00521-016-2257-028:9(2403-2413)Online publication date: 1-Sep-2017
https://dl.acm.org/doi/10.1007/s00521-016-2257-0
Alves FKoglin AMesa-Lao BMartínez Mde Lima Fonseca Nde Melo Sá AGonçalves JSzpak KSekino KAquino M(2016)Analysing the Impact of Interactive Machine Translation on Post-editing EffortNew Directions in Empirical Translation Process Research10.1007/978-3-319-20358-4_4(77-94)Online publication date: 2016
https://doi.org/10.1007/978-3-319-20358-4_4
Ortiz-Martínez DGonzález-Rubio JAlabau VSanchis-Trilles GCasacuberta F(2016)Integrating Online and Active Learning in a Computer-Assisted Translation WorkbenchNew Directions in Empirical Translation Process Research10.1007/978-3-319-20358-4_3(57-76)Online publication date: 2016
https://doi.org/10.1007/978-3-319-20358-4_3
Sanentz SLesk M(2016)Toward a semantic stability index (SSI) via a preliminary exploration of translation loopingProceedings of the Association for Information Science and Technology10.1002/pra2.2015.14505201007352:1(1-4)Online publication date: 24-Feb-2016
https://dl.acm.org/doi/10.1002/pra2.2015.145052010073
Sanentz SLesk MGiven L(2015)Toward a semantic stability index (SSI) via a preliminary exploration of translation loopingProceedings of the 78th ASIS&T Annual Meeting: Information Science with Impact: Research in and for the Community10.5555/2857070.2857143(1-4)Online publication date: 6-Nov-2015
https://dl.acm.org/doi/10.5555/2857070.2857143
Valor Miró JSilvestre-Cerdà JCivera JTurró CJuan A(2015)Efficiency and usability study of innovative computer-aided transcription strategies for video lecture repositoriesSpeech Communication10.1016/j.specom.2015.09.00674:C(65-75)Online publication date: 1-Nov-2015
https://dl.acm.org/doi/10.1016/j.specom.2015.09.006
Gispert ABlackwood GIglesias GByrne W(2013)N-gram posterior probability confidence measures for statistical machine translationMachine Translation10.1007/s10590-012-9132-227:2(85-114)Online publication date: 1-Jun-2013
https://dl.acm.org/doi/10.1007/s10590-012-9132-2
Burchardt ATscherwinka CAvramidis EUszkoreit H(2013)Machine Translation at WorkComputational Linguistics10.1007/978-3-642-34399-5_13(241-261)Online publication date: 2013
https://doi.org/10.1007/978-3-642-34399-5_13
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Magazine Site

View this article on the magazine site (external)

Magazine Site

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

References

Cited By

Index Terms

Recommendations

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination

Preventing translation quality deterioration caused by beam search decoding in neural machine translation using statistical machine translation

Evaluation of machine translation

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Magazine Site

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations