Using Mechanical Turk to Build Machine Translation Evaluation Sets

Bloodgood, Michael; Callison-Burch, Chris

Computer Science > Computation and Language

arXiv:1410.5491 (cs)

[Submitted on 20 Oct 2014]

Title:Using Mechanical Turk to Build Machine Translation Evaluation Sets

Authors:Michael Bloodgood, Chris Callison-Burch

View PDF

Abstract:Building machine translation (MT) test sets is a relatively expensive task. As MT becomes increasingly desired for more and more language pairs and more and more domains, it becomes necessary to build test sets for each case. In this paper, we investigate using Amazon's Mechanical Turk (MTurk) to make MT test sets cheaply. We find that MTurk can be used to make test sets much cheaper than professionally-produced test sets. More importantly, in experiments with multiple MT systems, we find that the MTurk-produced test sets yield essentially the same conclusions regarding system performance as the professionally-produced test sets yield.

Comments:	4 pages, 2 tables; appeared in Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, June 2010
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
ACM classes:	I.2.7; I.2.6; I.5.1; I.5.4
Cite as:	arXiv:1410.5491 [cs.CL]
	(or arXiv:1410.5491v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1410.5491
Journal reference:	In Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, pages 208-211, Los Angeles, California, June 2010. Association for Computational Linguistics

Submission history

From: Michael Bloodgood [view email]
[v1] Mon, 20 Oct 2014 22:28:55 UTC (12 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2014-10

Change to browse by:

cs
cs.LG
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Michael Bloodgood
Chris Callison-Burch

export BibTeX citation

Computer Science > Computation and Language

Title:Using Mechanical Turk to Build Machine Translation Evaluation Sets

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Using Mechanical Turk to Build Machine Translation Evaluation Sets

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators