Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.3115/1219840.1219898dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Alignment model adaptation for domain-specific word alignment

Published: 25 June 2005 Publication History

Abstract

This paper proposes an alignment adaptation approach to improve domain-specific (in-domain) word alignment. The basic idea of alignment adaptation is to use out-of-domain corpus to improve in-domain word alignment results. In this paper, we first train two statistical word alignment models with the large-scale out-of-domain corpus and the small-scale in-domain corpus respectively, and then interpolate these two models to improve the domain-specific word alignment. Experimental results show that our approach improves domain-specific word alignment in terms of both precision and recall, achieving a relative error rate reduction of 6.56% as compared with the state-of-the-art technologies.

References

[1]
L. Ahrenberg, M. Merkel, M. Andersson. 1998. A Simple Hybrid Aligner for Generating Lexical Correspondences in Parallel Tests. In Proc. of ACL/COLING-1998, pp. 29--35.
[2]
Y. Al-Onaizan, J. Curin, M. Jahr, K. Knight, J. Lafferty, D. Melamed, F. J. Och, D. Purdy, N. A. Smith, D. Yarowsky. 1999. Statistical Machine Translation Final Report. Johns Hopkins University Workshop.
[3]
P. F. Brown, S. A. Della Pietra, V. J. Della Pietra, R. Mercer. 1993. The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics, 19(2): 263--311.
[4]
C. Cherry and D. Lin. 2003. A Probability Model to Improve Word Alignment. In Proc. of ACL-2003, pp. 88--95.
[5]
T. Dunning. 1993. Accurate Methods for the Statistics of Surprise and Coincidence. Computational Linguistics, 19(1): 61--74.
[6]
R. Iyer, M. Ostendorf, H. Gish. 1997. Using Out-of-Domain Data to Improve In-Domain Language Models. IEEE Signal Processing Letters, 221--223.
[7]
S. J. Ker and J. S. Chang. 1997. A Class-based Approach to Word Alignment. Computational Linguistics, 23(2): 313--343.
[8]
I. D. Melamed. 1997. A Word-to-Word Model of Translational Equivalence. In Proc. of ACL 1997, pp. 490--497.
[9]
F. J. Och and H. Ney. 2000. Improved Statistical Alignment Models. In Proc. of ACL-2000, pp. 440--447.
[10]
A. Peñas, F. Verdejo, J. Gonzalo. 2001. Corpus-based Terminology Extraction Applied to Information Access. In Proc. of the Corpus Linguistics 2001, vol. 13.
[11]
F. Smadja, K. R. McKeown, V. Hatzivassiloglou. 1996. Translating Collocations for Bilingual Lexicons: a Statistical Approach. Computational Linguistics, 22(1): 1--38.
[12]
D. Tufis and A. M. Barbu. 2002. Lexical Token Alignment: Experiments, Results and Application. In Proc. of LREC-2002, pp. 458--465.
[13]
D. Wu. 1997. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora. Computational Linguistics, 23(3): 377--403.
[14]
H. Wu and H. Wang. 2004. Improving Domain-Specific Word Alignment with a General Bilingual Corpus. In R. E. Frederking and K. B. Taylor (Eds.), Machine Translation: From Real Users to Research: 6th conference of AMTA-2004, pp. 262--271.

Cited By

View all
  • (2010)Discriminative instance weighting for domain adaptation in statistical machine translationProceedings of the 2010 Conference on Empirical Methods in Natural Language Processing10.5555/1870658.1870702(451-459)Online publication date: 9-Oct-2010
  • (2006)Boosting statistical word alignment using labeled and unlabeled dataProceedings of the COLING/ACL on Main conference poster sessions10.5555/1273073.1273190(913-920)Online publication date: 17-Jul-2006
  • (2006)Word alignment for languages with scarce resources using bilingual corpora of other language pairsProceedings of the COLING/ACL on Main conference poster sessions10.5555/1273073.1273185(874-881)Online publication date: 17-Jul-2006

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '05: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
June 2005
657 pages
  • General Chair:
  • Kevin Knight

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 25 June 2005

Qualifiers

  • Article

Acceptance Rates

ACL '05 Paper Acceptance Rate 77 of 423 submissions, 18%;
Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)48
  • Downloads (Last 6 weeks)11
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2010)Discriminative instance weighting for domain adaptation in statistical machine translationProceedings of the 2010 Conference on Empirical Methods in Natural Language Processing10.5555/1870658.1870702(451-459)Online publication date: 9-Oct-2010
  • (2006)Boosting statistical word alignment using labeled and unlabeled dataProceedings of the COLING/ACL on Main conference poster sessions10.5555/1273073.1273190(913-920)Online publication date: 17-Jul-2006
  • (2006)Word alignment for languages with scarce resources using bilingual corpora of other language pairsProceedings of the COLING/ACL on Main conference poster sessions10.5555/1273073.1273185(874-881)Online publication date: 17-Jul-2006

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media