Abstract
This paper presents a statistical lexical ambiguity resolution method in direct transfer machine translation models in which the target language is Turkish. Since direct transfer MT models do not have full syntactic information, most of the lexical ambiguity resolution methods are not very helpful. Our disambiguation model is based on statistical language models. We have investigated the performances of some statistical language model types and parameters in lexical ambiguity resolution for our direct transfer MT system.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hajič, J., Hric, J., Kubon, V.: Machine Translation of Very Close Languages. Applied NLP Processing. NAACL, Washington (2000)
Canals, R., et al.: interNOSTRUM: A Spanish-Catalan Machine Translation System. EAMT Machine Translation Summit VIII, Spain (2001)
Hirst, G., Charniak, E.: Word Sense and Case Slot Disambiguation. In: AIII 1982, pp. 95–98 (1982)
Black, E.: An Experiment in Computational Discrimination of English Word Senses. IBM Journal of Research and Development 32(2), 185–194 (1988)
Gale, W., Church, K.W., Yarowsky, D.: A Method for Disambiguating Word Senses in a Large Corpus, Statistical Research Report 104, Bell Laboratories (1992)
Leacock, C., Towell, G., Voorhees, E.: Corpus-based statistical sense resolution. In: Proceedings of the ARPA Human Language Technology Worskshop, Morgan Kaufmann, San Francisco (1993)
Schütze, H.: Automatic word sense discrimination. Computational Linguistics 24(1) (1998)
Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics (ACL 1995), Cambridge, MA (1995)
Hearst, M.A.: Noun homograph disambiguation using local context in large corpora. In: Proceedings of the 7th Annual Conf. of the University of Waterloo Centre for the New OED and Text Research, Oxford, United Kingdom, pp. 1–19 (1991)
McRoy, S.W.: Using multiple knowledge sources for word sense discrimination. Computational Linguistics 18(1), 1–30 (1992)
Clarkson, P.R., Rosenfeld, R.: Statistical Language Modeling Using the CMU-Cambridge Toolkit. In: Proceedings ESCA Eurospeech (1997)
Fomey Jr., G.D.: The Viterbi Algorithm. IEEE Proc. 61, 268–278 (1973)
NIST Report, Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics (2002), http://www.nist.gov/speech/tests/mt/doc/ngramstudy.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tantuğ, A.C., Adalı, E., Oflazer, K. (2006). Lexical Ambiguity Resolution for Turkish in Direct Transfer Machine Translation Models. In: Levi, A., Savaş, E., Yenigün, H., Balcısoy, S., Saygın, Y. (eds) Computer and Information Sciences – ISCIS 2006. ISCIS 2006. Lecture Notes in Computer Science, vol 4263. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11902140_26
Download citation
DOI: https://doi.org/10.1007/11902140_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-47242-1
Online ISBN: 978-3-540-47243-8
eBook Packages: Computer ScienceComputer Science (R0)