Abstract
Word sense disambiguation (WSD) is the ability to identify the meaning of words in context in a computational manner. WSD is considered as a task whose solution is at least as hard as the most difficult problems in artificial intelligence. This is basically used in application like information retrieval, machine translation, information extraction because of its semantics understanding. This paper describes the proposed approach W3SD (This paper is an extended version of our work [4] published in the 8th International Conference on Computational Collective Intelligence.) which is based on the words surrounding the polysemous word in a context. Each meaning of these words is represented by a vector composed of weighted nouns using WordNet and Wiktionary features through the taxonomic information content from WordNet and the glosses from Wiktionary. The main emphasis of this paper is feature selection for disambiguation purpose. The assessment of WSD systems is discussed in the context of the Senseval campaign, aiming at the objective evaluation of our proposal to the systems participating in several different disambiguation tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Weaver, W.: Translation. In: Locke, W.N., Boothe, A.D. (eds.) Machine Translation of Languages, pp. 15–23. MIT Press, Cambridge (1949)
Brill, E.: Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging. Comput. Linguist. 21, 543–565 (1995)
Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database (Language, Speech, and Communication), illustrated edn. The MIT Press, Cambridge (1998)
Ben Aouicha, M., Hadj Taieb, M.A., Ibn Marai, H.: WSD-TIC: word sense disambiguation using taxonomic information content. In: Nguyen, N.-T., Manolopoulos, Y., Iliadis, L., Trawiński, B. (eds.) ICCCI 2016. LNCS (LNAI), vol. 9875, pp. 131–142. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45243-2_12
Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Sci. Am. 284, 34–43 (2001)
Zhong, Z., Ng, H.T.: Word sense disambiguation improves information retrieval. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Long Papers, vol. 1, pp. 273–282. Association for Computational Linguistics, Jeju Island (2012)
Ng, H.T.: Does word sense disambiguation improve information retrieval? In: Proceedings of the Fourth Workshop on Exploiting Semantic Annotations in Information Retrieval, ESAIR 2011, Glasgow, United Kingdom, 28 October 2011, pp. 17–18 (2011)
Guyot, J., Falquet, G., Radhouani, S., Benzineb, K.: Analysis of word sense disambiguation-based information retrieval. In: Evaluating Systems for Multilingual and Multimodal Information Access: 9th Workshop of the Cross-language Evaluation Forum, CLEF 2008, Aarhus, Denmark, 17–19 September 2008, Revised Selected Papers, pp. 146–154 (2008)
Sanderson, M.: Word sense disambiguation and information retrieval. In: Croft, B.W., van Rijsbergen, C.J. (eds.) SIGIR 1994, pp. 142–151. Springer, London (1994). https://doi.org/10.1007/978-1-4471-2099-5_15
Stokoe, C., Oakes, M.P., Tait, J.: Word sense disambiguation in information retrieval revisited. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 159–166. ACM, Toronto (2003)
Carpuat, M., Wu, D.: Word sense disambiguation vs. statistical machine translation. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 387–394. Association for Computational Linguistics, Ann Arbor (2005)
Carpuat, M., Wu, D.: How phrase sense disambiguation outperforms word sense disambiguation for statistical machine translation. In: Proceedings of the 11th Conference on Theoretical and Methodological Issues in Machine Translation (TMI) (2007)
Neale, S., Gomes, L., Agirre, E., de Lacalle, O.L., Branco, A.: Word sense-aware machine translation: including senses as contextual features for improved translation models. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, Portorož, Slovenia, 23–28 May 2016 (2016)
Liu, Y., Scheuermann, P., Li, X., Zhu, X.: Using WordNet to disambiguate word senses for text classification. In: Shi, Y., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2007. LNCS, vol. 4489, pp. 781–789. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-72588-6_127
Schadd, F.C., Roos, N.: Word-sense disambiguation for ontology mapping: concept disambiguation using virtual documents and information retrieval techniques. J. Data Semant. 4, 167–186 (2015)
Semeraro, G., Degemmis, M., Lops, P., Basile, P.: Combining learning and word sense disambiguation for intelligent user profiling. In: Twentieth International Joint Conference on Artificial Intelligence, 6–12 January 2007, Hyderabad, India (2007, to appear)
Che, W., Liu, T.: Using word sense disambiguation for semantic role labeling. In: 4th International Universal Communication Symposium, IUCS 2010, Beijing, China, 18–19 October 2010, pp. 167–174 (2010)
Kulkarni, A., Heilman, M., Eskénazi, M., Callan, J.: Word sense disambiguation for vocabulary learning. In: Proceedings of the 9th International Conference Intelligent Tutoring Systems, ITS 2008, Montreal, Canada, 23–27 June 2008, pp. 500–509 (2008)
Moro, A., Raganato, A., Navigli, R.: Entity linking meets word sense disambiguation: a unified approach. TACL 2, 231–244 (2014)
Véronis, J.: HyperLex: lexical cartography for information retrieval. Comput. Speech Lang. 18, 223–252 (2004)
Nameh, M., Fakhrahmad, S.M., Jahromi, M.Z.: A new approach to word sense disambiguation based on context similarity. In: Proceedings of the Workshop of the World Congress on Engineering (2011)
Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In: Proceedings of the 5th Annual International Conference on Systems Documentation, pp. 24–26. ACM, Toronto (1986)
Banerjee, S., Pedersen, T.: Extended gloss overlaps as a measure of semantic relatedness. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence, pp. 805–810. Morgan Kaufmann Publishers Inc., Acapulco (2003)
Sinha, R., Mihalcea, R.: Unsupervised graph-based word sense disambiguation using measures of word semantic similarity. In: Proceedings of the International Conference on Semantic Computing, pp. 363–369. IEEE Computer Society, Washington, DC (2007)
Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. CoRR cmp-lg/9709008 (1997)
Leacock, C., Chodorow, M.: Combining local context and WordNet similarity for word sense identification. In: Fellfaum, C. (ed.) WordNet: An Electronic Lexical Database, pp. 265–283. MIT Press, Cambridge (1998)
López-Arévalo, I., Sosa, V.J.S., López, F.R., Tello-Leal, E.: Improving selection of synsets from WordNet for domain-specific word sense disambiguation. Comput. Speech Lang. 41, 128–145 (2017)
Panchenko, A., Faralli, S., Ponzetto, S.P., Biemann, C.: Using linked disambiguated distributional networks for word sense disambiguation. In: EACL 2017 Workshop on Sense, Concept and Entity Representations and their Applications: Proceedings of the Workshop, 4 April 2017, Valencia, Spain, pp. 72–78. Association for Computational Linguistics, Stroudsburg (2017)
Faralli, S., Panchenko, A., Biemann, C., Ponzetto, S.P.: Linked disambiguated distributional semantic networks. In: Groth, P., Simperl, E., Gray, A., Sabou, M., Krötzsch, M., Lecue, F., Flöck, F., Gil, Y. (eds.) ISWC 2016. LNCS, vol. 9982, pp. 56–64. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46547-0_7
Rothe, S., Schütze, H.: AutoExtend: extending word embeddings to embeddings for synsets and lexemes. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, ACL 2015, 26–31 July 2015, Beijing, China, Long Papers, vol. 1, pp. 1793–1803 (2015)
Hessami, E., Mahmoudi, F., Jadidinejad, A.H.: Unsupervised graph-based word sense disambiguation using lexical relation of WordNet. Int. J. Comput. Sci. Issues 8(6), 225–230 (2011). No. 3
Basile, P., Caputo, A., Semeraro, G.: An enhanced lesk word sense disambiguation algorithm through a distributional semantic model. In: COLING 2014, 25th International Conference on Computational Linguistics, Proceedings of the Conference: Technical Papers, 23–29 August 2014, Dublin, Ireland, pp. 1591–1600 (2014)
Navigli, R., Ponzetto, S.P.: BabelNet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artif. Intell. 193, 217–250 (2012)
Zesch, T., Müller, C., Gurevych, I.: Extracting lexical semantic knowledge from Wikipedia and Wiktionary. In: Proceedings of the International Conference on Language Resources and Evaluation, LREC 2008, 26 May–1 June 2008, Marrakech, Morocco (2008)
Hadj Taieb, M.A., Ben Aouicha, M., Ben Hamadou, A.: A new semantic relatedness measurement using WordNet features. Knowl. Inf. Syst. 41, 467–497 (2014)
Ben Aouicha, M., Hadj Taieb, M.A., Ben Hamadou, A.: SISR: system for integrating semantic relatedness and similarity measures. Soft. Comput. 22, 1855–1879 (2016)
Didion, J.: JWNL (Java WordNet Library) (2004)
Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. 41, 10:1–10:69 (2009)
Palmer, M., Fellbaum, C., Cotton, S., Delfs, L., Dang, H.T.: English tasks: all-words and verb lexical sample. In: Proceedings of SENSEVAL-2 Second International Workshop on Evaluating Word Sense Disambiguation Systems, pp. 21–24. Association for Computational Linguistics, Toulouse (2001)
McCarthy, D., Carroll, J., Preiss, J.: Disambiguating noun and verb senses using automatically acquired selectional preferences. In: The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems, pp. 119–122. Association for Computational Linguistics, Toulouse (2001)
Fernández-Amorós, D., Gonzalo, J., Verdejo, F.: The UNED systems at Senseval-2. In: The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems, pp. 75–78. Association for Computational Linguistics, Toulouse (2001)
Kenneth, C.L.: Use of machine readable dictionaries for word-sense disambiguation in SENSEVAL-2. In: The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems, pp. 75–78. Association for Computational Linguistics, Toulouse (2001)
Snyder, B., Palmer, M.: The English all-words task. In: Mihalcea, R., Edmonds, P. (eds.) Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, pp. 41–43. Association for Computational Linguistics, Barcelona (2004)
Mccarthy, D., Koeling, R., Weeds, J., Carroll, J.: Using automatically acquired predominant senses for word sense disambiguation. In: Proceedings of the ACL SENSEVAL-3 Workshop, pp. 151–154 (2004)
Strapparava, C., Gliozzo, A., Giuliano, C.: Pattern abstraction and term similarity for word sense disambiguation: IRST at Senseval-3. In: Mihalcea, R., Edmonds, P. (eds.) Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, pp. 229–234. Association for Computational Linguistics, Barcelona (2004)
Seo, H.-C., Rim, H.-C., Kim, S.-H.: KUNLP system in Senseval-3. In: Mihalcea, R., Edmonds, P. (eds.) Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, pp. 222–225. Association for Computational Linguistics, Barcelona (2004)
Buscaldi, D., Rosso, P., Masulli, F.: The upv-unige-CIAOSENSO WSD system. In: Mihalcea, R., Edmonds, P. (eds.) Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, pp. 77–82. Association for Computational Linguistics, Barcelona (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Ben Aouicha, M., Hadj Taieb, M.A., Ibn Marai, H. (2018). WordNet and Wiktionary-Based Approach for Word Sense Disambiguation. In: Nguyen, N., Kowalczyk, R. (eds) Transactions on Computational Collective Intelligence XXIX. Lecture Notes in Computer Science(), vol 10840. Springer, Cham. https://doi.org/10.1007/978-3-319-90287-6_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-90287-6_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-90286-9
Online ISBN: 978-3-319-90287-6
eBook Packages: Computer ScienceComputer Science (R0)