Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.3115/1118675.1118684dlproceedingsArticle/Chapter ViewAbstractPublication PageswsdConference Proceedingsconference-collections
Article
Free access

Dutch word sense disambiguation: optimizing the localness of context

Published: 11 July 2002 Publication History

Abstract

We describe a new version of the Dutch word sense disambiguation system trained and tested on a corrected version of the SENSEVAL-2 data. The system is an ensemble of word experts; each word expert is a memory-based classifier of which the parameters are automatically determined through cross-validation on training material. The original best-performing system, which used only local context features for disambiguation, is further refined by performing additional parallel cross-validation experiments for optimizing algorithmic parameters and the amount of local context available to each of the word experts' memory-based kernels. This procedure produces an accuracy of 84.8% on test material, improving on a baseline score of 77.2% and the previous SENSEVAL-2 score of 84.2%. We show that cross-validation overfits; had the local context been held constant at two left and right neighbouring words, the system would have scored 85.0%.

References

[1]
D. W. Aha, D. Kibler, and M. Albert. 1991. Instance-based learning algorithms. Machine Learning, 6:37--66.]]
[2]
S. Cost and S. Salzberg. 1993. A weighted nearest neighbour algorithm for learning with symbolic features. Machine Learning, 10:57--78.]]
[3]
W. Daelemans, J. Zavrel, and P. Berck. 1996. Part-of-speech tagging for dutch with MBT, a memory-based tagger generator. In K. van der Meer, editor, Informatiewetenschap 1996, Wetenschappelijke bijdrage aan de Vierde Interdisciplinaire Onderzoeksconferentie Informatiewetenchap, pages 33--40, The Netherlands. TU Delft.]]
[4]
W. Daelemans, A. van den Bosch, and A. Weijters. 1997. IGTree: using trees for compression and classification in lazy learning algorithms. Artificial Intelligence Review, 11:407--423.]]
[5]
W. Daelemans, A. van den Bosch, and J. Zavrel. 1999. Forgetting exceptions is harmful in language learning. Machine Learning, Special issue on Natural Language Learning, 34:11--41.]]
[6]
W. Daelemans, J. Zavrel, K. van der Sloot, and A. van den Bosch. 2001. TiMBL: Tilburg memory based learner, version 4.0, reference guide. ILK Technical Report 01-04, Tilburg University. available from http://ilk.kub.nl.]]
[7]
I. Hendrickx and A. van den Bosch. 2002. Dutch word sense disambiguation: Data and preliminary results. To be published in the Proceedings of the SENSEVAL-2 Workshop, Toulouse, France, 2001.]]
[8]
V. Hoste, W. Daelemans, I. Hendrickx, and A. van den Bosch. 2002a. Evaluating the results of a memory-based word-expert approach to unrestricted word-sense disambiguation. To be published in the Proceedings of the SENSEVAL-2 Workshop, Toulouse, France, 2001.]]
[9]
V. Hoste, I. Hendrickx, W. Daelemans, and A. van den Bosch. 2002b. Parameter optimization for machine-learned word sense disambiguation. To be published in Natural Language Engineering.]]
[10]
A. Kilgarriff and M. Palmer. 2000. Introduction to the special issue on SENSEVAL. Computing in the Humanities, 34(1-2):1--13.]]
[11]
H. T. Ng and H. B. Lee. 1996. Integrating multiple knowledge sources to disambiguate word sense: An exemplar-based approach. In Proc. of 34th meeting of the Assiociation for Computational Linguistics.]]
[12]
W. Schrooten and A. Vermeer. 1994. Woorden in het basisonderwijs. 15.000 woorden aangeboden aan leerlingen. TUP(Studies in meertaligheid 6).]]
[13]
M. Stevenson and Y. Wilks. 2001. The interaction of knowledge sources in word sense disambiguation. Computational Linguistics, 27(3):321--351.]]
[14]
Van Dale. 1996. Van Dale Basiswoordenboek van de Nederlandse taal. Van Dale, Utrecht.]]
[15]
J. Veenstra, A. van den Bosch, S. Buchholz, W. Daelemans, and J. Zavrel. 2000. Memory-based word sense disambiguation. Computers and the Humanities, special issue on Senseval, Word Sense Disambiguations, 34(1-2).]]
[16]
J. Zavrel and W. Daelemans. 1997. Memory-based learning: Using similarity for smoothing. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, Madrid, Spain, Madrid.]]

Cited By

View all
  • (2010)UvT-WSD1: A cross-lingual word sense disambiguation systemProceedings of the 5th International Workshop on Semantic Evaluation10.5555/1859664.1859717(238-241)Online publication date: 15-Jul-2010
  • (2004)A lemma-based approach to a maximum entropy word sense disambiguation system for DutchProceedings of the 20th international conference on Computational Linguistics10.3115/1220355.1220467(778-es)Online publication date: 23-Aug-2004
  1. Dutch word sense disambiguation: optimizing the localness of context

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      WSD '02: Proceedings of the ACL-02 workshop on Word sense disambiguation: recent successes and future directions - Volume 8
      July 2002
      123 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 11 July 2002

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)36
      • Downloads (Last 6 weeks)6
      Reflects downloads up to 31 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2010)UvT-WSD1: A cross-lingual word sense disambiguation systemProceedings of the 5th International Workshop on Semantic Evaluation10.5555/1859664.1859717(238-241)Online publication date: 15-Jul-2010
      • (2004)A lemma-based approach to a maximum entropy word sense disambiguation system for DutchProceedings of the 20th international conference on Computational Linguistics10.3115/1220355.1220467(778-es)Online publication date: 23-Aug-2004

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media