Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.3115/1118627.1118630dlproceedingsArticle/Chapter ViewAbstractPublication PagesulaConference Proceedingsconference-collections
Article
Free access

Boosting automatic lexical acquisition with morphological information

Published: 12 July 2002 Publication History

Abstract

In this paper we investigate the impact of morphological features on the task of automatically extending a dictionary. We approach the problem as a pattern classification task and compare the performance of several models in classifying nouns that are unknown to a broad coverage dictionary. We used a boosting classifier to compare the performance of models that use different sets of features. We show how adding simple morphological features to a model greatly improves the classification performance.

References

[1]
M. Berland and E. Charniak. 1999. Finding parts in very large corpora. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics.
[2]
S. Caraballo. 1999. Automatic acquisition of a hypernym-labeled noun hierarchy from text. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics.
[3]
M. Collins and Y. Singer. 1999. Unsupervised models for named entity classification. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora.
[4]
M. Collins. 2000. Discriminative reranking for natural language parsing. In Proceedings of the 17th ICML.
[5]
C. Fellbaum. 1998. WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA.
[6]
R. Granger. 1977. Foul-up: A program that figures out meanings of words from context. In Proceedings of the Fifth International Joint Conference on Artificial Intelligence.
[7]
P. M. Hastings and S. L. Lytinen. 1994. The ups and downs of lexical acquisition. In AAAI-94.
[8]
M. Hearst. 1992. Automatic acquisition of hyponyms from large text corpora. In Proceedings of the 14th International Conference on Computational Linguistics.
[9]
P. Jacobs and U. Zernik. 1988. Acquiring lexical knowledge from text: A case study. In AAAI-88.
[10]
G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. 1990. Introduction to wordnet: An on-line lexical database. International Journal of Lexicography, 3(4).
[11]
A. Ratnaparkhi. 1996. A maximum entropy model for part-of-speech tagging. In Proceedings of the First Empirical Methods in Natural Language Processing Conference.
[12]
E. Riloff. 1996. An empirical study of automated dictionary construction for information extraction in three domains. Artificial Intelligence, 85.
[13]
B. Roark and E. Charniak. 1998. Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics.
[14]
R. E. Schapire and Y. Singer. 1998. Improved boosting algorithms using confidence-rated predictions. In Proceedings of the Eleventh Annual Conference on Computational Learning Theory.
[15]
R. E. Schapire and Y. Singer. 2000. Boostexter: A boosting-based system for text categorization. Machine Learning, 39.
[16]
M. Stevenson and Y. Wilks. 2001. The interaction of knowledge sources in word sense disambiguation. Computational Linguistics, 27.
[17]
Y. Yang. 1999. An evaluation of statistical approaches to text categorization. Information Retrieval, 1.
[18]
D. Yarowsky. 1995. Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics.

Cited By

View all
  • (2008)Extending a thesaurus with words from Pan-Chinese sourcesProceedings of the 22nd International Conference on Computational Linguistics - Volume 110.5555/1599081.1599139(457-464)Online publication date: 18-Aug-2008
  • (2005)Supersense tagging of unknown nouns using semantic similarityProceedings of the 43rd Annual Meeting on Association for Computational Linguistics10.3115/1219840.1219844(26-33)Online publication date: 25-Jun-2005
  • (2004)Linguistic preprocessing for distributional classification of wordsProceedings of the Workshop on Enhancing and Using Electronic Dictionaries10.5555/1610042.1610046(15-21)Online publication date: 29-Aug-2004
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ULA '02: Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition - Volume 9
July 2002
77 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 12 July 2002

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)46
  • Downloads (Last 6 weeks)7
Reflects downloads up to 26 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2008)Extending a thesaurus with words from Pan-Chinese sourcesProceedings of the 22nd International Conference on Computational Linguistics - Volume 110.5555/1599081.1599139(457-464)Online publication date: 18-Aug-2008
  • (2005)Supersense tagging of unknown nouns using semantic similarityProceedings of the 43rd Annual Meeting on Association for Computational Linguistics10.3115/1219840.1219844(26-33)Online publication date: 25-Jun-2005
  • (2004)Linguistic preprocessing for distributional classification of wordsProceedings of the Workshop on Enhancing and Using Electronic Dictionaries10.5555/1610042.1610046(15-21)Online publication date: 29-Aug-2004
  • (2004)Feature weighting for co-occurrence-based classification of wordsProceedings of the 20th international conference on Computational Linguistics10.3115/1220355.1220470(799-es)Online publication date: 23-Aug-2004
  • (2003)Hierarchical semantic classificationProceedings of the 18th international joint conference on Artificial intelligence10.5555/1630659.1630777(817-822)Online publication date: 9-Aug-2003
  • (2003)Supersense tagging of unknown nouns in WordNetProceedings of the 2003 conference on Empirical methods in natural language processing10.3115/1119355.1119377(168-175)Online publication date: 11-Jul-2003
  • (2003)Semantic classification of Chinese unknown wordsProceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 210.3115/1075178.1075188(72-79)Online publication date: 7-Jul-2003

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media