Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2976040.2976203guideproceedingsArticle/Chapter ViewAbstractPublication PagesnipsConference Proceedingsconference-collections
Article

Learning syntactic patterns for automatic hypernym discovery

Published: 01 December 2004 Publication History

Abstract

Semantic taxonomies such as WordNet provide a rich source of knowledge for natural language processing applications, but are expensive to build, maintain, and extend. Motivated by the problem of automatically constructing and extending such taxonomies, in this paper we present a new algorithm for automatically learning hypernym (is-a) relations from text. Our method generalizes earlier work that had relied on using small numbers of hand-crafted regular expression patterns to identify hypernym pairs. Using "dependency path" features extracted from parse trees, we introduce a general-purpose formalization and generalization of these patterns. Given a training set of text containing known hypernym pairs, our algorithm automatically extracts useful dependency paths and applies them to new corpora to identify novel pairs. On our evaluation task (determining whether two nouns in a news article participate in a hypernym relationship), our automatically extracted database of hypernyms attains both higher precision and higher recall than WordNet.

References

[1]
Caraballo, S.A. (2001) Automatic Acquisition of a Hypernym-Labeled Noun Hierarchy from Text. Brown University Ph.D. Thesis.
[2]
Cederberg, S. & Widdows, D. (2003) Using LSA and Noun Coordination Information to Improve the Precision and Recall of Automatic Hyponymy Extraction. Proc. of CoNLL-2003, pp. 111-118.
[3]
Ciaramita, M. & Johnson, M. (2003) Supersense Tagging of Unknown Nouns in WordNet. Proc. of EMNLP-2003.
[4]
Ciaramita, M., Hofmann, T., & Johnson, M. (2003) Hierarchical Semantic Classification: Word Sense Disambiguation with World Knowledge. Proc. of IJCAI-2003.
[5]
Fellbaum, C. (1998) WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press.
[6]
Girju, R., Badulescu A., & Moldovan D. (2003) Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations. Proc. of HLT-2003.
[7]
Harman, D. (1992) The DARPA TIPSTER project. ACM SIGIR Forum 26(2), Fall, pp. 26-28.
[8]
Hearst, M. (1992) Automatic Acquisition of Hyponyms from Large Text Corpora. Proc. of the Fourteenth International Conference on Computational Linguistics, Nantes, France.
[9]
Hearst, M. & Schütze, H. (1993) Customizing a lexicon to better suit a computational task. In Proc. of the ACL SIGLEX Workshop on Acquisition of Lexical Knowledge from Text.
[10]
Lin, D. (1998) Dependency-based Evaluation of MINIPAR. Workshop on the Evaluation of Parsing Systems, Granada, Spain
[11]
Lin, D. & Pantel P. (2001) Discovery of Inference Rules for Question Answering. Natural Language Engineering, 7(4), pp. 343-360.
[12]
Pantel, P. (2003) Clustering by Committee. Ph.D. Dissertation. Department of Computing Science, University of Alberta.
[13]
Pantel, P. & Ravichandran, D. (2004) Automatically Labeling Semantic Classes. Proc. of NAACL-2004.
[14]
Pereira, F., Tishby, N., & Lee, L. (1993) Distributional Clustering of English Words. Proc. of ACL-1993, pp. 183-190.
[15]
Ravichandran, D. & Hovy, E. (2002) Learning Surface Text Patterns for a Question Answering system. Proc. of ACL-2002.
[16]
Rennie J., Shih, L., Teevan, J., & Karger, D. (2003) Tackling the Poor Assumptions of Naive Bayes Text Classifiers. Proc. of ICLM-2003.
[17]
Riloff, E. & Shepherd, J. (1997) A Corpus-Based Approach for Building Semantic Lexicons. Proc of EMNLP-1997.
[18]
Roark, B. & Charniak, E. (1998) Noun-phrase co-occurerence statistics for semi-automatic-semantic lexicon construction. Proc. of ACL-1998, 1110-1116.
[19]
Tseng, H. (2003) Semantic classification of unknown words in Chinese. Proc. of ACL-2003.
[20]
Turney, P.D., Littman, M.L., Bigham, J. & Shanyder, V. (2003) Combining independent modules to solve multiple-choice synonym and analogy problems. Proc. of RANLP-2003, pp. 482-489.
[21]
Widdows, D. (2003) Unsupervised methods for developing taxonomies by combining syntactic and statistical information. Proc. of HLT/NAACL 2003, pp. 276-283.

Cited By

View all
  • (2020)On the Construction of Web NER Model Training Tool based on Distant SupervisionACM Transactions on Asian and Low-Resource Language Information Processing10.1145/342281719:6(1-28)Online publication date: 15-Nov-2020
  • (2020)CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and Relation TransferringProceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3394486.3403244(1928-1936)Online publication date: 23-Aug-2020
  • (2020)A Survey on Computational Metaphor ProcessingACM Computing Surveys10.1145/337326553:2(1-37)Online publication date: 20-Mar-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
NIPS'04: Proceedings of the 17th International Conference on Neural Information Processing Systems
December 2004
1656 pages

Publisher

MIT Press

Cambridge, MA, United States

Publication History

Published: 01 December 2004

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2020)On the Construction of Web NER Model Training Tool based on Distant SupervisionACM Transactions on Asian and Low-Resource Language Information Processing10.1145/342281719:6(1-28)Online publication date: 15-Nov-2020
  • (2020)CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and Relation TransferringProceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3394486.3403244(1928-1936)Online publication date: 23-Aug-2020
  • (2020)A Survey on Computational Metaphor ProcessingACM Computing Surveys10.1145/337326553:2(1-37)Online publication date: 20-Mar-2020
  • (2019)TiFi: Taxonomy Induction for Fictional DomainsThe World Wide Web Conference10.1145/3308558.3313519(2673-2679)Online publication date: 13-May-2019
  • (2019)A Family of Fuzzy Orthogonal Projection Models for Monolingual and Cross-lingual Hypernymy PredictionThe World Wide Web Conference10.1145/3308558.3313439(1965-1976)Online publication date: 13-May-2019
  • (2019)Visual Segmentation for Information Extraction from Heterogeneous Visually Rich DocumentsProceedings of the 2019 International Conference on Management of Data10.1145/3299869.3319867(247-262)Online publication date: 25-Jun-2019
  • (2019)Mining Algorithm Roadmap in Scientific PublicationsProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3292500.3330913(1083-1092)Online publication date: 25-Jul-2019
  • (2019)DuerQuizProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3292500.3330706(2165-2173)Online publication date: 25-Jul-2019
  • (2019)Conceptual Representations for Computational Concept CreationACM Computing Surveys10.1145/318672952:1(1-33)Online publication date: 25-Feb-2019
  • (2019)Predicting hypernym---hyponym relations for Chinese taxonomy learningKnowledge and Information Systems10.1007/s10115-018-1166-158:3(585-610)Online publication date: 1-Mar-2019
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media