Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.3115/992133.992154dlproceedingsArticle/Chapter ViewAbstractPublication PagescolingConference Proceedingsconference-collections
Article
Free access

Automatic acquisition of hyponyms from large text corpora

Published: 23 August 1992 Publication History

Abstract

We describe a method for the automatic acquisition of the hyponymy lexical relation from unrestricted text. Two goals motivate the approach: (i) avoidance of the need for pre-encoded knowledge and (ii) applicability across a wide range of text. We identify a set of lexico-syntactic patterns that are easily recognizable, that occur frequently and across text genre boundaries, and that indisputably indicate the lexical relation of interest. We describe a method for discovering these patterns and suggest that other lexical relations will also be acquirable in this way. A subset of the acquisition algorithm is implemented and the results are used to augment and critique the structure of a large hand-built thesaurus. Extensions and applications to areas such as information retrieval are suggested.

References

[1]
Ahlswede, T. & M. Evens (1988). Parsing vs. text processing in the analysis of dictionary definitions. Proceedings of the 26th Annual Meeting of the Association for Computational Linguistics, pages 217--224.
[2]
Alshawi, H. (1987). Processing dictionary definitions with phrasal pattern hierarchies. American Journal of Computational Linguistics, 13(3):195--202.
[3]
Batali, J. (1991). Automatic Acquisition and Use of Some of the Knowledge in Physics Texts. PhD thesis, Massachusetts Institute of Technology, Artificial Intelligence Laboratory.
[4]
Brent, M. R. (1991). Automatic acquisition of subcategorization frames from untagged, free-text corpora. In Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics.
[5]
Calzolari, N. & R. Bindi (1990). Acquisition of lexical information from a large textual italian corpus. In Proceedings of the Thirteenth International Conference on Computational Linguistics, Helsinki.
[6]
Coates-Stephens, S. (1991). Coping with lexical inadequacy - the automatic acquisition of proper nouns from news text. In The Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora, pages 154--169, Oxford.
[7]
Cutting, D., J. Kupiec, J. Pedersen, & P. Sibun (1991). A practical part-of-speech tagger. Submitted to The 3rd Conference on Applied Natural Language Processing.
[8]
Grolier (1990). Academic American Encyclopedia Grolier Electronic Publishing, Danbury, Connecticut.
[9]
Hearst, M. A. (1991). Noun homograph disambiguation using local context in large text corpora. In The Proceedings of the 7th Annual Conference of the UW Centre for the New OED and Text Research: Using Corpora, Oxford.
[10]
Hindle, D. (1990). Noun classification from predicate-argument structures. Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics, pages 268--275.
[11]
Jacobs, P. & U. Zernik (1988). Acquiring lexical knowledge from text: A case study. In Proceedings of AAAI88, pages 739--744.
[12]
Jensen, K. & J.-L. Binot (1987). Disambiguating prepositional phrase attachments by using online dictionary definitions. American Journal of Computational Linguistics, 13(3):251--260.
[13]
Markowitz, J., T. Ahlswede, & M. Evens (1986). Semantically significant patterns in dictionary definitions. Proceedings of the 24th Annual Meeting of the Association for Computational Linguistics, pages 112--119.
[14]
Miller, G. A., R. Beckwith, C. Fellbaum, D. Gross, & K. J. Miller (1990). Introduction to wordnet: An on-line lexical database. Journal of Lexicography, 3(4):235--244.
[15]
Morris, J. & G. Hirst (1991). Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics, 17(1):21--48.
[16]
Nakamura, J. & M. Nagao (1988). Extraction of semantic information from an ordinary english dictionary and its evaluation. In Proceedings of the Twelfth International Conference on Computational Linguistics, pages 459--464, Budapest.
[17]
Smadja, F. A. & K. R. McKeown (1990). Automatically extracting and representing collocations for language generation. Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics, pages 252--259.
[18]
Velardi, P. & M. T. Pazienza (1989). Computer aided interpretation of lexical cooccurrences. Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, pages 185--192.
[19]
Wilks, Y. A., D. C. Fass, C. ming Guo, J. E. McDonald, T. Plate, & B. M. Slator (1990). Providing machine tractable dictionary tools. Journal of Machine Translation, 2.

Cited By

View all
  • (2024)Knowledge Graph Enhancement for Improved Natural Language Health Question Answering using Large Language ModelsProceedings of the 36th International Conference on Scientific and Statistical Database Management10.1145/3676288.3676289(1-4)Online publication date: 10-Jul-2024
  • (2024)OntoType: Ontology-Guided and Pre-Trained Language Model Assisted Fine-Grained Entity TypingProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671745(1407-1417)Online publication date: 25-Aug-2024
  • (2023)DNGProceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v37i5.25810(6593-6601)Online publication date: 7-Feb-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
COLING '92: Proceedings of the 14th conference on Computational linguistics - Volume 2
August 1992
433 pages

Sponsors

  • SITE
  • Ministère de la recherche et de la technologie
  • Ville de Nantes
  • Ministère des Affairs Étrangères
  • Conseil Général de Loire Atlantique
  • CNRS: Centre National De La Rechercue Scientifique
  • Université de Nantes
  • ATALA
  • ACL
  • AFCET
  • Universités de Grenoble
  • IMAG

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 23 August 1992

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 1,537 of 1,537 submissions, 100%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)131
  • Downloads (Last 6 weeks)17
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Knowledge Graph Enhancement for Improved Natural Language Health Question Answering using Large Language ModelsProceedings of the 36th International Conference on Scientific and Statistical Database Management10.1145/3676288.3676289(1-4)Online publication date: 10-Jul-2024
  • (2024)OntoType: Ontology-Guided and Pre-Trained Language Model Assisted Fine-Grained Entity TypingProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671745(1407-1417)Online publication date: 25-Aug-2024
  • (2023)DNGProceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v37i5.25810(6593-6601)Online publication date: 7-Feb-2023
  • (2023)BiasAsker: Measuring the Bias in Conversational AI SystemProceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3611643.3616310(515-527)Online publication date: 30-Nov-2023
  • (2023)Completing and Debugging Ontologies: State-of-the-art and Challenges in Repairing OntologiesJournal of Data and Information Quality10.1145/359730415:4(1-38)Online publication date: 1-Nov-2023
  • (2023)Sensecape: Enabling Multilevel Exploration and Sensemaking with Large Language ModelsProceedings of the 36th Annual ACM Symposium on User Interface Software and Technology10.1145/3586183.3606756(1-18)Online publication date: 29-Oct-2023
  • (2023)Towards Visual Taxonomy ExpansionProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3613845(6481-6490)Online publication date: 26-Oct-2023
  • (2023)A Single Vector Is Not Enough: Taxonomy Expansion via Box EmbeddingsProceedings of the ACM Web Conference 202310.1145/3543507.3583310(2467-2476)Online publication date: 30-Apr-2023
  • (2023)Descartes: Generating Short Descriptions of Wikipedia ArticlesProceedings of the ACM Web Conference 202310.1145/3543507.3583220(1446-1456)Online publication date: 30-Apr-2023
  • (2022)Hierarchical Entity Resolution using an OracleProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3526147(414-428)Online publication date: 10-Jun-2022
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media