Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.3115/1219840.1219916dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Automatic acquisition of adjectival subcategorization from corpora

Published: 25 June 2005 Publication History

Abstract

This paper describes a novel system for acquiring adjectival subcategorization frames (SCFs) and associated frequency information from English corpus data. The system incorporates a decision-tree classifier for 30 SCF types which tests for the presence of grammatical relations (GRs) in the output of a robust statistical parser. It uses a powerful pattern-matching language to classify GRs into frames hierarchically in a way that mirrors inheritance-based lexica. The experiments show that the system is able to detect SCF types with 70% precision and 66% recall rate. A new tool for linguistic annotation of SCFs in corpus data is also introduced which can considerably alleviate the process of obtaining training and test data for subcategorization acquisition.

References

[1]
B. Boguraev, J. Carroll, E. Briscoe, D. Carter, and C. Grover. 1987. The derivation of a grammatically-indexed lexicon from the Longman Dictionary of Contemporary English. In Proceedings of the 25th Annual Meeting of the Association for Computational Linguistics, pages 193--200, Stanford, CA.
[2]
Michael R. Brent. 1991. Automatic acquisition of sub-categorization frames from untagged text. In Meeting of the Association for Computational Linguistics, pages 209--214.
[3]
E. J. Briscoe and J. Carroll. 1997. Automatic Extraction of Subcategorization from Corpora. In Proceedings of the 5th Conference on Applied Natural Language Processing, Washington DC, USA.
[4]
E. Briscoe and J. Carroll. 2002. Robust accurate statistical annotation of general text. In Proceedings of the Third International Conference on Language Resources and Evaluation, pages 1499--1504, Las Palmas, Canary Islands, May.
[5]
E. Briscoe, J. Carroll, Jonathan Graham, and Ann Copestake. 2002. Relational evaluation schemes. In Proceedings of the Beyond PARSEVAL Workshop at the 3rd International Conference on Language Resources and Evaluation, pages 4--8, Las Palmas, Gran Canaria.
[6]
Lou Burnard, 1995. The BNC Users Reference Guide. British National Corpus Consortium, Oxford, May.
[7]
J. Carroll and E. Briscoe. 2002. High precision extraction of grammatical relations. In Proceedings of the 19th International Conference on Computational Linguistics, pages 134--140, Taipei, Taiwan.
[8]
Glenn Carroll and Mats Rooth. 1998. Valence induction with a head-lexicalized pcfg. In Proc. of the 3rd Conference on Empirical Methods in Natural Language Processing, Granada, Spain.
[9]
J. Carroll, E. Briscoe, and A. Sanfilippo. 1998a. Parser evaluation: a survey and a new proposal. In Proceedings of the 1st International Conference on Language Resources and Evaluation, pages 447--454, Granada, Spain.
[10]
John Carroll, Guido Minnen, and Edward Briscoe. 1998b. Can Subcategorisation Probabilities Help a Statistical Parser? In Proceedings of the 6th ACL/SIGDAT Workshop on Very Large Corpora, pages 118--126, Montreal, Canada. Association for Computational Linguistics.
[11]
Eva Esteve Ferrer. 2004. Towards a Semantic Classification of Spanish Verbs Based on Subcategorisation Information. In ACL Student Research Workshop, Barcelona, Spain.
[12]
Dan Flickinger and John Nerbonne. 1992. Inheritance and complementation: A case study of easy adjectives and related nouns. Computational Linguistics, 18(3):269--309.
[13]
Daisuke Kawahara and Sadao Kurohashi. 2002. Fertilization of Case Frame Dictionary for Robust Japanese Case Analysis. In 19th International Conference on Computational Linguistics.
[14]
Anna Korhonen, Yuval Krymolowski, and Zvika Marx. 2003. Clustering Polysemic Subcategorization Frame Distributions Semantically. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pages 64--71, Sapporo, Japan.
[15]
Anna Korhonen. 2002. Subcategorization acquisition. Ph.D. thesis, University of Cambridge Computer Laboratory, February.
[16]
Catherine Macleod, Ralph Grishman, and Adam Meyers, 1998. COMLEX Syntax Reference Manual. Computer Science Department, New York University.
[17]
Christopher D. Manning. 1993. Automatic Acquisition of a Large Subcategorization Dictionary from Corpora. In Meeting of the Association for Computational Linguistics, pages 235--242.
[18]
S. Schulte im Walde and C. Brew. 2002. Inducing german semantic verb classes from purely syntactic subcategorisation information. In 40th Annual Meeting of the Association for Computational Linguistics, Philadephia, USA.
[19]
Mihai Surdeanu, Sanda Harabagiu, JohnWilliams, and Paul Aarseth. 2003. Using predicate-argument structures for information extraction. In Proc. of the 41st Annual Meeting of the Association for Computational Linguistics, Sapporo.
[20]
Susanne Rohen Wolff, Catherine Macleod, and Adam Meyers, 1998. COMLEX Word Classes Manual. Computer Science Department, New York University, June.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '05: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
June 2005
657 pages
  • General Chair:
  • Kevin Knight

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 25 June 2005

Qualifiers

  • Article

Acceptance Rates

ACL '05 Paper Acceptance Rate 77 of 423 submissions, 18%;
Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 243
    Total Downloads
  • Downloads (Last 12 months)49
  • Downloads (Last 6 weeks)11
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media