Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.3115/1119176.1119179dlproceedingsArticle/Chapter ViewAbstractPublication PagesconllConference Proceedingsconference-collections
Article
Free access

Active learning for HPSG parse selection

Published: 31 May 2003 Publication History

Abstract

We describe new features and algorithms for HPSG parse selection models and address the task of creating annotated material to train them. We evaluate the ability of several sample selection methods to reduce the number of annotated sentences necessary to achieve a given level of performance. Our best method achieves a 60% reduction in the amount of training material without any loss in accuracy.

References

[1]
Steven Abney. 2002. Bootstrapping. In Proceedings of the 40th Annual Meeting of the ACL, pages 360--367, Philadelphia, PA.
[2]
Shlomo Argamon-Engelson and Ido Dagan. 1999. Committee-based sample selection for probabilistic classifiers. Journal of Artificial Intelligence Research, 11:335--360.
[3]
David Cohn, Les Atlas, and Richard Ladner. 1994. Improving generalization with active learning. Machine Learning, 15(2):201--221.
[4]
Michael Collins and Nigel Duffy. 2002. New ranking algorithms for parsing and tagging: Kernels over discrete structures and the voted perceptron. In Proceedings of the 40th Annual Meeting of the ACL, pages 263--270, Philadelphia, Pennsylvania.
[5]
Michael Collins. 1997. Three generative, lexicalised models for statistical parsing. In Proceedings of the 35th Annual Meeting of the ACL, pages 16--23, Madrid, Spain.
[6]
Sean P. Engelson and Ido Dagan. 1996. Minimizing manual annotation cost in supervised training from copora. In Proceedings of the 34th Annual Meeting of the ACL, pages 319--326.
[7]
Dan Flickinger. 2000. On building a more efficient grammar by exploiting types. Natural Language Engineering, 6(1): 15--28. Special Issue on Efficient Processing with HPSG.
[8]
Yoav Freund, H. Sebastian Seung, Eli Shamir, and Naftali Tishby. 1997. Selective sampling using the query by committee algorithm. Machine Learning, 28(2-3):133--168.
[9]
Sally Goldman and Yan Zhou. 2000. Enhancing supervised learning with unlabeled data. In Proceedings of the 17th International Conference on Machine Learning, Stanford, CA.
[10]
Rebecca Hwa. 2000. Sample selection for statistical grammar induction. In Proceedings of the 2000 Joint SIGDAT Conference on EMNLP and VLC, pages 45--52, Hong Kong, China, October.
[11]
Mark Johnson, Stuart Geman, Stephen Cannon, Zhiyi Chi, and Stephan Riezler. 1999. Estimators for Stochastic "Unification-Based" Grammars. In 37th Annual Meeting of the ACL.
[12]
David D. Lewis and William A. Gale. 1994. A sequential algorithm for training text classifiers. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 3--12.
[13]
Robert Malouf. 2002. A comparison of algorithms for maximum entropy parameter estimation. In Proceedings of the Sixth Workshop on Natural Language Learning, pages 49--55, Taipei, Taiwan.
[14]
Ion Muslea, Steven Minton, and Craig Knoblock. 2000. Selective sampling with redundant views. In Proceedings of National Conference on Artificial Intelligence (AAAI-2000), pages 621--626.
[15]
Stephan Oepen, Kristina Toutanova, Stuart Shieber, Christopher Manning, Dan Flickinger, and Thorsten Brants. 2002. The LinGO Redwoods Treebank: Motivation and preliminary applications. In Proceedings of the 19th International Conference on Computational Linguistics, Taipei, Taiwan.
[16]
Miles Osborne. 2000. Estimation of Stochastic Attribute-Value Grammars using an Informative Sample. In The 18th International Conference on Computational Linguistics, Saarbrücken.
[17]
Cynthia A. Thompson, Mary Elaine Califf, and Raymond J. Mooney. 1999. Active learning for natural language parsing and information extraction. In Proc. 16th International Conf. on Machine Learning, pages 406--414. Morgan Kaufmann, San Francisco, CA.
[18]
Kristina Toutanova and Chris Manning. 2002. Feature selection for a rich HPSG grammar using decision trees. In Proceedings of the 6th Conference on Natural Language Learning, Taipei, Taiwan.

Cited By

View all
  • (2018)Active learning and logarithmic opinion pools for hpsg parse selectionNatural Language Engineering10.1017/S135132490600439614:2(191-222)Online publication date: 21-Dec-2018
  • (2018)Bootstrapping parsers via syntactic projection across parallel textsNatural Language Engineering10.1017/S135132490500384011:3(311-325)Online publication date: 21-Dec-2018
  • (2015)Active imitation learning of hierarchical policiesProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832581.2832744(3554-3560)Online publication date: 25-Jul-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
CONLL '03: Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
May 2003
213 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 31 May 2003

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)43
  • Downloads (Last 6 weeks)10
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Active learning and logarithmic opinion pools for hpsg parse selectionNatural Language Engineering10.1017/S135132490600439614:2(191-222)Online publication date: 21-Dec-2018
  • (2018)Bootstrapping parsers via syntactic projection across parallel textsNatural Language Engineering10.1017/S135132490500384011:3(311-325)Online publication date: 21-Dec-2018
  • (2015)Active imitation learning of hierarchical policiesProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832581.2832744(3554-3560)Online publication date: 25-Jul-2015
  • (2014)Toward detection of aliases without string similarityInformation Sciences: an International Journal10.1016/j.ins.2013.11.010261(89-100)Online publication date: 1-Mar-2014
  • (2009)Active Zipfian sampling for statistical parser trainingProceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers10.5555/1620853.1620922(249-252)Online publication date: 31-May-2009
  • (2009)Proactive learning for building machine translation systems for minority languagesProceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing10.5555/1564131.1564143(58-61)Online publication date: 5-Jun-2009
  • (2009)Active learning for anaphora resolutionProceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing10.5555/1564131.1564133(1-8)Online publication date: 5-Jun-2009
  • (2008)Specialized models and ranking for coreference resolutionProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/1613715.1613797(660-669)Online publication date: 25-Oct-2008
  • (2007)The Lextype DBProceedings of the 1st international conference on Intercultural collaboration10.5555/1769901.1769909(76-90)Online publication date: 25-Jan-2007
  • (2006)Margin-Based active learning for structured output spacesProceedings of the 17th European conference on Machine Learning10.1007/11871842_40(413-424)Online publication date: 18-Sep-2006
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media