Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.3115/1219840.1219841dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

A high-performance semi-supervised learning method for text chunking

Published: 25 June 2005 Publication History

Abstract

In machine learning, whether one can build a more accurate classifier by using unlabeled data (semi-supervised learning) is an important issue. Although a number of semi-supervised methods have been proposed, their effectiveness on NLP tasks is not always clear. This paper presents a novel semi-supervised method that employs a learning paradigm which we call structural learning. The idea is to find "what good classifiers are like" by learning from thousands of automatically generated auxiliary classification problems on unlabeled data. By doing so, the common predictive structure shared by the multiple classification problems can be discovered, which can then be used to improve performance on the target problem. The method produces performance higher than the previous best results on CoNLL'00 syntactic chunking and CoNLL'03 named entity chunking (English and German).

References

[1]
Rie Kubota Ando and Tong Zhang. 2004. A framework for learning predictive structures from multiple tasks and unlabeled data. Technical report, IBM. RC23462.
[2]
Rie Kubota Ando. 2004. Semantic lexicon construction: Learning from unlabeled data via spectral analysis. In Proceedings of CoNLL-2004.
[3]
Avrim Blum and Tom Mitchell. 1998. Combining labeled and unlabeled data with co-training. In proceedings of COLT-98.
[4]
Xavier Carreras and Lluis Marquez. 2003. Phrase recognition by filtering and ranking with perceptrons. In Proceedings of RANLP-2003.
[5]
Hai Leong Chieu and Hwee Tou Ng. 2003. Named entity recognition with a maximum entropy approach. In Proceedings CoNLL-2003, pages 160--163.
[6]
Michael Collins and Yoram Singer. 1999. Unsupervised models for named entity classification. In Proceedings of EMNLP/VLC'99.
[7]
Radu Florian, Abe Ittycheriah, Hongyan Jing, and Tong Zhang. 2003. Named entity recognition through classifier combination. In Proceedings CoNLL-2003, pages 168--171.
[8]
Gene H. Golub and Charles F. Van Loan. 1996. Matrix computations third edition.
[9]
Dan Klein, Joseph Smarr, Huy Nguyen, and Christopher D. Manning. 2003. Named entity recognition with character-level models. In Proceedings CoNLL-2003, pages 188--191.
[10]
Taku Kudoh and Yuji Matsumoto. 2001. Chunking with support vector machines. In Proceedings of NAACL 2001.
[11]
Bernard Merialdo. 1994. Tagging English text with a probabilistic model. Computational Linguistics, 20(2):155--171.
[12]
Scott Miller, Jethran Guinness, and Alex Zamanian. 2004. Name tagging with word clusters and discriminative training. In Proceedings of HLT-NAACL-2004.
[13]
Vincent Ng and Claire Cardie. 2003. Weakly supervised natural language learning without redundant views. In Proceedings of HLT-NAACL-2003.
[14]
David Pierce and Claire Cardie. 2001. Limitations of co-training for natural language learning from large datasets. In Proceedings of EMNLP-2001.
[15]
Ellen Riloff and Rosie Jones. 1999. Learning dictionaries for information extraction by multi-level bootstrapping. In Proceedings of AAAI-99.
[16]
Fei Sha and Fernando Pereira. 2003. Shallow parsing with conditional random fields. In Proceedings of HLT-NAACL'03.
[17]
David Yarowsky. 1995. Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of ACL-95.
[18]
Tong Zhang and David E. Johnson. 2003. A robust risk minimization based named entity recognition system. In Proceedings CoNLL-2003, pages 204--207.
[19]
Tong Zhang, Fred Damerau, and David E. Johnson. 2002. Text chunking based on a generalization of Winnow. Journal of Machine Learning Research, 2:615--637.
[20]
Tong Zhang. 2004. Solving large scale linear prediction problems using stochastic gradient descent algorithms. In ICML 04, pages 919--926.

Cited By

View all
  • (2024)Automated patch correctness predicting to fix software defectExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.124877256:COnline publication date: 5-Dec-2024
  • (2021)Lightweight Object Detection Ensemble Framework for Autonomous Vehicles in Challenging Weather ConditionsComputational Intelligence and Neuroscience10.1155/2021/52788202021Online publication date: 1-Jan-2021
  • (2018)SVMs Classification Based Two-side Cross Domain Collaborative Filtering by inferring intrinsic user and item featuresKnowledge-Based Systems10.1016/j.knosys.2017.11.010141:C(80-91)Online publication date: 1-Feb-2018
  • Show More Cited By
  1. A high-performance semi-supervised learning method for text chunking

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      ACL '05: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
      June 2005
      657 pages
      • General Chair:
      • Kevin Knight

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 25 June 2005

      Qualifiers

      • Article

      Acceptance Rates

      ACL '05 Paper Acceptance Rate 77 of 423 submissions, 18%;
      Overall Acceptance Rate 85 of 443 submissions, 19%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)81
      • Downloads (Last 6 weeks)11
      Reflects downloads up to 25 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Automated patch correctness predicting to fix software defectExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.124877256:COnline publication date: 5-Dec-2024
      • (2021)Lightweight Object Detection Ensemble Framework for Autonomous Vehicles in Challenging Weather ConditionsComputational Intelligence and Neuroscience10.1155/2021/52788202021Online publication date: 1-Jan-2021
      • (2018)SVMs Classification Based Two-side Cross Domain Collaborative Filtering by inferring intrinsic user and item featuresKnowledge-Based Systems10.1016/j.knosys.2017.11.010141:C(80-91)Online publication date: 1-Feb-2018
      • (2017)Scaling Reflection Prompts in Large Classrooms via Mobile Interfaces and Natural Language ProcessingProceedings of the 22nd International Conference on Intelligent User Interfaces10.1145/3025171.3025204(363-374)Online publication date: 7-Mar-2017
      • (2017)Summit-TrainingApplied Soft Computing10.1016/j.asoc.2016.06.00850:C(1-20)Online publication date: 1-Jan-2017
      • (2017)UFuzzyApplied Soft Computing10.1016/j.asoc.2016.05.04152:C(1296-1315)Online publication date: 1-Mar-2017
      • (2017)A systematic review and comparative analysis of cross-document coreference resolution methods and toolsComputing10.1007/s00607-016-0490-099:4(313-349)Online publication date: 1-Apr-2017
      • (2016)Multi-source iterative adaptation for cross-domain classificationProceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence10.5555/3061053.3061135(3691-3697)Online publication date: 9-Jul-2016
      • (2016)Boosted Web Named Entity Recognition via Tri-TrainingACM Transactions on Asian and Low-Resource Language Information Processing10.1145/296310016:2(1-23)Online publication date: 14-Oct-2016
      • (2016)An Empirical Study on Recommendation with Multiple Types of FeedbackProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining10.1145/2939672.2939690(283-292)Online publication date: 13-Aug-2016
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media