Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/956863.956921acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Lattice-based tagging using support vector machines

Published: 03 November 2003 Publication History

Abstract

Tagging algorithms have become increasingly important for identifying lexical and semantic features of unstructured text. We describe an approach to lattice-based tagging that estimates joint transition and emission probabilities using support vector machines. The technique offers several advantages over alternative methods, including the ability to accommodate non-local features, support for hundreds of thousands of features, and language-neutrality. We demonstrate the technique on two tagging applications: named entity recognition and part-of-speech tagging.

References

[1]
D. M. Bikel, S. Miller, R. Schwartz, and R. Weischedel, 1997. 'Nymble: a high-performance learning name-finder.' Proceedings of the 5th Conference on Applied Natural Language Processing (ANLP-97) pp. 194--201.
[2]
Thorsten Brants, 2000. 'TnT-A statistical part-of-speech tagger.' In Proceedings of ANLP-2000, Seattle, Washington.
[3]
Hai Leong Chieu and Hwee Tou Ng, 2002. 'Named entity recognition: A maximum entropy approach using global information.' Proceedings of the 19th International Conference on Computational Linguistics (COLING 2002), pp. 190--196, Taipei, Taiwan.
[4]
Thorsten Joachims. 1999. 'Making large-scale SVM learning practical.' In B. Schölkopf, C. Burges and A. Smola, eds., Support Vector Learning. MIT Press.
[5]
Mitchell P. Marcus, Mary Ann Marcinkiewicz and Beatrice Santorini, 1993. 'Building a large annotated corpus of English: The Penn Treebank.' Computational Linguistics 19(2):313--330.
[6]
John C. Platt. 1999. 'Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods.' In Advances in Large Margin Classifiers, A. Smola, P. Bartlett, B. Scholkopf, D. Schuurmans (eds.), MIT Press.
[7]
Adwait Ratnaparkhi, 1996. 'A maximum entropy part-of-speech tagger.' Proceedings of the Empirical Methods in Natural Language Processing Conference, Philadelphia, Pennsylvania. Available from <http://www.cis.upenn.edu/ adwait/statnlp.html>, visited 28 May 2003.
[8]
Beatrice Santorini, 1990. Part-of-Speech Tagging Guidelines for the Penn Treebank Project. 3rd revision. Available from <http://www.cis.upenn.edu/ treebank/>, visited 28 May 2003.
[9]
Erik F. Tjong Kim Sang, 2002. 'Introduction to the CoNLL-2002 shared task: Language-independent named entity recognition.' In Dan Roth and Antal van den Bosch, eds., Proceedings of CoNLL-2002, Taipei, Taiwan. pp. 155--158.
[10]
Erik F. Tjong Kim Sang and Fien De Meulder, 2003. 'Introduction to the CoNLL-2003 Shared Task: Language Independent Named Entity Recognition.' In Walter Daelemans and Miles Osborne (eds.), Proceedings of CoNLL-2003, Edmonton, Canada.
[11]
Vladimir N. Vapnik. 1995. The Nature of Statistical Learning Theory. Springer-Verlag.

Cited By

View all
  • (2012)Part-of-speech taggingWIREs Computational Statistics10.1002/wics.1954:1(107-113)Online publication date: 1-Jan-2012
  • (2010)Multi-facet product information search and retrieval using semantically annotated product family ontologyInformation Processing and Management: an International Journal10.1016/j.ipm.2009.09.00146:4(479-493)Online publication date: 1-Jul-2010
  • (2009)Faceted search and retrieval based on semantically annotated product family ontologyProceedings of the WSDM '09 Workshop on Exploiting Semantic Annotations in Information Retrieval10.1145/1506250.1506254(15-24)Online publication date: 9-Feb-2009

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '03: Proceedings of the twelfth international conference on Information and knowledge management
November 2003
592 pages
ISBN:1581137230
DOI:10.1145/956863
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2003

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. SVM-Lattice
  2. named entity recognition
  3. part of speech tagging
  4. support vector machines
  5. tagging

Qualifiers

  • Article

Conference

CIKM03

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)2
Reflects downloads up to 18 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2012)Part-of-speech taggingWIREs Computational Statistics10.1002/wics.1954:1(107-113)Online publication date: 1-Jan-2012
  • (2010)Multi-facet product information search and retrieval using semantically annotated product family ontologyInformation Processing and Management: an International Journal10.1016/j.ipm.2009.09.00146:4(479-493)Online publication date: 1-Jul-2010
  • (2009)Faceted search and retrieval based on semantically annotated product family ontologyProceedings of the WSDM '09 Workshop on Exploiting Semantic Annotations in Information Retrieval10.1145/1506250.1506254(15-24)Online publication date: 9-Feb-2009

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media