research-article

Named entity recognition using point prediction and active learning

Authors:

Koga Kobayashi and

Kei WakabayashiAuthors Info & Claims

iiWAS2019: Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services

December 2019

Pages 287 - 293

https://doi.org/10.1145/3366030.3366072

Published: 22 February 2020 Publication History

Abstract

Named entity recognition (NER) research has been spreading into specialty domains. A specialty domain corpus is smaller than a general domain corpus. Moreover, annotating a specialty domain corpus is more expensive than annotating a general corpus. Therefore, in this paper, we introduce a model that uses point-wise prediction and active learning to achieve a high extraction performance even in a small annotation corpus. We demonstrate the effectiveness of our approach through a simulation of active learning.

References

[1]

Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. Natural Language Processing (Almost) from Scratch. J. Mach. Learn. Res. 12 (Nov. 2011), 2493--2537. http://dl.acm.org/citation.cfm?id=1953048.2078186

[2]

Radu Florian, Abe Ittycheriah, Hongyan Jing, and Tong Zhang. 2003. Named entity recognition through classifier combination. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003-Volume 4. Association for Computational Linguistics, 168--171.

Digital Library

[3]

Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF Models for Sequence Tagging. CoRR abs/1508.01991 (2015). arXiv:1508.01991 http://arxiv.org/abs/1508.01991

[4]

Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. 2016. Neural Architectures for Named Entity Recognition. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 260--270. https://doi.org/10.18653/v1/N16-1030

[5]

David D Lewis and Jason Catlett. 1994. Heterogeneous uncertainty sampling for supervised learning. In Machine Learning Proceedings 1994. Elsevier, 148--156.

[6]

Yijia Liu, Yue Zhang, Wanxiang Che, Ting Liu, and Fan Wu. 2014. Domain adaptation for CRF-based Chinese word segmentation using free annotations. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 864--874.

[7]

Xuezhe Ma and Eduard Hovy. 2016. End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1064--1074. https://doi.org/10.18653/v1/P16-1101

[8]

Andrew McCallum and Wei Li. 2003. Early Results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-enhanced Lexicons. In Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003 - Volume 4 (CONLL '03). Association for Computational Linguistics, Stroudsburg, PA, USA, 188--191. https://doi.org/10.3115/1119176.1119206

Digital Library

[9]

Andrew McCallum and Kamal Nigam. 1998. Employing EM and Pool-Based Active Learning for Text Classification. In Proceedings of the Fifteenth International Conference on Machine Learning (ICML '98). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 350--358. http://dl.acm.org/citation.cfm?id=645527.757765

[10]

Shinsuke Mori, Yosuke Nakata, Graham Neubig, and Tatsuya Kawahara. 2011. Morphological Analysis with Pointwise Predictors. Journal of Natural Language Processing 18, 4 (2011), 367--381. https://doi.org/10.5715/jnlp.18.367

[11]

Graham Neubig, Yosuke Nakata, and Shinsuke Mori. 2011. Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis. In The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT). Portland, Oregon, USA, 529--533. http://www.phontron.com/paper/neubig11aclshort.pdf

[12]

Nicholas Roy and Andrew McCallum. 2001. Toward Optimal Active Learning through Sampling Estimation of Error Reduction. In ICML.

[13]

Tobias Scheffer, Christian Decomain, and Stefan Wrobel. 2001. Active hidden markov models for information extraction. In International Symposium on Intelligent Data Analysis. Springer, 309--318.

Digital Library

[14]

Burr Settles. 2009. Active Learning Literature Survey. Computer Sciences Technical Report 1648. University of Wisconsin-Madison.

[15]

Burr Settles and Mark Craven. 2008. An Analysis of Active Learning Strategies for Sequence Labeling Tasks. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP '08). Association for Computational Linguistics, Stroudsburg, PA, USA, 1070--1079. http://dl.acm.org/citation.cfm?id=1613715.1613855

Digital Library

[16]

Burr Settles, Mark Craven, and Soumya Ray. 2008. Multiple-Instance Active Learning. In Advances in Neural Information Processing Systems 20, J. C. Platt, D. Koller, Y. Singer, and S. T. Roweis (Eds.). Curran Associates, Inc., 1289--1296. http://papers.nips.cc/paper/3252-multiple-instance-active-learning.pdf

[17]

H. S. Seung, M. Opper, and H. Sompolinsky. 1992. Query by Committee. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory (COLT '92). ACM, New York, NY, USA, 287--294. https://doi.org/10.1145/130385.130417

Digital Library

[18]

Yanyao Shen, Hyokun Yun, Zachary C. Lipton, Yakov Kronrod, and Animashree Anandkumar. 2017. Deep Active Learning for Named Entity Recognition. CoRR abs/1707.05928 (2017). arXiv:1707.05928 http://arxiv.org/abs/1707.05928

[19]

Erik F. Tjong Kim Sang and Fien De Meulder. 2003. Introduction to the CoNLL-2003 Shared Task: Language-independent Named Entity Recognition. In Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003 - Volume 4 (CONLL '03). Association for Computational Linguistics, Stroudsburg, PA, USA, 142--147. https://doi.org/10.3115/1119176.1119195

Cited By

Cunha Santos A. V. Silva Neto Jde Paulo Faleiros T(2023)Investigation of Deep Active Self-learning Algorithms Applied to Named Entity RecognitionIntelligent Systems10.1007/978-3-031-45392-2_31(470-484)Online publication date: 12-Oct-2023
https://doi.org/10.1007/978-3-031-45392-2_31
Saito RKobayashi KWakabayashi K(2021)Efficient Training Method for Phrase Extraction Models using Natural Language ExplanationsThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487703(288-295)Online publication date: 29-Nov-2021
https://dl.acm.org/doi/10.1145/3487664.3487703

Index Terms

Named entity recognition using point prediction and active learning
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction

Recommendations

Mitigating Effect of Dictionary Matching Errors in Distantly Supervised Named Entity Recognition
iiWAS '20: Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services

Named entity recognition (NER) is a fundamental technique that brings basic semantic awareness to natural language processing applications and services. Since we need a large amount of training data to train a custom NER model, distant supervision that ...
Read More
Learning multilingual named entity recognition from Wikipedia

We automatically create enormous, free and multilingual silver-standard training annotations for named entity recognition (ner) by exploiting the text and structure of Wikipedia. Most ner systems rely on statistical models of annotated data to identify ...
Read More
Two-stage approach to named entity recognition using Wikipedia and DBpedia
IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication

In natural language understanding, extraction of named entity (NE) mentions in given text and classification of the mentions into pre-defined NE types are important processes. Most NE recognition (NER) relies on resources such as a training corpus or NE ...
Read More

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

iiWAS2019: Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services

December 2019

709 pages

ISBN:9781450371797

DOI:10.1145/3366030

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

JKU: Johannes Kepler Universität Linz
@WAS: International Organization of Information Integration and Web-based Applications and Services

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 February 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

iiWAS2019

iiWAS2019: The 21st International Conference on Information Integration and Web-based Applications & Services

December 2 - 4, 2019

Munich, Germany

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
101
Total Downloads

Downloads (Last 12 months)4
Downloads (Last 6 weeks)0

Other Metrics

View Author Metrics

Citations

Cited By

Cunha Santos A. V. Silva Neto Jde Paulo Faleiros T(2023)Investigation of Deep Active Self-learning Algorithms Applied to Named Entity RecognitionIntelligent Systems10.1007/978-3-031-45392-2_31(470-484)Online publication date: 12-Oct-2023
https://doi.org/10.1007/978-3-031-45392-2_31
Saito RKobayashi KWakabayashi K(2021)Efficient Training Method for Phrase Extraction Models using Natural Language ExplanationsThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487703(288-295)Online publication date: 29-Nov-2021
https://dl.acm.org/doi/10.1145/3487664.3487703

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents