Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3366030.3366072acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

Named entity recognition using point prediction and active learning

Published: 22 February 2020 Publication History
  • Get Citation Alerts
  • Abstract

    Named entity recognition (NER) research has been spreading into specialty domains. A specialty domain corpus is smaller than a general domain corpus. Moreover, annotating a specialty domain corpus is more expensive than annotating a general corpus. Therefore, in this paper, we introduce a model that uses point-wise prediction and active learning to achieve a high extraction performance even in a small annotation corpus. We demonstrate the effectiveness of our approach through a simulation of active learning.

    References

    [1]
    Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. Natural Language Processing (Almost) from Scratch. J. Mach. Learn. Res. 12 (Nov. 2011), 2493--2537. http://dl.acm.org/citation.cfm?id=1953048.2078186
    [2]
    Radu Florian, Abe Ittycheriah, Hongyan Jing, and Tong Zhang. 2003. Named entity recognition through classifier combination. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003-Volume 4. Association for Computational Linguistics, 168--171.
    [3]
    Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF Models for Sequence Tagging. CoRR abs/1508.01991 (2015). arXiv:1508.01991 http://arxiv.org/abs/1508.01991
    [4]
    Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. 2016. Neural Architectures for Named Entity Recognition. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 260--270. https://doi.org/10.18653/v1/N16-1030
    [5]
    David D Lewis and Jason Catlett. 1994. Heterogeneous uncertainty sampling for supervised learning. In Machine Learning Proceedings 1994. Elsevier, 148--156.
    [6]
    Yijia Liu, Yue Zhang, Wanxiang Che, Ting Liu, and Fan Wu. 2014. Domain adaptation for CRF-based Chinese word segmentation using free annotations. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 864--874.
    [7]
    Xuezhe Ma and Eduard Hovy. 2016. End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1064--1074. https://doi.org/10.18653/v1/P16-1101
    [8]
    Andrew McCallum and Wei Li. 2003. Early Results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-enhanced Lexicons. In Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003 - Volume 4 (CONLL '03). Association for Computational Linguistics, Stroudsburg, PA, USA, 188--191. https://doi.org/10.3115/1119176.1119206
    [9]
    Andrew McCallum and Kamal Nigam. 1998. Employing EM and Pool-Based Active Learning for Text Classification. In Proceedings of the Fifteenth International Conference on Machine Learning (ICML '98). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 350--358. http://dl.acm.org/citation.cfm?id=645527.757765
    [10]
    Shinsuke Mori, Yosuke Nakata, Graham Neubig, and Tatsuya Kawahara. 2011. Morphological Analysis with Pointwise Predictors. Journal of Natural Language Processing 18, 4 (2011), 367--381. https://doi.org/10.5715/jnlp.18.367
    [11]
    Graham Neubig, Yosuke Nakata, and Shinsuke Mori. 2011. Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis. In The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT). Portland, Oregon, USA, 529--533. http://www.phontron.com/paper/neubig11aclshort.pdf
    [12]
    Nicholas Roy and Andrew McCallum. 2001. Toward Optimal Active Learning through Sampling Estimation of Error Reduction. In ICML.
    [13]
    Tobias Scheffer, Christian Decomain, and Stefan Wrobel. 2001. Active hidden markov models for information extraction. In International Symposium on Intelligent Data Analysis. Springer, 309--318.
    [14]
    Burr Settles. 2009. Active Learning Literature Survey. Computer Sciences Technical Report 1648. University of Wisconsin-Madison.
    [15]
    Burr Settles and Mark Craven. 2008. An Analysis of Active Learning Strategies for Sequence Labeling Tasks. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP '08). Association for Computational Linguistics, Stroudsburg, PA, USA, 1070--1079. http://dl.acm.org/citation.cfm?id=1613715.1613855
    [16]
    Burr Settles, Mark Craven, and Soumya Ray. 2008. Multiple-Instance Active Learning. In Advances in Neural Information Processing Systems 20, J. C. Platt, D. Koller, Y. Singer, and S. T. Roweis (Eds.). Curran Associates, Inc., 1289--1296. http://papers.nips.cc/paper/3252-multiple-instance-active-learning.pdf
    [17]
    H. S. Seung, M. Opper, and H. Sompolinsky. 1992. Query by Committee. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory (COLT '92). ACM, New York, NY, USA, 287--294. https://doi.org/10.1145/130385.130417
    [18]
    Yanyao Shen, Hyokun Yun, Zachary C. Lipton, Yakov Kronrod, and Animashree Anandkumar. 2017. Deep Active Learning for Named Entity Recognition. CoRR abs/1707.05928 (2017). arXiv:1707.05928 http://arxiv.org/abs/1707.05928
    [19]
    Erik F. Tjong Kim Sang and Fien De Meulder. 2003. Introduction to the CoNLL-2003 Shared Task: Language-independent Named Entity Recognition. In Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003 - Volume 4 (CONLL '03). Association for Computational Linguistics, Stroudsburg, PA, USA, 142--147. https://doi.org/10.3115/1119176.1119195

    Cited By

    View all
    • (2023)Investigation of Deep Active Self-learning Algorithms Applied to Named Entity RecognitionIntelligent Systems10.1007/978-3-031-45392-2_31(470-484)Online publication date: 12-Oct-2023
    • (2021)Efficient Training Method for Phrase Extraction Models using Natural Language ExplanationsThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487703(288-295)Online publication date: 29-Nov-2021

    Index Terms

    1. Named entity recognition using point prediction and active learning

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      iiWAS2019: Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services
      December 2019
      709 pages
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      In-Cooperation

      • JKU: Johannes Kepler Universität Linz
      • @WAS: International Organization of Information Integration and Web-based Applications and Services

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 22 February 2020

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. datasets
      2. named entity recognition
      3. neural networks
      4. text tagging

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Conference

      iiWAS2019

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)4
      • Downloads (Last 6 weeks)0

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Investigation of Deep Active Self-learning Algorithms Applied to Named Entity RecognitionIntelligent Systems10.1007/978-3-031-45392-2_31(470-484)Online publication date: 12-Oct-2023
      • (2021)Efficient Training Method for Phrase Extraction Models using Natural Language ExplanationsThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487703(288-295)Online publication date: 29-Nov-2021

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media