Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3308558.3313743acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article

Neural Chinese Named Entity Recognition via CNN-LSTM-CRF and Joint Training with Word Segmentation

Published: 13 May 2019 Publication History
  • Get Citation Alerts
  • Abstract

    Chinese named entity recognition (CNER) is an important task in Chinese natural language processing field. However, CNER is very challenging since Chinese entity names are highly context-dependent. In addition, Chinese texts lack delimiters to separate words, making it difficult to identify the boundary of entities. Besides, the training data for CNER in many domains is usually insufficient, and annotating enough training data for CNER is very expensive and time-consuming. In this paper, we propose a neural approach for CNER. First, we introduce a CNN-LSTM-CRF neural architecture to capture both local and long-distance contexts for CNER. Second, we propose a unified framework to jointly train CNER and word segmentation models in order to enhance the ability of CNER model in identifying entity boundaries. Third, we introduce an automatic method to generate pseudo labeled samples from existing labeled data which can enrich the training data. Experiments on two benchmark datasets show that our approach can effectively improve the performance of Chinese named entity recognition, especially when training data is insufficient.

    References

    [1]
    Aitao Chen, Fuchun Peng, Roy Shan, and Gordon Sun. 2006. Chinese named entity recognition with conditional probabilistic models. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing. 173-176.
    [2]
    Xinchi Chen, Xipeng Qiu, Chenxi Zhu, Pengfei Liu, and Xuanjing Huang. 2015. Long short-term memory neural networks for chinese word segmentation. In EMNLP. 1197-1206.
    [3]
    Jason P. C. Chiu and Eric Nichols. 2016. Named Entity Recognition with Bidirectional LSTM-CNNs. TACL4(2016), 357-370.
    [4]
    Ronan Collobert, Jason Weston, Le´on Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. 2011. Natural language processing (almost) from scratch. Journal of Machine Learning Research12, Aug (2011), 2493-2537.
    [5]
    Yann Dauphin, Harm de Vries, and Yoshua Bengio. 2015. Equilibrated adaptive learning rates for non-convex optimization. In NIPS. 1504-1512.
    [6]
    Chuanhai Dong, Huijia Wu, Jiajun Zhang, and Chengqing Zong. 2017. Multichannel LSTM-CRF for Named Entity Recognition in Chinese Social Media. In Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. 197-208.
    [7]
    Chuanhai Dong, Jiajun Zhang, Chengqing Zong, Masanori Hattori, and Hui Di. 2016. Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition. In International Conference on Computer Processing of Oriental Languages. 239-250.
    [8]
    Li Dong, Furu Wei, Ming Zhou, and Ke Xu. 2015. Question answering over freebase with multi-column convolutional neural networks. In ACL. 260-269.
    [9]
    Cicero dos Santos and Victor Guimarães. 2015. Boosting Named Entity Recognition with Neural Character Embeddings. In Proceedings of the Fifth Named Entity Workshop. 25-33.
    [10]
    Jianfeng Gao, Mu Li, Andi Wu, and Chang-Ning Huang. 2005. Chinese word segmentation and named entity recognition: A pragmatic approach. Computational Linguistics31, 4 (2005), 531-574.
    [11]
    Xianpei Han, Le Sun, and Jun Zhao. 2011. Collective entity linking in web text: a graph-based method. In SIGIR. 765-774.
    [12]
    Hangfeng He and Xu Sun. 2017. A Unified Model for Cross-Domain and Semi-Supervised Named Entity Recognition in Chinese Social Media. In AAAI. 3216-3222.
    [13]
    Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation9, 8 (1997), 1735-1780.
    [14]
    Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991(2015).
    [15]
    John D. Lafferty, Andrew McCallum, and Fernando C. N. Pereira. 2001. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In ICML. 282-289.
    [16]
    Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. 2016. Neural Architectures for Named Entity Recognition. In NAACL. 260-270.
    [17]
    Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. nature521, 7553 (2015), 436.
    [18]
    Yann LeCun, Bernhard Boser, John S Denker, Donnie Henderson, Richard E Howard, Wayne Hubbard, and Lawrence D Jackel. 1989. Backpropagation applied to handwritten zip code recognition. Neural computation1, 4 (1989), 541-551.
    [19]
    Gina-Anne Levow. 2006. The third international Chinese language processing bakeoff: Word segmentation and named entity recognition. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing. 108-117.
    [20]
    Yankai Lin, Shiqi Shen, Zhiyuan Liu, Huanbo Luan, and Maosong Sun. 2016. Neural relation extraction with selective attention over instances. In ACL. 2124-2133.
    [21]
    Junxin Liu, Fangzhao Wu, Chuhan Wu, Yongfeng Huang, and Xing Xie. 2018. Neural Chinese Word Segmentation with Dictionary Knowledge. In NLPCC. 80-91.
    [22]
    Zhangxun Liu, Conghui Zhu, and Tiejun Zhao. 2010. Chinese named entity recognition with a sequence labeling approach: based on characters, or based on words?In Advanced intelligent computing theories and applications with aspects of artificial intelligence. Springer, 634-640.
    [23]
    Gang Luo, Xiaojiang Huang, Chin-Yew Lin, and Zaiqing Nie. 2015. Joint entity recognition and disambiguation. In EMNLP. 879-888.
    [24]
    Wencan Luo and Fan Yang. 2016. An Empirical Study of Automatic Chinese Word Segmentation for Spoken Language Understanding and Named Entity Recognition. In NAACL. 238-248.
    [25]
    Xuezhe Ma and Eduard H. Hovy. 2016. End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF. In ACL. 1064-1074.
    [26]
    Shotaro Misawa, Motoki Taniguchi, Yasuhide Miura, and Tomoko Ohkuma. 2017. Character-based Bidirectional LSTM-CRF with words and characters for Japanese Named Entity Recognition. In Proceedings of the First Workshop on Subword and Character Level Models in NLP. 97-102.
    [27]
    Nanyun Peng and Mark Dredze. 2015. Named Entity Recognition for Chinese Social Media with Jointly Trained Embeddings. In EMNLP. 548-554.
    [28]
    Nanyun Peng and Mark Dredze. 2017. Multi-task Domain Adaptation for Sequence Tagging. In Proceedings of the 2nd Workshop on Representation Learning for NLP. 91-100.
    [29]
    Burr Settles. 2004. Biomedical named entity recognition using conditional random fields and rich feature sets. In Proceedings of the international joint workshop on natural language processing in biomedicine and its applications. 104-107.
    [30]
    Nitish Srivastava, Geoffrey E Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting.Journal of machine learning research15, 1 (2014), 1929-1958.
    [31]
    Chunqi Wang, Wei Chen, and Bo Xu. 2017. Named Entity Recognition with Gated Convolutional Neural Networks. In Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. 110-121.
    [32]
    Xiaofeng Yu, Wai Lam, Shing-Kit Chan, Yiu Kei Wu, and Bo Chen. 2008. Chinese NER Using CRFs and Logic for the Fourth SIGHAN Bakeoff. In IJCNLP. 102-105.
    [33]
    Yuejie Zhang, Zhiting Xu, and Tao Zhang. 2008. Fusion of multiple features for chinese named entity recognition based on CRF model. In Asia Information Retrieval Symposium. 95-106.

    Cited By

    View all
    • (2024)Relief Supply-Demand Estimation Based on Social Media in Typhoon Disasters Using Deep Learning and a Spatial Information Diffusion ModelISPRS International Journal of Geo-Information10.3390/ijgi1301002913:1(29)Online publication date: 16-Jan-2024
    • (2024)Research on Information Fusion Method for Enterprise Fuzzy Technical RequirementsOperations Research and Fuzziology10.12677/orf.2024.14218614:02(857-869)Online publication date: 2024
    • (2024)Enhancing Named Entity Recognition in Safety Hazard Analysis through GBD and LLMs2024 7th International Conference on Information and Computer Technologies (ICICT)10.1109/ICICT62343.2024.00009(13-19)Online publication date: 15-Mar-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '19: The World Wide Web Conference
    May 2019
    3620 pages
    ISBN:9781450366748
    DOI:10.1145/3308558
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • IW3C2: International World Wide Web Conference Committee

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 May 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Named Entity Recognition
    2. Neural Network
    3. Word Segmentation

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    WWW '19
    WWW '19: The Web Conference
    May 13 - 17, 2019
    CA, San Francisco, USA

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)57
    • Downloads (Last 6 weeks)5

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Relief Supply-Demand Estimation Based on Social Media in Typhoon Disasters Using Deep Learning and a Spatial Information Diffusion ModelISPRS International Journal of Geo-Information10.3390/ijgi1301002913:1(29)Online publication date: 16-Jan-2024
    • (2024)Research on Information Fusion Method for Enterprise Fuzzy Technical RequirementsOperations Research and Fuzziology10.12677/orf.2024.14218614:02(857-869)Online publication date: 2024
    • (2024)Enhancing Named Entity Recognition in Safety Hazard Analysis through GBD and LLMs2024 7th International Conference on Information and Computer Technologies (ICICT)10.1109/ICICT62343.2024.00009(13-19)Online publication date: 15-Mar-2024
    • (2024)Research on Chinese Named Entity Recognition Methods for Machine Learning2024 7th International Conference on Advanced Algorithms and Control Engineering (ICAACE)10.1109/ICAACE61206.2024.10548563(419-422)Online publication date: 1-Mar-2024
    • (2024)Study on Chinese Semantic Entity Recognition Method for Cabin Utilizing BERT-BiGRU ModelIEEE Access10.1109/ACCESS.2024.338676012(56042-56049)Online publication date: 2024
    • (2024)Few-Shot Blockchain Domain Named Entity Recognition with Fused Lexical FeaturesComputational and Experimental Simulations in Engineering10.1007/978-3-031-44947-5_48(595-613)Online publication date: 25-Jan-2024
    • (2023)Banking System Incidents Analysis Using Knowledge GraphInternational Journal of Knowledge and Systems Science10.4018/IJKSS.32579414:1(1-23)Online publication date: 10-Jul-2023
    • (2023)Research on Chinese Semantic Named Entity Recognition in Marine Engine Room Systems Based on BERTJournal of Marine Science and Engineering10.3390/jmse1107126611:7(1266)Online publication date: 21-Jun-2023
    • (2023)Reassembling Fragmented Entity Names: A Novel Model for Chinese Compound Noun ProcessingElectronics10.3390/electronics1220425112:20(4251)Online publication date: 14-Oct-2023
    • (2023)Fusion of SoftLexicon and RoBERTa for Purpose-Driven Electronic Medical Record Named Entity RecognitionApplied Sciences10.3390/app13241329613:24(13296)Online publication date: 15-Dec-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media