Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Enhancing Chinese Named Entity Recognition with Disentangled Expert Knowledge

  • Conference paper
  • First Online:
Emerging Information Security and Applications (EISA 2023)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2004 ))

  • 145 Accesses

Abstract

Chinese Named Entity Recognition (NER) requires model identify entity boundaries in the sentence i.e., entity segmentation, and meanwhile assign entities to pre-defined categories, i.e., entity classification. Current NER tasks follows sequence tagging scheme and assign the characters to different labels by considering both segmentation position and entity categories. In such a scheme, the characters in the same entity will be regarded as different classes in the training process according to different positions. In fact, the knowledge of entity segmentation is shared across different entity categories, while entity category knowledge is relatively independent of entity segmentation. Such labeling scheme will lead to the entanglement of these two objectives, hindering the effective knowledge acquisition by the models. To address the entanglement issue and comprehensively extract useful knowledge of two objectives, we propose a novel framework that disentangle the original NER labels into two additional training labels for entity segmentation and entity classification respectively. Then we introduce two dedicated expert models to effectively extract specific knowledge from the disentangled labels. Afterwards, their predictions will be integrated into the original model as auxiliary knowledge, further enhancing the primary NER model learning process. We conduct experiments on three publicly available datasets to demonstrate the effectiveness of our proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Wang, D., Fan, H., Liu, J.: Learning with joint cross-document information via multi-task learning for named entity recognition. Inf. Sci. 579, 454–467 (2021)

    Article  MathSciNet  Google Scholar 

  2. Jimeno, A., Jimenez-Ruiz, E., Lee, V., Gaudan, S., Berlanga, R., Rebholz-Schuhmann, D.: Assessment of disease named entity recognition on a corpus of annotated sentences. BMC Bioinform. (2008)

    Google Scholar 

  3. Cabrera-Diego, L.A., Moreno, J.G., Doucet, A.: Using a frustratingly easy domain and tagset adaptation for creating slavic named entity recognition systems. In: Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing (2021)

    Google Scholar 

  4. Chiu, J.P., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016)

    Article  Google Scholar 

  5. Cucerzan, S., Yarowsky, D.: Language independent named entity recognition combining morphological and contextual evidence. In: Empirical Methods in Natural Language Processing (1999)

    Google Scholar 

  6. Dniken, P.V., Cieliebak, M.: Transfer learning and sentence level features for named entity recognition on tweets. In: Workshop on Noisy User-Generated Text (2017)

    Google Scholar 

  7. Feng, Y., Sun, L., Zhang, J.: Early results for Chinese named entity recognition using conditional random fields model, hmm and maximum entropy. In: Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2005. Proceedings of 2005 IEEE International Conference on (2005)

    Google Scholar 

  8. Jin, G., Chen, X.: The fourth international Chinese language processing bakeoff: Chinese word segmentation, named entity recognition and Chinese POS tagging. In: Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing (2008)

    Google Scholar 

  9. Kendall, A., Gal, Y., Cipolla, R.: Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7482–7491 (2018)

    Google Scholar 

  10. Kenton, J.D.M.W.C., Toutanova, L.K.: Bert: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of NAACL-HLT, vol. 1, p. 2 (2019)

    Google Scholar 

  11. Khalifa, M., Shaalan, K.: Character convolutions for Arabic named entity recognition with long short-term memory networks. Comput. Speech Lang. 58(Nov), 335–346 (2019)

    Google Scholar 

  12. Lee, S.H., Song, Y.K., Kim, H.S.: Named entity recognition using distant supervision and active bagging. J. KIISE 43(2), 269–274 (2016)

    Article  Google Scholar 

  13. Lee, S., Song, Y., Choi, M., Kim, H.: Bagging-based active learning model for named entity recognition with distant supervision. In: International Conference on Big Data & Smart Computing (2016)

    Google Scholar 

  14. Lin, Y., Chengjie, S., Xiaolong, W., Xuan, W.: Combining self learning and active learning for Chinese named entity recognition. J. Softw. 5(5), 530–537 (2010)

    Google Scholar 

  15. Luo, J., Jianqiang, D.U., Nie, B., Xiong, W., Jia, H.E., Yang, Y.: TCM named entity recognition based on character vector with bidirectional LSTM-CRF. In: International Conference on eHealth, Telemedicine, and Social Medicine (2019)

    Google Scholar 

  16. Lyu, C., Chen, B., Ren, Y., Ji, D.: Long short-term memory RNN for biomedical named entity recognition. BMC Bioinform. 18(1), 462 (2017)

    Article  Google Scholar 

  17. Ma, X., Hovy, E.: End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF. arXiv preprint arXiv:1603.01354 (2016)

  18. Mesfar, S.: Named entity recognition for Arabic using syntactic grammars. In: International Conference on Applications of Natural Language to Information Systems (2007)

    Google Scholar 

  19. Mukherjee, S., Awadallah, A.H.: Tinymbert: multi-stage distillation framework for massive multi-lingual NER. CoRR abs/2004.05686 (2020)

    Google Scholar 

  20. Neves Oliveira, B.S., et al.: HELD: Hierarchical entity-label disambiguation in named entity recognition task using deep learning. Intell. Data Anal. 26(3), 637–657 (2022)

    Article  Google Scholar 

  21. Ning, G., Bai, Y.: Biomedical named entity recognition based on glove-BLSTM-CRF model. J. Comput. Methods Sci. Eng. 3, 1–9 (2020)

    Google Scholar 

  22. Nozza, D., Manchanda, P., Fersini, E., Palmonari, M., Messina, E.: Learningtoadapt with word embeddings: domain adaptation of named entity recognition systems. Inf. Process. Manag. 58(3), 102537 (2021)

    Article  Google Scholar 

  23. Ouyang, E., Li, Y., Jin, L., Li, Z., Zhang, X.: Exploring N-gram character presentation in bidirectional RNN-CRF for Chinese clinical named entity recognition. In: CCKS: China Conference on Knowledge Graph and Semantic Computing 2017 (2017)

    Google Scholar 

  24. Patra, R., Saha, S.K.: Utilizing external corpora through kernel function: application in biomedical named entity recognition. Prog. Artif. Intell 9(3), 209–219 (2020)

    Article  Google Scholar 

  25. Peng, N., Dredze, M.: Named entity recognition for Chinese social media with jointly trained embeddings. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 548–554 (2015)

    Google Scholar 

  26. Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014). http://www.aclweb.org/anthology/D14-1162

  27. Peters, M.E., et al.: Deep contextualized word representations. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 2227–2237 (2018)

    Google Scholar 

  28. Rong, X.: word2vec parameter learning explained (2016)

    Google Scholar 

  29. Rouhou, A.C., Dhiaf, M., Kessentini, Y., Ben Salem, S.: Transformer-based approach for joint handwriting and named entity recognition in historical document. Pattern Recognit. Lett. 155, 128–134 (2022)

    Article  Google Scholar 

  30. Steinberger, R., Pouliquen, B.: Cross-lingual named entity recognition. Lingvisticae Investigationes 30(1), 135–162 (2007)

    Article  Google Scholar 

  31. Tran, V.C., Nguyen, N.T., Fujita, H., Hoang, D.T., Hwang, D.: A combination of active learning and self-learning for named entity recognition on twitter using conditional random fields. Knowl.-Based Syst. 132(15), 179–187 (2017)

    Article  Google Scholar 

  32. Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30 (2017)

    Google Scholar 

  33. Xu, L., et al.: Cluener 2020: fine-grained named entity recognition dataset and benchmark for Chinese. arXiv preprint arXiv:2001.04351 (2020)

  34. Yin, M., Mou, C., Xiong, K., Ren, J.: Chinese clinical named entity recognition with radical-level feature and self-attention mechanism. J. Biomed. Inform. 98, 103289 (2019)

    Article  Google Scholar 

  35. Yu, K., Kurohashi, S., Liu, H., Nakazawa, T.: Chinese word segmentation and named entity recognition by character tagging. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 146–149 (2006)

    Google Scholar 

  36. Zhang, Y., Yang, J.: Chinese NER using lattice LSTM. arXiv preprint arXiv:1805.02023 (2018)

Download references

Acknowledgements

This work was supported by the Science and Technology Project of State Grid Zhejiang Electric Power Co., Ltd. (Project number: B311XT220007).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yidan Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, H., Feng, J., Wang, Y., Pan, S., Zhao, S., Xue, Y. (2024). Enhancing Chinese Named Entity Recognition with Disentangled Expert Knowledge. In: Shao, J., Katsikas, S.K., Meng, W. (eds) Emerging Information Security and Applications. EISA 2023. Communications in Computer and Information Science, vol 2004 . Springer, Singapore. https://doi.org/10.1007/978-981-99-9614-8_6

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-9614-8_6

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-9613-1

  • Online ISBN: 978-981-99-9614-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics