Enhancing Chinese Named Entity Recognition with Disentangled Expert Knowledge

Wang, Hongkai; Feng, Jun; Wang, Yidan; Pan, Sichen; Zhao, Shuai; Xue, Yi

doi:10.1007/978-981-99-9614-8_6

Hongkai Wang⁸,
Jun Feng⁸,
Yidan Wang⁸,
Sichen Pan⁸,
Shuai Zhao⁸ &
…
Yi Xue⁹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2004 ))

Included in the following conference series:

International Symposium on Emerging Information Security and Applications

145 Accesses

Abstract

Chinese Named Entity Recognition (NER) requires model identify entity boundaries in the sentence i.e., entity segmentation, and meanwhile assign entities to pre-defined categories, i.e., entity classification. Current NER tasks follows sequence tagging scheme and assign the characters to different labels by considering both segmentation position and entity categories. In such a scheme, the characters in the same entity will be regarded as different classes in the training process according to different positions. In fact, the knowledge of entity segmentation is shared across different entity categories, while entity category knowledge is relatively independent of entity segmentation. Such labeling scheme will lead to the entanglement of these two objectives, hindering the effective knowledge acquisition by the models. To address the entanglement issue and comprehensively extract useful knowledge of two objectives, we propose a novel framework that disentangle the original NER labels into two additional training labels for entity segmentation and entity classification respectively. Then we introduce two dedicated expert models to effectively extract specific knowledge from the disentangled labels. Afterwards, their predictions will be integrated into the original model as auxiliary knowledge, further enhancing the primary NER model learning process. We conduct experiments on three publicly available datasets to demonstrate the effectiveness of our proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Chinese named entity recognition model: integrating label knowledge and lexicon information

Article 16 May 2024

TL-NER: A Transfer Learning Model for Chinese Named Entity Recognition

Article 04 June 2019

Incorporating Boundary and Category Feature for Nested Named Entity Recognition

References

Wang, D., Fan, H., Liu, J.: Learning with joint cross-document information via multi-task learning for named entity recognition. Inf. Sci. 579, 454–467 (2021)
Article MathSciNet Google Scholar
Jimeno, A., Jimenez-Ruiz, E., Lee, V., Gaudan, S., Berlanga, R., Rebholz-Schuhmann, D.: Assessment of disease named entity recognition on a corpus of annotated sentences. BMC Bioinform. (2008)
Google Scholar
Cabrera-Diego, L.A., Moreno, J.G., Doucet, A.: Using a frustratingly easy domain and tagset adaptation for creating slavic named entity recognition systems. In: Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing (2021)
Google Scholar
Chiu, J.P., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016)
Article Google Scholar
Cucerzan, S., Yarowsky, D.: Language independent named entity recognition combining morphological and contextual evidence. In: Empirical Methods in Natural Language Processing (1999)
Google Scholar
Dniken, P.V., Cieliebak, M.: Transfer learning and sentence level features for named entity recognition on tweets. In: Workshop on Noisy User-Generated Text (2017)
Google Scholar
Feng, Y., Sun, L., Zhang, J.: Early results for Chinese named entity recognition using conditional random fields model, hmm and maximum entropy. In: Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2005. Proceedings of 2005 IEEE International Conference on (2005)
Google Scholar
Jin, G., Chen, X.: The fourth international Chinese language processing bakeoff: Chinese word segmentation, named entity recognition and Chinese POS tagging. In: Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing (2008)
Google Scholar
Kendall, A., Gal, Y., Cipolla, R.: Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7482–7491 (2018)
Google Scholar
Kenton, J.D.M.W.C., Toutanova, L.K.: Bert: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of NAACL-HLT, vol. 1, p. 2 (2019)
Google Scholar
Khalifa, M., Shaalan, K.: Character convolutions for Arabic named entity recognition with long short-term memory networks. Comput. Speech Lang. 58(Nov), 335–346 (2019)
Google Scholar
Lee, S.H., Song, Y.K., Kim, H.S.: Named entity recognition using distant supervision and active bagging. J. KIISE 43(2), 269–274 (2016)
Article Google Scholar
Lee, S., Song, Y., Choi, M., Kim, H.: Bagging-based active learning model for named entity recognition with distant supervision. In: International Conference on Big Data & Smart Computing (2016)
Google Scholar
Lin, Y., Chengjie, S., Xiaolong, W., Xuan, W.: Combining self learning and active learning for Chinese named entity recognition. J. Softw. 5(5), 530–537 (2010)
Google Scholar
Luo, J., Jianqiang, D.U., Nie, B., Xiong, W., Jia, H.E., Yang, Y.: TCM named entity recognition based on character vector with bidirectional LSTM-CRF. In: International Conference on eHealth, Telemedicine, and Social Medicine (2019)
Google Scholar
Lyu, C., Chen, B., Ren, Y., Ji, D.: Long short-term memory RNN for biomedical named entity recognition. BMC Bioinform. 18(1), 462 (2017)
Article Google Scholar
Ma, X., Hovy, E.: End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF. arXiv preprint arXiv:1603.01354 (2016)
Mesfar, S.: Named entity recognition for Arabic using syntactic grammars. In: International Conference on Applications of Natural Language to Information Systems (2007)
Google Scholar
Mukherjee, S., Awadallah, A.H.: Tinymbert: multi-stage distillation framework for massive multi-lingual NER. CoRR abs/2004.05686 (2020)
Google Scholar
Neves Oliveira, B.S., et al.: HELD: Hierarchical entity-label disambiguation in named entity recognition task using deep learning. Intell. Data Anal. 26(3), 637–657 (2022)
Article Google Scholar
Ning, G., Bai, Y.: Biomedical named entity recognition based on glove-BLSTM-CRF model. J. Comput. Methods Sci. Eng. 3, 1–9 (2020)
Google Scholar
Nozza, D., Manchanda, P., Fersini, E., Palmonari, M., Messina, E.: Learningtoadapt with word embeddings: domain adaptation of named entity recognition systems. Inf. Process. Manag. 58(3), 102537 (2021)
Article Google Scholar
Ouyang, E., Li, Y., Jin, L., Li, Z., Zhang, X.: Exploring N-gram character presentation in bidirectional RNN-CRF for Chinese clinical named entity recognition. In: CCKS: China Conference on Knowledge Graph and Semantic Computing 2017 (2017)
Google Scholar
Patra, R., Saha, S.K.: Utilizing external corpora through kernel function: application in biomedical named entity recognition. Prog. Artif. Intell 9(3), 209–219 (2020)
Article Google Scholar
Peng, N., Dredze, M.: Named entity recognition for Chinese social media with jointly trained embeddings. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 548–554 (2015)
Google Scholar
Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014). http://www.aclweb.org/anthology/D14-1162
Peters, M.E., et al.: Deep contextualized word representations. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 2227–2237 (2018)
Google Scholar
Rong, X.: word2vec parameter learning explained (2016)
Google Scholar
Rouhou, A.C., Dhiaf, M., Kessentini, Y., Ben Salem, S.: Transformer-based approach for joint handwriting and named entity recognition in historical document. Pattern Recognit. Lett. 155, 128–134 (2022)
Article Google Scholar
Steinberger, R., Pouliquen, B.: Cross-lingual named entity recognition. Lingvisticae Investigationes 30(1), 135–162 (2007)
Article Google Scholar
Tran, V.C., Nguyen, N.T., Fujita, H., Hoang, D.T., Hwang, D.: A combination of active learning and self-learning for named entity recognition on twitter using conditional random fields. Knowl.-Based Syst. 132(15), 179–187 (2017)
Article Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
Google Scholar
Xu, L., et al.: Cluener 2020: fine-grained named entity recognition dataset and benchmark for Chinese. arXiv preprint arXiv:2001.04351 (2020)
Yin, M., Mou, C., Xiong, K., Ren, J.: Chinese clinical named entity recognition with radical-level feature and self-attention mechanism. J. Biomed. Inform. 98, 103289 (2019)
Article Google Scholar
Yu, K., Kurohashi, S., Liu, H., Nakazawa, T.: Chinese word segmentation and named entity recognition by character tagging. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 146–149 (2006)
Google Scholar
Zhang, Y., Yang, J.: Chinese NER using lattice LSTM. arXiv preprint arXiv:1805.02023 (2018)

Download references

Acknowledgements

This work was supported by the Science and Technology Project of State Grid Zhejiang Electric Power Co., Ltd. (Project number: B311XT220007).

Author information

Authors and Affiliations

State Grid Zhejiang Electric Power Corporation Information and Telecommunication Branch, Hangzhou, China
Hongkai Wang, Jun Feng, Yidan Wang, Sichen Pan & Shuai Zhao
Nanjing Duotuo Intelligent Technology Limited Liability Company, Nanjing, China
Yi Xue

Authors

Hongkai Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Feng
View author publications
You can also search for this author in PubMed Google Scholar
Yidan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Sichen Pan
View author publications
You can also search for this author in PubMed Google Scholar
Shuai Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yi Xue
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yidan Wang .

Editor information

Editors and Affiliations

Zhejiang Gongshang University, Hangzhou, China
Jun Shao
Norwegian University of Science and Technology, Gjøvik, Norway
Sokratis K. Katsikas
Technical University of Denmark, Kongens Lyngby, Denmark
Weizhi Meng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, H., Feng, J., Wang, Y., Pan, S., Zhao, S., Xue, Y. (2024). Enhancing Chinese Named Entity Recognition with Disentangled Expert Knowledge. In: Shao, J., Katsikas, S.K., Meng, W. (eds) Emerging Information Security and Applications. EISA 2023. Communications in Computer and Information Science, vol 2004 . Springer, Singapore. https://doi.org/10.1007/978-981-99-9614-8_6

Download citation

DOI: https://doi.org/10.1007/978-981-99-9614-8_6
Published: 04 January 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-9613-1
Online ISBN: 978-981-99-9614-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Enhancing Chinese Named Entity Recognition with Disentangled Expert Knowledge

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Chinese named entity recognition model: integrating label knowledge and lexicon information

TL-NER: A Transfer Learning Model for Chinese Named Entity Recognition

Incorporating Boundary and Category Feature for Nested Named Entity Recognition

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Enhancing Chinese Named Entity Recognition with Disentangled Expert Knowledge

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Chinese named entity recognition model: integrating label knowledge and lexicon information

TL-NER: A Transfer Learning Model for Chinese Named Entity Recognition

Incorporating Boundary and Category Feature for Nested Named Entity Recognition

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation