Abstract
Chinese Named Entity Recognition (NER) requires model identify entity boundaries in the sentence i.e., entity segmentation, and meanwhile assign entities to pre-defined categories, i.e., entity classification. Current NER tasks follows sequence tagging scheme and assign the characters to different labels by considering both segmentation position and entity categories. In such a scheme, the characters in the same entity will be regarded as different classes in the training process according to different positions. In fact, the knowledge of entity segmentation is shared across different entity categories, while entity category knowledge is relatively independent of entity segmentation. Such labeling scheme will lead to the entanglement of these two objectives, hindering the effective knowledge acquisition by the models. To address the entanglement issue and comprehensively extract useful knowledge of two objectives, we propose a novel framework that disentangle the original NER labels into two additional training labels for entity segmentation and entity classification respectively. Then we introduce two dedicated expert models to effectively extract specific knowledge from the disentangled labels. Afterwards, their predictions will be integrated into the original model as auxiliary knowledge, further enhancing the primary NER model learning process. We conduct experiments on three publicly available datasets to demonstrate the effectiveness of our proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Wang, D., Fan, H., Liu, J.: Learning with joint cross-document information via multi-task learning for named entity recognition. Inf. Sci. 579, 454–467 (2021)
Jimeno, A., Jimenez-Ruiz, E., Lee, V., Gaudan, S., Berlanga, R., Rebholz-Schuhmann, D.: Assessment of disease named entity recognition on a corpus of annotated sentences. BMC Bioinform. (2008)
Cabrera-Diego, L.A., Moreno, J.G., Doucet, A.: Using a frustratingly easy domain and tagset adaptation for creating slavic named entity recognition systems. In: Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing (2021)
Chiu, J.P., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016)
Cucerzan, S., Yarowsky, D.: Language independent named entity recognition combining morphological and contextual evidence. In: Empirical Methods in Natural Language Processing (1999)
Dniken, P.V., Cieliebak, M.: Transfer learning and sentence level features for named entity recognition on tweets. In: Workshop on Noisy User-Generated Text (2017)
Feng, Y., Sun, L., Zhang, J.: Early results for Chinese named entity recognition using conditional random fields model, hmm and maximum entropy. In: Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2005. Proceedings of 2005 IEEE International Conference on (2005)
Jin, G., Chen, X.: The fourth international Chinese language processing bakeoff: Chinese word segmentation, named entity recognition and Chinese POS tagging. In: Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing (2008)
Kendall, A., Gal, Y., Cipolla, R.: Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7482–7491 (2018)
Kenton, J.D.M.W.C., Toutanova, L.K.: Bert: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of NAACL-HLT, vol. 1, p. 2 (2019)
Khalifa, M., Shaalan, K.: Character convolutions for Arabic named entity recognition with long short-term memory networks. Comput. Speech Lang. 58(Nov), 335–346 (2019)
Lee, S.H., Song, Y.K., Kim, H.S.: Named entity recognition using distant supervision and active bagging. J. KIISE 43(2), 269–274 (2016)
Lee, S., Song, Y., Choi, M., Kim, H.: Bagging-based active learning model for named entity recognition with distant supervision. In: International Conference on Big Data & Smart Computing (2016)
Lin, Y., Chengjie, S., Xiaolong, W., Xuan, W.: Combining self learning and active learning for Chinese named entity recognition. J. Softw. 5(5), 530–537 (2010)
Luo, J., Jianqiang, D.U., Nie, B., Xiong, W., Jia, H.E., Yang, Y.: TCM named entity recognition based on character vector with bidirectional LSTM-CRF. In: International Conference on eHealth, Telemedicine, and Social Medicine (2019)
Lyu, C., Chen, B., Ren, Y., Ji, D.: Long short-term memory RNN for biomedical named entity recognition. BMC Bioinform. 18(1), 462 (2017)
Ma, X., Hovy, E.: End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF. arXiv preprint arXiv:1603.01354 (2016)
Mesfar, S.: Named entity recognition for Arabic using syntactic grammars. In: International Conference on Applications of Natural Language to Information Systems (2007)
Mukherjee, S., Awadallah, A.H.: Tinymbert: multi-stage distillation framework for massive multi-lingual NER. CoRR abs/2004.05686 (2020)
Neves Oliveira, B.S., et al.: HELD: Hierarchical entity-label disambiguation in named entity recognition task using deep learning. Intell. Data Anal. 26(3), 637–657 (2022)
Ning, G., Bai, Y.: Biomedical named entity recognition based on glove-BLSTM-CRF model. J. Comput. Methods Sci. Eng. 3, 1–9 (2020)
Nozza, D., Manchanda, P., Fersini, E., Palmonari, M., Messina, E.: Learningtoadapt with word embeddings: domain adaptation of named entity recognition systems. Inf. Process. Manag. 58(3), 102537 (2021)
Ouyang, E., Li, Y., Jin, L., Li, Z., Zhang, X.: Exploring N-gram character presentation in bidirectional RNN-CRF for Chinese clinical named entity recognition. In: CCKS: China Conference on Knowledge Graph and Semantic Computing 2017 (2017)
Patra, R., Saha, S.K.: Utilizing external corpora through kernel function: application in biomedical named entity recognition. Prog. Artif. Intell 9(3), 209–219 (2020)
Peng, N., Dredze, M.: Named entity recognition for Chinese social media with jointly trained embeddings. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 548–554 (2015)
Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014). http://www.aclweb.org/anthology/D14-1162
Peters, M.E., et al.: Deep contextualized word representations. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pp. 2227–2237 (2018)
Rong, X.: word2vec parameter learning explained (2016)
Rouhou, A.C., Dhiaf, M., Kessentini, Y., Ben Salem, S.: Transformer-based approach for joint handwriting and named entity recognition in historical document. Pattern Recognit. Lett. 155, 128–134 (2022)
Steinberger, R., Pouliquen, B.: Cross-lingual named entity recognition. Lingvisticae Investigationes 30(1), 135–162 (2007)
Tran, V.C., Nguyen, N.T., Fujita, H., Hoang, D.T., Hwang, D.: A combination of active learning and self-learning for named entity recognition on twitter using conditional random fields. Knowl.-Based Syst. 132(15), 179–187 (2017)
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
Xu, L., et al.: Cluener 2020: fine-grained named entity recognition dataset and benchmark for Chinese. arXiv preprint arXiv:2001.04351 (2020)
Yin, M., Mou, C., Xiong, K., Ren, J.: Chinese clinical named entity recognition with radical-level feature and self-attention mechanism. J. Biomed. Inform. 98, 103289 (2019)
Yu, K., Kurohashi, S., Liu, H., Nakazawa, T.: Chinese word segmentation and named entity recognition by character tagging. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 146–149 (2006)
Zhang, Y., Yang, J.: Chinese NER using lattice LSTM. arXiv preprint arXiv:1805.02023 (2018)
Acknowledgements
This work was supported by the Science and Technology Project of State Grid Zhejiang Electric Power Co., Ltd. (Project number: B311XT220007).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wang, H., Feng, J., Wang, Y., Pan, S., Zhao, S., Xue, Y. (2024). Enhancing Chinese Named Entity Recognition with Disentangled Expert Knowledge. In: Shao, J., Katsikas, S.K., Meng, W. (eds) Emerging Information Security and Applications. EISA 2023. Communications in Computer and Information Science, vol 2004 . Springer, Singapore. https://doi.org/10.1007/978-981-99-9614-8_6
Download citation
DOI: https://doi.org/10.1007/978-981-99-9614-8_6
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-9613-1
Online ISBN: 978-981-99-9614-8
eBook Packages: Computer ScienceComputer Science (R0)