Abstract
This paper proposes an approach based on the Ontology and transformation-based error-driven learning (TBL) to recognize Chinese proper nouns. Firstly, our approach redefines the label set and tags Chinese words according to the usage of proper nouns and their context, and then it extracts Characteristic Information (CI) of the proper noun from the text and merges them based on the Ontology. Secondly, it tags the training corpus following the new definition of Multi-dimension Attribute Points (MAP), and then extracts rules using the TBL approach. Finally, it recognizes proper nouns by utilizing the rule set and Ontology. The experimental results in our open test show that the precision is 92.5% and the recall is 86.3%.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Yu, H., Zhang, H., Liu, Q.: Recognition of Chinese Organization Name Based on Role Tagging. In: Proc. of 20th Int. Conf. on Computer Processing of Oriental Languages, pp. 79–87 (2003)
Yu, H., Zhang, H., Liu, Q., Lv, X., Shi, S.: Chinese NE Identification Using Cascaded Hidden Markov Model. Journal of Communications 27(2), 87–94 (2006)
Li, L., Huang, D., Mao, T., Xu, X.: Auto Recognition of Person Names from Chinese Texts Based on SVM. Computer Engineering 32(19), 188–210 (2006)
Qian, J., Zhang, Y., Zhang, T.: Research on Chinese Person Name and Location Name Recognition Based on ME Model. Mini-Micro Systems 27(9), 1761–1765 (2006)
Tan, H., Zheng, J., Liu, K.: Design and Realization of Chinese Place Name Automatic Recognition System. Computer Engineering 28(8), 128–129 (2002)
Li, Z., Liu, Y.: Chinese Name Recognition Based on Boundary Templates and Local Frequency. Journal of Chinese Information Processing 20(5), 44–50 (2006)
Lv, Y., Zhao, T., Yang, M., Yu, H., Li, S.: Unknown Chinese Words Resolution by Dynamic Programming. Journal of Chinese Information Processing 15(1), 123–128 (2001)
Manning, C., Schutze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
Li, P., Zhu, Q., Qian, P.: The Construction of a Multilingual Language Ontology Framework. Journal of Computer Application 27(3), 646–649 (2007)
Brill, E.: Transformation-based Error-drive Learning and Natural Language Processing: a Case Study in Part of Speech Tagging. Computational Linguistic 21(4), 543–565 (1995)
Zhou, M., Wu, J., Wang, C.: A Fast Learning Algorithm for Part of Speech Tagging: An Improvement on Brill’s Transformation-based Algorithm. Chinese Journal of Computer 21(4), 357–366 (1998)
Zhu, Q., Wen, T., Li, P., Qian, P.: Self-Adaptive Chinese Ambiguous Word Segmentation Method Based on Multi-Gram Library. Mini-micro Systems 27(8), 1597–1600 (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, P., Zhu, Q., Wang, L. (2007). Recognizing Chinese Proper Nouns with Transformation-Based Learning and Ontology. In: Basili, R., Pazienza, M.T. (eds) AI*IA 2007: Artificial Intelligence and Human-Oriented Computing. AI*IA 2007. Lecture Notes in Computer Science(), vol 4733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74782-6_74
Download citation
DOI: https://doi.org/10.1007/978-3-540-74782-6_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74781-9
Online ISBN: 978-3-540-74782-6
eBook Packages: Computer ScienceComputer Science (R0)