Abstract
The task of main verb identification is to recognize the predicate-verb in a sentence. This task plays a crucial role in various areas such as knowledge acquisition, text mining, and question answering, and is also an important preprocessing for many applications including sentence pattern analysis and semantic roles identification. This paper proposes a domain-independent bootstrapping method to automatically identify main verbs of sentences from un-annotated domain-specific Chinese unstructured texts. Experimental results in two domains show that the algorithm is promising. As applications of the main verb identification, we have developed a main verb driven approach of extracting domain-specific terms from unstructured text corpus.
The first and third authors are supported by the Program for New Century Excellent Talents in Universities of China and the IPv6 based National Foundation Education Grid (the Model Project of China Next Generation Internet) and Beijing Institute of Technology Basic Research Foundation (grant no.411002). The second author is supported by the Natural Science Foundation (grant no.60273019, 60496326, 60573063, and 60573064), and the National 973 Program (grant no. 2003CB317008 and G1999032701).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zhang, C., Hao, T.: The State of the Art and Difficulties in Automatic Chinese Word Segmentation. Journal of System and Simulation 1, 138–143 (2005)
Koong, H., Soo, V.: Hypothesis Scoring over Theta Grids Information in Parsing Chinese Sentences with Serial Verb Constructions. In: International Conference on Computational Linguistics, Kyoto, Japan, pp. 942–948 (1994)
Luo, Z., et al.: An Approach to the Recognition of Predicates in the Automatic Analysis of Chinese Sentence Patterns. In: Proceedings of 3rd National Computational Linguistics, Beijing, China, pp. 159–164 (1995)
Sui, Z., Yu, S.: The Research on Recognizing the Predicate Head of a Chinese Simple Sentence in EBMT. Journal of Chinese Information Processing 4, 39–46 (1998)
Tan, H.: Center Predicate Recognization for Scientific Article. Journal of Wuhan University 6, 1–3 (2000)
Chen, X., Shi, D.: To Mark Topic and Subject in Chinese Sentences. In: Proceedings of the Fourth National Conference on Computational Linguistics, pp. 102–108 (1997)
Sui, Z., et al.: The Acquisition and Application of the Knowledge for Recognizing the Predicate Head of a Chinese Simple Sentence. Journal of Peking University 223, 221–230 (1998)
Gong, X., Luo, Z., Luo, W.: Recognizing the Predicate Head of Chinese Sentences. Journal of Chinese Information Processing 2, 7–13 (2003)
Ding, B., Huang, C., Huang, D.: Chinese Main Verb Identification: from Specification to Realization. Computational Linguistics and Chinese Language Processing 1, 53–94 (2005)
Huang, B.R., Liao, X.D.: Modern Chinese. High Education Publisher, Beijing (2002)
Lu, S., et al.: Elementary Study of Chinese Grammar. The Commercial Press, Beijing (1999)
Chinese Encyclopedia. Encyclopedia of China Publishing House, Beijing (1998)
Yu, S., et al.: A Dictionary of Contemporary Chinese Grammatical Information. The Tsinghua University Press, Beijing (1998)
Mei, J., et al.: A Dictionary of Synonyms. Shanghai Thesaurus Press, Shanghai (1983)
Bourigault, D.: Lexter: A Natural Language Processing Tool for Terminology Extraction. In: Proceedings of the 7th Euralex International Congress (1996)
Evans, D., et al.: Noun-Phrase Analysis in Unrestricted Text for Information Retrieval. In: Proceedings of the 34th Association for Computational Linguistics, pp. 17–24 (1996)
Chinese Text Segmentation and POS Tagging, http://www.icl.pku.edu.cn
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, C., Cao, C., Niu, Z. (2007). A Bootstrapping Approach for Chinese Main Verb Identification. In: Apolloni, B., Howlett, R.J., Jain, L. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2007. Lecture Notes in Computer Science(), vol 4692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74819-9_72
Download citation
DOI: https://doi.org/10.1007/978-3-540-74819-9_72
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74817-5
Online ISBN: 978-3-540-74819-9
eBook Packages: Computer ScienceComputer Science (R0)