Abstract
This work deals with the semi-automatic generation of subcategorization frames (SCFs) of Spanish verbs; specifically, given a set of verbs in Spanish and their respective sense, their SCFs are obtained. The acquisition of SCFs in Spanish has been approached in different works: in some the frames are generated manually, while in others they are obtained semi-automatically from a tagged corpus; unfortunately in this case, the results depend on the characteristics of the texts used. The method proposed in this document combines an ontology-based approach (through lexical relations of verbs) and linguistic knowledge (functional class of verbs). The relations among base verbs and other verbs were obtained from the Spanish WordNet ontology, which contains lexical relations among words. Also, the existing relation between the SCF and the functional class of verbs was used to generate the SCFs. In order to evaluate the method the SCFs for 44 base verbs were generated manually, from which 239 SCFs were semi-automatically generated and validated, yielding an accuracy of 89.38%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ushioda, A., Evans, D.A., Gibson, T., Waibel, A.: The Automatic Acquisition of Frequencies of Verb Subcategorization Frames from Tagged Corpora. In: SIGLEX ACL Workshop on The Acquisition of Lexical Knowledge from Text, Columbus, Ohio, pp. 95–106 (1993)
Cervantes, A.: Diseño e Implementación de un Analizador Sintáctico para las Oraciones en Español Usando el Método de Dependencias. Tesis de maestría. Centro Nacional de Investigación y Desarrollo Tecnológico (2005)
Faure, D., Nedellec, C.: Knowledge Acquisition of Predicate Argument Structures from Technical Texts Using Machine Learning: The System ASIUM. In: Proc. of the 11th European Workshop on Knowledge Acquisition, Modeling and Management, pp. 329–334 (1999)
Kingsbury, P., Marcus, M., Palmer, M.: Adding semantic annotation to the Penn Tree-Bank. In: Proc. of the Human Language Technology Conference (HLT), San Diego, CA (2002)
Korhonen, A.: Assigning Verbs to Semantic Classes via WordNet. In: Proc. of the SemaNet 2002: Building and Using Semantic Networks, Taipei, Taiwan, pp. 1–7 (2002)
Sarkar, A., Tripasai, W.: Learning Verb Argument Structure from Minimally Annotated Corpora. In: Proc. of the Int. Conf. on Computational Linguistics, Taipei, Taiwan, pp. 1–8 (2002)
Castellón, I., Alemany, L.A., Tincheva, N.T.: A Procedure to Automatically Enrich Verbal Lexica with Subcategorization Frames. Revista Iberoamericana de Inteligencia Artificial 12(37), 45–53 (2008)
Real Academia Española, Banco de datos (CREA), Corpus de referencia del español actual (2006), http://www.rae.es
Galicia, S.: Análisis Sintáctico Conducido por un Diccionario de Patrones de Manejo Sintáctico para Lenguaje Español. Tesis doctoral. Centro de Investigación en Computación, Instituto Politécnico Nacional (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pazos R, R.A., Martínez F, J.A., González B, J., Morales-Rodríguez, M.L., Galiana B, G.M., Castro H., A. (2008). Ontology-Based Approach for Semi-automatic Generation of Subcategorization Frames for Spanish Verbs. In: Corchado, E., Abraham, A., Pedrycz, W. (eds) Hybrid Artificial Intelligence Systems. HAIS 2008. Lecture Notes in Computer Science(), vol 5271. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87656-4_69
Download citation
DOI: https://doi.org/10.1007/978-3-540-87656-4_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87655-7
Online ISBN: 978-3-540-87656-4
eBook Packages: Computer ScienceComputer Science (R0)