Abstract
In this work the CLASITEX+ system for discovers the most important themes treated in a text written in Spanish or English is presented. This system works on the basis of trees of concepts and find: a) the most frequent concepts in the text, b) the relation between these concepts computing the co-ocurrence into the sentences that conform the text. Also CLASITEX+ can give us a distribution map of the most frequent concepts in the text An important characteristic of the system is the amount of concepts in Spanish and English handled by the system, also the execution time in the document analysis is very acceptable.
Chapter PDF
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Feldman R. and Hirsh H. “Mining association in text in the presence of background knowledge”. In the Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Portland Oregon, August, 343–346 (1996).
Feldman R. and Ido Dagan. “Knowledge Discovery in textual databeses (KDT)”. In the Proceedings of the first Int. Conf. On Data Mining and Knowledge Discovery (KDD95), pp. 112–117, Montreal, Aug 1995. or[3] Flamsteed S. “Saturn, 2004”. Discover, The world of Science, pp. 76, January 1998.
Guzmán A.A. “Finding the main themes in a spanish document”. Journal Expert Systems and Applicacions, Vol. 14, No. 1, January (1998).
Rijsbergen C.J., et al. “Information retrieval”. Second Edition 1979, http://www.dcs.gla.ac.uk/Keith/Preface.html.
Salton G. “Automatic Text Processing: The transformation, Analysis and Retrieval of Information by Computer”. Addison-Wesley Publishing Company (1989).
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Trinidad, J.F.M., Martínez, B.B., Arenas, A.G., Schulcloper, J.R. (1998). Clasitex+: A tool for knowledge discovery from texts. In: Żytkow, J.M., Quafafou, M. (eds) Principles of Data Mining and Knowledge Discovery. PKDD 1998. Lecture Notes in Computer Science, vol 1510. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0094850
Download citation
DOI: https://doi.org/10.1007/BFb0094850
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65068-3
Online ISBN: 978-3-540-49687-8
eBook Packages: Springer Book Archive