Abstract
Search performance can be greatly improved by describing data using Natural Language Processing (NLP) tools to create new metadata for digital libraries. In this paper, a methodology is presented to use a specific domain knowledge to improve user request. This domain knowledge is based on concepts, extracted from the document itself, used as “semantic metadata tags” in order to annotate XML documents. We present the process followed to define and to add new XML semantic metadata into the digital library of scientific theses. Using these new metadata, an ontology is also built to complete the annotation process. Effective retrieval information is obtained by using an intelligent system based on our XML semantic metadata and a domain ontology.
Similar content being viewed by others
References
Abascal, R., Rumpler, B.: Evaluación de herramientas de extracción automática de conceptos dentro de un ambiente de biblioteca digital. Colombian Journal of Computation 6(1) (June 2005) ISSN 1657–2831
Plante, P., Dumas, L., Plante, A.: Nomino version 4.2.22
Thomasson, J.-J.: Schémas XML, Ed. Eyrolles, p. 466 (2002) ISBN: 2-212-11195-9
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abascal, R., Rumpler, B., Berisha-Bohé, S., Pinon, J.M. (2005). A Semantic Structure for Digital Theses Collection Based on Domain Annotations. In: Rauber, A., Christodoulakis, S., Tjoa, A.M. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2005. Lecture Notes in Computer Science, vol 3652. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551362_52
Download citation
DOI: https://doi.org/10.1007/11551362_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28767-4
Online ISBN: 978-3-540-31931-3
eBook Packages: Computer ScienceComputer Science (R0)