Abstract
Structure is an important characteristic of documents. In the document management community it has been realized that there is a need for querying and retrieval of documents based on the structure of the documents. In this paper we propose a knowledge-based system for representation and retrieval of documents. The model on which the system is based is an extension of the description logic model of information retrieval. We discuss the advantages of our model and we show how our model can cope with many of the desirable queries involving the structure of documents.
Part of this work was done while the first author was visiting the Department of Computer Science of the RMIT University in Melbourne, Australia. The first author is supported by grant 95-176 from the Swedish Research Council for Engineering Sciences (TFR).
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Arnold-Moore, T., Fuller, M., Lowe, B., Thom, J., Wilkinson, R., ‘The ELF data model and SGQL query language for structured document databases', Proceedings of the Australasian Database Conference, pp 17–26, Adelaide, Australia, 1995.
Artale, A., Cesarini, F., Grazzini, E., Pippolini, F., Soda, G., ‘Modelling Composition in a Terminological Language Environment', Proceedings of the ECAI Workshop on Parts and Wholes, pp 93–101, Amsterdam, August 1994.
Artale, A., Franconi, E., Guarino, N., Pazzi, L., ‘Part-Whole Relations in ObjectCentered Systems: An Overview', Data and Knowledge Engineering, Vol 20(3), pp 347–383, 1996.
Bertino, E., Rabitti, F., Gibbs, S., ‘Query Processing in a Multimedia Document System', ACM Transactions on Office Informations Systems, Vol 6(1), pp 1–41, 1988.
Blake, G., Conses, P., Kilpeläinen, P., Larson, P., Snider, T., Tompa, F., ‘Text/relational Database Management Systems: Harmonizing SQL and SGML', Proceedings of the International Conference on Applications of Databases, LNCS 819, pp 267–280, Vadstena, Sweden, 1994.
Borgida, A., Brachman, R., McGuinness, D., Resnick, L., ‘CLASSIC: a structural data model for objects', Proceedings of the ACM International Conference on Management of Data-SIGMOD 89, pp 58–67, 1989.
Burkowski, F., ‘Retrieval Activities in a Database Consisting of Heterogeneous Collections of Structured Text', Proceedings of the 15th ACM International Conference on Research and Development in Information Retrieval-SIGIR 92, pp 112–125, Copenhagen, Denmark, 1992.
Christophides, V., Abiteboul, S., Cluet, S., Scholl, M., ‘From Structured Documents to Novel Query Facilities', Proceedings of the ACM International Conference on Management of Data-SIGMOD 94, pp 1–22, 1994.
Franconi, E., ‘A Treatment of Plurals and Plural Qualifications based on a Theory of Collections', Minds and Machines: Special Issue on Knowledge Representation for Natural Language Processing, Vol 3(4), pp 453–474, November 1993.
Goldfarb, C.F., The SGML Handbook, Clarendon Press, Oxford, 1990.
Hors, P., ‘Description logics to specify the part-whole relation', Proceedings of the ECAI Workshop on Parts and Wholes, pp 103–109, Amsterdam, August 1994.
Jang, Y., Patil, R., ‘KOLA: A Knowledge Organization Language', Proceedings of the 13th Symposium on Computer Applications in Medical Care, pp 71–75, 1989.
Kilpeläinen, P., Manilla, H., ‘Retrieval from Hierarchical Texts by Partial Patterns', Proceedings of the 16th ACM International Conference on Research and Development in Information Retrieval-SIGIR 93, pp 214–222, Pittsburgh, PA, USA, 1993.
Lambrix, P., Part-Whole Reasoning in Description Logics, Ph.D. Thesis 448, Department of Computer and Information Science, Linköping University, 1996.
Lambrix, P., Shahmehri, N., Åberg, J., ‘Towards Creating a Knowledge Base for World-Wide Web Documents', Proceedings of the LASTED International Conference on Intelligent Information Systems, Grand Bahama Island, Bahamas, 1997.
MacLeod, I., ‘A Query Language for Retrieving Information from Hierarchic Text Structures', The Computer Journal, Vol 34(3), pp 254–264, 1991.
Meghini, C., Sebastiani, F., Straccia, U., Thanos, C., ‘A Model of Information Retrieval based on a Terminological Logic', Proceedings of the 16th ACM International Conference on Research and Development in Information Retrieval-SIGIR 93, pp 298–307, Pittsburgh, PA, USA, 1993.
Padgham, L., Lambrix, P., ‘A Framework for Part-of Hierarchies in Terminological Logics', Principles of Knowledge Representation and Reasoning: Proceedings of the Fourth International Conference-KR 94, pp 485–496, Bonn, Germany, 1994.
Sacks-Davis, R., Arnold-Moore, T., Zobel, J., ‘Database systems for structured documents', Proceedings of the International Symposium on Advanced Database Technologies and Their Integration, pp 272–283, Nara, Japan, 1994.
Salton, G., McGill, M., Introduction to Modern Information Retrieval, McGrawHill, Tokio, 1983.
Sattler, U., ‘A Concept Language for an Engineering Application with Part-Whole Relations', Proceedings of the International Workshop on Description Logics, pp 119–123, Roma, Italy, 1995.
Sebastiani, F., ‘A Probabilistic Terminological Logic for Modelling Information Retrieval', Proceedings of the 17th ACM International Conference on Research and Development in Information Retrieval-SIGIR 94, pp 122–130, Dublin, Ireland, 1994.
Speel, P.-H., Patil-Schneider, P.F., ‘CLASSIC Extended with Whole-Part Relations', Proceedings of the International Workshop on Description Logics, pp 45–50, Bonn, Germany, 1994.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lambrix, P., Padgham, L. (1997). A description logic model for querying knowledge bases for structured documents. In: Raś, Z.W., Skowron, A. (eds) Foundations of Intelligent Systems. ISMIS 1997. Lecture Notes in Computer Science, vol 1325. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63614-5_7
Download citation
DOI: https://doi.org/10.1007/3-540-63614-5_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63614-4
Online ISBN: 978-3-540-69612-4
eBook Packages: Springer Book Archive