Abstract
Much recent research in database design focuses on persistence models for semistructured data similar to the SGML and XML that humanities digital libraries have long used to encode digital editions of texts. Structure-aware querying promises to simplify the design of such digital repositories by allowing them to store and query texts using a single, unified information model. Using content the Perseus Project has acquired over the past ten years as a test case, we describe the advantages and delimit the problems in managing structure-aware queries over multiple or ambiguous schemas, evaluate the place of markup in digital libraries where much content is automatically generated, and examine the uses for structure-aware query in a system that stores both semistructured content and graph-structured metadata.
A grant from the Digital Libraries Initiative Phase 2 (NSF IIS-9817484) provided support for this work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Abiteboul, S., McHugh, J., Rys, M., Vassalos, V., Wiener, J.: Incremental Maintenance for Materialized Views over Semistructured Data. In: Proceedings of the 24th International Conference on Very Large Data Bases (VLDB 1998), pp. 38–49 (1998)
Aguilera, V., Cluet, S., Milo, T., Veltri, P., Vodislav, D.: Views in a Large-Scale XML Repository. VLDB 11 (3), 238–255 (2002)
Boag, S., Chamberlin, D., Fernandez, M.F., Florescu, D., Robie, J., Simeon, J., Stefanescu, M.: XQuery 1.0: An XML Query Language. W3C Working Draft (2002), http://www.w3.org/TR/xquery
Clark, J., DeRose, S.: XML Path Language (XPath) 1.0. W3C Recommendation (1999), http://www.w3.org/TR/xpath
Fiebig, T., Helmer, S., Kanne, C.-C., Moerkotte, G., Neumann, J., Schiele, R., Westmann, T.: Anatomy of a Native XML Base Management System. VLDB 11 (4), 292–314 (2002)
Fuhr, N., Großjohann, K.: XIRQL: A Query Language for Information Retrieval in XML Documents. In: Proceedings of the 24th Annual International Conference on Research and Development in Information Retrieval, pp. 172–180 (2001)
Jagadish, H.V., Al-Khalifa, S., Chapman, A., Lakshmanan, L.V.S., Nierman, A., Paparizos, S., Patel, J.M., Srivastava, D., Wiwatwattana, N., Wu, Y., Yu, C.: TIMBER: A Native XML Database. VLDB 11 (4), 274–291 (2002)
McHugh, J., Abiteboul, S., Goldman, R., Quass, D., Wisdom, J.: Lore: A Database Management System for Semistructured Data. In: ACM SIGMOD International Conference on Management of Data (SIGMOD 1997) SIGMOD Record, vol. 26 (3), pp. 54–66 (1997)
Robie, J., Garshol, L.M., Newcomb, S., Fuchs, M., Miller, L., Brickley, D., Christophides, V., Karvounarakis, G.: The Syntactic Web: Syntax and Semantics on the Web. Markup Languages: Theory and Practice 3 (4), 411–440 (2001)
Smith, D.A., Mahoney, A., Rydberg-Cox, J.A.: Managing XML Documents in an Integrated Digital Library. Markup Languages: Theory and Practice 2 (3), 205–214 (2000)
Smith, D., Crane, G.: Disambiguating Geographic Names in a Historical Digital Library. In: Constantopoulos, P., Sølvberg, I.T. (eds.) ECDL 2001. LNCS, vol. 2163, pp. 127–136. Springer, Heidelberg (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
York, C., Wulfman, C., Crane, G. (2003). Structure-Aware Query for Digital Libraries: Use Cases and Challenges for the Humanities. In: Koch, T., Sølvberg, I.T. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2003. Lecture Notes in Computer Science, vol 2769. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45175-4_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-45175-4_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40726-3
Online ISBN: 978-3-540-45175-4
eBook Packages: Springer Book Archive