Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2095536.2095542acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

A workload-aware approach for optimizing the XML schema design trade-off

Published: 05 December 2011 Publication History

Abstract

In general, the design of XML schemas involves translating conceptual schemas into XML schemas which aim to be: (i) normalized schemas, and (ii) connected structures in order to achieve good performance on queries. However, these requirements address a trade-off because highly connected XML structures allow data redundancy, and normalized schemas generate disconnected XML structures. This paper describes a workload-based approach which balances this trade-off on translating conceptual schemas into XML structures. An experimental study on an XML database shows that our XML schemas provide high query performance on the relevant elements for the workload and, at the same time, low cost of data redundancy on elements that are not relevant for update operations.

References

[1]
M. Arenas and L. Libkin. A normal form for xml documents. In Symposium on Principles of Database Systems, pages 85--96. ACM, 2002.
[2]
D. Barbosa, A. Mendelzon, J. Keenleyside, and K. Lyons. Toxgene: A template-based data generator for xml. In Proc. WebDB, 2002.
[3]
C. Batini, S. Ceri, and S. Navathe. Conceptual Database Design: An Entity-Relationship Approach. The Benjamin/Cummings Publishing Company, 1992.
[4]
L. Bird, A. Goodchild, and T. A. Halpin. Object role modeling and xml-schema. In International Conference on Conceptual Modeling, pages 661--705. Springer, 2000.
[5]
T. Bradford, V. Gritsenko, and K. O'Neill. Apache xindice. http://xml.apache.org/xindice/, 2011.
[6]
C. Curino, E. Jones, Y. Zhang, and S. Madden. Schism: a workload-driven approach to database replication and partitioning. Proc. VLDB Endow., 3:48--57, 2010.
[7]
R. Elmasri, J. Weeldreyer, and A. R. Hevner. The category concept: An extension to the entity-relationship model. In Data Knowledge Engineering, volume 1, pages 75--116, 1985.
[8]
D. Embley, S. Liddle, and S. Kamha. Enterprise modeling with conceptual xml. In International Conference on Conceptual Modeling, pages 150--165, 2004.
[9]
J. Fong and A. F. et. al. Translating relational schema with constraints into xml schema. In International Journal of Software Engineering and Knowledge Engineering, volume 16, pages 201--244, 2006.
[10]
T. Kudrass and T. Krumbein. Rule-based generation of xml schemas from uml class diagrams. In Advances in Databases and Information Systems. Springer, 2003.
[11]
M. Mani. Erex: A conceptual model for xml. In Internation XML Database Symposium, pages 128--142. Springer, 2004.
[12]
W. Y. Mok and D. W. Embley. Generating compact redundancy-free xml documents from conceptual-model hypergraphs. In IEEE Transactions on Knowledge and Data Engineering, volume 18, pages 1082--1096, 2006.
[13]
M. M. Moro, L. Lim, and Y.-C. Chang. Schema advisor for hybrid relational-xml dbms. In SIGMOD '07: ACM SIGMOD international conference on Management of data, pages 959--970. ACM, 2007.
[14]
P. Pigozzo and E. Quintarelli. An algorithm for generating xml schemas from er schemas. In Italian Symposium on Advanced Database Systems, pages 192--199, 2005.
[15]
N. Routledge, L. Bird, and A. Goodchild. Uml and xml schema. In Australian Database Conference, pages 157--166. IEEE, 2002.
[16]
H. Schöning. Tamino - a dbms designed for xml-schema. In International Conference on Data Engineering, pages 149--154. IEEE, 2001.
[17]
R. Schroeder and R. D. S. Mello. Designing xml documents from conceptual schemas and workload information. Multimedia Tools Appl., 43:303--326, July 2009.
[18]
R. Schroeder and R. S. Mello. Improving query performance on xml documents: A workload-driven design approach. In Symposium on Document Engineering, pages 177--186, 2008.
[19]
L. Stephens and A. G. et. al. Constructing consensus ontologies for the semantic web: A conceptual approach. In World Wide Web Journal, volume 7, 2004.
[20]
N. Wiwatwattana and H. J. et. al. Making designer schemas with colors. In ICDE, 2006.
[21]
C. Yu and H. V. Jagadish. Xml schema refinement through redundancy detection and normalization. In The VLDB Journal, volume 17, pages 203--223, 2008.

Cited By

View all

Index Terms

  1. A workload-aware approach for optimizing the XML schema design trade-off

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    iiWAS '11: Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
    December 2011
    572 pages
    ISBN:9781450307840
    DOI:10.1145/2095536
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 05 December 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. XML
    2. logical design
    3. redundancy
    4. workload

    Qualifiers

    • Research-article

    Conference

    MoMM '11

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 15 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)KVMod—A Novel Approach to Design Key-Value NoSQL DatabasesInformation10.3390/info1410056314:10(563)Online publication date: 12-Oct-2023
    • (2022)A workload-driven method for designing aggregate-oriented NoSQL databasesData & Knowledge Engineering10.1016/j.datak.2022.102089142(102089)Online publication date: Nov-2022
    • (2020)MortadeloFuture Generation Computer Systems10.1016/j.future.2019.11.032105:C(455-474)Online publication date: 1-Apr-2020
    • (2016)On proposing and evaluating a NoSQL document database logical approachInternational Journal of Web Information Systems10.1108/IJWIS-04-2016-001812:4(398-417)Online publication date: 7-Nov-2016
    • (2015)A workload-driven logical design approach for NoSQL document databasesProceedings of the 17th International Conference on Information Integration and Web-based Applications & Services10.1145/2837185.2837218(1-10)Online publication date: 11-Dec-2015
    • (2012)On evaluating an approach for balancing the trade‐off on XML schema designInternational Journal of Web Information Systems10.1108/174400812112828748:4(371-389)Online publication date: 16-Nov-2012

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media