Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1066157.1066252acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Clio grows up: from research prototype to industrial tool

Published: 14 June 2005 Publication History

Abstract

Clio, the IBM Research system for expressing declarative schema mappings, has progressed in the past few years from a research prototype into a technology that is behind some of IBM's mapping technology. Clio provides a declarative way of specifying schema mappings between either XML or relational schemas. Mappings are compiled into an abstract query graph representation that captures the transformation semantics of the mappings. The query graph can then be serialized into different query languages, depending on the kind of schemas and systems involved in the mapping. Clio currently produces XQuery, XSLT, SQL, and SQL/XML queries. In this paper, we revisit the architecture and algorithms behind Clio. We then discuss some implementation issues, optimizations needed for scalability, and general lessons learned in the road towards creating an industrial-strength tool.

References

[1]
P. Bernstein. Applying Model Management to Classical Meta Data Problems. In CIDR, 2003.
[2]
R. Fagin, P. Kolaitis, L. Popa, and W.-C. Tan. Composing Schema Mappings: Second-Order Dependencies to the Rescue. In PODS, 2004.
[3]
R. Fagin, P. G. Kolaitis, R. J. Miller, and L. Popa. Data Exchange: Semantics and Query Answering. In ICDT, 2003.
[4]
M. Lenzerini. Data Integration: A Theoretical Perspective. In PODS, 2002.
[5]
S. Melnik, P. A. Bernstein, A. Halevy, and E. Rahm. Supporting Executable Mappings in Model Management. In SIGMOD, 2005.
[6]
R. J. Miller, L. M. Haas, and M. A. Hernández. Schema Mapping as Query Discovery. In VLDB, 2000.
[7]
L. Popa, Y. Velegrakis, R. J. Miller, M. A. Hernández, and R. Fagin. Translating Web Data. In VLDB, 2002.
[8]
E. Rahm and P. A. Bernstein. A Survey of Approaches to Automatic Schema Matching. The VLDB Journal, 10(4):334--350, 2001.
[9]
N. C. Shu, B. C. Housel, R. W. Taylor, S. P. Ghosh, and V. Y. Lum. EXPRESS: A Data EXtraction, Processing, and REStructuring System. TODS, 2(2):134--174, 1977.

Cited By

View all
  • (2023)GIO: Generating Efficient Matrix and Frame Readers for Custom Data Formats by ExampleProceedings of the ACM on Management of Data10.1145/35892651:2(1-26)Online publication date: 20-Jun-2023
  • (2022)Fast Left Kan Extensions Using the ChaseJournal of Automated Reasoning10.1007/s10817-022-09634-266:4(805-844)Online publication date: 6-Jul-2022
  • (2021)“Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AIProceedings of the 2021 CHI Conference on Human Factors in Computing Systems10.1145/3411764.3445518(1-15)Online publication date: 6-May-2021
  • Show More Cited By
  1. Clio grows up: from research prototype to industrial tool

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '05: Proceedings of the 2005 ACM SIGMOD international conference on Management of data
    June 2005
    990 pages
    ISBN:1595930604
    DOI:10.1145/1066157
    • Conference Chair:
    • Fatma Ozcan
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 14 June 2005

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Article

    Conference

    SIGMOD/PODS05
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)18
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 08 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)GIO: Generating Efficient Matrix and Frame Readers for Custom Data Formats by ExampleProceedings of the ACM on Management of Data10.1145/35892651:2(1-26)Online publication date: 20-Jun-2023
    • (2022)Fast Left Kan Extensions Using the ChaseJournal of Automated Reasoning10.1007/s10817-022-09634-266:4(805-844)Online publication date: 6-Jul-2022
    • (2021)“Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AIProceedings of the 2021 CHI Conference on Human Factors in Computing Systems10.1145/3411764.3445518(1-15)Online publication date: 6-May-2021
    • (2019)Meta-mappings for schema mapping reuseProceedings of the VLDB Endowment10.14778/3303753.330376112:5(557-569)Online publication date: 1-Jan-2019
    • (2019)Towards Scalable Hybrid StoresProceedings of the 2019 International Conference on Management of Data10.1145/3299869.3319895(1660-1677)Online publication date: 25-Jun-2019
    • (2018)Reflections on Schema Mappings, Data Exchange, and Metadata ManagementProceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3196959.3196991(107-109)Online publication date: 27-May-2018
    • (2018)Active Learning of GAV Schema MappingsProceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3196959.3196974(355-368)Online publication date: 27-May-2018
    • (2018)Executable schema mappings for statistical data processingDistributed and Parallel Databases10.1007/s10619-017-7212-236:2(265-300)Online publication date: 1-Jun-2018
    • (2018)Extraction, Transformation, and LoadingEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_158(1432-1440)Online publication date: 7-Dec-2018
    • (2017)Discovering context-aware conditional functional dependenciesFrontiers of Computer Science: Selected Publications from Chinese Universities10.5555/3128671.312868711:4(688-701)Online publication date: 1-Aug-2017
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media