Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1559845.1559914acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Core schema mappings

Published: 29 June 2009 Publication History
  • Get Citation Alerts
  • Abstract

    Research has investigated mappings among data sources under two perspectives. On one side, there are studies of practical tools for schema mapping generation; these focus on algorithms to generate mappings based on visual specifications provided by users. On the other side, we have theoretical researches about data exchange. These study how to generate a solution - i.e., a target instance - given a set of mappings usually specified as tuple generating dependencies. However, despite the fact that the notion of a core of a data exchange solution has been formally identified as an optimal solution, there are yet no mapping systems that support core computations. In this paper we introduce several new algorithms that contribute to bridge the gap between the practice of mapping generation and the theory of data exchange. We show how, given a mapping scenario, it is possible to generate an executable script that computes core solutions for the corresponding data exchange problem. The algorithms have been implemented and tested using common runtime engines to show that they guarantee very good performances, orders of magnitudes better than those of known algorithms that compute the core as a post-processing step.

    References

    [1]
    B. Alexe, W. Tan, and Y. Velegrakis. Comparing and Evaluating Mapping Systems with STBenchmark. Proc. of the VLDB Endowment, 1(2):1468--1471, 2008.
    [2]
    Y. An, A. Borgida, R. Miller, and J. Mylopoulos. A Semantic Approach to Discovering Schema Mapping Expressions. In Proc. of ICDE, pages 206--215, 2007.
    [3]
    C. Beeri and M. Vardi. A Proof Procedure for Data Dependencies. J. of the ACM, 31(4):718--741, 1984.
    [4]
    P. Bohannon, E. Elnahrawy, W. Fan, and M. Flaster. Putting Context into Schema Matching. In Proc. of VLDB, pages 307--318. VLDB Endowment, 2006.
    [5]
    A. Bonifati, G. Mecca, A. Pappalardo, S. Raunich, and G. Summa. Schema Mapping Verification: The Spicy Way. In Proc. of EDBT, pages 85--96, 2008.
    [6]
    L. Bravo, W. Fan, and S. Ma. Extending Dependencies with Conditions. In Proc. of VLDB, pages 243--254, 2007.
    [7]
    L. Cabibbo. On Keys, Foreign Keys and Nullable Attributes in Relational Mapping Systems. In Proc. of EDBT, pages 263--274, 2009.
    [8]
    L. Chiticariu. Computing the Core in Data Exchange: Algorithmic Issues. MS Project Report, 2005. Unpublished manuscript.
    [9]
    R. Fagin, P. Kolaitis, R. Miller, and L. Popa. Data exchange: Semantics and query answering. Theor. Comput. Sci., 336(1):89--124, 2005.
    [10]
    R. Fagin, P. Kolaitis, A. Nash, and L. Popa. Towards a Theory of Schema-Mapping Optimization. In Proc. of ACM PODS, pages 33--42, 2008.
    [11]
    R. Fagin, P. Kolaitis, and L. Popa. Data Exchange: Getting to the Core. ACM TODS, 30(1):174--210, 2005.
    [12]
    A. Fuxman, M. A. Hernández, C. T. Howard, R. J. Miller, P. Papotti, and L. Popa. Nested Mappings: Schema Mapping Reloaded. In Proc. of VLDB, pages 67--78, 2006.
    [13]
    G. Gottlob and A. Nash. Efficient Core Computation in Data Exchange. J. of the ACM, 55(2):1--49, 2008.
    [14]
    T. J. Green, G. Karvounarakis, Z. G. Ives, and V. Tannen. Update Exchange with Mappings and Provenance. In Proc. of VLDB, pages 675--686, 2007.
    [15]
    P. Hell and J. Nešetřil. The Core of a Graph. Discrete Mathematics, 109(1-3):117--126, 1992.
    [16]
    A. Y. Levy, A. O. Mendelzon, Y. Sagiv, and D. Srivastava. Answering queries using views. In PODS, pages 95--104, 1995.
    [17]
    R. J. Miller, L. M. Haas, and M. A. Hernandez. Schema Mapping as Query Discovery. In Proc. of VLDB, pages 77--99, 2000.
    [18]
    L. Popa, Y. Velegrakis, R. J. Miller, M. A. Hernandez, and R. Fagin. Translating Web Data. In Proc. of VLDB, pages 598--609, 2002.
    [19]
    A. Raffio, D. Braga, S. Ceri, P. Papotti, and M. A. Hernández. Clip: a Visual Language for Explicit Schema Mappings. In Proc. of ICDE, pages 30--39, 2008.
    [20]
    V. Savenkov and R. Pichler. Towards practical feasibility of core computation in data exchange. In Proc. of LPAR, pages 62--78, 2008.
    [21]
    B. ten Cate, L. Chiticariu, P. Kolaitis, and W. C. Tan. Laconic Schema Mappings: Computing Core Universal Solutions by Means of SQL Queries. Unpublished manuscript -http://arxiv.org/abs/0903.1953, March 2009.
    [22]
    L. L. Yan, R. J. Miller, L. M. Haas, and R. Fagin. Data Driven Understanding and Refinement of Schema Mappings. In Proc. of ACM SIGMOD, pages 485--496, 2001.

    Cited By

    View all

    Index Terms

    1. Core schema mappings

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGMOD '09: Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
      June 2009
      1168 pages
      ISBN:9781605585512
      DOI:10.1145/1559845
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 29 June 2009

      Permissions

      Request permissions for this article.

      Check for updates

      Badges

      Author Tags

      1. core computation
      2. data exchange
      3. schema mappings

      Qualifiers

      • Research-article

      Conference

      SIGMOD/PODS '09
      Sponsor:
      SIGMOD/PODS '09: International Conference on Management of Data
      June 29 - July 2, 2009
      Rhode Island, Providence, USA

      Acceptance Rates

      Overall Acceptance Rate 785 of 4,003 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)10
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 11 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Incomplete Data and Data Dependencies in Relational Databases10.1007/978-3-031-01893-0Online publication date: 2-Mar-2022
      • (2021)Towards Knowledge Exchange: State-of-the-Art and Open ProblemsSOFSEM 2021: Theory and Practice of Computer Science10.1007/978-3-030-67731-2_2(13-27)Online publication date: 25-Jan-2021
      • (2020)Dataset Discovery in Data Lakes2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00067(709-720)Online publication date: Apr-2020
      • (2020)Schema matching based on SQL statementsDistributed and Parallel Databases10.1007/s10619-019-07268-938:1(193-226)Online publication date: 1-Mar-2020
      • (2017)A Framework for User-Driven Mapping Discovery in Rich Spaces of Heterogeneous DataOn the Move to Meaningful Internet Systems. OTM 2017 Conferences10.1007/978-3-319-69459-7_27(399-417)Online publication date: 21-Oct-2017
      • (2015)The iBench integration metadata generatorProceedings of the VLDB Endowment10.14778/2850583.28505869:3(108-119)Online publication date: 1-Nov-2015
      • (2015)Schema matching based on position of attribute in query statementKnowledge-Based Systems10.1016/j.knosys.2014.11.00575:C(41-51)Online publication date: 1-Feb-2015
      • (2014)Optimizing the chaseProceedings of the VLDB Endowment10.14778/2733085.27330937:14(1869-1880)Online publication date: 1-Oct-2014
      • (2013)Building a Dynamic, Large-Scale Spatio-temporal Vector Database to Support a National Spatial Data Infrastructure in ChinaGIScience & Remote Sensing10.2747/1548-1603.47.1.13547:1(135-162)Online publication date: 15-May-2013
      • (2013)Semantic-Based MappingsProceedings of the 32nd International Conference on Conceptual Modeling - Volume 821710.1007/978-3-642-41924-9_22(255-269)Online publication date: 11-Nov-2013
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media