Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Using schematically heterogeneous structures

Published: 01 June 1998 Publication History

Abstract

Schematic heterogeneity arises when information that is represented as data under one schema, is represented within the schema (as metadata) in another. Schematic heterogeneity is an important class of heterogeneity that arises frequently in integrating legacy data in federated or data warehousing applications. Traditional query languages and view mechanisms are insufficient for reconciling and translating data between schematically heterogeneous schemas. Higher order query languages, that permit quantification over schema labels, have been proposed to permit querying and restructuring of data between schematically disparate schemas. We extend this work by considering how these languages can be used in practice. Specifically, we consider a restricted class of higher order views and show the power of these views in integrating legacy structures. Our results provide insights into the properties of restructuring transformations required to resolve schematic discrepancies. In addition, we show how the use of these views permits schema browsing and new forms of data independence that are important for global information systems. Furthermore, these views provide a framework for integrating semi-structured and unstructured queries, such as keyword searches, into a structured querying environment. We show how these views can be used with minimal extensions to existing query engines. We give conditions under which a higher order view is usable for answering a query and provide query translation algorithms.

References

[1]
S. Abiteboul, H. Garcia-Molina, Y. Papakonstantinou, and R. Yerneni. Fusion Queries over Internet Databases. Technical Report unpublished manuscript, Stanford University, 1997.
[2]
R. Ahmed, P. DeSmedt, W. Du, W. Kent, M. A. Ketabchi, W. A. Litwin, A. Rafii, and M. C. Shah. The Pegasus Heterogeneous Multidatabase System. IEEE Computer, 24(12):19-27, December 1991.
[3]
Y. Arens, C. Y. Chee, C. N. Hsu, and C. A. KnobIock. Retrieving and Integrating Data from Multiple Information Sources. intl. J. of Intelligent and Cooperative Info. Systems, 2(2):127-158, 1993.
[4]
T. Barsalou and D. Gangopadhyay. M(DM): An Open Framework for Interoperation of Multimodel Multidatabase Systems. In Proc. of the Int7 Conf. on Data Eng., pages 218-227, Tempe, AZ, February 1992.
[5]
C. Batini, M. Lenzerini, and S. B. Navathe. A Comparative Analysis of Methodologies for Database Schema Integration. A CM Computing Surveys, 18(4):323-364, December 1986.
[6]
M. L. Brodie and M. Stonebraker. Migrating Legacy Systems: Gateways, Interfaces, and the Incremental Approach. Morgan Kaufmann Series in Data Mngmt. Sys., Jim Gray, Ed. Morgan Kaufmann, 1995.
[7]
M. J. Carey, L. M. Haas, P. M. Schwarz, M. Arya, W. F. Cody, R. Fagin, M. Flickner, A. W. Luniewski, W. Niblack, D. Petkovic, J. Thomas, J. H. Williams, and E. L. Wimmers. Towards Heterogeneous Multimedia information Systems: The Garlic Approach. In Proc. of the Fifth Int'l IEEE Wksp. on Research Issues in Data Eng. (RIDE-95): Distributed Object Mngmt., Taipei, Taiwan, March 1995.
[8]
S. Chaudhuri, R. Krishnamurthy, S. Potamianos, and K. Shim. Optimizing Queries with Materialized Views. In Proc. of the Int'l Conf. on Data Eng., pages 190-200. IEEE, 1995.
[9]
S. Chaudhuri and M. Y. Vardi. Optimization of Real Conjunctive Queries. In Proc. of the A CM Syrup. on Principles of Database Systems (PODS), 1993.
[10]
S. Chawathe, H. Gaxcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, and J. Widom. The TSIMMIS Project: Integration of Heterogeneous Information Sources. In Proc. of the l OOth Anniversary Meeting of the Information Processing Society of Japan(IPSJ), pages 7-18, Tokyo, Japan, October 1994.
[11]
W. Chen, M. Kifer, and D. S. Warren. HiLog as a Platform for Database Languages. In Int'l Workshop on Database Programming Languages, pages 315-329, Gleneden Beach, OR, June 1989.
[12]
E.F Codd and S. B. Codd. Providing OLAP (On-line Analytical Processing) to User-Analysts: An IT Mandate. Technical report, E.F. Codd and Associates, 1994.
[13]
U. Dayal and H. Y. Hwang. View Definition and Generalization for Database Integration in a Multidatabase System. IEEE Trans. on Software Engineering, SE- 10(6):628-644, November 1984.
[14]
J. Gary, A. Bosworth, A. Layman, and H. Pirahesh. Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals. In Proc. of the Int'l Conf. on Data Eng., pages 152-159, 1996.
[15]
R. Hull. Relative Information Capacity of Simple Relational Database Schemata. Society for Industrial and Applied Mathematics (SIAM) Journal of Computing, 15(3):856-886, August 1986.
[16]
R. Hull. Managing Semantic Heterogeneity in Databases: A Theoretical Perspective. In Proc. of the A CM Syrup. on Principles of Database Systems (PODS), pages 51-61, 1997.
[17]
D. Van Gucht J. Van den Bussche and G. Vossen. Reflective Programming in the Relational Algebra. In Proc. of the A CM Syrup. on Principles of Database Systems (PODS), pages 17-25, 1993.
[18]
V. Kashyap and A. Sheth. Semantic and Schematic Similarities between Database Objects: A Contextbased Approach. The Int'l journal on Very Large Data Bases, 5(4):276-304, December 1996.
[19]
W. Kent. The Many Forms of a Single Fact. In Proc. of IEEE Int'l Computer Conf. (COMPCON), pages 438- 443, 1989.
[20]
M. Kifer, W. Kim, and Y. Sagiv. Querying Object- Oriented Databases. In ACM SIGMOD Int'l Conf. on the Management of Data, pages 393-402, 1992.
[21]
W. Kim and J. Seo. Classifying Schematic and Data Heterogeneity in Multidatabase Systems. IEEE Computer, 24(12):12-18, December 1991.
[22]
R. Krishnamurthy, W. Litwin, and W. Kent. Language Features for Interoperability of Databases with Schematic Discrepancies. In A CM SIGMOD Int'l Conf. on the Management of Data, pages 40-49, 1991.
[23]
L. Lakshmanam, F. Sadri, and I. N. Subramanian. On the Logical Foundations of Schema Integration and Evolution in Heterogeneous Database Systems. In Proc. of the Int'l. Conf. on Deductive and Object-Oriented Databases, 1993.
[24]
L. Lakshmanam, F. Sadri, and I. N. Subramanian. SchemaSQL- A Language for Interoperability in Relational Multi-database Systems. In Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), Bombay, India, 1996.
[25]
A. Y. Levy, A. O. Mendelzon, Y. Sagiv, and D. Srivastava. Answering Queries Using Views. In Proc. of the A CM Syrup. on Principles of Database Systems (PODS), San Jose, CA, May 1995.
[26]
A. Y. Levy, A. Rajaraman, and J. J. Ordille. Querying Heterogeneous Information Sources Using Source Descriptions. In Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), pages 251-262, Bombay, India, 1996.
[27]
W. Litwin and A. Abdellatif. Multidatabase Interoperability. IEEE Computer, 19(12):10-18, December 1986.
[28]
W. Litwin, M. Ketabchi, and R. Krishnamurthy. First Order Normal Form for Relational Databases and Multidatabases. SIGMOD Record, 20(4), December 1991.
[29]
R. J. Miller. Using Schematically Heterogeneous Structures: Extended Version. Technical Report OSU- CISRC-3/98-TR09, Ohio State University, Dept of Computer and Information Science, 1998.
[30]
R. J. Miller, Y. E. Ioannidis, and R. Ramakrishnan. The Use of Information Capacity in Schema Integration and Translation. In Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), pages 120-133, Dublin, Ireland, August 1993.
[31]
R. J. Miller, O. G. Tsatalos, and J. H. Williams. Data- Web: Customizable Database Publishing for the Web. IEEE Multimedia, 4(4):14-21, Oct-Dec 1997.
[32]
Y. Papakonstantinou, H. GarciaMolina, and J. Widom. Object Exchange Across Heterogeneous Information Sources. in Proc. of the Int'l Conf. on Data Engineering, Taipei, Taiwan, March 1995.
[33]
K. A. Ross. Relations with Relation Names as Argurnents: Algebra and Calculus. In Proc. of the A CM Syrup. on Principles of Database Systems (PODS), pages 346-353, San Diego, CA, June 1992.
[34]
P. Selinger, M. Astrahan, D. Chamberlin, R. Lorie, and T. Price. Access Path Selection in a Relational Database Management System. In A CM SIGMOD Int'l Conf. on the Management of Data, pages 23-34, 1979.
[35]
D. Srivastava, S. Dar, H. V. Jagadish, and A. Y Levy. Answering Queries with Aggregation Using Views. In Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), Bombay, India, 1996.
[36]
A. Tomasic, L. Raschid, and P. Valduriez. A Data Model and Query Processing Techniques for Scaling Access to Distributed Heterogeneous Databases in Disco. IEEE Trans on Computers, 1997.
[37]
O. Tsatalos, M. Solomon, and Y. Ioannidis. The dence. The Int'l Journal on Very Large Data Bases, 5(2), April 1996.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMOD Record
ACM SIGMOD Record  Volume 27, Issue 2
June 1998
595 pages
ISSN:0163-5808
DOI:10.1145/276305
Issue’s Table of Contents
  • cover image ACM Conferences
    SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data
    June 1998
    599 pages
    ISBN:0897919955
    DOI:10.1145/276304
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 June 1998
Published in SIGMOD Volume 27, Issue 2

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)119
  • Downloads (Last 6 weeks)29
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Towards Knowledge Exchange: State-of-the-Art and Open ProblemsSOFSEM 2021: Theory and Practice of Computer Science10.1007/978-3-030-67731-2_2(13-27)Online publication date: 11-Jan-2021
  • (2020)Knowledge translationProceedings of the VLDB Endowment10.14778/3407790.340780613:12(2018-2032)Online publication date: 14-Sep-2020
  • (2014)6.7.4 XMInt: An XML Mediated Integration ModelINCOSE International Symposium10.1002/j.2334-5837.2001.tb02423.x11:1(1163-1170)Online publication date: 4-Nov-2014
  • (2008)Data exchange with data-metadata translationsProceedings of the VLDB Endowment10.14778/1453856.14538881:1(260-273)Online publication date: 1-Aug-2008
  • (2008)An XML Schema integration and query mechanism systemData & Knowledge Engineering10.1016/j.datak.2007.09.00865:2(266-303)Online publication date: 1-May-2008
  • (2007)Querying through a user interfaceData & Knowledge Engineering10.1016/j.datak.2007.04.00763:3(774-794)Online publication date: 1-Dec-2007
  • (2007)One-to-many data transformations through data mappersData & Knowledge Engineering10.1016/j.datak.2006.08.01162:3(483-503)Online publication date: 1-Sep-2007
  • (2006)An ontology based approach to the integration of entity-relationship schemasData & Knowledge Engineering10.1016/j.datak.2005.07.00558:3(299-326)Online publication date: 1-Sep-2006
  • (2005)Interoperable e-learning ontologies using model correspondencesProceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems10.1007/11575863_141(1179-1189)Online publication date: 31-Oct-2005
  • (2004)Incremental Maintenance of Schema-Restructuring Views in SchemaSQLIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2004.4216:9(1096-1111)Online publication date: 1-Sep-2004
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media