Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

The Piazza Peer Data Management System

Published: 01 July 2004 Publication History

Abstract

Intuitively, data management and data integration tools should be well-suited for exchanging information in a semantically meaningful way. Unfortunately, they suffer from two significant problems: They typically require a comprehensive schema design before they can be used to store or share information and they are difficult to extend because schema evolution is heavyweight and may break backward compatibility. As a result, many small-scale data sharing tasks are more easily facilitated by non-database-oriented tools that have little support for semantics. The goal of the peer data management system (PDMS) is to address this need: We propose the use of a decentralized, easily extensible data management architecture in which any user can contribute new data, schema information, or even mappings between other peers' schemas. PDMSs represent a natural step beyond data integration systems, replacing their single logical schema with an interlinked collection of semantic mappings between peers' individual schemas. This paper describes several aspects of the Piazza PDMS, including the schema mediation formalism, query answering and optimization algorithms, and the relevance of PDMSs to the Semantic Web.

References

[1]
T. Berners-Lee J. Hendler and O. Lassila, “The Semantic Web,” Scientific Am., May 2001.]]
[2]
M. Dean D. Connolly F. van Harmelen J. Hendler I. Horrocks D.L. McGuinness P.F. Patel-Schneider and L.A. Stein, “OWL Web Ontology Language 1.0 Reference,” http://www.w3c.org/TR/2002-WD-owl-ref-20020729/, w3C working draft, 29 July 2002.]]
[3]
H. Garcia-Molina Y. Papakonstantinou D. Quass A. Rajaraman Y. Sagiv J. Ullman and J. Widom, “The TSIMMIS Project: Integration of Heterogeneous Information Sources,” J. Intelligent Information Systems, vol. 8, no. 2, pp. 117-132, Mar. 1997.]]
[4]
L. Haas D. Kossmann E. Wimmers and J. Yang, “Optimizing Queries across Diverse Data Sources,” Proc. 23 Int'l Conf. Very Large Databases, 1997.]]
[5]
S. Adali K. Candan Y. Papakonstantinou and V. Subrahmanian, “Query Caching and Optimization in Distributed Mediator Systems,” Proc. SIGMOD, pp. 137-148, 1996.]]
[6]
A.Y. Levy A. Rajaraman and J.J. Ordille, “Querying Heterogeneous Information Sources Using Source Descriptions,” Proc. 22nd Int'l Conf. Very Large Databases, pp. 251-262, 1996.]]
[7]
O.M. Duschka and M.R. Genesereth, “Answering Recursive Queries Using Views,” Proc. 16th ACM SIGACT-SIGMOD-SIGART Symp. Principles of Database Systems, pp. 109-116, 1997.]]
[8]
I. Manolescu D. Florescu and D. Kossmann, “Answering XML Queries on Heterogeneous Data Sources,” Proc. 27th Int'l Conf. Very Large Data Bases, pp. 241-250, 2001.]]
[9]
J.L. Ambite N. Ashish G. Barish C.A. Knoblock S. Minton P.J. Modi I. Muslea A. Philpot and S. Tejada, “ARIADNE: A System for Constructing Mediators for Internet Sources (System Demonstration),” Proc. SIGMOD, 1998.]]
[10]
E. Lambrecht S. Kambhampati and S. Gnanaprakasam, “Optimizing Recursive Information Gathering Plans,” Proc. 16th Int'l Joint Conf. Artificial Intelligence, pp. 1204-1211, 1999.]]
[11]
J.M. Smith P.A. Bernstein U. Dayal N. Goodman T. Landers K. Lin and E. Wong, “Multibase-Integrating Heterogeneous Distributed Database Systems,” Proc. Nat'l Computer Conf., 1981.]]
[12]
A.Y. Halevy, “Answering Queries Using Views: A Survey,” VLDB J., vol. 10, no. 4, 2001.]]
[13]
D. Draper A.Y. Halevy and D.S. Weld, “The Nimble Integration System,” Proc. SIGMOD, 2001.]]
[14]
S. Abiteboul and O. Duschka, “Complexity of Answering Queries Using Materialized Views,” Proc. 17th ACM SIGACT-SIGMOD-SIGART Symp. Principles of Database Systems, pp. 254-263, 1998.]]
[15]
A. Halevy Z. Ives D. Suciu and I. Tatarinov, “Schema Mediation for Large-Scale Semantic Data Sharing,” VLDB J., 2003.]]
[16]
M. Friedman A. Levy and T. Millstein, “Navigational Plans for Data Integration,” Proc. 16th Nat'l Conf. Artificial Intelligence, 1999.]]
[17]
Z.G. Ives A.Y. Halevy and D.S. Weld, “Integrating Network-bound XML Data,” IEEE Data Eng. Bull., special issue on XML, vol. 24, no. 2, June 2001.]]
[18]
A.Y. Halevy I. Mumick Y. Sagiv and O. Shmueli, “Static Analysis in Datalog Extensions,” J. ACM, vol. 48, no. 5, pp. 971-1012, Sept. 2001.]]
[19]
D. Srivastava and R. Ramakrishnan, “Pushing Constraint Selections,” Proc. 11th ACM SIGACT-SIGMOD-SIGART Symp. Principles of Database Systems, pp. 301-315, 1992.]]
[20]
J. Madhavan and A. Halevy, “Composing Mappings among Data Sources,” Proc. 20th Int'l Conf. Very Large Data Bases, 2003.]]
[21]
M.Y. Vardi, “On the Complexity of Bounded-Variable Queries,” Proc. 14th ACM SIGACT-SIGMOD-SIGART Symp. Principles of Database Systems, pp. 266-276, 1995.]]
[22]
I. Stoica R. Morris D. Karger M.F. Kaashoek and H. Balakrishnan, “Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications,” Proc. ACM SIGCOMM, 2001.]]
[23]
S. Ratnasamy P. Francis M. Handley R. Karp and S. Shenker, “A Scalable Content-Addressable Network,” Proc. ACM SIGCOMM, 2001.]]
[24]
A. Halevy Z. Ives P. Mork and I. Tatarinov, “Piazza: Data Management Infrastructure for Semantic Web Applications,” Proc. 12th Int'l World Wide Web Conf., 2003.]]
[25]
D. Chamberlin D. Florescu J. Robie J. Simeon and M. Stefanascu, “XQuery: A Query Language for XML,” World Wide Web Consortium, technical report, Feb. 2001, available from http://wwww.w3.org/TR/xquery/.]]
[26]
A. Deutsch M. Fernandez D. Florescu and A. Levy D. Suciu, “A Query Language for XML,” http://www.research.att.com/mff/xml/w3c-note.html, 1998.]]
[27]
P. Patel-Schneider and J. Simeon, “Building the Semantic Web on XML,” Proc. Int'l Semantic Web Conf., June 2002.]]
[28]
I. Horrocks F. van Harmelen and P. Patel-Schneider, “DAML+OIL,” http://www.daml.org/2001/03/daml+oil-in dex.html, Mar. 2001.]]
[29]
A. Levy and M.-C. Rousset, “Combining Horn Rules and Description Logics in Carin,” Artificial Intelligence, vol. 104, pp. 165-209, Sept. 1998.]]
[30]
W. Litwin L. Mark and N. Roussopoulos, “Interoperability of Multiple Autonomous Databases,” ACM Computing Surveys, vol. 22,no. 3, pp. 267-293, 1990.]]
[31]
R. Krishnamurthy W. Litwin and W. Kent, “Language Features for Interoperability of Databases with Schematic Discrepancies,” Proc. SIGMOD, pp. 40-49, 1991.]]
[32]
M. Rusinkiewicz A. Sheth and G. Karabatis, “Specifying Interdatabase Dependencies in a Multidatabase Environment,” Computer, vol. 24, no. 12, Dec. 1991.]]
[33]
T. Catarci and M. Lenzerini, “Representing and Using Interschema Knowledge in Cooperative Information Systems,” J. Intelligent and Cooperative Information Systems, pp. 55-62, 1993.]]
[34]
S. Gribble A. Halevy Z. Ives M. Rodrig and D. Suciu, “What Can Databases Do for Peer-to-Peer?” Proc. ACM SIGMOD WebDB Workshop, 2001.]]
[35]
P. Kalnis W. Ng B. Ooi D. Papadias and K. Tan, “An Adaptive Peer-to-Peer Network for Distributed Caching of Olap Results,” Proc. SIGMOD, 2002.]]
[36]
P. Bernstein F. Giunchiglia A. Kementsietsidis J. Mylopoulos L. Serafini and I. Zaihrayeu, “Data Management for Peer-to-Peer Computing: A Vision,” Proc. WebDB Workshop, 2002.]]
[37]
W. Nejdl B. Wolf C. Qu S. Decker M. Sintek A. Naeve M. Nilsson M. Palmer and T. Risch, “EDUTELLA: A P2P Networking Infrastructure Based on RDF,” Proc. Int'l WWW Conf., 2002.]]
[38]
K. Aberer P. Cudre-Mauroux and M. Hauswirth, “The Chatty Web: Emergent Semantics through Gossiping,” Proc. 12th Int'l World Wide Web Conf., 2003.]]
[39]
M. Arenas V. Kantere A. Kementsietsidis I. Kiringa R.J. Miller and J. Mylopoulos, “The Hyperion Project: From Data Integration to Data Coordination,” SIGMOD Record, Sept. 2003.]]
[40]
W.S. Ng B.C. Ooi K.-L. Tan and A. Zhou, “PeerDB: A P2P-Based System for Distributed Data Sharing,” Proc. SIGMOD, 2003.]]
[41]
E. Mena V. Kashyap A.P. Sheth and A. Illarramendi, “OBSERVER: An Approach for Query Processing in Global Information Systems Based on Interoperation across Pre-Existing Ontologies,” Distributed and Parallel Databases, vol. 8, no. 2, pp. 223-271, 2000.]]
[42]
A. Preece K. Hui and P. Gray, “Kraft: An Agent Architecture for Knowledge Fusion,” Int'l J. Cooperative Information Systems, vol. 10, nos. 1-2, pp. 171-195, 1999.]]

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image IEEE Transactions on Knowledge and Data Engineering
IEEE Transactions on Knowledge and Data Engineering  Volume 16, Issue 7
July 2004
95 pages

Publisher

IEEE Educational Activities Department

United States

Publication History

Published: 01 July 2004

Author Tags

  1. Peer data management
  2. Web
  3. data integration
  4. databases.
  5. schema mediation

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2020)Data coordinationProceedings of the VLDB Endowment10.14778/3402707.34027224:11(831-842)Online publication date: 3-Jun-2020
  • (2015)Mapping evolution of dynamic web ontologiesInformation Sciences: an International Journal10.1016/j.ins.2014.12.040303:C(101-119)Online publication date: 10-May-2015
  • (2014)TransactiveDBProceedings of the VLDB Endowment10.14778/2733085.27331027:14(1977-1980)Online publication date: 1-Oct-2014
  • (2014)Curriculum data enrichment with ontologiesProceedings of the 4th International Conference on Web Intelligence, Mining and Semantics (WIMS14)10.1145/2611040.2611070(1-6)Online publication date: 2-Jun-2014
  • (2014)Data-centric intelligent information integration--from concepts to automationJournal of Intelligent Information Systems10.1007/s10844-014-0340-543:3(437-462)Online publication date: 1-Dec-2014
  • (2014)Integration operators for generating RDF/OWL-based user defined mediator views in a grid environmentJournal of Intelligent Information Systems10.1007/s10844-013-0300-543:1(1-32)Online publication date: 1-Aug-2014
  • (2014)Cooperation across Multiple Healthcare Clinics on the CloudProceedings of the 14th IFIP WG 6.1 International Conference on Distributed Applications and Interoperable Systems - Volume 846010.1007/978-3-662-43352-2_7(82-88)Online publication date: 3-Jun-2014
  • (2013)Query optimization in information integrationActa Informatica10.1007/s00236-013-0179-150:4(257-287)Online publication date: 1-Jun-2013
  • (2012)Partial Evaluation for Distributed XPath Query Processing and BeyondACM Transactions on Database Systems10.1145/2389241.238925137:4(1-43)Online publication date: 1-Dec-2012
  • (2012)Towards law-aware semantic cloud policies with exceptions for data integration and protectionProceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics10.1145/2254129.2254162(1-12)Online publication date: 13-Jun-2012
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media