Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1416691.1416702acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmobicaseConference Proceedingsconference-collections
research-article

Building a PDMS infrastructure for XML data sharing with SUNRISE

Published: 25 March 2008 Publication History

Abstract

Semantic support for data representation as well as a flexible machine-readable format have made XML the de facto standard for Internet applications semantic interoperability. Its applicability is primarily evident in realities where actors are heterogeneous data sources which interact each other for data sharing purposes. This is exactly the scenario envisioned by Peer Data Management Systems (PDMSs), where autonomous sources (peers) model their local data according to a schema, and are connected in a peer-to-peer network by means of pairwise semantic mappings between the peers' own schemas. One of the main challenges in such a semantically heterogeneous environment is concerned with query processing when dealing with the inherent semantic approximations occurring in the data.
In this paper we present an instantiation of SUNRISE (System for Unified Network Routing, Indexing and Semantic Exploration) for XML data sources. SUNRISE is a complete PDMS infrastructure which extends each peer with functionalities for capturing the semantic approximation originating from schema heterogeneity and exploiting it for a semantically driven network organization and query routing.

References

[1]
K. Aberer, P. Cudré-Mauroux, M. Hauswirth, and T. V. Pelt. GridVine: Building Internet-Scale Semantic Overlay Networks. In Proc. of ISWC, 2004.
[2]
M. Arenas, V. Kantere, A. Kementsietsidis, I. Kiringa, R. Miller, and J. Mylopoulos. The Hyperion Project: from Data Integration to Data Coordination. SIGMOD Record, 32(3):53--58, 2003.
[3]
D. Aumueller, H. H. Do, S. Massmann, and E. Rahm. Schema and Ontology Matching with COMA++. In SIGMOD Conference, pages 906--908, 2005.
[4]
T. Berners-Lee, J. Hendler, and O. Lassila. The Semantic Web. Scientific American, May 2001.
[5]
A. Crespo and H. Garcia-Molina. Routing Indices for Peer-to-Peer Systems. In Proc. of ICDCS, 2002.
[6]
A. Crespo and H. Garcia-Molina. Semantic Overlay Networks for P2P Systems. In Proc. of the 3rd AP2PC Workshop, pages 1--13, 2004.
[7]
H. Do and E. Rahm. COMA - A System for Flexible Combination of Schema Matching Approaches. In Proc. of the 28th VLDB, pages 610--621, 2002.
[8]
C. Doulkeridis, K. Nørvåg, and M. Vazirgiannis. DESENT: Decentralized and Distributed Semantic Overlay Generation in P2P Networks. IEEE J. on Selected Areas in Comm., 25(1):25--34, 2007.
[9]
A. Halevy, Z. Ives, J. Madhavan, P. Mork, D. Suciu, and I. Tatarinov. The Piazza Peer Data Management System. IEEE TKDE, 16(7): 787--798, 2004.
[10]
M. A. Hernández, R. J. Miller, and L. M. Haas. Clio: A Semi-Automatic Tool For Schema Mapping. In SIGMOD Conference, page 607, 2001.
[11]
G. Koloniari and E. Pitoura. Content-Based Routing of Path Queries in Peer-to-Peer Systems. In Proc. of the 9th EDBT Conf., pages 29--47, 2004.
[12]
M. Li, W.-C. Lee, and A. Sivasubramaniam. Semantic Small World: An Overlay Network for Peer-to-Peer Search. In Proc. of the 12th IEEE ICNP, 2004.
[13]
S. Lodi, F. Mandreoli, R. Martoglia, W. Penzo, and S. Sassatelli. Semantic Peer, Here are the Neighbors You Want! In Accepted for publication at the 11th EDBT Conf., 2008.
[14]
J. Madhavan, P. A. Bernstein, and E. Rahm. Generic Schema Matching with Cupid. In Proc. of the 27th VLDB, pages 49--58, 2001.
[15]
F. Mandreoli, R. Martoglia, W. Penzo, and S. Sassatelli. SRI: Exploiting Semantic Information for Effective Query Routing in a PDMS. In Proc. of the WIDM (in conj. with CIKM), pages 19--26, 2006.
[16]
F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli, and G. Villani. SRI@work: Efficient and Effective Routing Strategies in a PDMS. In Proc. of WISE, 2007.
[17]
F. Mandreoli, R. Martoglia, W. Penzo, S. Sassatelli, and G. Villani. SUNRISE: Exploring PDMS Networks with Semantic Routing Indexes. In Proc. of ESWC, 2007.
[18]
F. Mandreoli, R. Martoglia, and E. Ronchetti. Versatile Structural Disambiguation for Semantic-aware Applications. In Proc. of CIKM, 2005.
[19]
F. Mandreoli, R. Martoglia, and E. Ronchetti. STRIDER: a Versatile System for Structural Disambiguation. In Proc. of EDBT, 2006.
[20]
F. Mandreoli, R. Martoglia, and P. Tiberio. Approximate Query Answering for a Heterogeneous XML Document Base. In Proc. of WISE, 2004.
[21]
S. Melnik, H. Garcia-Molina, and E. Rahm. Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching. In Proc. of the 18th ICDE, 2002.
[22]
R. Miller, L. Haas, and M. Hernández. Schema Mapping as Query Discovery. In Proc. of VLDB, 2000.
[23]
W. Nejdl, B. Wolf, C. Qu, S. Decker, M. Sintek, A. Naeve, M. Nilsson, M. Palmér, and T. Risch. EDUTELLA: A P2P Networking Infrastructure Based on RDF. In Proc. of the 11th WWW Conf., 2002.
[24]
J. Parreira, S. Michel, and G. Weikum. P2PDating: Real Life Inspired Semantic Overlay Networks for Web Search. Inf. Proc. & Manag., 43(3):643--664, 2007.
[25]
I. Tatarinov and A. Halevy. Efficient Query Reformulation in Peer Data Management Systems. In Proc. of the 2004 ACM SIGMOD Int. Conf. on Management of Data (SIGMOD 2004), 2004.
[26]
C. Yu and H. Jagadish. Schema Summarization. In Proc. of the 32nd VLDB Conf., pages 319--330, 2006.

Cited By

View all
  • (2017)From Data Integration to Big Data IntegrationA Comprehensive Guide Through the Italian Database Research Over the Last 25 Years10.1007/978-3-319-61893-7_3(43-59)Online publication date: 31-May-2017
  • (2013)Privacy Preserving Query Answering in Peer Data Management SystemsProceedings of the 2013 IEEE 33rd International Conference on Distributed Computing Systems Workshops10.1109/ICDCSW.2013.46(64-69)Online publication date: 8-Jul-2013
  • (2010)Leveraging Semantic Approximations in Heterogeneous XML Data Sharing Networks: The SUNRISE ApproachSoft Computing in XML Data Management10.1007/978-3-642-14010-5_12(315-350)Online publication date: 2010

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
DataX '08: Proceedings of the 2008 EDBT workshop on Database technologies for handling XML information on the web
March 2008
76 pages
ISBN:9781595939661
DOI:10.1145/1416691
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 March 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. PDMS
  2. XML
  3. network organization
  4. query processing
  5. semantics

Qualifiers

  • Research-article

Conference

EDBT '08

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 06 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2017)From Data Integration to Big Data IntegrationA Comprehensive Guide Through the Italian Database Research Over the Last 25 Years10.1007/978-3-319-61893-7_3(43-59)Online publication date: 31-May-2017
  • (2013)Privacy Preserving Query Answering in Peer Data Management SystemsProceedings of the 2013 IEEE 33rd International Conference on Distributed Computing Systems Workshops10.1109/ICDCSW.2013.46(64-69)Online publication date: 8-Jul-2013
  • (2010)Leveraging Semantic Approximations in Heterogeneous XML Data Sharing Networks: The SUNRISE ApproachSoft Computing in XML Data Management10.1007/978-3-642-14010-5_12(315-350)Online publication date: 2010

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media