Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

WebContent: efficient P2P Warehousing of web data

Published: 01 August 2008 Publication History

Abstract

We present the WebContent platform for managing distributed repositories of XML and semantic Web data. The platform allows integrating various data processing building blocks (crawling, translation, semantic annotation, full-text search, structured XML querying, and semantic querying), presented as Web services, into a large-scale efficient platform. Calls to various services are combined inside ActiveXML [8] documents, which are XML documents including service calls. An ActiveXML optimizer is used to: (i) efficiently distribute computations among sites; (ii) perform XQuery-specific optimizations by leveraging an algebraic XQuery optimizer; and (iii) given an XML query, chose among several distributed indices the most appropriate in order to answer the query.

References

[1]
S. Abiteboul, Z. Abrams, S. Haar, and T. Milo. Diagnosis of asynchronous discrete event systems: Datalog to the rescue! In PODS, 2005.
[2]
S. Abiteboul, O. Benjelloun, B. Cautis, I. Manolescu, T. Milo, and N. Preda. Lazy query evaluation for Active XML. In SIGMOD, 2004.
[3]
S. Abiteboul, A. Bonifati, G. Cobéna, I. Manolescu, and T. Milo. Dynamic XML documents with distribution and replication. In SIGMOD, 2003.
[4]
S. Abiteboul, I. Manolescu, N. Polyzotis, N. Preda, and C. Sun. XML processing in DHT networks. In ICDE, 2008.
[5]
S. Abiteboul, I. Manolescu, and S. Zoupanos. OptimAX: Efficient Support for Data-Intensive Mash-Ups (demo). In ICDE, 2008.
[6]
S. Abiteboul, I. Manolescu, and S. Zoupanos. OptimAX: Optimizing Distributed ActiveXML Applications. In ICWE, 2008.
[7]
P. Adjiman, F. Goasdoué, and M.-C. Rousset. SomeRDFS in the semantic web. Journal on Data Semantics, 8, 2007.
[8]
ActiveXML home page. Available at http://www.activexml.net.
[9]
Business Process Execution Language for Web Services. www.ibm.com/developerworks/library/ws-bpel.
[10]
D. Chappell. Enterprise Service Bus. O'Reilly, 2004.
[11]
F. Dabek, B. Zhao, P. Druschel, J. Kubiatowicz, and I. Stoica. Towards a common API for structured P2P overlays. In IPTPS, 2003.
[12]
F. Dragan, G. Gardarin, and L. Yeh. Pathfinder: Indexing and querying XML data in a P2P system. In WTAS, 2006.
[13]
L. Galanis, Y. Wang, S. Jeffery, and D. DeWitt. Locating data sources in large distributed systems. In VLDB, 2003.
[14]
SPARQL query language for RDF. http://www.w3.org/TR/rdf-sparql-query/.
[15]
N. Travers, T. Dang-Ngoc, and T. Liu. TGV: A tree graph view for modeling untyped XQuery. In DASFAA, 2007.
[16]
P. Valduriez and T. Ozsu. Principles of Distributed Database Systems. Prentice Hall, 1999.
[17]
W3C. WSDL: Web Services Definition Language 1.1.
[18]
W3C. SOAP version 1.2 part 1: Messaging framework (second edition), 2007.
[19]
P. Wu, Y. Sismanis, and B. Reinwald. Towards keyword-driven analytical processing. In SIGMOD, 2007.
[20]
Exist: Open source native XML database. Available at http://exist.sourceforge.net, 2004.
[21]
MonetDB database system with XQuery front-end. Available at http://monetdb.cwi.nl/XQuery, 2007.
[22]
WebContent, the Semantic Web platform (rntl project). www.webcontent.fr.

Cited By

View all
  • (2010)Brown dwarfProceedings of the 19th ACM international conference on Information and knowledge management10.1145/1871437.1871777(1945-1946)Online publication date: 26-Oct-2010
  • (2010)Distributing the power of OLAPProceedings of the 19th ACM International Symposium on High Performance Distributed Computing10.1145/1851476.1851521(324-327)Online publication date: 21-Jun-2010
  • (2010)Efficient updates for a shared nothing analytics platformProceedings of the 2010 Workshop on Massive Data Analytics on the Cloud10.1145/1779599.1779606(1-6)Online publication date: 26-Apr-2010
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 1, Issue 2
August 2008
461 pages

Publisher

VLDB Endowment

Publication History

Published: 01 August 2008
Published in PVLDB Volume 1, Issue 2

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 24 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2010)Brown dwarfProceedings of the 19th ACM international conference on Information and knowledge management10.1145/1871437.1871777(1945-1946)Online publication date: 26-Oct-2010
  • (2010)Distributing the power of OLAPProceedings of the 19th ACM International Symposium on High Performance Distributed Computing10.1145/1851476.1851521(324-327)Online publication date: 21-Jun-2010
  • (2010)Efficient updates for a shared nothing analytics platformProceedings of the 2010 Workshop on Massive Data Analytics on the Cloud10.1145/1779599.1779606(1-6)Online publication date: 26-Apr-2010
  • (2009)Non-conservative extension of a peer in a P2P inference systemAI Communications10.5555/1662603.166260422:4(211-233)Online publication date: 1-Dec-2009

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media