Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1951365.1951435acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

A probabilistic XML merging tool

Published: 21 March 2011 Publication History

Abstract

This demonstration paper presents a probabilistic XML data merging tool, that represents the outcome of semi-structured document integration as a probabilistic tree. The system is fully automated and integrates methods to evaluate the uncertainty (modeled as probability values) of the result of the merge. It is based on the two-way tree-merge technique and an uncertain data model defined using probabilistic event variables. The resulting probabilistic repository can be queried using a subset of the XPath query language. The demonstration application is based on revisions of the Wikipedia encyclopedia: a Wikipedia article is no longer considered as the latest valid revision but as the merge of all possible revisions, some of which are uncertain.

References

[1]
S. Abiteboul, B. Kimelfeld, Y. Sagiv, and P. Senellart. On the expressiveness of probabilistic XML models. VLDB Journal, 18(5):1041--1064, 2009.
[2]
B. T. Adler and L. de Alfaro. A content-driven reputation system for the wikipedia. In Proc. WWW, Banff, Canada, May 2007.
[3]
M. Benedikt, E. Kharlamov, D. Olteanu, and P. Senellart. Probabilistic XML via Markov chains. Proceedings of the VLDB Endowment, 3(1), Aug. 2010. Presented at the VLDB 2010 conference, Singapore.
[4]
E. Kharlamov, W. Nutt, and P. Senellart. Updating probabilistic XML. In Proc. Updates in XML, Lausanne, Switzerland, Mar. 2010.
[5]
B. Kimelfeld, Y. Kosharovsky, and Y. Sagiv. Query evaluation over probabilistic XML. VLDB Journal, 18(5):1117--1140, 2009.
[6]
R. La Fontaine. Merging XML files: A new approach providing intelligent merge of XML data sets. In Proc. XML Europe, Barcelona, Spain, May 2002.
[7]
T. Lindholm. A three-way merge for XML documents. In Proc. DocEng, Milwaukee, WI, USA, Oct. 2004.
[8]
M. van Keulen, A. de Keijzer, and W. Alink. A probabilistic XML approach to data integration. In Proc. ICDE, Tokyo, Japan, Apr. 2005.

Cited By

View all
  • (2018)Enabling lock-free concurrent workers over temporal graphs composed of multiple time-seriesProceedings of the 33rd Annual ACM Symposium on Applied Computing10.1145/3167132.3167255(1054-1061)Online publication date: 9-Apr-2018
  • (2014)Using versioned trees, change detection and node identity for three-way XML mergingComputer Science - Research and Development10.1007/s00450-013-0253-5Online publication date: 29-Nov-2014
  • (2013)Uncertain version control in open collaborative editing of tree-structured documentsProceedings of the 2013 ACM symposium on Document engineering10.1145/2494266.2494277(27-36)Online publication date: 10-Sep-2013
  • Show More Cited By

Index Terms

  1. A probabilistic XML merging tool

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    EDBT/ICDT '11: Proceedings of the 14th International Conference on Extending Database Technology
    March 2011
    587 pages
    ISBN:9781450305280
    DOI:10.1145/1951365

    Sponsors

    • Microsoft Research: Microsoft Research

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 March 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. XML merge
    2. probabilistic XML
    3. tree merge

    Qualifiers

    • Research-article

    Conference

    EDBT/ICDT '11
    Sponsor:
    • Microsoft Research
    EDBT/ICDT '11: EDBT/ICDT '11 joint conference
    March 21 - 24, 2011
    Uppsala, Sweden

    Acceptance Rates

    Overall Acceptance Rate 7 of 10 submissions, 70%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 06 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Enabling lock-free concurrent workers over temporal graphs composed of multiple time-seriesProceedings of the 33rd Annual ACM Symposium on Applied Computing10.1145/3167132.3167255(1054-1061)Online publication date: 9-Apr-2018
    • (2014)Using versioned trees, change detection and node identity for three-way XML mergingComputer Science - Research and Development10.1007/s00450-013-0253-5Online publication date: 29-Nov-2014
    • (2013)Uncertain version control in open collaborative editing of tree-structured documentsProceedings of the 2013 ACM symposium on Document engineering10.1145/2494266.2494277(27-36)Online publication date: 10-Sep-2013
    • (2013)Optimizing approximations of DNF query lineage in probabilistic XMLProceedings of the 2013 IEEE International Conference on Data Engineering (ICDE 2013)10.1109/ICDE.2013.6544869(721-732)Online publication date: 8-Apr-2013
    • (2013)Probabilistic XML: Models and ComplexityAdvances in Probabilistic Databases for Uncertain Information Management10.1007/978-3-642-37509-5_3(39-66)Online publication date: 2013
    • (2011)Towards a version control model with uncertain dataProceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management10.1145/2065003.2065013(43-50)Online publication date: 28-Oct-2011
    • (2011)Efficient query evaluation over probabilistic XML with long-distance dependenciesProceedings of the 2011 Joint EDBT/ICDT Ph.D. Workshop10.1145/1966874.1966880(32-37)Online publication date: 25-Mar-2011

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media