Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Making Aggregation Work in Uncertain and Probabilistic Databases

Published: 01 August 2011 Publication History

Abstract

We describe how aggregation is handled in the Trio system for uncertain and probabilistic data. Because “exact” aggregation in uncertain databases can produce exponentially sized results, we provide three alternatives: a low bound on the aggregate value, a high bound on the value, and the expected value. These variants return a single result instead of a set of possible results, and they are generally efficient to compute for both full-table and grouped aggregation queries. We provide formal definitions and semantics and a description of our open source implementation for single-table aggregation queries. We study the performance and scalability of our algorithms through experiments over a large synthetic data set. We also provide some preliminary results on aggregations over joins.

Cited By

View all
  • (2025)FastPDB: Towards Bag-Probabilistic Queries at Interactive SpeedsProceedings of the ACM on Management of Data10.1145/37096913:1(1-25)Online publication date: 11-Feb-2025
  • (2023)Efficient Approximation of Certain and Possible Answers for Ranking and Window Queries over Uncertain DataProceedings of the VLDB Endowment10.14778/3583140.358315116:6(1346-1358)Online publication date: 20-Apr-2023
  • (2023)Queries with aggregate functions over fuzzy RDF dataThe Journal of Supercomputing10.1007/s11227-023-05235-x79:13(14780-14807)Online publication date: 9-Apr-2023
  • Show More Cited By
  1. Making Aggregation Work in Uncertain and Probabilistic Databases

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image IEEE Transactions on Knowledge and Data Engineering
    IEEE Transactions on Knowledge and Data Engineering  Volume 23, Issue 8
    August 2011
    158 pages

    Publisher

    IEEE Educational Activities Department

    United States

    Publication History

    Published: 01 August 2011

    Author Tags

    1. Database management
    2. query processing.

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)FastPDB: Towards Bag-Probabilistic Queries at Interactive SpeedsProceedings of the ACM on Management of Data10.1145/37096913:1(1-25)Online publication date: 11-Feb-2025
    • (2023)Efficient Approximation of Certain and Possible Answers for Ranking and Window Queries over Uncertain DataProceedings of the VLDB Endowment10.14778/3583140.358315116:6(1346-1358)Online publication date: 20-Apr-2023
    • (2023)Queries with aggregate functions over fuzzy RDF dataThe Journal of Supercomputing10.1007/s11227-023-05235-x79:13(14780-14807)Online publication date: 9-Apr-2023
    • (2021)Efficient Uncertainty Tracking for Complex Queries with Attribute-level BoundsProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452791(528-540)Online publication date: 9-Jun-2021
    • (2017)Reengineering Probabilistic Relational Databases with Fuzzy Probability Measures into XML ModelJournal of Database Management10.4018/JDM.201707010228:3(26-47)Online publication date: 1-Jul-2017
    • (2017)Probabilistic object deputy model for uncertain data and lineage managementData & Knowledge Engineering10.1016/j.datak.2017.03.005109:C(70-84)Online publication date: 1-May-2017
    • (2012)Aggregation in probabilistic databases via knowledge compilationProceedings of the VLDB Endowment10.14778/2140436.21404455:5(490-501)Online publication date: 1-Jan-2012

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media