Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1516360.1516487acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article
Free access

Estimating aggregates in time-constrained approximate queries in Oracle

Published: 24 March 2009 Publication History

Abstract

The concept of time-constrained SQL queries was introduced to address the problem of long-running SQL queries. A key approach adopted for supporting time-constrained SQL queries is to use sampling to reduce the amount of data that needs to be processed, thereby allowing completion of the query in the specified time constraint. However, sampling does make the query results approximate and hence requires the system to estimate the values of the expressions (especially aggregates) occurring in the select list. Thus, coming up with estimates for aggregates is crucial for time-constrained approximate SQL queries to be useful, which is the focus of this paper. Specifically, we address the problem of estimating commonly occurring aggregates (namely, SUM, COUNT, AVG, MEDIAN, MIN, and MAX) in time-constrained approximate queries. We give both point and interval estimates for SUM, COUNT, AVG, and MEDIAN using Bernoulli sampling for various type of queries, including join processing with cross product sampling. For MIN (MAX), we give the confidence level that the proportion 100γ% of the population will exceed the MIN (or be less than the MAX) obtained from the sampled data.

References

[1]
P. J. Haas, J. F. Naughton, S. Seshadri, A. N. Swami, "Selectivity and Cost Estimation for Joins Based on Random Sampling," J. Comput. Syst. Sci. 52(3), pp. 550--569, 1996.
[2]
W.-C. Hou, G. Özsoyoglu, B. K. Taneja, "Processing Aggregate Relational Queries with Hard Time Constraints," SIGMOD 1989, pp. 68--77.
[3]
Y. Hu, S. Sundara, J. Srinivasan, "Supporting Time-Constrained SQL Queries in Oracle," VLDB 2007, pp. 1207--1218.
[4]
G. S. Manku, S. Rajagopalan, B. G. Lindsay, "Approximate Medians and other Quantiles in One Pass and with Limited Memory," SIGMOD 1998, pp. 426--435.

Cited By

View all
  • (2023)Efficient Diversification for Recommending Aggregate Data VisualizationsIEEE Access10.1109/ACCESS.2023.328345711(62261-62280)Online publication date: 2023
  • (2018)DiVEProceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3271744(1123-1132)Online publication date: 17-Oct-2018
  • (2018)Optimally Leveraging Density and Locality for Exploratory Browsing and SamplingProceedings of the Workshop on Human-In-the-Loop Data Analytics10.1145/3209900.3209903(1-7)Online publication date: 10-Jun-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
EDBT '09: Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
March 2009
1180 pages
ISBN:9781605584225
DOI:10.1145/1516360
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 March 2009

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

EDBT/ICDT '09
EDBT/ICDT '09: EDBT/ICDT '09 joint conference
March 24 - 26, 2009
Saint Petersburg, Russia

Acceptance Rates

Overall Acceptance Rate 7 of 10 submissions, 70%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)29
  • Downloads (Last 6 weeks)6
Reflects downloads up to 10 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Efficient Diversification for Recommending Aggregate Data VisualizationsIEEE Access10.1109/ACCESS.2023.328345711(62261-62280)Online publication date: 2023
  • (2018)DiVEProceedings of the 27th ACM International Conference on Information and Knowledge Management10.1145/3269206.3271744(1123-1132)Online publication date: 17-Oct-2018
  • (2018)Optimally Leveraging Density and Locality for Exploratory Browsing and SamplingProceedings of the Workshop on Human-In-the-Loop Data Analytics10.1145/3209900.3209903(1-7)Online publication date: 10-Jun-2018
  • (2017)Database LearningProceedings of the 2017 ACM International Conference on Management of Data10.1145/3035918.3064013(587-602)Online publication date: 9-May-2017
  • (2014)ABSProceedings of the 2014 ACM SIGMOD International Conference on Management of Data10.1145/2588555.2594532(1067-1070)Online publication date: 18-Jun-2014
  • (2014)The analytical bootstrapProceedings of the 2014 ACM SIGMOD International Conference on Management of Data10.1145/2588555.2588579(277-288)Online publication date: 18-Jun-2014
  • (2010)An experimental study of time-constrained aggregate queriesProceedings of the 13th International Conference on Extending Database Technology10.1145/1739041.1739123(669-674)Online publication date: 22-Mar-2010

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media