Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1316689.1316779dlproceedingsArticle/Chapter ViewAbstractPublication PagesvldbConference Proceedingsconference-collections
Article

Generating thousand benchmark queries in seconds

Published: 31 August 2004 Publication History

Abstract

The combination of an exponential growth in the amount of data managed by a typical business intelligence system and the increased competitiveness of a global economy has propelled decision support systems (DSS) from the role of exploratory tools employed by a few visionary companies to become a core requirement for a competitive enterprise. That same maturation has often resulted in a selection process that requires an ever more critical system evaluation and selection to be completed in an increasingly short period of time. While there have been some advances in the generation of data sets for system evaluation (see [3]), the quantification of query performance has often relied on models and methodologies that were developed for systems that were more simplistic, less dynamic, and less central to a successful business. In this paper we present QGEN, a flexible, high-level query generator optimized for decision support system evaluation. QGEN is able to generate arbitrary query sets, which conform to a selected statistical profile without requiring that the queries be statically defined or disclosed prior to testing. Its novel design links query syntax with abstracted data distributions, enabling users to parameterize their query workload to match an emerging access pattern or data set modification. This results in query sets that retain comparability for system comparisons while reflecting the inherent dynamism of operational systems, and which provide a broad range of syntactic and semantic coverage, while remaining focused on appropriate commonalities within a particular evaluation process or business segment.

References

[1]
{1} Slutz, D. Massive Stochastic Testing of SQL, Proc. 24th Int. Conf. Very Large Data Bases, VLDB, 1998.
[2]
{2} ISO/IEC 9075. Database Language SQL, International Standard ISO/IEC 9075:1992, American National Standard X3.135-1992, ANSI, New York, NY 10036, November 1992.
[3]
{3} Kimball, R. The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses. John Wiley & Sons, 1996.
[4]
{4} Poess, M. and Floyd, C., "New TPC Benchmarks for Decision Support and Web Commerce". ACM SIGMOD RECORD, Vol. 29, No. 4 (Dec. 2000).
[5]
{5} Poess, M., Smith B., Kollár L., Larson P.: TPC-DS: Taking Decision Support Benchmarking to the Next Level. SIGMOD Conference 2002.
[6]
{6} Stephens, J., Poess, M.: Mudd: A Multi-Dimensional Data Generator, WOSP 2004.
[7]
{7} Transaction Processing Performance Council (TPC), "TPC Benchmark D (Decision Support)", May 1995 http://www.tpc.org/tpcd/spec/tpcd_current.pdf
[8]
{8} Transaction Processing Performance Council (TPC), "TPC-H Specification Version 2.1.0", August 2003 http://www.tpc.org/tpch/spec/tpch2.1.0.pdf
[9]
{9} Transaction Processing Performance Council (TPC), "TPC-R Specification Version 2.1.0", August 2003 http://www.tpc.org/tpcr/spec/tpcr2.1.0.pdf
[10]
{10} US Census Bureau, Unadjusted and Adjusted Estimates of Monthly Retail and Food Services Sales by Kinds of Business:2001, Department stores (excl.L.D) 4521. http ://www.census.gov/mrts/www/data/html/nsal01.html

Cited By

View all
  • (2020)Consistent synchronization schemes for workload replayProceedings of the VLDB Endowment10.14778/3402755.34027574:12(1225-1236)Online publication date: 3-Jun-2020
  • (2017)gMarkIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2016.263399329:4(856-869)Online publication date: 1-Apr-2017
  • (2016)A semantic-aware data generator for ETL workflowsConcurrency and Computation: Practice & Experience10.1002/cpe.302828:4(1016-1040)Online publication date: 25-Mar-2016
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
VLDB '04: Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
August 2004
1380 pages

Sponsors

  • VLDB Endowment: Very Large Database Endowment

Publisher

VLDB Endowment

Publication History

Published: 31 August 2004

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Consistent synchronization schemes for workload replayProceedings of the VLDB Endowment10.14778/3402755.34027574:12(1225-1236)Online publication date: 3-Jun-2020
  • (2017)gMarkIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2016.263399329:4(856-869)Online publication date: 1-Apr-2017
  • (2016)A semantic-aware data generator for ETL workflowsConcurrency and Computation: Practice & Experience10.1002/cpe.302828:4(1016-1040)Online publication date: 25-Mar-2016
  • (2013)REDACT: preventing database deadlocks from application-based transactionsProceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering10.1145/2491411.2494594(591-594)Online publication date: 18-Aug-2013
  • (2013)Preventing database deadlocks in applicationsProceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering10.1145/2491411.2491412(356-366)Online publication date: 18-Aug-2013
  • (2013)Variations of the star schema benchmark to test the effects of data skew on query performanceProceedings of the 4th ACM/SPEC International Conference on Performance Engineering10.1145/2479871.2479927(361-372)Online publication date: 21-Apr-2013
  • (2010)Automated SQL query generation for systematic testing of database enginesProceedings of the 25th IEEE/ACM International Conference on Automated Software Engineering10.1145/1858996.1859063(329-332)Online publication date: 20-Sep-2010
  • (2010)Using the optimizer to generate an effective regression suiteProceedings of the Third International Workshop on Testing Database Systems10.1145/1838126.1838131(1-6)Online publication date: 7-Jun-2010
  • (2010)A framework for testing DBMS featuresThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-009-0157-y19:2(203-230)Online publication date: 1-Apr-2010
  • (2008)Oracle database replayProceedings of the 2008 ACM SIGMOD international conference on Management of data10.1145/1376616.1376732(1159-1170)Online publication date: 9-Jun-2008
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media