Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1516360.1516441acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article
Free access

Fair, effective, efficient and differentiated scheduling in an enterprise data warehouse

Published: 24 March 2009 Publication History

Abstract

A typical online Business Intelligence (BI) workload consists of a combination of short, less intensive queries, along with long, resource intensive queries. As such, the longest queries in a typical BI workload may take several orders of magnitude more time to execute, compared with the shortest queries in the workload. This makes it challenging to design a good Mixed Workload Scheduler (MWS). In this paper we first define the design criteria that make a 'good' MWS. We then use these criteria to design rFEED, a MWS that is fair, effective, efficient, and differentiated. We simulate real workloads and compare our rFEED MWS with models of the current best of breed commercial systems. We show that the rFEED MWS works extremely well.

References

[1]
N. Bansal and K. Pruhs. Server scheduling in the lp norm: a rising tide lifts all boat. In STOC, pages 242--250, 2003.
[2]
L. Becchetti, S. Leonardi, A. Marchetti-Spaccamela, and K. Pruhs. Online weighted flow time and deadline scheduling. J. Discrete Algorithms, 4(3):339--352, 2006.
[3]
A. Bedekar, S. Borst, K. Ramanan, P. Whiting, and E. Yeh. Downlink scheduling in CDMA data networks. GLOBECOM, 5:2653--2657, 1999.
[4]
M. A. Bender, S. Chakrabarti, and S. Muthukrishnan. Flow and Stretch Metrics for Scheduling Continuous Job Streams. In SODA, pages 270--279, 1998.
[5]
M. A. Bender, S. Muthukrishnan, and R. Rajaraman. Improved algorithms for stretch scheduling. In SODA, pages 762--771, 2002.
[6]
P. Brucker. Scheduling Algorithms. Springer-Verlag New York, Inc., Secaucus, NJ, USA, 1995.
[7]
Carrie Ballinger. The Wild World of Mixed Workload: Priorities and resources learn to get along. Teradata Magazine Online. http://www.teradata.com/t/go.aspx/?id=114533.
[8]
B. Chandramouli, C. N. Bond, S. Babu, and J. Yang. Query suspend and resume. In SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pages 557--568, New York, NY, USA, 2007. ACM.
[9]
C. Chekuri and S. Khanna. Approximation schemes for preemptive weighted flow time. In STOC, pages 297--305, 2002.
[10]
C. Chekuri, S. Khanna, and A. Zhu. Algorithms for minimizing weighted flow time. In STOC, pages 84--93, 2001.
[11]
L. Cherkasova and T. Rokicki. Alpha Message Scheduling for Packet-Switched Interconnects. Technical Report HPL-94-71, HP Labs, August 1994.
[12]
M. Crovella, R. Frangioso, and M. Harchol-Balter. Connection scheduling in web servers. In USENIX Symposium on Internet Technologies and Systems, 1999.
[13]
E. J. Friedman and S. G. Henderson. Fairness and efficiency in web server protocols. In SIGMETRICS, pages 229--237, 2003.
[14]
J. R. M. Hosking and J. F. Wallis. Parameter and quantile estimation for the generalized pareto distribution. Technometrics, 29(3):339--349, 1987.
[15]
IBM. DB2 Query Patroller. http://www-306.ibm.com/software/data/db2/querypatroller/.
[16]
B. Kalyanasundaram, K. Pruhs, and M. Velauthapillai. Scheduling Broadcasts in Wireless Networks. In Proceedings of the 8th Annual European Symposium on Algorithms (ESA), pages 290--301, 2000.
[17]
A. Legrand, A. Su, and F. Vivien. Minimizing the stretch when scheduling flows of biological requests. In SPAA, pages 103--112, 2006.
[18]
J. Leung, L. Kelly, and J. H. Anderson. Handbook of Scheduling: Algorithms, Models, and Performance Analysis. CRC Press, Inc., Boca Raton, FL, USA, 2004.
[19]
A. Mehta, C. Gupta, S. Wang, and U. Dayal. rfeed: A mixed workload scheduler for enterprise data warehouses. In ICDE '09, Accepted, 2009.
[20]
S. Muthukrishnan, R. Rajaraman, A. Shaheen, and J. E. Gehrke. Online scheduling to minimize average stretch. In Proceedings of the 40th Annual Symposium on Foundations of Computer Science (FOCS), page 433, 1999.
[21]
V. Paxson. End-to-end internet packet dynamics. IEEE/ACM Trans. Netw., 7(3):277--292, 1999.
[22]
B. Schroeder, M. Harchol-Balter, A. Iyengar, E. M. Nahum, and A. Wierman. How to Determine a Good Multi-Programming Level for External Scheduling. In ICDE, page 60, 2006.
[23]
M. A. Sharaf, P. K. Chrysanthis, A. Labrinidis, and K. Pruhs. Efficient scheduling of heterogeneous continuous queries. In VLDB, pages 511--522, 2006.

Cited By

View all
  • (2022)LSched: A Workload-Aware Learned Query Scheduler for Analytical Database SystemsProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3526158(1228-1242)Online publication date: 10-Jun-2022
  • (2021)Database isolation by schedulingProceedings of the VLDB Endowment10.14778/3461535.346153714:9(1467-1480)Online publication date: 22-Oct-2021
  • (2021)Fairness in rankings and recommendations: an overviewThe VLDB Journal10.1007/s00778-021-00697-y31:3(431-458)Online publication date: 2-Oct-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
EDBT '09: Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
March 2009
1180 pages
ISBN:9781605584225
DOI:10.1145/1516360
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 March 2009

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

EDBT/ICDT '09
EDBT/ICDT '09: EDBT/ICDT '09 joint conference
March 24 - 26, 2009
Saint Petersburg, Russia

Acceptance Rates

Overall Acceptance Rate 7 of 10 submissions, 70%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)47
  • Downloads (Last 6 weeks)12
Reflects downloads up to 10 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)LSched: A Workload-Aware Learned Query Scheduler for Analytical Database SystemsProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3526158(1228-1242)Online publication date: 10-Jun-2022
  • (2021)Database isolation by schedulingProceedings of the VLDB Endowment10.14778/3461535.346153714:9(1467-1480)Online publication date: 22-Oct-2021
  • (2021)Fairness in rankings and recommendations: an overviewThe VLDB Journal10.1007/s00778-021-00697-y31:3(431-458)Online publication date: 2-Oct-2021
  • (2019)Real-Time Supply Chain Simulation: A Big Data-Driven Approach2019 Winter Simulation Conference (WSC)10.1109/WSC40007.2019.9004717(548-559)Online publication date: Dec-2019
  • (2018)Workload Management in Database Management Systems: A TaxonomyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.276704430:7(1386-1402)Online publication date: 1-Jul-2018
  • (2017)Adaptive Concurrent Query Execution Framework for an Analytical In-Memory Database System2017 IEEE International Congress on Big Data (BigData Congress)10.1109/BigDataCongress.2017.13(23-30)Online publication date: Jun-2017
  • (2016)Cheap data analytics using cold storage devicesProceedings of the VLDB Endowment10.14778/2994509.29945219:12(1029-1040)Online publication date: 1-Aug-2016
  • (2015)Dynamic Query Prioritization for In-Memory DatabasesIn Memory Data Management and Analysis10.1007/978-3-319-13960-9_5(56-68)Online publication date: 14-Jan-2015
  • (2014)Sharing-Aware Scheduling of Web ServicesWeb Technologies and Applications10.1007/978-3-319-11116-2_14(153-164)Online publication date: 2014
  • (2014)Concurrent Execution of Mixed Enterprise Workloads on In-Memory DatabasesDatabase Systems for Advanced Applications10.1007/978-3-319-05810-8_9(126-140)Online publication date: 2014
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media