Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

WiSeDB: a learning-based workload management advisor for cloud databases

Published: 01 June 2016 Publication History

Abstract

Workload management for cloud databases deals with the tasks of resource provisioning, query placement, and query scheduling in a manner that meets the application's performance goals while minimizing the cost of using cloud resources. Existing solutions have approached these three challenges in isolation while aiming to optimize a single performance metric. In this paper, we introduce WiSeDB, a learning-based framework for generating holistic workload management solutions customized to application-defined performance goals and workload characteristics. Our approach relies on supervised learning to train cost-effective decision tree models for guiding query placement, scheduling, and resource provisioning decisions. Applications can use these models for both batch and online scheduling of incoming workloads. A unique feature of our system is that it can adapt its offline model to stricter/looser performance goals with minimal re-training. This allows us to present to the application alternative workload management strategies that address the typical performance vs. cost trade-off of cloud services. Experimental results show that our approach has very low training overhead while offering low cost strategies for a variety of performance metrics and workload characteristics.

References

[1]
Amazon Web Services, http://aws.amazon.com/.
[2]
Microsoft Azure Services, http://www.microsoft.com/azure/.
[3]
PostgreSQL database, http://www.postgresql.org/.
[4]
The TPC-H benchmark, http://www.tpc.org/tpch/.
[5]
Weka 3, http://cs.waikato.ac.nz/ml/weka/.
[6]
K. D. Ba et al. Sublinear time algorithms for earth mover's distance. TCS '11.
[7]
U. V. Catalyurek et al. Integrated data placement and task assignment for scientific workflows in clouds. In DIDC '11.
[8]
Y. Chi et al. iCBS: Incremental cost-based scheduling under piecewise linear SLAs. In VLDB '11.
[9]
Y. Chi et al. SLA-tree: A framework for efficiently supporting SLA-based decisions in cloud computing. In EDBT '11.
[10]
J. Duggan et al. Contender: A resource modeling approach for concurrent query performance prediction. In EDBT '14.
[11]
J. Duggan et al. Performance prediction for concurrent database workloads. In SIGMOD '11.
[12]
A. J. Elmore et al. Characterizing tenant behavior for placement and crisis mitigation in multitenant DBMSs. In SIGMOD '13.
[13]
P. E. Greenwood et al. A guide to chi-squared testing, volume 280. 1996.
[14]
P. Hart et al. A formal basis for the heuristic determination of minimum cost paths. SSC '68.
[15]
V. Jalaparti et al. Bridging the tenant-provider gap in cloud services. In SoCC '12.
[16]
S. Koenig et al. A new principle for incremental heuristic search: Theoretical results. In ICAPS '06.
[17]
W. Lang et al. Towards multi-tenant performance SLOs. In ICDE '14.
[18]
Z. Liu et al. PMAX: Tenant placement in multitenant databases for profit maximization. In EDBT '13.
[19]
H. Mahmoud et al. CloudOptimizer: Multi-tenancy for I/O-bound OLAP workloads. In EDBT '13.
[20]
R. Marcus et al. WiSeDB: A learning-based workload management advisor for cloud databases. Technical Report, arXiv.org.
[21]
R. Marcus et al. Workload management for cloud databases via machine learning. In CloudDM '16.
[22]
S. Martello et al. Knapsack Problems: Algorithms and Computer Implementations. 1990.
[23]
V. Narasayya et al. SQLVM: Performance isolation in multi-tenant relational database-as-a-service. In CIDR '13.
[24]
J. Ortiz et al. Changing the face of database cloud services with personalized service level agreements. In CIDR '15.
[25]
M. Poess et al. New TPC benchmarks for decision support and web commerce. SIGMOD '00.
[26]
S. Tozer et al. Q-Cop: Avoiding bad query mixes to minimize client timeouts under heavy loads. In ICDE '10.
[27]
V. Vazirani. Approximation algorithms. 2013.
[28]
P. Xiong et al. ActiveSLA: A profit-oriented admission control framework for database-as-a-service providers. In SoCC '11.
[29]
P. Xiong et al. Intelligent management of virtualized resources for database systems in cloud environment. In ICDE '11.

Cited By

View all
  • (2024)A Spark Optimizer for Adaptive, Fine-Grained Parameter TuningProceedings of the VLDB Endowment10.14778/3681954.368202117:11(3565-3579)Online publication date: 1-Jul-2024
  • (2024)Intelligent Scaling in Amazon RedshiftCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3653394(269-279)Online publication date: 9-Jun-2024
  • (2024)Stage: Query Execution Time Prediction in Amazon RedshiftCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3653391(280-294)Online publication date: 9-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 9, Issue 10
June 2016
132 pages
ISSN:2150-8097
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 June 2016
Published in PVLDB Volume 9, Issue 10

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Spark Optimizer for Adaptive, Fine-Grained Parameter TuningProceedings of the VLDB Endowment10.14778/3681954.368202117:11(3565-3579)Online publication date: 1-Jul-2024
  • (2024)Intelligent Scaling in Amazon RedshiftCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3653394(269-279)Online publication date: 9-Jun-2024
  • (2024)Stage: Query Execution Time Prediction in Amazon RedshiftCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3653391(280-294)Online publication date: 9-Jun-2024
  • (2024)Flux: Decoupled Auto-Scaling for Heterogeneous Query Workload in Alibaba AnalyticDBCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3653381(255-268)Online publication date: 9-Jun-2024
  • (2023)Real-Time Workload Pattern Analysis for Large-Scale Cloud DatabasesProceedings of the VLDB Endowment10.14778/3611540.361155716:12(3689-3701)Online publication date: 1-Aug-2023
  • (2023)Auto-WLM: Machine Learning Enhanced Workload Management in Amazon RedshiftCompanion of the 2023 International Conference on Management of Data10.1145/3555041.3589677(225-237)Online publication date: 4-Jun-2023
  • (2022)Fine-grained modeling and optimization for intelligent resource management in big data processingProceedings of the VLDB Endowment10.14778/3551793.355185515:11(3098-3111)Online publication date: 29-Sep-2022
  • (2021)Toto – Benchmarking the Efficiency of a Cloud ServiceProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3457555(2543-2556)Online publication date: 9-Jun-2021
  • (2019)NashDBProceedings of the VLDB Endowment10.14778/3352063.335207712:12(1830-1833)Online publication date: 1-Aug-2019
  • (2019)Plan-structured deep neural network models for query performance predictionProceedings of the VLDB Endowment10.14778/3342263.334264612:11(1733-1746)Online publication date: 1-Jul-2019
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media