Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Public Access

An Experimental Performance Evaluation of Autoscalers for Complex Workflows

Published: 10 April 2018 Publication History

Abstract

Elasticity is one of the main features of cloud computing allowing customers to scale their resources based on the workload. Many autoscalers have been proposed in the past decade to decide on behalf of cloud customers when and how to provision resources to a cloud application based on the workload utilizing cloud elasticity features. However, in prior work, when a new policy is proposed, it is seldom compared to the state-of-the-art, and is often compared only to static provisioning using a predefined quality of service target. This reduces the ability of cloud customers and of cloud operators to choose and deploy an autoscaling policy, as there is seldom enough analysis on the performance of the autoscalers in different operating conditions and with different applications. In our work, we conduct an experimental performance evaluation of autoscaling policies, using as application model workflows, a popular formalism for automating resource management for applications with well-defined yet complex structures. We present a detailed comparative study of general state-of-the-art autoscaling policies, along with two new workflow-specific policies. To understand the performance differences between the seven policies, we conduct various experiments and compare their performance in both pairwise and group comparisons. We report both individual and aggregated metrics. As many workflows have deadline requirements on the tasks, we study the effect of autoscaling on workflow deadlines. Additionally, we look into the effect of autoscaling on the accounted and hourly based charged costs, and we evaluate performance variability caused by the autoscaler selection for each group of workflow sizes. Our results highlight the trade-offs between the suggested policies, how they can impact meeting the deadlines, and how they perform in different operating conditions, thus enabling a better understanding of the current state-of-the-art.

References

[1]
B. Abbott et al. 2008. Search for gravitational waves from binary inspirals in S3 and S4 LIGO data. Phys. Rev. D 77 (2008), 062002.
[2]
Saeid Abrishami, Mahmoud Naghibzadeh, and Dick H. J. Epema. 2012. Cost-driven scheduling of grid workflows using partial critical paths. IEEE TPDS 23 (2012), 1400--1414.
[3]
Ahmed Ali-Eldin, Johan Tordsson, and Erik Elmroth. 2012. An adaptive hybrid elasticity controller for cloud infrastructures. In Proceedings of IEEE NOMS.
[4]
Ahmed Ali-Eldin et al. 2012. Efficient provisioning of bursty scientific workloads on the cloud using adaptive elasticity control. In Proceedings of ScienceCloud Workshop.
[5]
Ahmed Ali-Eldin et al. 2013. Workload Classification for Efficient Auto-Scaling of Cloud Resources. Technical Report. Umeå University, Lund University.
[6]
Guillermo A. Alvarez et al. 2001. Minerva: An automated resource provisioning tool for large-scale storage systems. ACM TOCS 19 (2001), 483--518.
[7]
Hamid Arabnejad and Jorge Barbosa. 2012. Fairness resource sharing for dynamic workflow scheduling on heterogeneous systems. In Proceedings of IEEE ISPA.
[8]
Shishir Bharathi et al. 2008. Characterization of scientific workflows. In Proceedings of WORKS Workshop.
[9]
Eun-Kyu Byun et al. 2011. Cost optimized provisioning of elastic resources for application workflows. FGCS 27 (2011), 1011--1026.
[10]
Jeffrey S. Chase et al. 2001. Managing energy and server resources in hosting centers. In Proceedings of ACM SIGOPS.
[11]
T. C. Chieu et al. 2009. Dynamic scaling of web applications in a virtualized cloud computing environment. In Proceedings of IEEE ICEBE.
[12]
Artem M. Chirkin et al. 2017. Execution time estimation for workflow scheduling. FGCS 75 (2017), 376--387.
[13]
Reginald Cushing et al. 2011. Prediction-based auto-scaling of scientific workflows. In Proceedings of MGC Workshop.
[14]
Rafael Ferreira Da Silva et al. 2015. Online task resource consumption prediction for scientific workflows. Parallel Process. Lett. 25 (2015).
[15]
Herbert A. David. 1987. Ranking from unbalanced paired-comparison data. Biometrika 74 (1987), 432--436.
[16]
Elias De Coninck et al. 2016. Dynamic auto-scaling and scheduling of deadline constrained service workloads on IaaS clouds. JSS 118 (2016), 101--114.
[17]
Christina Delimitrou and Christos Kozyrakis. 2014. Quasar: Resource-efficient and QoS-aware cluster management. ACM SIGPLAN Notices 49 (2014), 127--144.
[18]
Qiu Dishan et al. 2013. A dynamic scheduling method of earth-observing satellites by employing rolling horizon strategy. Sci. World J. 2013 (2013).
[19]
Tim Dornemann, Ernst Juhnke, and Bernd Freisleben. 2009. On-demand resource provisioning for BPEL workflows using amazon’s elastic compute cloud. In Proceedings of IEEE/ACM CCGrid.
[20]
Lipu Fei et al. 2014. KOALA-C: A task allocator for integrated multicluster and multicloud environments. In Proceedings of IEEE Cluster.
[21]
Dror G. Feitelson. 2015. Workload Modeling for Computer Systems Performance Evaluation. Cambridge University Press.
[22]
Hector Fernandez, Guillaume Pierre, and Thilo Kielmann. 2014. Autoscaling web applications in heterogeneous cloud infrastructures. In Proceedings of IEEE IC2E.
[23]
Philip J. Fleming and John J. Wallace. 1986. How not to lie with statistics: The correct way to summarize benchmark results. ACM Commun. 29 (1986), 218--221.
[24]
Anshul Gandhi et al. 2012. Autoscale: Dynamic, robust capacity management for multi-tier data centers. ACM TOCS 30 (2012), 14:1--14:26.
[25]
Thomas Heinis et al. 2005. Design and evaluation of an autonomic workflow engine. In Proceedings of IEEE ICAC.
[26]
Nikolas Herbst et al. 2016. Ready for Rain? A View from SPEC Research on the Future of Cloud Metrics. Technical Report. SPEC Research Group, Cloud Working Group.
[27]
Nikolas Herbst, Samuel Kounev, and Ralf Reussner. 2013. Elasticity in cloud computing: What it is, and What it is Not. In Proceedings of ICAC.
[28]
Alexey Ilyushkin et al. 2017. An experimental performance evaluation of autoscaling policies for complex workflows. In Proceedings of ACM/SPEC ICPE.
[29]
Alexey Ilyushkin, Bogdan Ghit, and Dick Epema. 2015. Scheduling workloads of workflows with unknown task runtimes. In Proceedings of IEEE/ACM CCGrid.
[30]
Alexandru Iosup et al. 2011. Performance analysis of cloud computing services for many-tasks scientific computing. Proceedings of IEEE TPDPS 22 (2011), 931--945.
[31]
Waheed Iqbal et al. 2011. Adaptive resource provisioning for read intensive multi-tier applications in the cloud. FGCS 27 (2011), 871--879.
[32]
Mohammad Islam et al. 2012. Oozie: Towards a scalable workflow management system for hadoop. In Proceedings of the ACM SIGMOD Workshop SWEET.
[33]
Joseph C. Jacob et al. 2010. Montage: An astronomical image mosaicking toolkit. Astrophys. Source Code Libr. 1 (2010), 10036.
[34]
Gideon Juve et al. 2017. Synthetic workflow generators. Retrieved from https://github.com/pegasus-isi/WorkflowGenerator.
[35]
Jonathan Livny. 2012. Bioinformatic discovery of bacterial regulatory RNAs using SIPHT. In Bacterial Regulatory RNA. Springer, 3--14.
[36]
Dionysios Logothetis et al. 2010. Stateful bulk processing for incremental analytics. In Proceedings of SoCC.
[37]
Tania Lorido-Botran et al. 2014. A review of auto-scaling techniques for elastic applications in cloud environments. J. Grid Comput. 12 (2014), 559--592.
[38]
Uri Lublin and Dror G. Feitelson. 2003. The workload on parallel supercomputers: Modeling the characteristics of rigid jobs. JPDC 63 (2003), 1105--1122.
[39]
Maciej Malawski et al. 2012. Cost-and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds. In Proceedings of ACM/IEEE Conference on Supercomputing.
[40]
Maciej Malawski et al. 2015. Scheduling multilevel deadline-constrained scientific workflows on clouds based on cost optimization. Sci. Program. 2015 (2015).
[41]
Ming Mao and Marty Humphrey. 2011. Auto-scaling to minimize cost and meet application deadlines in cloud workflows. In Proceedings of ACM/IEEE Conference on Supercomputing.
[42]
Ming Mao and Marty Humphrey. 2013. Scaling and scheduling to maximize application performance within budget constraints in cloud workflows. In Proceedings of IEEE IPDPS.
[43]
Athanasios Naskos et al. 2015. Dependable horizontal scaling based on probabilistic model checking. In Proceedings of IEEE/ACM CCGrid.
[44]
Simon Ostermann et al. 2008. On the characteristics of grid workflows. In Proceedings of the CoreGRID Integration Workshop.
[45]
A. V. Papadopoulos et al. 2016. PEAS: A performance evaluation framework for auto-scaling strategies in cloud applications, tail response time modeling, and control for interactive cloud services. ACM TOMPECS 1 (2016), 15:1--15:31.
[46]
Mayank Pundir et al. 2016. Supporting on-demand elasticity in distributed graph processing. In Proceedings of IEEE IC2E.
[47]
Mustafizur Rahman, Xiaorong Li, and Henry Palit. 2011. Hybrid heuristic for scheduling data analytics workflow applications in hybrid cloud environment. In Proceedings of IEEE IPDPSW.
[48]
Simon Spinner et al. 2015. Evaluating approaches to resource demand estimation. Perform. Eval. 92 (2015), 51--71.
[49]
Domenico Talia. 2013. Toward cloud-based big-data analytics. IEEE Comput. Sci. 2 (2013), 98--101.
[50]
Ian J. Taylor et al. 2014. Workflows for e-Science: Scientific Workflows for Grids. Springer.
[51]
Sachin Tilloo. 2017. Running arbitrary DAG-based workflows in the cloud. Retrieved from http://www.ebaytechblog.com/2016/04/05/running-arbitrary-dag-based-workflows-in-the-cloud.
[52]
Bhuvan Urgaonkar et al. 2005. An analytical model for multi-tier internet services and its applications. In Proceedings of ACM SIGMETRICS.
[53]
Bhuvan Urgaonkar et al. 2008. Agile dynamic provisioning of multi-tier internet applications. ACM TAAS 3 (2008), 1:1--1:39.
[54]
Naga Vydyanathan et al. 2008. A duplication based algorithm for optimizing latency under throughput constraints for streaming workflows. In Proceedings of ICPP.
[55]
Katherine Yelick et al. 2011. The Magellan report on cloud computing for science. U.S. Department of Energy, Washington, DC, Tech. Rep (2011).

Cited By

View all
  • (2022)A bi-metric autoscaling approach for n-tier web applications on kubernetesFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-021-0118-116:3Online publication date: 1-Jun-2022
  • (2021)Performance Analysis of the IOTA DAG-Based Distributed LedgerACM Transactions on Modeling and Performance Evaluation of Computing Systems10.1145/34851886:3(1-20)Online publication date: 2-Dec-2021
  • (2021)A Survey on Resilience in the IoTACM Computing Surveys10.1145/346251354:7(1-39)Online publication date: 17-Sep-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Modeling and Performance Evaluation of Computing Systems
ACM Transactions on Modeling and Performance Evaluation of Computing Systems  Volume 3, Issue 2
Special Issue on ICPE 2017 and Regular Papers
June 2018
114 pages
ISSN:2376-3639
EISSN:2376-3647
DOI:10.1145/3199681
  • Editors:
  • Sem Borst,
  • Carey Williamson
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 April 2018
Accepted: 01 November 2017
Revised: 01 October 2017
Received: 01 June 2017
Published in TOMPECS Volume 3, Issue 2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Autoscaling
  2. benchmarking
  3. elasticity
  4. metrics
  5. scientific workflows

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • Swedish Government’s strategic research project eSSENCE
  • Dutch projects Vidi MagnaData and KIEM KIESA
  • Research Group of the Standard Performance Evaluation Corporation (SPEC)
  • Swedish Research Council (VR) project Cloud Control
  • Commit and the Commit projects IV-E and Commissioner
  • NSF
  • German Research Foundation (DFG)

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)116
  • Downloads (Last 6 weeks)27
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)A bi-metric autoscaling approach for n-tier web applications on kubernetesFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-021-0118-116:3Online publication date: 1-Jun-2022
  • (2021)Performance Analysis of the IOTA DAG-Based Distributed LedgerACM Transactions on Modeling and Performance Evaluation of Computing Systems10.1145/34851886:3(1-20)Online publication date: 2-Dec-2021
  • (2021)A Survey on Resilience in the IoTACM Computing Surveys10.1145/346251354:7(1-39)Online publication date: 17-Sep-2021
  • (2021)Methodological Principles for Reproducible Performance Evaluation in Cloud ComputingIEEE Transactions on Software Engineering10.1109/TSE.2019.292790847:8(1528-1543)Online publication date: 1-Aug-2021
  • (2021)Predictive Autoscaling of Microservices Hosted in Fog Microdata CenterIEEE Systems Journal10.1109/JSYST.2020.299751815:1(1275-1286)Online publication date: Mar-2021
  • (2021)WIRE: Resource-efficient Scaling with Online Prediction for DAG-based Workflows2021 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/Cluster48925.2021.00025(35-46)Online publication date: Sep-2021
  • (2020)Self-Adaptive Data Processing to Improve SLOs for Dynamic IoT WorkloadsComputers10.3390/computers90100129:1(12)Online publication date: 14-Feb-2020
  • (2020)Effective Elastic Scaling of Deep Learning Workloads2020 28th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)10.1109/MASCOTS50786.2020.9285954(1-8)Online publication date: 17-Nov-2020
  • (2020)An Experimental Evaluation of the Kubernetes Cluster Autoscaler in the Cloud2020 IEEE International Conference on Cloud Computing Technology and Science (CloudCom)10.1109/CloudCom49646.2020.00002(17-24)Online publication date: Dec-2020
  • (2020)Evaluation of cloud autoscaling strategies under different incoming workload patternsConcurrency and Computation: Practice and Experience10.1002/cpe.566732:17Online publication date: 9-Jan-2020
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media