Horizontal Scaling in Cloud Using Contextual Bandits

Delande, David; Stolf, Patricia; Feraud, Raphaël; Pierson, Jean-Marc; Bottaro, André

doi:10.1007/978-3-030-85665-6_18

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12820))

Included in the following conference series:

European Conference on Parallel Processing

2097 Accesses
2 Citations

Abstract

One characteristic of the Cloud is elasticity: it provides the ability to adapt resources allocated to applications as needed at runtime. This capacity relies on scaling and scheduling. In this article online horizontal scaling is studied. The aim is to determine dynamically applications deployment parameters and to adjust them in order to respect a Quality of Service level without any human parameters tuning. This work focuses on CaaS (container-based) environments and proposes an algorithm based on contextual bandits (HSLinUCB). Our proposal has been evaluated on a simulated platform and on a real Kubernetes’s platform. The comparison has been done with several baselines: threshold based auto-scaler, Q-Learning, and Deep Q-Learning. The results show that HSLinUCB gives very good results compared to other baselines, even when used without any training period.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

An Auto-Scaling Cloud Controller Using Fuzzy Q-Learning - Implementation in OpenStack

CloudAIBus: a testbed for AI based cloud computing environments

Article 06 June 2024

Enhancing Machine Learning-Based Autoscaling for Cloud Resource Orchestration

Article Open access 19 October 2024

Notes

References

Abbasi-Yadkori, Y., Pal, D., Szepesvari, C.: Improved algorithms for linearstochastic bandits. In: NIPS (2011)
Google Scholar
Abdullah, M., Iqbal, W., Bukhari, F.: Containers vs virtual machines for auto-scaling multi-tier applications under dynamically increasing workloads. In: Intelligent Technologies and Applications (2019)
Google Scholar
Al-Dhuraibi, Y., Paraiso, F., Djarallah, N., Merle, P.: Elasticity in cloud computing: State of the art and research challenges. IEEE Trans. Serv, Comput. 11(2), 430–447 (2018). https://doi.org/10.1109/TSC.2017.2711009
Ayimba, C., Casari, P., Mancuso, V.: SQLR: short term memory q-learning for elastic provisioning. CoRR (2019)
Google Scholar
Barrett, E., Howley, E., Duggan, J.: Applying reinforcement learning towards automating resource allocation and application scalability in the cloud. Concurr. Comput. Pract, Exp 25(12), 1656–1674 (2013)
Google Scholar
Cano, I., et al.: ADARES: Adaptive resource management for virtual machines. arXiv (2018)
Google Scholar
Coutinho, E.F., de Carvalho Sousa, F.R., Rego, P.A.L., Gomes, D.G., de Souza, J.N.: Elasticity in cloud computing: a survey. annals of telecommunications - annales des télécommunications, pp. 289–309 (2014). https://doi.org/10.1007/s12243-014-0450-7
Dutreilh, X., Kirgizov, S., Melekhova, O., Malenfant, J., Rivierre, N., Truck, I.: Using reinforcement learning for autonomic resource allocation in clouds: Towards a fully automated workflow. In: ICAS (2011)
Google Scholar
Gari, Y., Monge, D.A., Pacini, E., Mateos, C., Garino, C.G.: Reinforcement learning-based autoscaling of workflows in the cloud: A survey. CoRR (2020)
Google Scholar
Hwang, K., Bai, X., Shi, Y., Li, M., Chen, W., Wu, Y.: Cloud performance modeling with benchmark evaluation of elastic scaling strategies. IEEE Trans. Parallel Distrib. Syst. 27(1), 130–143 (2016)
Article Google Scholar
Jin, Y., Bouzid, M., Kostadinov, D., Aghasaryan, A.: Model-free resource management of cloud-based applications using reinforcement learning. In: ICIN (2018)
Google Scholar
Khatua, S., Ghosh, A., Mukherjee, N.: Optimizing the utilization of virtual resources in cloud environment. In: VECIMS (2010)
Google Scholar
Li, L., Chu, W., Langford, J., Schapire, R.E.: A contextual-bandit approach to personalized news article recommendation. In: WWW (2010)
Google Scholar
Lorido-Botran, T., Miguel-Alonso, J., Lozano, J.A.: A review of auto-scaling techniques for elastic applications in cloud environments. J. Grid Comput. 1–34 (2014). https://doi.org/10.1007/s10723-014-9314-7
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Article Google Scholar
Nguyen, H., Shen, Z., Gu, X., Subbiah, S., Wilkes, J.: AGILE: Elastic distributed resource scaling for infrastructure-as-a-service. In: ICAC (2013)
Google Scholar
Nikravesh, A.Y., Ajila, S.A., Lung, C.: Towards an autonomic auto-scaling prediction system for cloud resource provisioning. In: SEAMS (2015)
Google Scholar
Pascual, J.A., Lozano, J.A., Miguel-Alonso, J.: Effects of reducing VMs management times on elastic applications. J. Grid Comput. 518(7540), 529–533 (2018)
Google Scholar
Qu, C., Calheiros, R.N., Buyya, R.: Auto-scaling web applications in clouds: a taxonomy and survey. ACM Comput. Surv. 51(4), 1–33 (2018)
Article Google Scholar
Schuler, L., Jamil, S., Kühl, N.: AI-based resource allocation: Reinforcement learning for adaptive auto-scaling in serverless environments. arXiv (2020)
Google Scholar
Shariffdeen, R.S., Munasinghe, D.T.S.P., Bhathiya, H.S., Bandara, U.K.J.U., Bandara, H.M.N.D.: Adaptive workload prediction for proactive auto scaling in PaaS systems. In: CloudTech (2016)
Google Scholar
Singh, P., Gupta, P., Jyoti, K., Nayyar, A.: Research on auto-scaling of web applications in cloud: Survey, trends and future directions. Pract. Experience Scalable Comput. 20(2), 399–432 (2019)
Article Google Scholar
Sutton, R.S., Barto, A.G.: Introduction to Reinforcement Learning. MIT Press, Cambridge (1998)
Google Scholar
Tadakamalla, V., Menasce, D.A.: Model-driven elasticity control for multi-server queues under traffic surges in cloud environments. In: ICAC (2018)
Google Scholar
Toslali, M., Parthasarathy, S., Oliveira, F., Coskun, A.K.: JACKPOT: Online experimentation of cloud microservices. In: HotCloud (2020)
Google Scholar
Urdaneta, G., Pierre, G., van Steen, M.: Wikipedia workload analysis for decentralized hosting. Comput. Netw. 53(11), 1830–1845 (2009)
Article Google Scholar
Wei, Y., Kudenko, D., Liu, S., Pan, L., Wu, L., Meng, X.: A reinforcement learning based auto-scaling approach for SaaS providers in dynamic cloud environment. Math. Prob. Eng. 2019, 11 p. (2019). Article ID 5080647. https://doi.org/10.1155/2019/5080647
Xu, H., Liu, Y., Lau, W.C., Zeng, T., Guo, J., Liu, A.X.: Online resource allocation with machine variability: a bandit perspective. IEEE/ACM Trans. Networking 28(5), 2243–2256 (2020). https://doi.org/10.1109/TNET.2020.3006906

Download references

Author information

Authors and Affiliations

Orange Labs, Lannion, France
David Delande, Raphaël Feraud & André Bottaro
IRIT, Université de Toulouse, 31062, Toulouse, France
David Delande, Patricia Stolf & Jean-Marc Pierson

Authors

David Delande
View author publications
You can also search for this author in PubMed Google Scholar
Patricia Stolf
View author publications
You can also search for this author in PubMed Google Scholar
Raphaël Feraud
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Marc Pierson
View author publications
You can also search for this author in PubMed Google Scholar
André Bottaro
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Delande .

Editor information

Editors and Affiliations

Universidade de Lisboa, Lisbon, Portugal
Leonel Sousa
Universidade de Lisboa, Lisbon, Portugal
Nuno Roma
Universidade de Lisboa, Lisbon, Portugal
Pedro Tomás

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Delande, D., Stolf, P., Feraud, R., Pierson, JM., Bottaro, A. (2021). Horizontal Scaling in Cloud Using Contextual Bandits. In: Sousa, L., Roma, N., Tomás, P. (eds) Euro-Par 2021: Parallel Processing. Euro-Par 2021. Lecture Notes in Computer Science(), vol 12820. Springer, Cham. https://doi.org/10.1007/978-3-030-85665-6_18

Download citation

DOI: https://doi.org/10.1007/978-3-030-85665-6_18
Published: 25 August 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-85664-9
Online ISBN: 978-3-030-85665-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Horizontal Scaling in Cloud Using Contextual Bandits

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

An Auto-Scaling Cloud Controller Using Fuzzy Q-Learning - Implementation in OpenStack

CloudAIBus: a testbed for AI based cloud computing environments

Enhancing Machine Learning-Based Autoscaling for Cloud Resource Orchestration

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Horizontal Scaling in Cloud Using Contextual Bandits

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

An Auto-Scaling Cloud Controller Using Fuzzy Q-Learning - Implementation in OpenStack

CloudAIBus: a testbed for AI based cloud computing environments

Enhancing Machine Learning-Based Autoscaling for Cloud Resource Orchestration

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation