Abstract
One characteristic of the Cloud is elasticity: it provides the ability to adapt resources allocated to applications as needed at runtime. This capacity relies on scaling and scheduling. In this article online horizontal scaling is studied. The aim is to determine dynamically applications deployment parameters and to adjust them in order to respect a Quality of Service level without any human parameters tuning. This work focuses on CaaS (container-based) environments and proposes an algorithm based on contextual bandits (HSLinUCB). Our proposal has been evaluated on a simulated platform and on a real Kubernetes’s platform. The comparison has been done with several baselines: threshold based auto-scaler, Q-Learning, and Deep Q-Learning. The results show that HSLinUCB gives very good results compared to other baselines, even when used without any training period.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
References
Abbasi-Yadkori, Y., Pal, D., Szepesvari, C.: Improved algorithms for linearstochastic bandits. In: NIPS (2011)
Abdullah, M., Iqbal, W., Bukhari, F.: Containers vs virtual machines for auto-scaling multi-tier applications under dynamically increasing workloads. In: Intelligent Technologies and Applications (2019)
Al-Dhuraibi, Y., Paraiso, F., Djarallah, N., Merle, P.: Elasticity in cloud computing: State of the art and research challenges. IEEE Trans. Serv, Comput. 11(2), 430–447 (2018). https://doi.org/10.1109/TSC.2017.2711009
Ayimba, C., Casari, P., Mancuso, V.: SQLR: short term memory q-learning for elastic provisioning. CoRR (2019)
Barrett, E., Howley, E., Duggan, J.: Applying reinforcement learning towards automating resource allocation and application scalability in the cloud. Concurr. Comput. Pract, Exp 25(12), 1656–1674 (2013)
Cano, I., et al.: ADARES: Adaptive resource management for virtual machines. arXiv (2018)
Coutinho, E.F., de Carvalho Sousa, F.R., Rego, P.A.L., Gomes, D.G., de Souza, J.N.: Elasticity in cloud computing: a survey. annals of telecommunications - annales des télécommunications, pp. 289–309 (2014). https://doi.org/10.1007/s12243-014-0450-7
Dutreilh, X., Kirgizov, S., Melekhova, O., Malenfant, J., Rivierre, N., Truck, I.: Using reinforcement learning for autonomic resource allocation in clouds: Towards a fully automated workflow. In: ICAS (2011)
Gari, Y., Monge, D.A., Pacini, E., Mateos, C., Garino, C.G.: Reinforcement learning-based autoscaling of workflows in the cloud: A survey. CoRR (2020)
Hwang, K., Bai, X., Shi, Y., Li, M., Chen, W., Wu, Y.: Cloud performance modeling with benchmark evaluation of elastic scaling strategies. IEEE Trans. Parallel Distrib. Syst. 27(1), 130–143 (2016)
Jin, Y., Bouzid, M., Kostadinov, D., Aghasaryan, A.: Model-free resource management of cloud-based applications using reinforcement learning. In: ICIN (2018)
Khatua, S., Ghosh, A., Mukherjee, N.: Optimizing the utilization of virtual resources in cloud environment. In: VECIMS (2010)
Li, L., Chu, W., Langford, J., Schapire, R.E.: A contextual-bandit approach to personalized news article recommendation. In: WWW (2010)
Lorido-Botran, T., Miguel-Alonso, J., Lozano, J.A.: A review of auto-scaling techniques for elastic applications in cloud environments. J. Grid Comput. 1–34 (2014). https://doi.org/10.1007/s10723-014-9314-7
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Nguyen, H., Shen, Z., Gu, X., Subbiah, S., Wilkes, J.: AGILE: Elastic distributed resource scaling for infrastructure-as-a-service. In: ICAC (2013)
Nikravesh, A.Y., Ajila, S.A., Lung, C.: Towards an autonomic auto-scaling prediction system for cloud resource provisioning. In: SEAMS (2015)
Pascual, J.A., Lozano, J.A., Miguel-Alonso, J.: Effects of reducing VMs management times on elastic applications. J. Grid Comput. 518(7540), 529–533 (2018)
Qu, C., Calheiros, R.N., Buyya, R.: Auto-scaling web applications in clouds: a taxonomy and survey. ACM Comput. Surv. 51(4), 1–33 (2018)
Schuler, L., Jamil, S., Kühl, N.: AI-based resource allocation: Reinforcement learning for adaptive auto-scaling in serverless environments. arXiv (2020)
Shariffdeen, R.S., Munasinghe, D.T.S.P., Bhathiya, H.S., Bandara, U.K.J.U., Bandara, H.M.N.D.: Adaptive workload prediction for proactive auto scaling in PaaS systems. In: CloudTech (2016)
Singh, P., Gupta, P., Jyoti, K., Nayyar, A.: Research on auto-scaling of web applications in cloud: Survey, trends and future directions. Pract. Experience Scalable Comput. 20(2), 399–432 (2019)
Sutton, R.S., Barto, A.G.: Introduction to Reinforcement Learning. MIT Press, Cambridge (1998)
Tadakamalla, V., Menasce, D.A.: Model-driven elasticity control for multi-server queues under traffic surges in cloud environments. In: ICAC (2018)
Toslali, M., Parthasarathy, S., Oliveira, F., Coskun, A.K.: JACKPOT: Online experimentation of cloud microservices. In: HotCloud (2020)
Urdaneta, G., Pierre, G., van Steen, M.: Wikipedia workload analysis for decentralized hosting. Comput. Netw. 53(11), 1830–1845 (2009)
Wei, Y., Kudenko, D., Liu, S., Pan, L., Wu, L., Meng, X.: A reinforcement learning based auto-scaling approach for SaaS providers in dynamic cloud environment. Math. Prob. Eng. 2019, 11 p. (2019). Article ID 5080647. https://doi.org/10.1155/2019/5080647
Xu, H., Liu, Y., Lau, W.C., Zeng, T., Guo, J., Liu, A.X.: Online resource allocation with machine variability: a bandit perspective. IEEE/ACM Trans. Networking 28(5), 2243–2256 (2020). https://doi.org/10.1109/TNET.2020.3006906
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Delande, D., Stolf, P., Feraud, R., Pierson, JM., Bottaro, A. (2021). Horizontal Scaling in Cloud Using Contextual Bandits. In: Sousa, L., Roma, N., Tomás, P. (eds) Euro-Par 2021: Parallel Processing. Euro-Par 2021. Lecture Notes in Computer Science(), vol 12820. Springer, Cham. https://doi.org/10.1007/978-3-030-85665-6_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-85665-6_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-85664-9
Online ISBN: 978-3-030-85665-6
eBook Packages: Computer ScienceComputer Science (R0)