Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

A Taxonomy of Performance Forecasting Systems in the Serverless Cloud Computing Environments

  • Chapter
  • First Online:
Serverless Computing: Principles and Paradigms

Abstract

The Serverless Clouds Computing environment (or platform) manages the resource management of its respective clients who generally submit their respective applications as sets of functions (tasks). A client may submit his application as a set of tasks (functions) or as a monolithic task (single function). Each set of functions (tasks) compiled in the form of Directed Acyclic Graph (DAG), where each node is a function representing a fine-grained task and each edge represents a dependency among two functions. The decisions made through performance forecasting systems (PFS) or resource forecasting engines are of immense importance to such resource management systems. However, the forecasting of future resources is a complex problem. Several of PFS projects span over several computer resources in several dimensions. The most of the PFS projects have already been designed for performance forecasting of resources on the Distributed Computing Environments such as Peer-Peer, Queue systems, Clusters, Grids, Virtual machine organizations and Cloud systems and therefore in software engineering point of view, the new code can be written to integrate their forecasting services on the Serverless (Edge) Clouds platforms. In this chapter the taxonomy for describing the PFS architecture is discussed. The taxonomy is used to classify and identify approaches which are followed in the implementation of the existing PFSs in the Distributed Computing Environments and to realise their adaptation in the Serverless (Edge) Cloud Computing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Andrzejak A, Graupner S, Plantikow S (2006) Predicting resource demand in dynamic utility computing environments. In: International conference on autonomic and autonomous systems (ICAS'06), Santa Clara, USA, July 2006

    Google Scholar 

  2. Andrzejak A, Domingues P, Silva L (2006) Predicting machine availabilities in desktop pools. In: IEEE/IFIP network operations & management symposium (NOMS 2006), Vancouver, Canada, April 2006

    Google Scholar 

  3. Aristotle 323 BC. Metaphysics

    Google Scholar 

  4. Ayodele AO, Rao J, Boult TE (2015) Performance measurement and interference profiling in multi-tenant clouds. In: 2015 IEEE 8th international conference on cloud computing, pp 941–949

    Google Scholar 

  5. Badia RM, Labarta J, Gimenez J, Escalé AF (2003) DIMEMAS: predicting MPI applications behavior in grid environments. In: Workshop on grid applications and programming tools (GGF8), June 2003

    Google Scholar 

  6. Baldini I, Chang K, Chang P, Flink S, Ishakian V, Michell N, Muthusamy V, Rabbah R, Slominsky A, Sutter P (2017) Serverless computing: current trends and open problems. Res Adv Cloud Comput

    Google Scholar 

  7. Barnes BJ, Rountree B, Lowenthal DK, Reeves J, Supinski B, Schulz M (2008) A regression-based approach to scalability prediction. In: 22nd international conference on supercomputing (ICS ‘08). ACM, Kos, Greece

    Google Scholar 

  8. Bell WH, Cameron DG, Capozza L, Millar P, Stockinger K, Zini F (2002) Design of a replica optimization service. EU Data-Grid Project, Geneva, Switzerland

    Google Scholar 

  9. Bhattacharjee A, Chhokra AD, Kang Z, Sun H, Gokhale A, Karsai G (2019) BARISTA: efficient and scalable serverless serving system for deep learning prediction services

    Google Scholar 

  10. Boza EF, Abad CL, Villavicencio M, Quimba S, Plaza JA (2017) Reserved, on demand or serverless: model-based simulations for cloud budget planning. In: 2017 IEEE second Ecuador technical chapters meeting (ETCM), pp 1–6

    Google Scholar 

  11. Calheiros RN, Masoumi E, Ranjan R, Buyya R (2015) Workload prediction using ARIMA model and its impact on cloud applications’ QoS. IEEE Trans Cloud Comput 3:449–458

    Article  Google Scholar 

  12. Carvalho M, Miceli R, Maciel Jr PD, Brasileiro F, Lopes R (2010) Predicting the quality of service of a peer-to-peer desktop grid. In: 10th IEEE/ACM international conference on cluster, cloud and grid computing (CCGrid). IEEE Comput. Soc., Melbourne, Australia

    Google Scholar 

  13. Cordingly R, Shu W, Lloid W (2020) Predicting performance and cost of serverless computing functions with SAAF. In: IEEE 6th international conference on cloud and big data computing

    Google Scholar 

  14. Coregrid 2006. Review of performance prediction models and solutions. Institute on Resource Management and Scheduling

    Google Scholar 

  15. Das A, Imai S, Patterson S, Wittie MP (2020) Performance optimization for edge-cloud serverless platforms via dynamic task placement. In: 2020 20th IEEE/ACM international symposium on cluster, cloud and internet computing (CCGRID), 11–14 May 2020, pp 41–50

    Google Scholar 

  16. Das A, Leaf A, Varela CA, Patterson S (2020) Skedulix: hybrid cloud scheduling for cost-efficient execution of serverless applications. In: 2020 IEEE 13th international conference on cloud computing (CLOUD), 19–23 Oct 2020, pp 609–618

    Google Scholar 

  17. Desprez F, Quinson M, Suter F (2002) Dynamic performance forecasting for network-enabled servers in a heterogeneous environment. In: International conference on parallel & distributed processing techniques & applications (PDPTA), Las Vegas, USA

    Google Scholar 

  18. Dinda PA (1999) The statistical properties of host load. Sci Program 7:211–229

    Google Scholar 

  19. Dinda PA (2002) Online prediction of running time of tasks. Clust Comput 5:225–236

    Article  Google Scholar 

  20. Dinda PA, O’Hallaron DR (2000) Host load prediction using linear models. Clust Comput 3:265–280

    Article  Google Scholar 

  21. Dinda PA (2000) Resource signal prediction and its application to real time scheduling advisors. PhD, Carnegie Mellon University, USA

    Google Scholar 

  22. Downey AB (1997) Predicting queue times on space-sharing parallel computers. In: 11th international symposium on parallel processing, Geneva, Switzerland, 1997, pp 209–218

    Google Scholar 

  23. Downey AB, Feitelson DG (1999) The elusive goal of workload characterisation. Perform Eval Rev 26:14–29

    Article  Google Scholar 

  24. Duan R, Nadeem F, Wang J, Zhang Y, Prodan R, Fahringer T (2009) A hybrid intelligent method for performance modeling and prediction of workflow activities in grids. In: 9th IEEE/ACM international symposium on cluster computing and the grid (CCGRID ‘09), Shanghai, China

    Google Scholar 

  25. Eyupoglu C (2019) Big data in cloud computing and Internet of Things. In: 2019 3rd international symposium on multidisciplinary studies and innovative technologies (ISMSIT), 11–13 Oct 2019, pp 1–5

    Google Scholar 

  26. Faerman M, Su A, Wolski R, Berman F (1999) Adaptive performance prediction for distributed data-intensive applications. In: ACM/IEEE international conference on super computing, Portland, OR, USA, 1999 (CDROM), Article No. 36

    Google Scholar 

  27. Fahringer T, Jugravu A, Pllana S, Prodan R, Seragiotto C Jr, Truong HL (2005) ASKALON: a tool set for cluster and grid computing. Concurr Comput: Pract Exp 17:143–169

    Article  Google Scholar 

  28. Farley B, Juels A, Varadarajan V, Ristenpart T, Bowers KD, Swift MM (2012) More for your money: exploiting performance heterogeneity in public clouds. In: Proceedings of the third ACM symposium on cloud computing. Association for Computing Machinery, San Jose, CA

    Google Scholar 

  29. Feitelson DG (2002) Workload modelling for performance evaluation. School of Computer Science & Engineering, Hebrew University, Jerusalem, Israel

    Google Scholar 

  30. Feng L, Kudva P, Silva DD, Hu J (2018) Exploring serverless computing for neural network training. In: 2018 IEEE 11th international conference on cloud computing (CLOUD), 2–7 July 2018, pp 334–341

    Google Scholar 

  31. Fotouhi M, Chen D, Lloyd WJ (2019) Function-as-a-Service application service composition: implications for a natural language processing application. In: Proceedings of the 5th international workshop on serverless computing. Association for Computing Machinery, Davis, CA, USA

    Google Scholar 

  32. Glimcher L, Agrawal G (2007) A performance prediction framework for grid-based data mining application. In: International parallel and disributed processing symposium (IPDPS)

    Google Scholar 

  33. Gruber R, Tran TM (2004) Parameterisation to Tailor commodity clusters to applications. EPFL Supercomput Rev 14:12–17

    MATH  Google Scholar 

  34. Gunasekaran JR, Thinakaran P, Kandemir MT, Urgaonkar B, Kesidis G, Das C (2019) Spock: exploiting serverless functions for SLO and cost aware resource procurement in public cloud. In: 2019 IEEE 12th international conference on cloud computing (CLOUD), 8–13 July 2019, pp 199–208

    Google Scholar 

  35. Gunasekaran JR, Thinakaran P, Nachiappan NC, Kandemir MT, Das CR (2020) Fifer: tackling resource underutilization in the serverless era. In: Proceedings of the 21st international middleware conference. Association for Computing Machinery, Delft, Netherlands

    Google Scholar 

  36. Hendrickson S, Sturdevant S, Harter T, Venkataramani V, Arpaci-Dusseau AC, Arpaci-Dusseau RH (2016) Serverless computation with openLambda. In: Proceedings of the 8th USENIX conference on hot topics in cloud computing. USENIX Association, Denver, CO

    Google Scholar 

  37. Hsieh S-Y, Liu C-S, Buyya R, Zomaya AY (2020) Utilization-prediction-aware virtual machine consolidation approach for energy-efficient cloud data centers. J Parallel Distrib Comput 139:99–109

    Article  Google Scholar 

  38. Jonas E, Schleier-Smith J, Sreekanti V, Tsai C-C, Khandelwal A, Pu Q, Shankar V, Carreira J, Krauth K, Yadwadkar NJ, Gonzalez J, Popa RA, Stoica I, Patterson DA (2019) Cloud programming simplified: a Berkeley view on serverless computing. arXiv:1902.03383

  39. Kim YK, Hoseinyfarahabady MR, Lee YC, Zomaya AY (2020) Automated fine-grained CPU cap control in serverless computing platform. IEEE Trans Parallel Distrib Syst 31:2289–2301

    Article  Google Scholar 

  40. Kumar J, Singh AK, Buyya R (2020) Ensemble learning based predictive framework for virtual machine resource request prediction. Neurocomputing 397:20–30

    Article  Google Scholar 

  41. Kurowski K, Oleksiak A, Nabrzyski J, Kwiecien A, Wojtkiewicz M, Dyczkowski M, Guim F, Corbalan J, Labarta J (2005) Multi-criteria grid resource management using performance prediction techniques. In: CoreGrid integration workshop, Pisa, Italy, Nov 2005

    Google Scholar 

  42. Li H, Groep D, Wolters L (2007) Mining performance data for metascheduling decision support in the grid. Futur Gener Comput Syst 23:92–99

    Article  Google Scholar 

  43. Li H, Sun J, Sun BL (2009) Financial distress prediction based on OR-CBR in the principle of K-nearest neighbors. Expert Syst Appl 36:643–659

    Article  Google Scholar 

  44. Lloyd W, Ramesh S, Chinthalapati S, Ly L, Pallickara S (2018) Serverless computing: an investigation of factors influencing microservice performance. In: IEEE international conference on cloud engineering (IC2E 2018)

    Google Scholar 

  45. Mahgoub AY, Shankar K, Mitra S, Klimovic A, Chaterji S, Bagchi S (2021) SONIC: application-aware data passing for chained serverless applications. In: USENIX annual technical conference, 2021

    Google Scholar 

  46. Mahmoudi N, Khazaei H (2020) Performance modeling of serverless computing platforms. IEEE Trans Cloud Comput 10:2834–2847

    Google Scholar 

  47. Malawski M, Gajek A, Zima A, Balis B, Figiela K (2020) Serverless execution of scientific workflows: experiments with HyperFlow, AWS Lambda and Google Cloud Functions. Future Gener Comput Syst 110:502–514

    Article  Google Scholar 

  48. Matsunaga A, Fortes JAB (2010) On the use of machine learning to predict the time and resources consumed by applications. In: 10th IEEE/ACM international conference on cluster, cloud and grid computing (CCGRID). IEEE Comput. Soc., Melbourne, VIC, Australia

    Google Scholar 

  49. Minh TN, Wolters L (2010) Using historical data to predict application runtimes on backfilling parallel systems. In: 18th Euromicro conference on parallel, distributed and network-based processing (PDP ‘10). IEEE Comp. Soc., Pisa, Italy

    Google Scholar 

  50. Nikravesh AY, Ajila SA, Lung C-H (2017) An autonomic prediction suite for cloud resource provisioning. J Cloud Comput 6:3

    Article  Google Scholar 

  51. Nudd GR, Kerbyson DJ, Panaefstathiou E, Perry SC, Harper JS, Ewilcox DV (2000) Pace—a toolset for the performance prediction of parallel and distributed systems. High Perform Comput Appl 14:228–252

    Article  Google Scholar 

  52. Nurmi DC, Brevik J, Wolski R (2007) QBETS: queue bounds estimation from time series. In: SIGMETRICS ‘07. International conference on measurement & modeling of computer systems. Springer, San Diego, CA, USA

    Google Scholar 

  53. Oakes E, Yang L, Houck K, Harter T, Arpaci-Dusseau AC, Arpaci-Dusseau RH (2017) Pipsqueak: lean lambdas with large libraries. In: IEEE 37th international conference on distributed computing systems workshops (ICDCSW 2017)

    Google Scholar 

  54. Ou Z, Zhuang H, Lukyanenko A, Nurminen JK, Hui P, Mazalov V, Ylä-Jääski A (2013) Is the same instance type created equal? Exploiting heterogeneity of public clouds. IEEE Trans Cloud Comput 1:201–214

    Article  Google Scholar 

  55. Palmer N, Sherman M, Wang Y, Just S (2015) Scaling to build the consolidated audit trail: a financial services application of Google Cloud Bigtable

    Google Scholar 

  56. Prodan R (2007) Specification and runtime workflow support in the ASKALON grid environment. Sci Program 15:193–211

    Google Scholar 

  57. Rehman MS, Sakr MF (2010) Initial findings for provisioning variation in cloud computing. In: 2010 IEEE second international conference on cloud computing technology and science, 30 Nov–3 Dec 2010, pp 473–479

    Google Scholar 

  58. Rodero I, Guim F, Corbalán J, Labarta J (2005) eNANOS: coordinated scheduling in grid environments. In: Parallel computing: current & future issues of high-end computing, Parco, 2005

    Google Scholar 

  59. Sanjay HA, Vadhiyar S (2008) Performance modeling of parallel applications for grid scheduling. Parallel Distrib Comput 68:1135–1145

    Article  MATH  Google Scholar 

  60. Sanjay HA, Vadhiyar S (2009) A strategy for scheduling tightly-coupled parallel applications on clusters. Concurr Comput: Pract Exp 21:2491–2517

    Article  Google Scholar 

  61. Schad J, Dittrich J, Quiané-Ruiz J-A (2010) Runtime measurements in the cloud: observing, analyzing, and reducing variance. Proc VLDB Endow 3:460–471

    Article  Google Scholar 

  62. Scheuner J, Leitner P (2020) Function-as-a-Service performance evaluation: a multivocal literature review. J Syst Softw 170:110708

    Article  Google Scholar 

  63. Seneviratne S, Levy DC, Hong W, De Silva LC, Hu J (2021) Introduction of the new Os kernel internals for the new metrics for the performance prediction on the distributed computing environments. In: Petrova VM (ed) Advances in engineering research. Nova Science and Technology, New York

    Google Scholar 

  64. Seneviratne S (2009) A framework for load profile prediction for grid computing. PhD, Sydney University

    Google Scholar 

  65. Seneviratne S, Levy D, Rajkumar B (2015) Taxonomy of performance prediction systems for parallel and distributed computing systems. In: Barbosa JG (ed) Grid computing techniques and future prospects. Nova Science Publishers, New York

    Google Scholar 

  66. Seneviratne S, Levy DC (2011) Task profiling model for load profile prediction. Futur Gener Comput Syst 27:245–255

    Article  Google Scholar 

  67. Seneviratne S, Witharana S, Toosi AN (2019) Adapting the machine learning grid prediction models for forecasting of resources on the clouds. In: 2019 advances in science and engineering technology international conferences (ASET), Dubai, United Arab Emirates

    Google Scholar 

  68. Singhvi A, Houck K, Balasubramanian A, Shaikh MD, Venkataraman S, Akella A (2019) Archipelago: a scalable low-latency serverless platform. arXiv:1911.09849

    Google Scholar 

  69. Smith W, Foster I, Taylor V (2004) Predicting application run times with historical information. Parallel Distrib Comput 64:1007–1016

    Article  MATH  Google Scholar 

  70. Smith W, Taylor V, Foster I (1999) Using runtime predictions to estimate queue wait times and improve scheduler performance. In: International workshop on job scheduling strategies for parallel processing, San Juan, Puerto Rico, 1999. Springer, pp 202–219

    Google Scholar 

  71. Sodhi S, Subhlok J, Xu Q (2008) Performance prediction with skeletons. Clust Comput 11:151–165

    Article  Google Scholar 

  72. Song B (2005) Workload modelling for parallel computers. PhD, University of Dortmund

    Google Scholar 

  73. Song B, Ernemann C, Yahyapour R (2004) Parallel computer workload modelling with Markov chains. In: International conference on job schdeduling strategies for parallel processing. Springer, NY, USA

    Google Scholar 

  74. Spillner J, Mateos C, Monge DA (2018) FaaSter, Better, Cheaper: the prospect of serverless scientific computing and HPC. In: High performance computing. Springer, Cham, pp 154–168

    Google Scholar 

  75. Steffenel LA (2005) LaPIe: Communications Collectives Adaptées aux Grilles de Calcul. PhD, INPG

    Google Scholar 

  76. Steffenel LA, Mounie G (2008) A framework for adaptive collective communications for heterogeneous hierarchical computing systems. Comput Syst Sci 74:1082–1093

    Article  MathSciNet  MATH  Google Scholar 

  77. Stockinger K, Stockinger H, Dutka L, Slota R, Nikolow D, Kitowski J (2003) Access cost estimation for unified grid storage systems. In: Fourth international workshop on grid computing (GRID03), Phoenix, AZ, USA

    Google Scholar 

  78. Ullah QZ, Shahzad H, Khan GM (2017) Adaptive resource utilization prediction system for infrastructure as a service cloud. Comput Intell Neurosci 2017

    Google Scholar 

  79. Vazhkudai S, Schopf JM (2003) Using regression techniques to predict large data transfers. High Perform Comput Appl 17:249–268

    Article  Google Scholar 

  80. Verboven S, Hellinckx P, Arickx F, Broeckhove J (2008) Runtime prediction based grid scheduling of parameter sweep jobs. In: IEEE international conference of Asia-Pacific services computing (APSCC), Yilan, Taiwan

    Google Scholar 

  81. Verma M, Gangadharan GR, Narendra NC, Ravi V, Inamdar V, Ramachandran L, Calheiros RN, Buyya R (2016) Dynamic resource demand prediction and allocation in multi-tenant service clouds. Concurr Comput: Pract Exp 28:4429–4442

    Article  Google Scholar 

  82. Villamizar M, Garcés O, Ochoa L, Castro H, Salamanca L, Verano M, Casallas R, Gil S, Valencia C, Zambrano A, Lang M (2016) Infrastructure cost comparison of running web applications in the cloud using AWS lambda and monolithic and microservice architectures. In: 2016 16th IEEE/ACM international symposium on cluster, cloud and grid computing (CCGrid), 16–19 May 2016, pp 179–182

    Google Scholar 

  83. Wang L, Zhang M, Li Y, Ristenpart T, Swift M (2018) Peeking behind the curtains of serverless platforms. In: USENIX annual technical confeference

    Google Scholar 

  84. Wolski R (1998) Dynamically forecasting network performance using the network weather service. Clust Comput 1:119–132

    Article  Google Scholar 

  85. Wolski R, Spring N, Hayes J (2000) Predicting the CPU availability of time-shared Unix systems on the computational grid. Clust Comput 3:293–301

    Article  Google Scholar 

  86. Wolski R, Spring N, Hayes J (1999) The network weather service: a distributed resource performance forecasting service for metacomputing. Future Gener Comput Syst 15:757–768

    Article  Google Scholar 

  87. Wu Y, Hwang K, Yuan Y, Zheng W (2010) Adaptive workload prediction of grid performance in confidence windows. IEEE Trans Parallel Distrib Syst 21:925–938

    Article  Google Scholar 

  88. Yan M, Castro P, Cheng P, Ishakian V (2016) Building a chatbot with serverless computing. In: Proceedings of the 1st international workshop on mashups of things and APIs

    Google Scholar 

  89. Yang L, Schopf JM, Foster I (2003) Homeostatic and tendency-based CPU load predictions. In: 17th international parallel and distributed processing symposium (IPDPS 2003), Los Alamitos, CA, USA, 2003. IEEE CD-ROM, p 9

    Google Scholar 

  90. Yang L, Schopf JM, Foster I (2005) Improving parallel data transfer times using predicted variances in shared networks. In: Fifth IEEE international symposium on cluster computing and the grid (CCGRID05). IEEE Computer Society, Washington, DC, USA

    Google Scholar 

  91. Zhang M, Krintz C, Wolski R (2021) Edge-adaptable serverless acceleration for machine learning Internet of Things applications. Softw: Pract Exp 51:1852–1867

    Google Scholar 

Download references

Acknowledgements

We thank Prof David Abramson of the Queensland University (who was the former head of the school of Computer Science & Engineering, Monash University) and Redmond Barry Distinguished Prof Rajkumar Buyya of the Melbourne University for their invaluable guidance to Dr Sena Seneviratne in the fields of the Distributed Computer Systems.

We acknowledge that some of the classification methods which are applied on the prediction algorithms of the Serverless Cloud Computing in this chapter had been previously used by Dr Sena Seneviratne in the field of Grid computing in the book chapter namely “Taxonomy of Performance Prediction Systems for Parallel and Distributed Computing Systems” which was published in 2015, In: BARBOSA, J. G. (ed.) Grid Computing Techniques and Future Prospects. New York: Nova Science Publishers. We acknowledge that he had submitted to ARXIV repository a similar article namely “A Taxonomy of Performance Prediction Systems in the Parallel and Distributed Computing Grids”. Further, we have used the same classification techniques in the paper namely “Taxonomy & Survey of Performance Prediction Systems For the Distributed Systems Including the Clouds” which was presented by him at 2021 IEEE International Conference CPSCom which was held in Melbourne, Australia. Further, we acknowledge that some of the Machine Learning Algorithms which are contained in this chapter had been discussed by him in the paper namely “Adapting the Machine Learning Grid Prediction Models for Forecasting of Resources on the Clouds” and presented at the 2019 IEEE conference namely “Advances in Science and Engineering Technology” which was held in Dubai, United Arab Emirates.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sena Seneviratne .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Seneviratne, S., Levy, D.C., De Silva, L.C. (2023). A Taxonomy of Performance Forecasting Systems in the Serverless Cloud Computing Environments. In: Krishnamurthi, R., Kumar, A., Gill, S.S., Buyya, R. (eds) Serverless Computing: Principles and Paradigms. Lecture Notes on Data Engineering and Communications Technologies, vol 162. Springer, Cham. https://doi.org/10.1007/978-3-031-26633-1_4

Download citation

Publish with us

Policies and ethics