Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3620678.3624783acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads

Published: 31 October 2023 Publication History

Abstract

This paper releases and analyzes two new Huawei cloud serverless traces. The traces span a period of over 7 months with over 1.4 trillion function invocations combined. The first trace is derived from Huawei's internal workloads and contains detailed per-second statistics for 200 functions running across multiple Huawei cloud data centers. The second trace is a representative workload from Huawei's public FaaS platform. This trace contains per-minute arrival rates for over 5000 functions running in a single Huawei data center. We present the internals of a production FaaS platform by characterizing resource consumption, cold-start times, programming languages used, periodicity, per-second versus per-minute burstiness, correlations, and popularity. Our findings show that there is considerable diversity in how serverless functions behave: requests vary by up to 9 orders of magnitude across functions, with some functions executed over 1 billion times per day; scheduling time, execution time and cold-start distributions vary across 2 to 4 orders of magnitude and have very long tails; and function invocation counts demonstrate strong periodicity for many individual functions and on an aggregate level. Our analysis also highlights the need for further research in estimating resource reservations and time-series prediction to account for the huge diversity in how serverless functions behave.

References

[1]
Alexandru Agache, Marc Brooker, Alexandra Iordache, Anthony Liguori, Rolf Neugebauer, Phil Piwonka, and Diana-Maria Popa. 2020. Firecracker: Lightweight Virtualization for Serverless Applications. In NSDI, Vol. 20. 419--434.
[2]
Ahmed Ali-Eldin, Oleg Seleznjev, Sara Sjöstedt-de Luna, Johan Tordsson, and Erik Elmroth. 2014. Measuring cloud workload burstiness. In 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing. IEEE, 566--572.
[3]
Apache OpenWhisk. 2016. Open Source Serverless Cloud Platform. https://openwhisk.apache.org/.
[4]
Martin F Arlitt and Carey L Williamson. 1996. Web server workload characterization: The search for invariants. ACM SIGMETRICS Performance Evaluation Review 24, 1 (1996), 126--137.
[5]
Berk Atikoglu, Yuehai Xu, Eitan Frachtenberg, Song Jiang, and Mike Paleczny. 2012. Workload analysis of a large-scale key-value store. In Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems. 53--64.
[6]
Konstantinos Benidis, Syama Sundar Rangapuram, Valentin Flunkert, Yuyang Wang, Danielle Maddix, Caner Turkmen, Jan Gasthaus, Michael Bohlke-Schneider, David Salinas, Lorenzo Stella, François-Xavier Aubet, Laurent Callot, and Tim Januschowski. 2022. Deep Learning for Time Series Forecasting: Tutorial and Literature Survey. ACM Comput. Surv. 55, 6, Article 121 (dec 2022), 36 pages. https://doi.org/10.1145/3533382
[7]
Benjamin Carver, Jingyuan Zhang, Ao Wang, Ali Anwar, Panruo Wu, and Yue Cheng. 2020. Wukong: A Scalable and Locality-Enhanced Framework for Serverless Parallel Computing. In Proceedings of the 11th ACM Symposium on Cloud Computing (Virtual Event, USA) (SoCC '20). Association for Computing Machinery, New York, NY, USA, 1--15. https://doi.org/10.1145/3419111.3421286
[8]
Cristian Challu, Kin G Olivares, Boris N Oreshkin, Federico Garza, Max Mergenthaler-Canseco, and Artur Dubrawski. 2022. N-hits: Neural hierarchical interpolation for time series forecasting. arXiv preprint arXiv:2201.12886 (2022).
[9]
Eli Cortez, Anand Bonde, Alexandre Muzio, Mark Russinovich, Marcus Fontoura, and Ricardo Bianchini. 2017. Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms. In Proceedings of the 26th Symposium on Operating Systems Principles (Shanghai, China) (SOSP '17). Association for Computing Machinery, New York, NY, USA, 153--167. https://doi.org/10.1145/3132747.3132772
[10]
Mark E Crovella and Azer Bestavros. 1997. Self-similarity in World Wide Web traffic: Evidence and possible causes. IEEE/ACM Transactions on networking 5, 6 (1997), 835--846.
[11]
Abhimanyu Das, Weihao Kong, Andrew Leach, Shaan K Mathur, Rajat Sen, and Rose Yu. 2023. Long-term Forecasting with TiDE: Time-series Dense Encoder. Transactions on Machine Learning Research (2023). https://openreview.net/forum?id=pCbC3aQB5W
[12]
Nilanjan Daw, Umesh Bellur, and Purushottam Kulkarni. 2020. Xanadu: Mitigating cascading cold starts in serverless function chain deployments. In Proceedings of the 21st International Middleware Conference. 356--370.
[13]
Dror G Feitelson. 2015. Workload modeling for computer systems performance evaluation. Cambridge University Press.
[14]
Joseph M. Hellerstein, Jose M. Faleiro, Joseph Gonzalez, Johann Schleier-Smith, Vikram Sreekanti, Alexey Tumanov, and Chenggang Wu. 2019. Serverless Computing: One Step Forward, Two Steps Back. In 9th Biennial Conference on Innovative Data Systems Research, CIDR 2019, Asilomar, CA, USA, January 13-16, 2019, Online Proceedings. www.cidrdb.org. http://cidrdb.org/cidr2019/papers/p119-hellerstein-cidr19.pdf
[15]
Julien Herzen, Francesco Lässig, Samuele Giuliano Piazzetta, Thomas Neuer, Léo Tafti, Guillaume Raille, Tomas Van Pottelbergh, Marek Pasieka, Andrzej Skrodzki, Nicolas Huguenin, et al. 2022. Darts: User-friendly modern machine learning for time series. The Journal of Machine Learning Research 23, 1 (2022), 5442--5447.
[16]
Hansika Hewamalage, Christoph Bergmeir, and Kasun Bandara. 2022. Global models for time series forecasting: A Simulation study. Pattern Recognition 124 (2022), 108441. https://doi.org/10.1016/j.patcog.2021.108441
[17]
Abhinav Jangda, Donald Pinckney, Yuriy Brun, and Arjun Guha. 2019. Formal foundations of serverless computing. Proceedings of the ACM on Programming Languages 3, OOPSLA (2019), 1--26.
[18]
Knative. 2020. Knative Serverless Platform. https://knative.dev/docs/.
[19]
"AWS Lambda". 2020. Coca-Cola Freestyle Launches Touchless Fountain Experience in 100 Days Using AWS Lambda. https://aws.amazon.com/solutions/case-studies/coca-cola-freestyle/.
[20]
Zijun Li, Linsong Guo, Quan Chen, Jiagan Cheng, Chuhao Xu, Deze Zeng, Zhuo Song, Tao Ma, Yong Yang, Chao Li, et al. 2022. Help Rather Than Recycle: Alleviating Cold Startup in Serverless Computing Through Inter-Function Container Sharing. In 2022 USENIX Annual Technical Conference (USENIX ATC 22). 69--84.
[21]
Ashraf Mahgoub, Edgardo Barsallo Yi, Karthick Shankar, Eshaan Minocha, Sameh Elnikety, Saurabh Bagchi, and Somali Chaterji. 2022. Wisefuse: Workload characterization and dag transformation for serverless workflows. Proceedings of the ACM on Measurement and Analysis of Computing Systems 6, 2 (2022), 1--28.
[22]
Frank San Miguel. 2021. The Netflix Cosmos Platform. https://netflixtechblog.com/the-netflix-cosmos-platform-35c14d9351ad.
[23]
Yuqi Nie, Nam H. Nguyen, Phanwadee Sinthong, and Jayant Kalagnanam. 2023. A Time Series is Worth 64 Words: Long-term Forecasting with Transformers. In International Conference on Learning Representations.
[24]
Openshift. 2021. OpenShift Serverless overview. https://docs.openshift.com/serverless/1.29/about/about-serverless.html.
[25]
Vladislav Petkov, Ram Rajagopal, and Katia Obraczka. 2013. Characterizing per-application network traffic using entropy. ACM Transactions on Modeling and Computer Simulation (TOMACS) 23, 2 (2013), 1--25.
[26]
"Microsoft Research". 2019. Azure Public Dataset. https://github.com/Azure/AzurePublicDataset/tree/master.
[27]
Joshua S Richman, Douglas E Lake, and J Randall Moorman. 2004. Sample entropy. In Methods in enzymology. Vol. 384. Elsevier, 172--184.
[28]
J Riihijarvi, Matthias Wellens, and P Mahonen. 2009. Measuring complexity and predictability in networks with multiscale entropy analysis. In IEEE INFOCOM 2009. IEEE, 1107--1115.
[29]
Mohammad Shahrad, Rodrigo Fonseca, Inigo Goiri, Gohar Chaudhry, Paul Batum, Jason Cooke, Eduardo Laureano, Colby Tresness, Mark Russinovich, and Ricardo Bianchini. 2020. Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider. In 2020 USENIX Annual Technical Conference (USENIX ATC 20). USENIX Association, 205--218. https://www.usenix.org/conference/atc20/presentation/shahrad
[30]
Zhiming Shen, Qin Jia, Gur-Eyal Sela, Ben Rainero, Weijia Song, Robbert van Renesse, and Hakim Weatherspoon. 2016. Follow the sun through the clouds: Application migration for geographically shifting workloads. In Proceedings of the Seventh ACM Symposium on Cloud Computing. 141--154.
[31]
Shelby Thomas, Lixiang Ao, Geoffrey M Voelker, and George Porter. 2020. Particle: ephemeral endpoints for serverless networking. In Proceedings of the 11th ACM Symposium on Cloud Computing. 16--29.
[32]
Muhammad Tirmazi, Adam Barker, Nan Deng, Md Ehtesam Haque, Zhijing Gene Qin, Steven Hand, Mor Harchol-Balter, and John Wilkes. 2020. Borg: the Next Generation. In EuroSys'20. Heraklion, Crete.
[33]
Oskar Triebe, Hansika Hewamalage, Polina Pilyugina, Nikolay Laptev, Christoph Bergmeir, and Ram Rajagopal. 2021. Neuralprophet: Explainable forecasting at scale. arXiv preprint arXiv:2111.15397 (2021).
[34]
Raphael Vallat. 2018. AntroPy: entropy and complexity of (EEG) time-series in Python. https://github.com/raphaelvallat/antropy.
[35]
Abhishek Verma, Luis Pedrosa, Madhukar Korupolu, David Oppenheimer, Eric Tune, and John Wilkes. 2015. Large-scale cluster management at Google with Borg. In Proceedings of the Tenth European Conference on Computer Systems. 1--17.
[36]
Ao Wang, Shuai Chang, Huangshi Tian, Hongqi Wang, Haoran Yang, Huiba Li, Rui Du, and Yue Cheng. 2021. Faasnet: Scalable and fast provisioning of custom serverless container runtimes at alibaba cloud function compute. In 2021 USENIX Annual Technical Conference (USENIX ATC 21).
[37]
Liang Wang, Mengyuan Li, Yinqian Zhang, Thomas Ristenpart, and Michael Swift. 2018. Peeking behind the curtains of serverless platforms. In 2018 USENIX Annual Technical Conference (USENIX ATC 18). 133--146.
[38]
Qizhen Weng, Wencong Xiao, Yinghao Yu, Wei Wang, Cheng Wang, Jian He, Yong Li, Liping Zhang, Wei Lin, and Yu Ding. 2022. MLaaS in the wild: Workload analysis and scheduling in Large-Scale heterogeneous GPU clusters. In 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22). USENIX Association, 945--960.
[39]
Haixu Wu, Tengge Hu, Yong Liu, Hang Zhou, Jianmin Wang, and Mingsheng Long. 2022. TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis. arXiv preprint arXiv:2210.02186 (2022).
[40]
Hongliang Yu, Dongdong Zheng, Ben Y Zhao, and Weimin Zheng. 2006. Understanding user behavior in large-scale video-on-demand systems. ACM SIGOPS Operating Systems Review 40, 4 (2006), 333--344.
[41]
Minchen Yu, Tingjia Cao, Wei Wang, and Ruichuan Chen. 2023. Following the data, not the function: Rethinking function orchestration in serverless computing. In 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23). 1489--1504.
[42]
Ailing Zeng, Muxi Chen, Lei Zhang, and Qiang Xu. 2022. Are transformers effective for time series forecasting? arXiv preprint arXiv:2205.13504 (2022).
[43]
Yanqi Zhang, Íñigo Goiri, Gohar Irfan Chaudhry, Rodrigo Fonseca, Sameh Elnikety, Christina Delimitrou, and Ricardo Bianchini. 2021. Faster and cheaper serverless computing on harvested resources. In Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles. 724--739.

Cited By

View all
  • (2024)YuanRong: A Production General-purpose Serverless System for Distributed Applications in the CloudProceedings of the ACM SIGCOMM 2024 Conference10.1145/3651890.3672216(843-859)Online publication date: 4-Aug-2024
  • (2024)Serverless Confidential Containers: Challenges and OpportunitiesProceedings of the 2nd Workshop on SErverless Systems, Applications and MEthodologies10.1145/3642977.3652097(32-40)Online publication date: 22-Apr-2024
  • (2024)Serverless? RISC more!Proceedings of the 2nd Workshop on SErverless Systems, Applications and MEthodologies10.1145/3642977.3652095(15-24)Online publication date: 22-Apr-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SoCC '23: Proceedings of the 2023 ACM Symposium on Cloud Computing
October 2023
624 pages
ISBN:9798400703874
DOI:10.1145/3620678
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 October 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cloud
  2. datasets
  3. neural networks
  4. serverless
  5. time series

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

SoCC '23
Sponsor:
SoCC '23: ACM Symposium on Cloud Computing
October 30 - November 1, 2023
CA, Santa Cruz, USA

Acceptance Rates

Overall Acceptance Rate 169 of 722 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1,036
  • Downloads (Last 6 weeks)85
Reflects downloads up to 14 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)YuanRong: A Production General-purpose Serverless System for Distributed Applications in the CloudProceedings of the ACM SIGCOMM 2024 Conference10.1145/3651890.3672216(843-859)Online publication date: 4-Aug-2024
  • (2024)Serverless Confidential Containers: Challenges and OpportunitiesProceedings of the 2nd Workshop on SErverless Systems, Applications and MEthodologies10.1145/3642977.3652097(32-40)Online publication date: 22-Apr-2024
  • (2024)Serverless? RISC more!Proceedings of the 2nd Workshop on SErverless Systems, Applications and MEthodologies10.1145/3642977.3652095(15-24)Online publication date: 22-Apr-2024
  • (2024)Do Predictors for Resource Overcommitment Even Predict?Proceedings of the 4th Workshop on Machine Learning and Systems10.1145/3642970.3655838(153-160)Online publication date: 22-Apr-2024
  • (2024)FaaSRail: Employing Real Workloads to Generate Representative Load for Serverless ResearchProceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing10.1145/3625549.3658684(214-226)Online publication date: 3-Jun-2024
  • (2024)Incendio: Priority-Based Scheduling for Alleviating Cold Start in Serverless ComputingIEEE Transactions on Computers10.1109/TC.2024.338606373:7(1780-1794)Online publication date: Jul-2024
  • (2024)HarmonyBatch: Batching multi-SLO DNN Inference with Heterogeneous Serverless Functions2024 IEEE/ACM 32nd International Symposium on Quality of Service (IWQoS)10.1109/IWQoS61813.2024.10682915(1-10)Online publication date: 19-Jun-2024

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media