research-article

Open access

Towards Latency-Aware Linux Scheduling for Serverless Workloads

Authors:

Al Amjad Tawfiq Isstaif,

Richard MortierAuthors Info & Claims

SESAME '23: Proceedings of the 1st Workshop on SErverless Systems, Applications and MEthodologies

Pages 19 - 26

https://doi.org/10.1145/3592533.3592807

Published: 08 May 2023 Publication History

Abstract

A key principle in the design of the Linux kernel's Completely Fair Scheduler (CFS) is fairness: all running tasks receive a minimum time slice during every scheduling period, ensuring that none starve. However, this may lead to a significant number of context switches when a server is overloaded with a large number of colocated tasks, which may cause significant degradation in server performance. Unfortunately, this situation is exactly what we found when hosting serverless-style workloads which typically consist of a large number of short-lived, CPU-bound functions sharing resources. We propose modifying the Linux CFS to mitigate this problem by giving priority to the long tail of least loaded functions. These are the functions which are mostly idle and only run occasionally for a short while after being triggered unexpectedly. The large number of such functions in serverless environments means that prioritising them helps drain contended CPU run queues, reducing the total overhead due to context switching, thereby improving the performance not only of the prioritised functions but other functions as well. We implement this policy in the Linux kernel scheduler and demonstrate how it integrates well with Knative, an open source Kubernetes-based serverless framework. Given contention scenarios synthesised from real-world traces, our modified CFS can introduce a 5--30% increase in attainment of latency targets.

References

[1]

15 February 2013. Amazon EMR Serverless supports larger worker sizes to run more compute and memory-intensive workloads. https://aws.amazon.com/blogs/big-data/amazon-emr-serverless-supports-larger-worker-sizes-to-run-more-compute-and-memory-intensive-workloads/.

[2]

2023. Knative. https://https://knative.dev/.

[3]

8 February 2021. Messaging Fanout Pattern for Serverless Architectures Using Amazon SNS. https://aws.amazon.com/blogs/compute/messaging-fanout-pattern-for-serverless-architectures-using-amazon-sns/.

[4]

April 15, 2015. Load tracking in the scheduler. https://lwn.net/Articles/639543/.

[5]

August 24, 2022. For the love of god, stop using CPU limits on Kubernetes. https://home.robusta.dev/blog/stop-using-cpu-limits.

[6]

February 8, 2021. Comparing Serverless Performance for CPU Bound Tasks. https://blog.cloudflare.com/serverless-performance-with-cpu-bound-tasks/.

[7]

February 8, 2021. The burstable CFS bandwidth controller. https://lwn.net/Articles/844976/.

[8]

July 2, 2007. CFS group scheduling. https://lwn.net/Articles/240474/.

[9]

March 17, 2022. Control Group v2, The Linux Kernel Documentation. https://docs.kernel.org/admin-guide/cgroup-v2.html.

[10]

March 17, 2022. Improved response times with latency nice. https://lwn.net/Articles/887842/.

[11]

March 17, 2022. introduce cpu.headroom knob to cpu controller (Linux patch). https://lore.kernel.org/lkml/[email protected]/.

[12]

March 17, 2022. Kubernetes Resource Bin Packing, Kubernetes Documentation. https://kubernetes.io/docs/concepts/scheduling-eviction/resource-bin-packing/.

[13]

March 17, 2022. Kubernetes Vertical Pod Autoscaler. https://github.com/kubernetes/autoscaler/tree/master/vertical-pod-autoscaler.

[14]

November 26, 2019. Fixing SCHED IDLE. https://lwn.net/Articles/805317/.

[15]

Alexandru Agache, Marc Brooker, Alexandra Iordache, Anthony Liguori, Rolf Neugebauer, Phil Piwonka, and Diana-Maria Popa. 2020. Firecracker: Lightweight Virtualization for Serverless Applications. In 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20). USENIX Association, Santa Clara, CA, 419--434. https://www.usenix.org/conference/nsdi20/presentation/agache

[16]

M. Aslanpour, A. Toosi, C. Cicconetti, B. Javadi, Peter Sbarski, D. Taibi, M. Assunção, S. S. Gill, R. Gaire, and S. Dustdar. 2021. Serverless Edge Computing: Vision and Challenges. 2021 Australasian Computer Science Week Multiconference (2021).

[17]

Shuang Chen, Christina Delimitrou, and José F. Martínez. 2019. PARTIES: QoS-Aware Resource Partitioning for Multiple Interactive Services. In Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '19). Association for Computing Machinery, New York, NY, USA, 107--120.

Digital Library

[18]

Simon Eismann, Joel Scheuner, Erwin Van Eyk, Maximilian Schwinger, Johannes Grohmann, Nikolas Roman Herbst, Cristina L. Abad, and Alexandru Iosup. 2020. A Review of Serverless Use Cases and their Characteristics. ArXiv abs/2008.11110 (2020).

[19]

Yuqi Fu, Li Liu, Haoliang Wang, Yue Cheng, and Songqing Chen. 2022. SFS: smart OS scheduling for serverless functions. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC '22). IEEE Press, Dallas, Texas, 1--16.

[20]

Alexander Fuerst and Prateek Sharma. 2022. Locality-Aware Load-Balancing For Serverless Clusters. In Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing (Minneapolis, MN, USA) (HPDC '22). Association for Computing Machinery, New York, NY, USA, 227--239.

Digital Library

[21]

Arpan Gujarati, Sameh Elnikety, Yuxiong He, Kathryn S. McKinley, and Björn B. Brandenburg. 2017. Swayam: Distributed Autoscaling to Meet SLAs of Machine Learning Inference Services with Resource Efficiency. In Proceedings of the 18th ACM/IFIP/USENIX Middleware Conference (Las Vegas, Nevada) (Middleware '17). Association for Computing Machinery, New York, NY, USA, 109--120.

Digital Library

[22]

Weiwei Jia, Jiyuan Zhang, Jianchen Shan, Jing Li, and Xiaoning Ding. 2022. Achieving Low Latency in Public Edges by Hiding Workloads Mutual Interference. In Proceedings of the 13th Symposium on Cloud Computing (San Francisco, California) (SoCC '22). Association for Computing Machinery, New York, NY, USA, 477--492.

Digital Library

[23]

Kostis Kaffes, Neeraja Jayant Yadwadkar, and Christos Kozyrakis. 2021. Practical Scheduling for Real-World Serverless Computing. ArXiv abs/2111.07226 (2021).

[24]

Kostis Kaffes, Neeraja J. Yadwadkar, and Christos Kozyrakis. 2022. Hermod: principled and practical scheduling for serverless functions. In Proceedings of the 13th Symposium on Cloud Computing. ACM, San Francisco California, 289--305.

Digital Library

[25]

Jeongchul Kim and Kyungyong Lee. 2019. FunctionBench: A Suite of Workloads for Serverless Cloud Function Service. 2019 IEEE 12th International Conference on Cloud Computing (CLOUD) (2019), 502--504.

[26]

Baptiste Lepers, Redha Gouicem, Damien Carver, Jean-Pierre Lozi, Nicolas Palix, Maria-Virginia Aponte, Willy Zwaenepoel, Julien Sopena, Julia Lawall, and Gilles Muller. 2020. Provable multicore schedulers with Ipanema: application to work conservation. In Proceedings of the Fifteenth European Conference on Computer Systems (EuroSys '20). Association for Computing Machinery, New York, NY, USA, 1--16.

Digital Library

[27]

Junfeng Li, Sameer G. Kulkarni, K. K. Ramakrishnan, and Dan Li. 2019. Understanding Open Source Serverless Platforms: Design Considerations and Performance. In Proceedings of the 5th International Workshop on Serverless Computing (Davis, CA, USA) (WOSC '19). Association for Computing Machinery, New York, NY, USA, 37--42.

Digital Library

[28]

Zijun Li, Jiagan Cheng, Quan Chen, Eryu Guan, Zizheng Bian, Yi Tao, Bin Zha, Qiang Wang, Weidong Han, and Minyi Guo. 2022. {RunD}: A Lightweight Secure Container Runtime for High-density Deployment and High-concurrency Startup in Serverless Computing. 53--68. https://www.usenix.org/conference/atc22/presentation/li-zijun-rund

[29]

David Lo, Liqun Cheng, Rama Govindaraju, Parthasarathy Ranganathan, and Christos Kozyrakis. 2015. Heracles: improving resource efficiency at scale. ACM SIGARCH Computer Architecture News 43, 3S (June 2015), 450--462.

Digital Library

[30]

Jean-Pierre Lozi, Baptiste Lepers, Justin Funston, Fabien Gaud, Vivien Quéma, and Alexandra Fedorova. 2016. The Linux scheduler: a decade of wasted cores. In Proceedings of the Eleventh European Conference on Computer Systems (EuroSys '16). Association for Computing Machinery, New York, NY, USA, 1--16.

Digital Library

[31]

Ashraf Mahgoub, Edgardo Barsallo Yi, Karthick Shankar, Sameh Elnikety, Somali Chaterji, and Saurabh Bagchi. 2022. ORION and the Three Rights: Sizing, Bundling, and Prewarming for Serverless DAGs. In 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI 22). USENIX Association, Carlsbad, CA, 303--320. https://www.usenix.org/conference/osdi22/presentation/mahgoub

[32]

Viyom Mittal, Shixiong Qi, Ratnadeep Bhattacharya, Xiaosu Lyu, Junfeng Li, Sameer G. Kulkarni, Dan Li, Jinho Hwang, K. K. Ramakrishnan, and Timothy Wood. 2021. Mu: An Efficient, Fair and Responsive Serverless Framework for Resource-Constrained Edge Clouds. In Proceedings of the ACM Symposium on Cloud Computing (Seattle, WA, USA) (SoCC '21). Association for Computing Machinery, New York, NY, USA, 168--181.

Digital Library

[33]

Djob Mvondo, Antonio Barbalace, Alain Tchana, and Gilles Muller. 2021. Tell Me When You Are Sleepy and What May Wake You Up!. In Proceedings of the ACM Symposium on Cloud Computing (Seattle, WA, USA) (SoCC '21). Association for Computing Machinery, New York, NY, USA, 562--569.

Digital Library

[34]

Hai Nguyen, Chaojie Zhang, Zhujun Xiao, and A. Chien. 2019. Real-time Server-less: Enabling Application Performance Guarantees. Proceedings of the 5th International Workshop on Serverless Computing (2019).

Digital Library

[35]

Krzysztof Rzadca, Pawel Findeisen, Jacek Swiderski, Przemyslaw Zych, Przemyslaw Broniek, Jarek Kusmierek, Pawel Nowak, Beata Strack, Piotr Witusowski, Steven Hand, and John Wilkes. 2020. Autopilot: Workload Autoscaling at Google. In Proceedings of the Fifteenth European Conference on Computer Systems (Heraklion, Greece) (EuroSys '20). Association for Computing Machinery, New York, NY, USA, Article 16, 16 pages.

Digital Library

[36]

Arjun Singhvi, Arjun Balasubramanian, Kevin Houck, Mohammed Danish Shaikh, Shivaram Venkataraman, and Aditya Akella. 2021. Atoll: A Scalable Low-Latency Serverless Platform. In Proceedings of the ACM Symposium on Cloud Computing (Seattle, WA, USA) (SoCC '21). Association for Computing Machinery, New York, NY, USA, 138--152.

Digital Library

[37]

R. Smith. 2007. An Overview of the Tesseract OCR Engine. In Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02 (ICDAR '07). IEEE Computer Society, USA, 629--633.

Digital Library

[38]

Huangshi Tian, Suyi Li, Ao Wang, Wei Wang, Tianlong Wu, and Haoran Yang. 2022. Owl: Performance-Aware Scheduling for Resource-Efficient Function-as-a-Service Cloud. In Proceedings of the 13th Symposium on Cloud Computing (San Francisco, California) (SoCC '22). Association for Computing Machinery, New York, NY, USA, 78--93.

Digital Library

[39]

Bin Wang, A. Ali-Eldin, and P. Shenoy. 2021. LaSS: Running Latency Sensitive Serverless Computations at the Edge. Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing (2021).

Digital Library

[40]

Yanan Yang, Laiping Zhao, Yiming Li, Huanyu Zhang, Jie Li, Mingyang Zhao, Xingzhen Chen, and Keqiu Li. 2022. INFless: a native serverless system for low-latency, high-throughput inference. In Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '22). Association for Computing Machinery, New York, NY, USA, 768--781.

Digital Library

[41]

Yanqi Zhang, Íñigo Goiri, Gohar Irfan Chaudhry, Rodrigo Fonseca, Sameh Elnikety, Christina Delimitrou, and Ricardo Bianchini. 2021. Faster and Cheaper Serverless Computing on Harvested Resources. In Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles (Virtual Event, Germany) (SOSP '21). Association for Computing Machinery, New York, NY, USA, 724--739.

Digital Library

[42]

Laiping Zhao, Yanan Yang, Yiming Li, Xian Zhou, and Keqiu Li. 2021. Understanding, predicting and scheduling serverless workloads under partial interference. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. ACM, St. Louis Missouri, 1--15.

Digital Library

Cited By

Pfandzelter TDhakal AFrachtenberg EChalamalasetti SEmmot DHogade NEnriquez RRattihalli GBermbach DMilojicic D(2023)Kernel-as-a-ServiceProceedings of the 24th International Middleware Conference10.1145/3590140.3629115(192-206)Online publication date: 27-Nov-2023
https://dl.acm.org/doi/10.1145/3590140.3629115

Index Terms

Towards Latency-Aware Linux Scheduling for Serverless Workloads
1. Computer systems organization
  1. Architectures
    1. Distributed architectures
      1. Cloud computing
2. Software and its engineering
  1. Software organization and properties
    1. Contextual software domains
      1. Operating systems
        Process management
        Scheduling

Recommendations

Scheduler activations for interference-resilient SMP virtual machine scheduling
Middleware '17: Proceedings of the 18th ACM/IFIP/USENIX Middleware Conference

The wide adoption of SMP virtual machines (VMs) and resource consolidation present challenges to efficiently executing multi-threaded programs in the cloud. An important problem is the semantic gaps between the guest OS and the hypervisor. The well-...
Optimizing the live migration of virtual machine by CPU scheduling

Live migration has been proposed to reduce the downtime for migrated VMs by pre-copying the generated run-time memory state files from the original host to the migration destination host. However, if the rate for such a dirty memory generation is high, ...
Container-based real-time scheduling in the Linux kernel
Special Issue on Embedded Operating Systems Workshops (EWiLi'17 and EWiLi'18)

In recent years, there has been a growing interest in supporting component-based software development of complex real-time embedded systems. Techniques such as machine virtualisation have emerged as interesting mechanisms to enhance the security of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SESAME '23: Proceedings of the 1st Workshop on SErverless Systems, Applications and MEthodologies

May 2023

64 pages

ISBN:9798400701856

DOI:10.1145/3592533

Co-chairs:
Dmitrii Ustiugov
NTU Singapore
,
Rodrigo Bruno
University of Lisbon
,
Pedro Fonseca
Purdue University
,
Boris Grot
University of Edinburgh
,
Antonio Barbalace
University of Edinburgh

Copyright © 2023 Owner/Author(s).

This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

SIGOPS: ACM Special Interest Group on Operating Systems

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 May 2023

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SESAME '23

Sponsor:

SIGOPS

SESAME '23: 1st Workshop on SErverless Systems, Applications and MEthodologies

May 8, 2023

Rome, Italy

Upcoming Conference

EuroSys '25

Sponsor:
sigops

Twentieth European Conference on Computer Systems

March 30 - April 3, 2025

Rotterdam , Netherlands

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
730
Total Downloads

Downloads (Last 12 months)442
Downloads (Last 6 weeks)57

Reflects downloads up to 15 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Pfandzelter TDhakal AFrachtenberg EChalamalasetti SEmmot DHogade NEnriquez RRattihalli GBermbach DMilojicic D(2023)Kernel-as-a-ServiceProceedings of the 24th International Middleware Conference10.1145/3590140.3629115(192-206)Online publication date: 27-Nov-2023
https://dl.acm.org/doi/10.1145/3590140.3629115

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents