research-article

Public Access

Occupy the cloud: distributed computing for the 99%

Authors:

Shivaram Venkataraman,

Benjamin RechtAuthors Info & Claims

SoCC '17: Proceedings of the 2017 Symposium on Cloud Computing

Pages 445 - 451

https://doi.org/10.1145/3127479.3128601

Published: 24 September 2017 Publication History

Abstract

Distributed computing remains inaccessible to a large number of users, in spite of many open source platforms and extensive commercial offerings. While distributed computation frameworks have moved beyond a simple map-reduce model, many users are still left to struggle with complex cluster management and configuration tools, even for running simple embarrassingly parallel jobs. We argue that stateless functions represent a viable platform for these users, eliminating cluster management overhead, fulfilling the promise of elasticity. Furthermore, using our prototype implementation, PyWren, we show that this model is general enough to implement a number of distributed computing models, such as BSP, efficiently. Extrapolating from recent trends in network bandwidth and the advent of disaggregated storage, we suggest that stateless functions are a natural fit for data processing in future computing environments.

References

[1]

Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., et al. Tensorflow: A system for large-scale machine learning. In OSDI (2016).

Digital Library

[2]

Armbrust, M., Fox, A., Griffith, R., Joseph, A. D., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I., et al. A view of cloud computing. CACM 53, 4 (2010), 50--58.

Digital Library

[3]

Asanovic, K., and Patterson, D. Firebox: A hardware building block for 2020 warehouse-scale computers. In FAST (2014).

[4]

Serverless Reference Architecture: MapReduce. https://github.com/awslabs/lambda-refarch-mapreduce.

[5]

Canny, J., and Zhao, H. Big data analytics with small footprint: Squaring the cloud. In KDD (2013).

Digital Library

[6]

Carriero, N., and Gelernter, D. Linda in context. CACM 32, 4 (Apr. 1989).

Digital Library

[7]

cloudpickle: Extended pickling support for python objects. https://github.com/cloudpipe/cloudpickle.

[8]

Douze, M., Jégou, H., Sandhawalia, H., Amsaleg, L., and Schmid, C. Evaluation of gist descriptors for web-scale image search. In ACM International Conference on Image and Video Retrieval (2009).

Digital Library

[9]

IEEE P802.3ba, 40Gb/s and 100Gb/s Ethernet Task Force. http://www.ieee802.org/3/ba/.

[10]

Fang, L., Nguyen, K., Xu, G., Demsky, B., and Lu, S. Interruptible tasks: Treating memory pressure as interrupts for highly scalable data-parallel programs. In SOSP (2015).

Digital Library

[11]

Fouladi, S., Wahby, R. S., Shacklett, B., Balasubramaniam, K. V., Zeng, W., Bhalerao, R., Sivaraman, A., Porter, G., and Winstein, K. Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads. In NSDI (2017).

[12]

G. Ananthanarayanan, A. Ghodsi, S. Shenker, I. Stoica. Disk-Locality in Datacenter Computing Considered Irrelevant. In Proc. HotOS (2011).

[13]

Gao, P. X., Narayan, A., Karandikar, S., Carreira, J., Han, S., Agarwal, R., Ratnasamy, S., and Shenker, S. Network requirements for resource disaggregation. In OSDI (2016).

Digital Library

[14]

Han, S., Egi, N., Panda, A., Ratnasamy, S., Shi, G., and Shenker, S. Network support for resource disaggregation in next-generation datacenters. In HotNets (2013).

Digital Library

[15]

Han, S., and Ratnasamy, S. Large-scale computation not at the cost of expressiveness. In HotOS (2013).

[16]

Hendrickson, S., Sturdevant, S., Harter, T., Venkataramani, V., Arpaci-Dusseau, A. C., and Arpaci-Dusseau, R. H. Serverless computation with OpenLambda. In HotCloud (2016).

Digital Library

[17]

Herodotou, H., Lim, H., Luo, G., Borisov, N., Dong, L., Cetin, F. B., and Babu, S. Starfish: A self-tuning system for big data analytics. In CIDR (2011).

[18]

Hettrick, S., Antonioletti, M., Carr, L., Chue Hong, N., Crouch, S., De Roure, D., Emsley, I., Goble, C., Hay, A., Inupakutika, D., Jackson, M., Nenadic, A., Parkinson, T., Parsons, M. I., Pawlik, A., Peru, G., Proeme, A., Robinson, J., and Sufi, S. Uk research software survey 2014. Dec. 2014.

[19]

Hindman, B., Konwinski, A., Zaharia, M., Ghodsi, A., Joseph, A., Katz, R., Shenker, S., and Stoica, I. Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center. In Proc. NSDI (2011).

Digital Library

[20]

HP The Machine: Our vision for the Future of Computing. https://www.labs.hpe.com/the-machine.

[21]

Isard, M., Prabhakaran, V., Currey, J., Wieder, U., Talwar, K., and Goldberg, A. Quincy: Fair Scheduling for Distributed Computing Clusters. In Proc. SOSP (2009), pp. 261--276.

Digital Library

[22]

Lagar-Cavilla, H. A., Whitney, J. A., Scannell, A. M., Patchin, P., Rumble, S. M., de Lara, E., Brudno, M., and Satyanarayanan, M. Snowflock: Rapid virtual machine cloning for cloud computing. In EuroSys (2009).

Digital Library

[23]

Li, M., Andersen, D. G., Park, J. W., Smola, A. J., Ahmed, A., Josifovski, V., Long, J., Shekita, E. J., and Su, B.-Y. Scaling distributed machine learning with the parameter server. In OSDI (2014).

Digital Library

[24]

McAuley, J., Targett, C., Shi, Q., and Van Den Hengel, A. Image-based recommendations on styles and substitutes. In SIGIR (2015).

Digital Library

[25]

McSherry, F., Isard, M., and Murray, D. G. Scalability! but at what COST? In HotOS (2015).

[26]

Momcheva, I., and Tollerud, E. Software Use in Astronomy: an Informal Survey. arXiv 1507.03989 (2015).

[27]

Nightingale, E. B., Elson, J., Fan, J., Hofmann, O., Howell, J., and Suzue, Y. Flat datacenter storage. In OSDI (2012).

Digital Library

[28]

Niu, F., Recht, B., Re, C., and Wright, S. Hogwild: A lock-free approach to parallelizing stochastic gradient descent. In NIPS (2011).

Digital Library

[29]

Oliva, A., and Torralba, A. Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of computer vision 42, 3 (2001), 145--175.

[30]

O'Malley, O. TeraByte Sort on Apache Hadoop. http://sortbenchmark.org/YahooHadoop.pdf.

[31]

OpenWhisk. https://developer.ibm.com/openwhisk/.

[32]

Ousterhout, K., Panda, A., Rosen, J., Venkataraman, S., Xin, R., Ratnasamy, S., Shenker, S., and Stoica, I. The case for tiny tasks in compute clusters. In HotOS (2013).

Digital Library

[33]

Ousterhout, K., Wendell, P., Zaharia, M., and Stoica, I. Sparrow: distributed, low latency scheduling. In SOSP (2013).

Digital Library

[34]

Peng, D., and Dabek, F. Large-scale incremental processing using distributed transactions and notifications. In OSDI (2010).

Digital Library

[35]

Power, R., and Li, J. Piccolo: Building fast, distributed programs with partitioned tables. In OSDI (2010).

Digital Library

[36]

Redis server side scripting. https://redis.io/commands/eval.

[37]

Redis benchmarks. https://redis.io/topics/benchmarks.

[38]

Rumble, S. M., Ongaro, D., Stutsman, R., Rosenblum, M., and Ousterhout, J. K. It's Time for Low Latency. In Proc. HotOS (2011).

Digital Library

[39]

Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., Berg, A. C., and Li, F.-F. ImageNet Large Scale Visual Recognition Challenge. IJCV 115, 3 (2015), 211--252.

[40]

Schwarzkopf, M., Konwinski, A., Abd-El-Malek, M., and Wilkes, J. Omega: flexible, scalable schedulers for large compute clusters. In Proc. EuroSys (2013).

Digital Library

[41]

Scott, C. Latency trends. http://colin-scott.github.io/blog/2012/12/24/latency-trends/.

[42]

Shvachko, K., Kuang, H., Radia, S., and Chansler, R. The Hadoop Distributed File System. In Mass storage systems and technologies (MSST) (2010).

Digital Library

[43]

Sort Benchmark. http://sortbenchmark.org.

[44]

Tuning Java Garbage Collection for Apache Spark Applications. https://goo.gl/SIWlqx.

[45]

Tuning Spark. https://spark.apache.org/docs/latest/tuning.html#garbage-collection-tuning.

[46]

Vavilapalli, V. K., Murthy, A. C., Douglas, C., Agarwal, S., Konar, M., Evans, R., Graves, T., Lowe, J., Shah, H., Seth, S., et al. Apache Hadoop YARN: Yet another resource negotiator. In SoCC (2013).

Digital Library

[47]

Venkataraman, S., Yang, Z., Franklin, M., Recht, B., and Stoica, I. Ernest: Efficient performance prediction for large-scale advanced analytics. In NSDI (2016).

Digital Library

[48]

X1 instances. https://aws.amazon.com/ec2/instance-types/x1/.

[49]

Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J., McCauley, M., Franklin, M., Shenker, S., and Stoica, I. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. In Proc. NSDI (2011).

Cited By

Lu XLi NYuan LZhang J(2024)Enhancing Resource Utilization Efficiency in Serverless Education: A Stateful Approach with RofuseElectronics10.3390/electronics1311216813:11(2168)Online publication date: 2-Jun-2024
https://doi.org/10.3390/electronics13112168
Deka RGhosh ANanda SBarik RSaikia M(2024)Smart Healthcare System in Server-Less Environment: Concepts, Architecture, Challenges, Future DirectionsComputers10.3390/computers1304010513:4(105)Online publication date: 19-Apr-2024
https://doi.org/10.3390/computers13040105
Das SPeter RZhang XRay S(2024)FunDa: Towards Serverless Data Analytics and In Situ Query ProcessingProceedings of the International Workshop on Big Data in Emergent Distributed Environments10.1145/3663741.3664788(1-6)Online publication date: 9-Jun-2024
https://dl.acm.org/doi/10.1145/3663741.3664788
Show More Cited By

Index Terms

Occupy the cloud: distributed computing for the 99%
1. Computer systems organization
  1. Architectures
    1. Distributed architectures
      1. Cloud computing
2. Computing methodologies
  1. Distributed computing methodologies
    1. Distributed programming languages

Recommendations

Serverless Data Analytics in the IBM Cloud
Middleware '18: Proceedings of the 19th International Middleware Conference Industry

Unexpectedly, the rise of serverless computing has also collaterally started the "democratization" of massive-scale data parallelism. This new trend heralded by PyWren pursues to enable untrained users to execute single-machine code in the cloud at ...
Distributed Double Machine Learning with a Serverless Architecture
ICPE '21: Companion of the ACM/SPEC International Conference on Performance Engineering

This paper explores serverless cloud computing for double machine learning. Being based on repeated cross-fitting, double machine learning is particularly well suited to exploit the high level of parallelism achievable with serverless computing. It ...
Exploring the cost and performance benefits of AWS step functions using a data processing pipeline
UCC '21: Proceedings of the 14th IEEE/ACM International Conference on Utility and Cloud Computing

In traditional cloud computing, dedicated hardware is substituted by dynamically allocated, utility-oriented resources such as virtualized servers. While cloud services are following the pay-as-you-go pricing model, resources are billed based on ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SoCC '17: Proceedings of the 2017 Symposium on Cloud Computing

September 2017

672 pages

ISBN:9781450350280

DOI:10.1145/3127479

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 September 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Badges

Best Paper

Author Tags

Qualifiers

Research-article

Funding Sources

Amazon Web Services
ONR
VMware
Amazon
NSF
Ericsson
GE
Intel
IBM
Microsoft
Huawei
DHS
Ant Financial
CapitalOne
Google
Sloan Research Fellowship

Conference

SoCC '17

Sponsor:

SoCC '17: ACM Symposium on Cloud Computing

September 24 - 27, 2017

California, Santa Clara

Acceptance Rates

Overall Acceptance Rate 169 of 722 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

305
Total Citations
View Citations
4,420
Total Downloads

Downloads (Last 12 months)597
Downloads (Last 6 weeks)36

Reflects downloads up to 01 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Lu XLi NYuan LZhang J(2024)Enhancing Resource Utilization Efficiency in Serverless Education: A Stateful Approach with RofuseElectronics10.3390/electronics1311216813:11(2168)Online publication date: 2-Jun-2024
https://doi.org/10.3390/electronics13112168
Deka RGhosh ANanda SBarik RSaikia M(2024)Smart Healthcare System in Server-Less Environment: Concepts, Architecture, Challenges, Future DirectionsComputers10.3390/computers1304010513:4(105)Online publication date: 19-Apr-2024
https://doi.org/10.3390/computers13040105
Das SPeter RZhang XRay S(2024)FunDa: Towards Serverless Data Analytics and In Situ Query ProcessingProceedings of the International Workshop on Big Data in Emergent Distributed Environments10.1145/3663741.3664788(1-6)Online publication date: 9-Jun-2024
https://dl.acm.org/doi/10.1145/3663741.3664788
Chen QQian JChe YLin ZWang JZhou JSong LLiang YWu JZheng WLiu WLi LLiu FTan KSekar VYu MSeneviratne AVeitch D(2024)YuanRong: A Production General-purpose Serverless System for Distributed Applications in the CloudProceedings of the ACM SIGCOMM 2024 Conference10.1145/3651890.3672216(843-859)Online publication date: 4-Aug-2024
https://dl.acm.org/doi/10.1145/3651890.3672216
Garcia Lopez PSlominski AMetzler BBerhendt MShillaker S(2024)Serverless End Game: Disaggregation enabling TransparencyProceedings of the 2nd Workshop on SErverless Systems, Applications and MEthodologies10.1145/3642977.3652094(9-14)Online publication date: 22-Apr-2024
https://dl.acm.org/doi/10.1145/3642977.3652094
Basu Roy RTiwari D(2024)StarShip: Mitigating I/O Bottlenecks in Serverless Computing for Scientific WorkflowsProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/36390288:1(1-29)Online publication date: 21-Feb-2024
https://dl.acm.org/doi/10.1145/3639028
Triendl SRistov SBalsamo SKnottenbelt WAbad CShang W(2024)Peeking Behind the Serverless Implementations and Deployments of the Montage WorkflowCompanion of the 15th ACM/SPEC International Conference on Performance Engineering10.1145/3629527.3651420(196-203)Online publication date: 7-May-2024
https://dl.acm.org/doi/10.1145/3629527.3651420
Yu HBasu Roy RFontenot CTiwari DLi JZhang HWang HPark STsafrir DMUSUVATHI MGupta RAbu-Ghazaleh N(2024)RainbowCake: Mitigating Cold-starts in Serverless with Layer-wise Container Caching and SharingProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3617232.3624871(335-350)Online publication date: 27-Apr-2024
https://dl.acm.org/doi/10.1145/3617232.3624871
Zeng QHou KLeng XChen YChua TNgo CKa-Wei Lee RKumar RLauw H(2024)DirectFaaS: A Clean-Slate Network Architecture for Efficient Serverless Chain CommunicationsProceedings of the ACM Web Conference 202410.1145/3589334.3645333(2759-2767)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645333
Schmidt CBoissonnet TDohle JBernhardt KFerrando‐May EWernet TNitschke RKunis SWeidtkamp‐Peters S(2024)A practical guide to bioimaging research data management in core facilitiesJournal of Microscopy10.1111/jmi.13317294:3(350-371)Online publication date: 16-May-2024
https://doi.org/10.1111/jmi.13317
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents