Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
column

Toward a cloud computing research agenda

Published: 20 June 2009 Publication History

Abstract

The 2008 LADIS workshop on Large Scale Distributed Systems brought together leaders from the commercial cloud computing community with researchers working on a variety of topics in distributed computing. The dialog yielded some surprises: some hot research topics seem to be of limited near-term importance to the cloud builders, while some of their practical challenges seem to pose new questions to us as systems researchers. This brief note summarizes our impressions.

References

[1]
M. K. Aguilera, A. Merchant, M. Shah, A. Veitch, and C. Karamanolis. Sinfonia: a new paradigm for building scalable distributed systems. In Proceedings of SOSP'07, pages 159--174, Stevenson, WA, 2007.
[2]
Amazon.com. Amazon simple storage service (Amazon S3). 2009. http://aws.amazon.com/s3.
[3]
Apache.org. HDFS architecture. 2009. http://hadoop.apache.org/core/docs/current/hdfs_design.html.
[4]
M. Burrows. The Chubby lock service for loosely-coupled distributed systems. In OSDI '06: Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation, Seattle, WA, 2006. USENIX Association.
[5]
C. Cachin, I. Keidar, and A. Shraer. Fail-aware untrusted storage. In DSN'09: the thirty-ninth Annual International Conference on Dependable Systems and Networks, Lisbon, Portugal, 2009. IEEE/IFIP.
[6]
M. Castro and B. Liskov. Practical Byzantine Fault Tolerance. In OSDI '99: Proceedings of the third Symposium on Operating Systems Design and Implementation, pages 173--186, New Orleans, LA, 1999. USENIX Association.
[7]
F. Chang, J. Dean, S. Ghemawat, W.C. Hsieh, Wallach D.A., M. Burrows, T. Chandra, A. Fikes, and R.E. Gruber. Bigtable: A distributed storage system for structured data. In OSDI'06: Seventh Symposium on Operating System Design and Implementation, Seattle, WA, November 2006.
[8]
G. Chockler, I. Keidar, and R. Vitenberg. Group communication specifications: a comprehensive study. ACM Computing Surveys, 33(4):427--469, 2001.
[9]
G. Chockler, R. Melamed, Y. Tock, and R. Vitenberg. Spidercast: a scalable interest-aware overlay for topic-based pub/sub communication. In DEBS '07: Proceedings of the 2007 inaugural International Conference on Distributed Event-Based Systems, pages 14--25, Toronto, ON, 2007.
[10]
A. Clement, M. Marchetti, E. Wong, L. Alvisi, and M. Dahlin. BFT: the time is now. In LADIS'08 {21}. http://doi.acm.org/10.1145/1529974.1529992.
[11]
J. Dean and S. Ghemawat. MapReduce: simplified data processing on large clusters. In OSDI'04: Proceedings of the 6th Symposium on Operating Systems Design and Implementation, San Francisco, CA, 2004. USENIX Association.
[12]
G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. Dynamo: Amazon's highly available key-value store. In Proceedings of SOSP '07, pages 205--220, Stevenson, WA, 2007.
[13]
B. Devlin, J. Gray, B. Laing, and G. Spix. Scalability terminology: Farms, clones, partitions, and packs: RACS and RAPS. Technical Report MS-TR-99-85, Microsoft Research, 1999. ftp://ftp.research.microsoft.com/pub/tr/tr-99-85.doc.
[14]
X. Fan, W.-D. Weber, and L.A. Barroso. Power provisioning for a warehouse-sized computer. In ISCA '07: Proceedings of the 34th annual International Symposium on Computer Architecture, pages 13--23, San Diego, CA, 2007. ACM.
[15]
S. Ghemawat, H. Gobioff, and S.-T. Leung. The Google File System. In Proceedings of SOSP '03, pages 29--43, Bolton Landing, NY, 2003. ACM.
[16]
S. Girdzijauskas, G. Chockler, R. Melamed, and Y. Tock. Gravity: An interest-aware publish/subscribe system based on structured overlays (Fast Abstract). In DEBS'08: The 2nd International Conference on Distributed Event-Based Systems, Rome, Italy, 2008.
[17]
J. Hamilton. On designing and deploying Internet-scale services. In LISA'07: Proceedings of the 21st conference on Large Installation System Administration, pages 1--12, Dallas, TX, 2007. USENIX Association.
[18]
Danga Interactive. memcached: a distributed memory object caching system. 2009. http://www.danga.com/memcached.
[19]
J. Kirsch and Y. Amir. Paxos for system builders: An overview. In LADIS'08 {21}. http://doi.acm.org/10.1145/1529974.1529979.
[20]
R. Kotla, L. Alvisi, M. Dahlin, A. Clement, and E. Wong. Zyzzyva: speculative Byzantine fault tolerance. In Proceedings of SOSP '07, pages 45--58, Stevenson, WA, 2007. ACM.
[21]
LADIS'08: Proceedings of the 2nd Workshop on Large-Scale Distributed Systems and Middleware. Yorktown Heights, NY, USA, 2008. ACM. http://doi.acm.org/10.1145/1529974.
[22]
L. Lamport. The part-time parliament. ACM Transactions on Computer Systems, 16(2):133--169, 1998.
[23]
J. MacCormick, N. Murphy, M. Najork, C. A. Thekkath, and L. Zhou. Boxwood: abstractions as the foundation for storage infrastructure. In OSDI'04: Proceedings of the 6th symposium on Operating Systems Design and Implementation, San Francisco, CA, 2004. USENIX Association.
[24]
D. Patterson. Recovery Oriented Computing. 2009. http://roc.cs.berkeley.edu.
[25]
B. Reed and F. P. Junqueira. A simple totally ordered broadcast protocol. In LADIS'08 {21}. http://doi.acm.org/10.1145/1529974.1529978.
[26]
A. Singh, P. Fonseca, P. Kuznetsov, R. Rodrigues, and P. Maniatis. Defining weakly consistent Byzantine fault-tolerant services. In LADIS'08 {21}. http://doi.acm.org/10.1145/1529974.1529990.
[27]
A. Singh, P. Fonseca, P. Kuznetsov, R. Rodrigues, and P. Maniatis. Zeno: Eventually consistent Byzantine fault tolerance. In NSDI'09: Proceedings of USENIX Networked Systems Design and Implementation, Boston, MA, 2009. USENIX Association.
[28]
D.B. Terry, M.M. Theimer, K. Petersen, A.J. Demers, M.J. Spreitzer, and C.H. Hauser. Managing update conflicts in Bayou, a weakly connected replicated storage system. In Proceedings of SOSP '95, pages 172--182, Copper Mountain, CO, 1995. ACM.
[29]
R. Van Renesse, K.P. Birman, and W. Vogels. Astrolabe: A robust and scalable technology for distributed systems monitoring, management, and data mining. ACM Transactions on Computer Systems, 21(3), May 2003.
[30]
R. Van Renesse, D. Dumitriu, V. Gough, and C. Thomas. Efficient reconciliation and flow control for anti-entropy protocols. In LADIS'08 {21}. http://doi.acm.org/10.1145/1529974.1529983.
[31]
Y. Vigfusson, H. Abu-Libdeh, M. Balakrishnan, K. Birman, and Y. Tock. Dr. Multicast: Rx for data-center communication scalability. In HotNets VII: Seventh ACM Workshop on Hot Topics in Networks. ACM, 2008.
[32]
P. Yalagandula and M. Dahlin. A Scalable Distributed Information Management System. In Proceedings of SIGCOMM'04, Portland, OR, August 2004. ACM.
[33]
Y. Yu, M. Isard, D. Fetterly, M. Budiu, U. Erlingsson, P. K. Gunda, and J. Currey. DryadLINQ: A system for general-purpose distributed data-parallel computing using a high-level language. In Proceedings of OSDI'08, San Diego, CA, December 2008. http://research.microsoft.com/en-us/projects/DryadLINQ.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGACT News
ACM SIGACT News  Volume 40, Issue 2
June 2009
91 pages
ISSN:0163-5700
DOI:10.1145/1556154
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 June 2009
Published in SIGACT Volume 40, Issue 2

Check for updates

Qualifiers

  • Column

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Cloud Technologies in Media Localisation (Subtitling and Revoicing)New Advances in Translation Technology10.1007/978-981-97-2958-6_4(55-78)Online publication date: 13-Jun-2024
  • (2021)Bandwidth-Aware Rescheduling Mechanism in SDN-Based Data Center NetworksElectronics10.3390/electronics1015177410:15(1774)Online publication date: 24-Jul-2021
  • (2020)GryffProceedings of the 17th Usenix Conference on Networked Systems Design and Implementation10.5555/3388242.3388286(591-618)Online publication date: 25-Feb-2020
  • (2020)Keeping CALMCommunications of the ACM10.1145/336973663:9(72-81)Online publication date: 21-Aug-2020
  • (2020)Bandwidth-Aware Rescheduling Mechanism in SDN-based Data Center Networks2020 International Conferences on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData) and IEEE Congress on Cybermatics (Cybermatics)10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics50389.2020.00137(806-811)Online publication date: Nov-2020
  • (2020)DAIS: dynamic access and integration services framework for cloud-oriented storage systemsCluster Computing10.1007/s10586-020-03088-023:4(3289-3308)Online publication date: 1-Dec-2020
  • (2020)Autoscaling tiered cloud storage in AnnaThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-020-00632-730:1(25-43)Online publication date: 9-Sep-2020
  • (2020)The Cloud Turn in Audiovisual TranslationThe Palgrave Handbook of Audiovisual Translation and Media Accessibility10.1007/978-3-030-42105-2_26(519-544)Online publication date: 1-Aug-2020
  • (2019)Autoscaling tiered cloud storage in AnnaProceedings of the VLDB Endowment10.14778/3311880.331188112:6(624-638)Online publication date: 1-Feb-2019
  • (2019)Dynamics of Evolving Social GroupsACM Transactions on Economics and Computation10.1145/33559487:3(1-27)Online publication date: 24-Sep-2019
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media