Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3094405.3094407acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Free access

The Grand CRU Challenge

Published: 11 August 2017 Publication History

Abstract

One of the main objectives of any cluster management system is the maximization of cluster resource utilization (CRU). In this paper, we argue that there is a dilemma underlying the challenge of maximizing CRU, as soon as network resources enter the picture. In contrast to local resources which can be handled in a more isolated fashion, global network resources are namely shared, and their allocation is intertwined with that of local resources. For effective resource management, either applications thus have to learn more about the infrastructure, or the resource manager has to understand application semantics -- both options violate the separation of applications from the underlying infrastructure strived for by resource managers. This paper makes the case for a resource management system that addresses the dilemma, and presents first ideas.

References

[1]
Hitesh Ballani, Paolo Costa, Thomas Karagiannis, and Ant Rowstron. 2011. Towards predictable datacenter networks. In ACM SIGCOMM.
[2]
D. Xie et al. 2012. The only constant is change: incorporating time-varying network reservations in data centers. In ACM SIGCOMM.
[3]
Andrew D Ferguson, Peter Bodik, Srikanth Kandula, Eric Boutin, and Rodrigo Fonseca. 2012. Jockey: guaranteed job latency in data parallel clusters. In Proceedings of the 7th ACM european conference on Computer Systems. ACM, 99--112.
[4]
Carlo Fuerst, Stefan Schmid, Lalith Suresh, and Paolo Costa. 2015. Kraken: Towards Elastic Performance Guarantees in Multi-tenant Data Centers. In Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. ACM, 433--434.
[5]
Carlo Fuerst, Stefan Schmid, Lalith Suresh, and Paolo Costa. 2016. Kraken: Online and Elastic Resource Reservations for Multi-tenant Datacenters. In Proc. 35th IEEE Conference on Computer Communications (INFOCOM).
[6]
Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D Joseph, Randy H Katz, Scott Shenker, and Ion Stoica. 2011. Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center. In NSDI, Vol. 11. 22--22.
[7]
Virajith Jalaparti, Hitesh Ballani, Paolo Costa, Thomas Karagiannis, and Ant Rowstron. 2012. Bridging the tenant-provider gap in cloud services. In Proc. 3rd ACM Symposium on Cloud Computing (SOCC). ACM.
[8]
Jeffrey C Mogul and Lucian Popa. 2012. What we talk about when we talk about cloud network performance. ACM SIGCOMM Computer Communication Review 42, 5 (2012), 44--48.
[9]
Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, and John Wilkes. 2013. Omega: flexible, scalable schedulers for large compute clusters. In Proceedings of the 8th ACM European Conference on Computer Systems. ACM, 351--364.
[10]
Vinod Kumar Vavilapalli, Arun C Murthy, Chris Douglas, Sharad Agarwal, Mahadev Konar, Robert Evans, Thomas Graves, Jason Lowe, Hitesh Shah, Siddharth Seth, et al. 2013. Apache hadoop yarn: Yet another resource negotiator. In Proceedings of the 4th annual Symposium on Cloud Computing. ACM, 5.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HotConNet '17: Proceedings of the Workshop on Hot Topics in Container Networking and Networked Systems
August 2017
52 pages
ISBN:9781450350587
DOI:10.1145/3094405
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 August 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cloud computing
  2. cluster scheduling
  3. resource manager architecture

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Predictable Big Data Analytics (PreLytics) Aalborg University
  • German Research Foundation (DFG) as part of project B2 within the Collaborative Research Center (CRC) 1053 - MAKI - Multi-Mechanisms Adaptation for the Future Internet
  • ERC grant

Conference

SIGCOMM '17
Sponsor:
SIGCOMM '17: ACM SIGCOMM 2017 Conference
August 25, 2017
CA, Los Angeles, USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 347
    Total Downloads
  • Downloads (Last 12 months)61
  • Downloads (Last 6 weeks)16
Reflects downloads up to 03 Feb 2025

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media