Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3343180.3343187acmotherconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article

Ether: Providing both Interactive Service and Fairness in Multi-Tenant Datacenters

Published: 17 August 2019 Publication History

Abstract

Multi-tenant datacenters and cloud networks must provide both isolation and interactive service to tenant applications, many of which are sensitive to tail flow completion times. Network operators must also ensure high utilization of network capacity to reduce cost. Existing approaches that statically partition network capacity, in either time or space, provide good isolation but suffer from under-utilization. Existing schemes that dynamically allocate capacity to tenants incur either decreased fairness or high tail flow completion times. To overcome these limitations, we propose Ether. Ether is able to overcome these limitations because it can prioritize bursty flows during short congestion episodes while still ensuring fairness at long timescales. In this paper, we present a preliminary design of Ether and discuss its feasibility in today's programmable switches. Our evaluations show that, at high loads, Ether achieves 23% improvement in tail flow completion times (FCT) when compared with idealized fair queueing (FQ) while still providing similar fairness as FQ. In contrast, pFabric, which optimizes FCT, worsens fairness by a factor of 1.8 when compared with Ether.

References

[1]
{n.d.}. Amazon Virtual Private Cloud. https://aws.amazon.com/vpc.
[2]
{n.d.}. NS-3 network simulator. http://www.nsnam.org/.
[3]
{n.d.}. Portable Switch Architecture (PSA). https://p4.org/specs/.
[4]
Mohammad Al-Fares et al. 2008. A scalable, commodity data center network architecture. In SIGCOMM.
[5]
Mohammad Alizadeh et al. 2010. Data Center TCP (DCTCP). In SIGCOMM.
[6]
Mohammad Alizadeh et al. 2013. pfabric: Minimal near-optimal data-center transport. In SIGCOMM.
[7]
Mohammad Alizadeh et al. 2014. CONGA: Distributed congestion-aware load balancing for datacenters. In SIGCOMM.
[8]
Wei Bai et al. 2015. Information-agnostic Flow Scheduling for Commodity Data Centers. In NSDI.
[9]
Hitesh Ballani et al. 2011. Towards predictable datacenter networks. In SIGCOMM.
[10]
Mosharaf Chowdhury et al. 2016. {HUG}: Multi-Resource Fairness for Correlated and Elastic Demands. In NSDI.
[11]
David D Clark et al. 1992. Supporting real-time applications in an integrated services packet network: Architecture and mechanism. In SIGCOMM.
[12]
Nandita Dukkipati et al. 2005. Processor Sharing Flows in the Internet. In IWQoS.
[13]
Peter X Gao et al. 2015. phost: Distributed near-optimal datacenter transport over commodity network fabric. In CoNEXT.
[14]
Shuihai Hu, Wei Bai, Kai Chen, Chen Tian, Ying Zhang, and Haitao Wu. 2018. Providing bandwidth guarantees, work conservation and low latency simultaneously in the cloud. IEEE Transactions on Cloud Computing (2018).
[15]
Rajendra K Jain et al. 1984. A quantitative measure of fairness and discrimination. Eastern Research Laboratory, Digital Equipment Corporation (1984).
[16]
Keon Jang et al. 2015. Silo: Predictable Message Latency in the Cloud. In SIGCOMM.
[17]
Vimalkumar Jeyakumar et al. 2013. EyeQ: Practical network performance isolation at the edge. In NSDI.
[18]
David Lo et al. 2015. Heracles: Improving Resource Efficiency at Scale. In ISCA.
[19]
Radhika Mittal et al. 2016. Universal packet scheduling. In NSDI.
[20]
Jayaram Mudigonda et al. 2011. NetLord: A Scalable Multi-tenant Network Architecture for Virtualized Datacenters. In SIGCOMM.
[21]
Lucian Popa et al. 2013. ElasticSwitch: Practical Work-conserving Bandwidth Guarantees for Cloud Computing. In SIGCOMM.
[22]
Lucian Popa, Praveen Yalagandula, Sujata Banerjee, Jeffrey C Mogul, Yoshio Turner, and Jose Renato Santos. 2013. Elasticswitch: Practical work-conserving bandwidth guarantees for cloud computing. In ACM SIGCOMM Computer Communication Review, Vol. 43. ACM, 351--362.
[23]
Hamed Rezaei et al. 2018. Slytherin: Dynamic, network-assisted prioritization of tail packets in datacenter networks. In ICCCN.
[24]
Naveen Kr Sharma et al. 2018. Approximating fair queueing on reconfigurable switches. In NSDI.
[25]
Balajee Vamanan et al. 2015. Timetrader: Exploiting latency tail to save datacenter energy for online search. In MICRO.
[26]
Luping Wang, Wei Wang, and Bo Li. 2018. Utopia: Near-optimal coflow scheduling with isolation guarantee. In IEEE INFOCOM 2018-IEEE Conference on Computer Communications. IEEE, 891--899.
[27]
David Zats et al. 2012. DeTail: reducing the flow completion time tail in datacenter networks. In SIGCOMM.
[28]
Qiao Zhang et al. 2017. High-resolution Measurement of Data Center Microbursts. In IMC.

Cited By

View all
  • (2022)ADA: Arithmetic Operations with Adaptive TCAM Population in Programmable Switches2022 IEEE 42nd International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS54860.2022.00044(1-11)Online publication date: Jul-2022
  • (2020)ResQueue: A Smarter Datacenter Flow SchedulerProceedings of The Web Conference 202010.1145/3366423.3380012(2599-2605)Online publication date: 20-Apr-2020
  • (2020)A compressed-sensing-based compressor for ECGBiomedical Engineering Letters10.1007/s13534-020-00148-710:2(299-307)Online publication date: 6-Feb-2020
  • Show More Cited By

Index Terms

  1. Ether: Providing both Interactive Service and Fairness in Multi-Tenant Datacenters

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        APNet '19: Proceedings of the 3rd Asia-Pacific Workshop on Networking
        August 2019
        104 pages
        ISBN:9781450376358
        DOI:10.1145/3343180
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 17 August 2019

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Datacenter
        2. Fairness
        3. Multi-tenant datacenter
        4. Optimal FCT

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Conference

        APNet '19

        Acceptance Rates

        Overall Acceptance Rate 50 of 118 submissions, 42%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)6
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 13 Sep 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2022)ADA: Arithmetic Operations with Adaptive TCAM Population in Programmable Switches2022 IEEE 42nd International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS54860.2022.00044(1-11)Online publication date: Jul-2022
        • (2020)ResQueue: A Smarter Datacenter Flow SchedulerProceedings of The Web Conference 202010.1145/3366423.3380012(2599-2605)Online publication date: 20-Apr-2020
        • (2020)A compressed-sensing-based compressor for ECGBiomedical Engineering Letters10.1007/s13534-020-00148-710:2(299-307)Online publication date: 6-Feb-2020
        • (2019)WardProceedings of the 15th International Conference on emerging Networking EXperiments and Technologies10.1145/3360468.3366780(34-36)Online publication date: 9-Dec-2019
        • (2019)Compression of Speech Signals using Kronecker Enhanced Compressive Sensing Method2019 5th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS)10.1109/ICSPIS48872.2019.9066024(1-6)Online publication date: Dec-2019

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media