Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3106989.3107003acmotherconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article

Congestion Control for High-speed Extremely Shallow-buffered Datacenter Networks

Published: 03 August 2017 Publication History

Abstract

The link speed in datacenters is growing fast, from 1Gbps to 100Gbps. However, the buffer size of commodity switches increases slowly, thus significantly outpaced by the link speed. In such extremely shallow-buffered datacenter networks, prior TCP/ECN solutions suffer from either excessive packet losses or significant throughput degradation. Motivated by this, we introduce BCC, a simple yet effective solution with only one more configuration (shared buffer ECN/RED) at commodity switches. BCC operates based on real-time shared buffer utilization. When the buffer is abundant, BCC delivers both high throughput and low packet loss rate. When it becomes scarce, BCC triggers shared buffer ECN/RED to prevent packet losses at the cost of sacrificing a small amount of throughput. Our preliminary results show that BCC maintains low packet loss rate persistently while only slightly degrading throughput when the buffer becomes insufficient. Compared to current practice, BCC achieves up to 94.4% lower 99th percentile completion time for small flows while only degrading large flows by up to 2.8%.

References

[1]
Arista 7060CX-32 and 7260CX-64. https://people.ucsc.edu/~warner/Bufs/7060CX.html.
[2]
Broadcom Tomahawk. https://people.ucsc.edu/~warner/Bufs/tomahawk.
[3]
IEEE DCB. 802.1Qbb - Priority-based Flow Control. http://www.ieee802.org/1/pages/802.1bb.html.
[4]
User Manual of Arista EOS version 4.15.0F. https://www.arista.com/assets/data/docs/Manuals/EOS-4.15.0F-Manual.pdf.
[5]
Mohammad Alizadeh and et al. Data center TCP (DCTCP). In SIGCOMM 2010.
[6]
Mohammad Alizadeh, Adel Javanmard, and Balaji Prabhakar. Analysis of DCTCP: stability, convergence, and fairness. In SIGMETRICS 2011.
[7]
Guido Appenzeller, Isaac Keslassy, and Nick McKeown. Sizing Router Buffers. In SIGCOMM 2004.
[8]
Wei Bai, Li Chen, Kai Chen, and Haitao Wu. Enabling ECN in Multi-Service Multi-Queue Data Centers. In NSDI 2016.
[9]
Abhijit K. Choudhury and Ellen L. Hahne. 1998. Dynamic Queue Length Thresholds for Shared-memory Packet Switches. IEEE/ACM Trans. Netw. 6, 2 (April 1998), 130--140.
[10]
Sally Floyd and Van Jacobson. 1993. Random early detection gateways for congestion avoidance. IEEE/ACM Trans. Netw. (1993), 397--413.
[11]
Peter X. Gao and et al. pHost: Distributed Near-optimal Datacenter Transport over Commodity Network Fabric. In CoNEXT 2015.
[12]
Chuanxiong Guo and et al. Pingmesh: A Large-Scale System for Data Center Network Latency Measurement and Analysis. In SIGCOMM 2015.
[13]
Chuanxiong Guo, Haitao Wu, Zhong Deng, Gaurav Soni, Jianxi Ye, Jitu Padhye, and Marina Lipshteyn. RDMA over Commodity Ethernet at Scale. In SIGCOMM 2016.
[14]
Chi-Yao Hong, Matthew Caesar, and P Godfrey. Finishing flows quickly with preemptive scheduling. In SIGCOMM 2012.
[15]
Radhika Mittal and et al. TIMELY: RTT-based Congestion Control for the Datacenter. In SIGCOMM 2015.
[16]
Ali Munir, Ihsan A Qazi, Zartash A Uzmi, Aisha Mushtaq, Saad N Ismail, M Safdar Iqbal, and Basma Khan. Minimizing flow completion times in data centers. In INFOCOM 2013.
[17]
Jonathan Perry, Hari Balakrishnan, and Devavrat Shah. Flow-tune: Flowlet Control for Datacenter Networks. In NSDI 2017.
[18]
Jonathan Perry, Amy Ousterhout, Hari Balakrishnan, Deverat Shah, and Hans Fugal. Fastpass: A Centralized "Zero-queue" Datacenter Network. In SIGCOMM 2014.
[19]
Kadangode Ramakrishnan, Sally Floyd, and David Black. 2001. RFC 3168: The addition of explicit congestion notification (ECN) to IP. (2001).
[20]
Balajee Vamanan, Jahangir Hasan, and TN Vijaykumar. Deadline-aware datacenter tcp (d2tcp). In SIGCOMM 2012.
[21]
Haitao Wu, Jiabo Ju, Guohan Lu, Chuanxiong Guo, Yongqiang Xiong, and Yongguang Zhang. Tuning ECN for data center networks. In CoNEXT 2012.
[22]
Yibo Zhu, Monia Ghobadi, Vishal Misra, and Jitendra Padhye. ECN or Delay: Lessons Learnt from Analysis of DCQCN and TIMELY. In CoNEXT 2016.

Cited By

View all
  • (2024)Opportunistic Packet Forwarding for Proactive Transport in Datacenters2024 IFIP Networking Conference (IFIP Networking)10.23919/IFIPNetworking62109.2024.10619903(1-9)Online publication date: 3-Jun-2024
  • (2024)FlowSail: Fine-Grained and Practical Flow Control for Datacenter NetworksIEEE/ACM Transactions on Networking10.1109/TNET.2024.340661332:5(3916-3928)Online publication date: Oct-2024
  • (2024)Load Balancing With Multi-Level Signals for Lossless Datacenter NetworksIEEE/ACM Transactions on Networking10.1109/TNET.2024.336633632:3(2736-2748)Online publication date: Jun-2024
  • Show More Cited By

Index Terms

  1. Congestion Control for High-speed Extremely Shallow-buffered Datacenter Networks

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    APNet '17: Proceedings of the First Asia-Pacific Workshop on Networking
    August 2017
    127 pages
    ISBN:9781450352444
    DOI:10.1145/3106989
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 August 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Buffer
    2. Data center networks
    3. ECN
    4. TCP

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    APNet'17
    APNet'17: First Asia-Pacific Workshop on Networking
    August 3 - 4, 2017
    Hong Kong, China

    Acceptance Rates

    Overall Acceptance Rate 50 of 118 submissions, 42%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)22
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 13 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Opportunistic Packet Forwarding for Proactive Transport in Datacenters2024 IFIP Networking Conference (IFIP Networking)10.23919/IFIPNetworking62109.2024.10619903(1-9)Online publication date: 3-Jun-2024
    • (2024)FlowSail: Fine-Grained and Practical Flow Control for Datacenter NetworksIEEE/ACM Transactions on Networking10.1109/TNET.2024.340661332:5(3916-3928)Online publication date: Oct-2024
    • (2024)Load Balancing With Multi-Level Signals for Lossless Datacenter NetworksIEEE/ACM Transactions on Networking10.1109/TNET.2024.336633632:3(2736-2748)Online publication date: Jun-2024
    • (2024)A Hybrid Solution to Provide End-to-End Flow Control and Congestion Management in High-Performance Interconnection Networks2024 IEEE 24th International Symposium on Cluster, Cloud and Internet Computing (CCGrid)10.1109/CCGrid59990.2024.00011(8-17)Online publication date: 6-May-2024
    • (2024)Towards fine-grained load balancing with dynamical flowlet timeout in datacenter networksComputer Networks10.1016/j.comnet.2024.110867(110867)Online publication date: Oct-2024
    • (2024)HAECN: Hierarchical Automatic ECN Tuning with Ultra-Low Overhead in Datacenter NetworksAlgorithms and Architectures for Parallel Processing10.1007/978-981-97-0798-0_19(324-343)Online publication date: 1-Mar-2024
    • (2024)Adaptive Routing for Datacenter Networks Using Ant Colony OptimizationAlgorithms and Architectures for Parallel Processing10.1007/978-981-97-0798-0_17(290-309)Online publication date: 1-Mar-2024
    • (2024)Deep Reinforcement Learning Based Load Balancing for Heterogeneous Traffic in Datacenter NetworksAlgorithms and Architectures for Parallel Processing10.1007/978-981-97-0798-0_16(270-289)Online publication date: 1-Mar-2024
    • (2024)Enabling Traffic-Differentiated Load Balancing for Datacenter NetworksAlgorithms and Architectures for Parallel Processing10.1007/978-981-97-0798-0_15(250-269)Online publication date: 1-Mar-2024
    • (2023) Flash : Joint Flow Scheduling and Congestion Control in Data Center Networks IEEE Transactions on Cloud Computing10.1109/TCC.2021.312951111:1(1038-1049)Online publication date: 1-Jan-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media