Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2535372.2535375acmconferencesArticle/Chapter ViewAbstractPublication PagesconextConference Proceedingsconference-collections
research-article

Per-packet load-balanced, low-latency routing for clos-based data center networks

Published: 09 December 2013 Publication History
  • Get Citation Alerts
  • Abstract

    Clos-based networks including Fat-tree and VL2 are being built in data centers, but existing per-flow based routing causes low network utilization and long latency tail. In this paper, by studying the structural properties of Fat-tree and VL2, we propose a per-packet round-robin based routing algorithm called Digit-Reversal Bouncing (DRB). DRB achieves perfect packet interleaving. Our analysis and simulations show that, compared with random-based load-balancing algorithms, DRB results in smaller and bounded queues even when traffic load approaches 100%, and it uses smaller re-sequencing buffer for absorbing out-of-order packet arrivals. Our implementation demonstrates that our design can be readily implemented with commodity switches. Experiments on our testbed, a Fat-tree with 54 servers, confirm our analysis and simulations, and further show that our design handles network failures in 1-2 seconds and has the desirable graceful performance degradation property.

    References

    [1]
    M. Al-Fares, A. Loukissas, and A. Vahdat. A Scalable, Commodity Data Center Network Architecture. In SIGCOMM, 2008.
    [2]
    M. Al-Fares, S. Radhakrishnan, B. Raghavan, N. Huang, and A. Vahdat. Hedera: Dynamic Flow Scheduling for Data Center Networks. In NSDI, 2010.
    [3]
    M. Alizadeh, A. Greenberg, D. Maltz, J. Padhye, P. Patel, B. Prabhakar, S. Sengupta, and M. Sridharan. Data Center TCP (DCTCP). In SIGCOMM, 2010.
    [4]
    M. Alizadeh, A. Kabbani, T. Edsall, B. Prabhakar, A. Vahdat, and M. Yasuda. Less is More: Trading a little Bandwidth for Ultra-Low Latency in the Data Center. In NSDI, 2012.
    [5]
    Amazon EC2. http://aws.amazon.com/ec2/.
    [6]
    T. Benson, A. Anand, A. Akella, and M. Zhang. MicroTE: Fine Grained Traffic Engineering for Data Centers. In CoNEXT, 2011.
    [7]
    B. Calder, J. Wang, A. Ogus, N. Nilakantan, A. Skjolsvold, S. McKelvie, Y. Xu, S. Srivastav, J. Wu, H. Simitci, et al. Windows Azure Storage: A Highly Available Cloud Storage Service with Strong Consistency. In SOSP, 2011.
    [8]
    Cisco. Per-packet load balancing. http://www.cisco.com/en/US/docs/ios/12 0s/feature/guide/pplb.html.
    [9]
    C. Clos. A Study of Nonblocking Switching Networks. Bell Syst. Tech. J., 32(2), 1953.
    [10]
    J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In OSDI, 2004.
    [11]
    A. Dixit, P. Prakash, Y. C. Hu, and R. R. Kompella. On the Impact of Packet Spraying in Data Center Networks. In INFOCOM, 2013.
    [12]
    A. Greenberg, N. Jain, S. Kandula, C. Kim, P. Lahiri, D. Maltz, P. Patel, and S. Sengupta. VL2: A Scalable and Flexible Data Center Network. In SIGCOMM, 2009.
    [13]
    C. Guo, G. Lu, D. Li, H. Wu, X. Zhang, Y. Shi, C. Tian, Y. Zhang, and S. Lu. BCube: A High Performance, Server-centric Network Architecture for Modular Data Centers. In SIGCOMM, 2009.
    [14]
    C. Guo, H. Wu, K. Tan, L. Shi, Y. Zhang, and S. Lu. DCell: A Scalable and Fault Tolerant Network Structure for Data Centers. In SIGCOMM, 2008.
    [15]
    J. Hamilton. 42: the answer to the ultimate question of life, the universe, and everything, Nov 2011.
    [16]
    T. Hoff. Latency is Everywhere and it Costs You Sales - How to Crush it, July 2009. http://highscalability.com/latency-everywhere-and-it-costs-you-sales-how-crush-it.
    [17]
    C. Hong, M. Caesar, and P. B. Godfrey. Finishing Flows Quickly with Preemptive Scheduling. In SIGCOMM, 2012.
    [18]
    R. Kohavi and R. Longbotham. Online Epxeriments: Lessons Learned. IEEE Computer, September 2007.
    [19]
    S. Mahapatra and X. Yuan. Load Balancing Mechanisms in Data Center Networks. In CEWIT, Sept 2010.
    [20]
    R. N. Mysore, A. Pamboris, N. Farrington, N. Huang, P. Miri, S. Radhakrishnan, V. Subramanya, and A. Vahdat. PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric. In SIGCOMM, 2009.
    [21]
    Juniper Networks. Overview of per-packet load balancing. http://www.juniper.net/techpubs/en US/junos11.2/topics/concept/policy-per-packet-load-balancing-overview.html.
    [22]
    C. Perkins. IP Encapsulation within IP, Oct 1996. RFC2003.
    [23]
    C. Raiciu, S. Barre, C. Pluntke, A. Greenhalgh, D. Wischik, and M. Handley. Improving Datacenter Performance and Robustness with Multipath TCP. In SIGCOMM, 2011.
    [24]
    S. Sen, D. Shue, S. Ihm, and M. J. Freedman. Scalable, Optimal Flow Routing in Datacenters via Local Link Balancing. In CoNEXT, 2013.
    [25]
    D. Thaler and C. Hopps. Multipath Issues in Unicast and Multicast Next-Hop Selection, Nov 2000. RFC 2991.
    [26]
    C. Wilson, H. Ballani, T. Karagiannis, and A. Rowstron. Better Never than Late: Meeting Deadlines in Datacenter Networks. In SIGCOMM, 2011.
    [27]
    X. Wu, D. Turner, C. Chen, D. Maltz, X. Yang, L. Yuan, and M. Zhang. NetPilot: Automating Datacenter Network Failure Mitigation. In SIGCOMM, 2012.
    [28]
    D. Zats, T. Das, P. Mohan, D Borthakur, and R. Katz. DeTail: Reducing the Flow Completion Time Tail in Datacenter Networks. In SIGCOMM, 2012.

    Cited By

    View all
    • (2024)Fine-grained load balancing with proactive prediction and adaptive rerouting in data centerJournal of High Speed Networks10.3233/JHS-23000330:1(83-96)Online publication date: 10-Jan-2024
    • (2024)Halflife: An Adaptive Flowlet-based Load Balancer with Fading Timeout in Data Center NetworksProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3650062(66-81)Online publication date: 22-Apr-2024
    • (2024)BurstBalancer: Do Less, Better Balance for Large-Scale Data Center TrafficIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.329545435:6(932-949)Online publication date: Jun-2024
    • Show More Cited By

    Index Terms

    1. Per-packet load-balanced, low-latency routing for clos-based data center networks

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CoNEXT '13: Proceedings of the ninth ACM conference on Emerging networking experiments and technologies
      December 2013
      454 pages
      ISBN:9781450321013
      DOI:10.1145/2535372
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 09 December 2013

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. load balance routing
      2. low latency

      Qualifiers

      • Research-article

      Conference

      CoNEXT '13
      Sponsor:
      CoNEXT '13: Conference on emerging Networking Experiments and Technologies
      December 9 - 12, 2013
      California, Santa Barbara, USA

      Acceptance Rates

      CoNEXT '13 Paper Acceptance Rate 44 of 226 submissions, 19%;
      Overall Acceptance Rate 198 of 789 submissions, 25%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)77
      • Downloads (Last 6 weeks)9

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Fine-grained load balancing with proactive prediction and adaptive rerouting in data centerJournal of High Speed Networks10.3233/JHS-23000330:1(83-96)Online publication date: 10-Jan-2024
      • (2024)Halflife: An Adaptive Flowlet-based Load Balancer with Fading Timeout in Data Center NetworksProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3650062(66-81)Online publication date: 22-Apr-2024
      • (2024)BurstBalancer: Do Less, Better Balance for Large-Scale Data Center TrafficIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.329545435:6(932-949)Online publication date: Jun-2024
      • (2024)Yuz: Improving Performance of Cluster-Based Services by Near-L4 Session-Persistent Load BalancingIEEE Transactions on Network and Service Management10.1109/TNSM.2023.334196421:2(1929-1942)Online publication date: Apr-2024
      • (2024)Load Profiling via In-Band Flow Classification and P4 With HowdahIEEE Transactions on Network and Service Management10.1109/TNSM.2023.329972921:1(295-309)Online publication date: Feb-2024
      • (2024)Alarm: An Adaptive Routing Algorithm Based on One-Way Delay for InfinibandIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.338229511:4(3653-3666)Online publication date: Jul-2024
      • (2024)HG: Leveraging Hybrid Switching Granularity to Balance Heterogeneous Data Center Traffic Load for Cloud-Based Industrial ApplicationsIEEE Transactions on Industrial Informatics10.1109/TII.2024.337021520:6(8416-8427)Online publication date: Jun-2024
      • (2024)CanaryFuture Generation Computer Systems10.1016/j.future.2023.10.010152:C(70-82)Online publication date: 1-Mar-2024
      • (2023)Load-optimization in Reconfigurable Data-center Networks: Algorithms and Complexity of Flow RoutingACM Transactions on Modeling and Performance Evaluation of Computing Systems10.1145/35972008:3(1-30)Online publication date: 18-Jul-2023
      • (2023)Dependable Virtualized Fabric on Programmable Data PlaneIEEE/ACM Transactions on Networking10.1109/TNET.2022.322461731:4(1748-1764)Online publication date: Aug-2023
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media