Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Heavy-traffic Delay Optimality in Pull-based Load Balancing Systems: Necessary and Sufficient Conditions

Published: 17 December 2019 Publication History

Abstract

In this paper, we consider a load balancing system under a general pull-based policy. In particular, each arrival is randomly dispatched to any server whose queue length is below a threshold; if no such server exists, then the arrival is randomly assigned to any server. We are interested in the fundamental relationship between the threshold and the delay performance of the system in heavy traffic. To this end, we first establish the following necessary condition to guarantee heavy-traffic delay optimality: the threshold needs to grow to infinity as the exogenous arrival rate approaches the boundary of the capacity region (i.e., the load intensity approaches one) but the growth rate should be slower than a polynomial function of the mean number of tasks in the system. As a special case of this result, we directly show that the delay performance of the popular pull-based policy Join-Idle-Queue (JIQ) is not heavy traffic optimal, but performs strictly better than random routing. We further show that a sufficient condition for heavy-traffic delay optimality is that the threshold grows logarithmically with the mean number of tasks in the system. This result directly resolves a generalized version of the conjecture by Kelly and Laws.

References

[1]
Hong Chen and Heng-Qing Ye. 2012. Asymptotic optimality of balanced routing. Operations research 60, 1 (2012), 163--179.
[2]
Atilla Eryilmaz and R Srikant. 2012. Asymptotically tight steady-state queue length bounds implied by drift conditions. Queueing Systems 72, 3--4 (2012), 311--359.
[3]
G Foschini and J. Salz. 1978. A basic dynamic routing problem and diffusion. IEEE Transactions on Communications 26, 3 (1978), 320--327.
[4]
FP Kelly and CN Laws. 1993. Dynamic routing in open queueing networks: Brownian models, cut constraints and resource pooling. Queueing systems 13, 1--3 (1993), 47--86.
[5]
Yi Lu, Qiaomin Xie, Gabriel Kliot, Alan Geller, James R Larus, and Albert Greenberg. 2011. Join-Idle-Queue: A novel load balancing algorithm for dynamically scalable web services. Performance Evaluation 68, 11 (2011), 1056--1071.
[6]
Siva Theja Maguluri, R Srikant, and Lei Ying. 2014. Heavy traffic optimal resource allocation algorithms for cloud computing clusters. Performance Evaluation 81 (2014), 20--39.
[7]
Alexander L Stolyar. 2015. Pull-based load distribution in large-scale heterogeneous service systems. Queueing Systems 80, 4 (2015), 341--361.
[8]
Xingyu Zhou, Jian Tan, and Ness Shroff. 2018. Heavy-traffic Delay Optimality in Pull-based Load Balancing Systems: Necessary and Sufficient Conditions. Proceedings of the ACM on Measurement and Analysis of Computing Systems 2, 3 (2018), 41.
[9]
Xingyu Zhou, Fei Wu, Jian Tan, Yin Sun, and Ness Shroff. 2017. Designing Low- Complexity Heavy-Traffic Delay-Optimal Load Balancing Schemes: Theory to Algorithms. Proceedings of the ACM on Measurement and Analysis of Computing Systems 1, 2 (2017), 39.

Cited By

View all
  • (2023)Distributed Dispatching in the Parallel Server ModelIEEE/ACM Transactions on Networking10.1109/TNET.2022.322093131:4(1521-1534)Online publication date: Aug-2023
  • (2021)Self-Learning Threshold-Based Load BalancingINFORMS Journal on Computing10.1287/ijoc.2021.110034:1(39-54)Online publication date: 16-Sep-2021

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMETRICS Performance Evaluation Review
ACM SIGMETRICS Performance Evaluation Review  Volume 47, Issue 1
June 2019
100 pages
ISSN:0163-5999
DOI:10.1145/3376930
Issue’s Table of Contents
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 December 2019
Published in SIGMETRICS Volume 47, Issue 1

Check for updates

Author Tags

  1. heavy-traffic delay optimality
  2. load balancing
  3. necessary and sufficient conditions
  4. pull-based

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Distributed Dispatching in the Parallel Server ModelIEEE/ACM Transactions on Networking10.1109/TNET.2022.322093131:4(1521-1534)Online publication date: Aug-2023
  • (2021)Self-Learning Threshold-Based Load BalancingINFORMS Journal on Computing10.1287/ijoc.2021.110034:1(39-54)Online publication date: 16-Sep-2021

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media