Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2745844.2745849acmconferencesArticle/Chapter ViewAbstractPublication PagesmetricsConference Proceedingsconference-collections
research-article

Power of d Choices for Large-Scale Bin Packing: A Loss Model

Published: 15 June 2015 Publication History

Abstract

We consider a system of $N$ parallel servers, where each server consists of B units of a resource. Jobs arrive at this system according to a Poisson process, and each job stays in the system for an exponentially distributed amount of time. Each job may request different units of the resource from the system. The goal is to understand how to route arriving jobs to the servers to minimize the probability that an arriving job does not find the required amount of resource at the server, i.e., the goal is to minimize blocking probability. The motivation for this problem arises from the design of cloud computing systems in which the jobs are virtual machines (VMs) that request resources such as memory from a large pool of servers. In this paper, we consider power-of-d-choices routing, where a job is routed to the server with the largest amount of available resource among d ≥ 2 randomly chosen servers. We consider a fluid model that corresponds to the limit as N goes to infinity and provide an explicit upper bound for the equilibrium blocking probability. We show that the upper bound exhibits different behavior as B goes to infinity depending on the relationship between the total traffic intensity λ and B. In particular, if (B -- λ)/√λ → α, the upper bound is doubly exponential in √λ and if (B -- λ)/logd λ → β, β > 1, the upper bound is exponential in λ. Simulation results show that the blocking probability, even for small B, exhibits qualitatively different behavior in the two traffic regimes. This is in contrast with the result for random routing, where the blocking probability scales as O(1/√λ) even if (B -- λ)/√λ → α.

References

[1]
Amazon EC2. http://aws.amazon.com/ec2/.
[2]
V. Anantharam and M. Benchekroun. A technique for computing sojourn times in large networks of interacting queues. Probability in the Engineering and Informational Sciences, 7(04):441--464, 1993.
[3]
Google App Engine. https://cloud.google.com/appengine/docs?csw=1.
[4]
Y. Azar, A. Z. Broder, A. R. Karlin, and E. Upfal. Balanced allocations. SIAM J. Comput., 29(1):180--200, Sept. 1999.
[5]
Azure. http://azure.microsoft.com/en-us/.
[6]
N. Bansal, A. Caprara, and M. Sviridenko. A new approximation method for set covering problems, with applications to multidimensional bin packing. SIAM Journal on Computing, 39(4):1256--1278, 2010.
[7]
A. A. Borovkov. Stochastic Processes in Queueing Theory. Springer, 1976.
[8]
M. Bramson, Y. Lu, and B. Prabhakar. Asymptotic independence of queues under randomized load balancing. Queueing Systems, 71(3):247--292, 2012.
[9]
J. Csirik, D. S. Johnson, C. Kenyon, J. B. Orlin, P. W. Shor, and R. R. Weber. On the sum-of-squares algorithm for bin packing. J. ACM, 53(1):1--65, Jan. 2006.
[10]
C. Graham. Chaoticity on path space for a queueing network with selection of the shortest queue among several. Journal of Applied Probability, 37(1):198--211, 2000.
[11]
V. Gupta and A. Radovanovic. Online stochastic bin packing, 2012.
[12]
J. Kaufman. Blocking in a shared resource environment. Communications, IEEE Transactions on, 29(10):1474--1481, Oct 1981.
[13]
T. G. Kurtz. Approximation of Population Processes. Society for Industrial and Applied Mathematics, 1981.
[14]
M. Luczak and C. McDiarmid. On the maximum queue length in the supermarket model. The Annals of Probability, 34(2):493--527, 2006.
[15]
S. Maguluri, R. Srikant, and L. Ying. Stochastic models of load balancing and scheduling in cloud computing clusters. In Proc. of IEEE INFOCOM, pages 702--710, Mar 2012.
[16]
S. T. Maguluri, R. Srikant, and L. Ying. Heavy traffic optimal resource allocation algorithms for cloud computing clusters. In Proc. of the 24th International Teletraffic Congress, pages 25:1--25:8, 2012.
[17]
X. Meng, V. Pappas, and L. Zhang. Improving the scalability of data center networks with traffic-aware virtual machine placement. In Proc. of IEEE INFOCOM, pages 1154--1162, Piscataway, NJ, USA, 2010.
[18]
M. Mitzenmacher. The power of two choices in randomized load balancing. PhD thesis, UC Berkeley, 1996.
[19]
M. Mitzenmacher. Studying balanced allocations with differential equations. Combinatorics, Probability and Computing, 8(5):473--482, Sept. 1999.
[20]
A. Mukhopadhyay and R. R. Mazumdar. Analysis of load balancing in large heterogeneous processor sharing systems. ArXiv preprint arXiv:1311.5806, 2013.
[21]
A. Mukhopadhyay, R. R. Mazumdar, and F. Guillemin. Static versus dynamic user assignment to cloud resources. 2015.
[22]
J. W. Roberts. A service system with heterogeneous user requirement. In G. Pujolle, editor, Performance of Data Communications Systems and Their Applications, 1981.
[23]
R. Srikant and W. Whitt. Simulation run lengths to estimate blocking probabilities. ACM Trans. Model. Comput. Simul., 6(1):7--52, Jan. 1996.
[24]
A. L. Stolyar. An infinite server system with general packing constraints. ArXiv preprint arXiv:1205.4271, 2012.
[25]
A. L. Stolyar and Y. Zhong. An infinite server system with general packing constraints: Asymptotic optimality of a greedy randomized algorithm. In Proc. 53th Annu. Allerton Conf. Commun., Control Comput., pages 575--582, Oct 2013.
[26]
A. L. Stolyar and Y. Zhong. A large-scale service system with packing constraints: Minimizing the number of occupied servers. SIGMETRICS Perform. Eval. Rev., 41(1):41--52, June 2013.
[27]
J. N. Tsitsiklis and K. Xu. On the power of (even a little) resource pooling. Stochastic Systems, 2(1):1--66, 2012.
[28]
N. D. Vvedenskaya, R. L. Dobrushin, and F. I. Karpelevich. Queueing system with selection of the shortest of two queues: An asymptotic approach. Probl. Peredachi Inf., 32(1):20--34, 1996.
[29]
L. Wang, F. Zhang, A. V. Vasilakos, C. Hou, and Z. Liu. Joint virtual machine assignment and traffic engineering for green data center networks. SIGMETRICS Perform. Eval. Rev., 41(3):107--112, Jan. 2014.
[30]
W. Whitt. Heavy-traffic approximations for service systems with blocking. AT&T Bell Laboratories Technical Journal, 63(5):689--708, 1984.
[31]
L. Ying, R. Srikant, and X. Kang. The power of slightly more than one sample in randomized load balancing. In Proc. of IEEE INFOCOM, 2015.

Cited By

View all
  • (2022)Power of Random Choices Made Efficient for Fog ComputingIEEE Transactions on Cloud Computing10.1109/TCC.2020.296844310:2(1130-1141)Online publication date: 1-Apr-2022
  • (2021)Rosella: A Self-Driving Distributed Scheduler for Heterogeneous Clusters2021 17th International Conference on Mobility, Sensing and Networking (MSN)10.1109/MSN53354.2021.00073(446-453)Online publication date: Dec-2021
  • (2020) Power-of- d -Choices with Memory: Fluid Limit and Optimality Mathematics of Operations Research10.1287/moor.2019.1014Online publication date: 7-Jan-2020
  • Show More Cited By

Index Terms

  1. Power of d Choices for Large-Scale Bin Packing: A Loss Model

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGMETRICS '15: Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems
      June 2015
      488 pages
      ISBN:9781450334860
      DOI:10.1145/2745844
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 15 June 2015

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. fluid limit analysis
      2. loss model
      3. randomized algorithms
      4. resource allocation
      5. virtual machine assignment

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      SIGMETRICS '15
      Sponsor:

      Acceptance Rates

      SIGMETRICS '15 Paper Acceptance Rate 32 of 239 submissions, 13%;
      Overall Acceptance Rate 459 of 2,691 submissions, 17%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)22
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 02 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Power of Random Choices Made Efficient for Fog ComputingIEEE Transactions on Cloud Computing10.1109/TCC.2020.296844310:2(1130-1141)Online publication date: 1-Apr-2022
      • (2021)Rosella: A Self-Driving Distributed Scheduler for Heterogeneous Clusters2021 17th International Conference on Mobility, Sensing and Networking (MSN)10.1109/MSN53354.2021.00073(446-453)Online publication date: Dec-2021
      • (2020) Power-of- d -Choices with Memory: Fluid Limit and Optimality Mathematics of Operations Research10.1287/moor.2019.1014Online publication date: 7-Jan-2020
      • (2019)Delay Asymptotics and Bounds for Multi-Task Parallel JobsACM SIGMETRICS Performance Evaluation Review10.1145/3308897.330890146:3(2-7)Online publication date: 25-Jan-2019
      • (2019) Insensitivity of the mean field limit of loss systems under SQ( d ) routeing Advances in Applied Probability10.1017/apr.2019.4151:4(1027-1066)Online publication date: 15-Nov-2019
      • (2019)Delay asymptotics and bounds for multitask parallel jobsQueueing Systems: Theory and Applications10.1007/s11134-018-09597-591:3-4(207-239)Online publication date: 1-Apr-2019
      • (2018)Asymptotics of insensitive load balancing and blocking phasesQueueing Systems: Theory and Applications10.1007/s11134-017-9559-588:3-4(243-278)Online publication date: 1-Apr-2018
      • (2017)Mean-Field Analysis of Loss Models with Mixed-Erlang Distributions under Power-of-d Routing2017 29th International Teletraffic Congress (ITC 29)10.23919/ITC.2017.8064362(250-258)Online publication date: Sep-2017
      • (2017)Stein's Method for Mean Field Approximations in Light and Heavy Traffic RegimesProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/30844491:1(1-27)Online publication date: 13-Jun-2017
      • (2017)Delay Versus Stickiness Violation Trade-Offs for Load Balancing in Large-Scale Data Centers2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)10.1109/MASCOTS.2017.33(63-72)Online publication date: Sep-2017
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media