Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleApril 2024
Load Balancing with Job-Size Testing: Performance Improvement or Degradation?
ACM Transactions on Modeling and Performance Evaluation of Computing Systems (TOMPECS), Volume 9, Issue 2Article No.: 8, Pages 1–27https://doi.org/10.1145/3651154In the context of decision making under explorable uncertainty, scheduling with testing is a powerful technique used in the management of computer systems to improve performance via better job-dispatching decisions. Upon job arrival, a scheduler may run ...
- research-articleApril 2024
Reinforcement learning in a birth and death process: breaking the dependence on the state space
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsNovember 2022, Article No.: 1051, Pages 14464–14474In this paper, we revisit the regret of undiscounted reinforcement learning in MDPs with a birth and death structure. Specifically, we consider a controlled queue with impatient jobs and the main objective is to optimize a trade-off between energy ...
- articleMarch 2022
Optimal Speed Profile of a DVFS Processor under Soft Deadlines
ACM SIGMETRICS Performance Evaluation Review (SIGMETRICS), Volume 49, Issue 3December 2021, Pages 71–72https://doi.org/10.1145/3529113.3529139Minimizing the energy consumption of embedded systems with real-time execution constraints is becoming more and more important. More functionalities and better performance/ cost tradeoffs are expected from such systems because of the increased use of ...
- research-articleDecember 2021
Optimal speed profile of a DVFS processor under soft deadlines
Performance Evaluation (PEVA), Volume 152, Issue CDec 2021https://doi.org/10.1016/j.peva.2021.102245AbstractWe consider a Dynamic Voltage and Frequency Scaling (DVFS) processor executing jobs with obsolescence deadlines: A job becomes obsolete and is removed from the system if it is not completed before its deadline. The objective is to ...
-
- research-articleNovember 2021
Stability and Optimization of Speculative Queueing Networks
IEEE/ACM Transactions on Networking (TON), Volume 30, Issue 2Pages 911–922https://doi.org/10.1109/TNET.2021.3128778We provide a queueing-theoretic framework for job replication schemes based on the principle “<italic>replicate a job as soon as the system detects it as a straggler</italic>”. This is called job <italic>speculation</italic>. Recent works ...
- research-articleAugust 2020
Power-of-d-Choices with Memory: Fluid Limit and Optimality
Mathematics of Operations Research (MOOR), Volume 45, Issue 3August 2020, Pages 862–888https://doi.org/10.1287/moor.2019.1014In multiserver distributed queueing systems, the access of stochastically arriving jobs to resources is often regulated by a dispatcher, also known as a load balancer. A fundamental problem consists in designing a load-balancing algorithm that minimizes ...
- research-articleApril 2020
Combining Size-Based Load Balancing with Round-Robin for Scalable Low Latency
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 31, Issue 4April 2020, Pages 886–896https://doi.org/10.1109/TPDS.2019.2950621When dispatching jobs to parallel servers, or queues, the highly scalable round-robin (RR) scheme reduces the variance of interarrival times at all queues to a great extent but has no impact on the variances of service processes. Contrariwise, size-...
- research-articleNovember 2019
Asymptotically Optimal Size-Interval Task Assignments
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 30, Issue 11Nov. 2019, Pages 2422–2433https://doi.org/10.1109/TPDS.2019.2920121Size-based routing provides robust strategies to improve the performance of computer and communication systems with highly variable workloads because it is able to isolate small jobs from large ones in a static manner. The basic idea is that each server ...
- articleDecember 2017
Asymptotically optimal open-loop load balancing
Queueing Systems: Theory and Applications (QSYS), Volume 87, Issue 3-4December 2017, Pages 245–267https://doi.org/10.1007/s11134-017-9547-9In many distributed computing systems, stochastically arriving jobs need to be assigned to servers with the objective of minimizing waiting times. Many existing dispatching algorithms are basically included in the SQ(d) framework: Upon arrival of a job, ...
- research-articleAugust 2017
The Economics of the Cloud
ACM Transactions on Modeling and Performance Evaluation of Computing Systems (TOMPECS), Volume 2, Issue 4Article No.: 18, Pages 1–23https://doi.org/10.1145/3086574This article proposes a model to study the interaction of price competition and congestion in the cloud computing marketplace. Specifically, we propose a three-tier market model that captures a marketplace with users purchasing services from Software-as-...
- articleJanuary 2016
Decentralized Proportional Load Balancing
SIAM Journal on Applied Mathematics (SJAM), Volume 76, Issue 12016, Pages 391–410https://doi.org/10.1137/140969361Load balancing is a powerful technique commonly used in communication and computer networks to improve system performance, robustness and fairness. In this paper, we consider a general model capturing the performance of communication and computer ...
- research-articleNovember 2014
The economics of the cloud: price competition and congestion
ACM SIGecom Exchanges (SIGECOM), Volume 13, Issue 1June 2014, Pages 58–63https://doi.org/10.1145/2692375.2692380This letter provides an overview of our recent work studying the impacts of price competition and congestion in the cloud marketplace. Specifically, we discuss a three-tier market model that studies a vertical marketplace where users purchase services ...
- extended-abstractApril 2014
- articleJanuary 2014
Efficiency of simulation in monotone hyper-stable queueing networks
Queueing Systems: Theory and Applications (QSYS), Volume 76, Issue 1January 2014, Pages 51–72https://doi.org/10.1007/s11134-013-9357-7We consider Jackson queueing networks with finite buffer constraints (JQN) and analyze the efficiency of sampling from their stationary distribution. In the context of exact sampling, the monotonicity structure of JQNs ensures that such efficiency is of ...
- articleOctober 2013
Heavy-traffic revenue maximization in parallel multiclass queues
Performance Evaluation (PEVA), Volume 70, Issue 10October, 2013, Pages 806–821https://doi.org/10.1016/j.peva.2013.08.008Motivated by revenue maximization in server farms with admission control, we investigate the optimal scheduling in parallel processor-sharing queues. Incoming customers are distinguished in multiple classes and we define revenue as a weighted sum of ...
- articleAugust 2013
Closed Queueing Networks Under Congestion: Nonbottleneck Independence and Bottleneck Convergence
Mathematics of Operations Research (MOOR), Volume 38, Issue 308 2013, Pages 469–491https://doi.org/10.1287/moor.1120.0583We analyze the behavior of closed multiclass product-form queueing networks when the number of customers grows to infinity and remains proportionate on each route or class. First, we focus on the stationary behavior and prove the conjecture that the ...
- articleDecember 2011
The price of forgetting in parallel and non-observable queues
Performance Evaluation (PEVA), Volume 68, Issue 12December, 2011, Pages 1291–1311https://doi.org/10.1016/j.peva.2011.07.023We consider a broker-based network of non-observable parallel queues and analyze the minimum expected response time and the optimal routing policy when the broker has the memory of its previous routing decisions. We provide lower bounds on the minimum ...
- articleNovember 2011
Competition yields efficiency in load balancing games
Performance Evaluation (PEVA), Volume 68, Issue 11November, 2011, Pages 986–1001https://doi.org/10.1016/j.peva.2011.07.005We study a nonatomic congestion game with N parallel links, with each link under the control of a profit maximizing provider. Within this 'load balancing game', each provider has the freedom to set a price, or toll, for access to the link and seeks to ...
- articleNovember 2011
Energy-aware capacity scaling in virtualized environments with performance guarantees
Performance Evaluation (PEVA), Volume 68, Issue 11November, 2011, Pages 1207–1221https://doi.org/10.1016/j.peva.2011.07.004We investigate the trade-off between performance and power consumption in servers hosting virtual machines running IT services. The performance behavior of such servers is modeled through Generalized Processor Sharing (GPS) queues enhanced with a green ...