Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
This paper proposes an interference-aware scheduler for heterogeneous inference serving systems, reducing the latency degradation from co-location interference.
Apr 26, 2021 · Our preliminary results show that our interference-aware scheduler achieves 2× lower latency degradation than a commonly used least-loaded ...
People also ask
Contention-aware datacenter scheduling: Sharing system resources to increase utilization results in interference, and performance degradation [15,18,41,46], and ...
The evaluation results show that TRACON can achieve up to 50% improvement on application runtime, and up to 80% on I/O throughput for data-intensive ...
Jan 5, 2024 · Our experimental results show that our proposed Interference-aware Intelligent Scheduling (IAIS) method can achieve up to 39% and 70% throughput ...
Interference-aware scheduling determines which links should transmit at which time slots so that all packets transmitted by the scheduled.
In this work, we present IADA, a full-fledged dynamic interference-aware cloud scheduling architecture for latency-sensitive workloads.
We then present a model of interference, which can be used for more effective application scheduling, as demonstrated by real-world experiments. 1. Introduction.
At the core of our interference-aware scheduling is to understand the compute resource requirement prior to job execution, and perform job placement with ...
In this paper we empirically evaluate the interference rate of communication on computation via measurements on a single processor communicating on a ...