Cited By
View all- Hong SKim YNam JKim S(2024)On the Analysis of Inter-Relationship between Auto-Scaling Policy and QoS of FaaS WorkloadsSensors10.3390/s2412377424:12(3774)Online publication date: 10-Jun-2024
- Trihinas DMichael PSymeonides M(2024)Evaluating DL Model Scaling Trade-Offs During Inference via an Empirical Benchmark AnalysisFuture Internet10.3390/fi1612046816:12(468)Online publication date: 13-Dec-2024
- Dai YPan RIyer ALi KNetravali RWitchel EArpaci-Dusseau ARossbach CKeeton K(2024)Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML ServingProceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles10.1145/3694715.3695963(607-623)Online publication date: 4-Nov-2024
- Show More Cited By