Cited By
View all- Ahmad SGuan HSitaraman RMencagli GDazzi PLowenthal DBadia R(2024)Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy ScalingProceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing10.1145/3625549.3658688(267-280)Online publication date: 3-Jun-2024
- Singh MRachuri SCao BSharma ABhumireddy VBronzino FDas SGandhi AJain S(2024)OVIDA: Orchestrator for Video Analytics on Disaggregated Architecture2024 IEEE/ACM Symposium on Edge Computing (SEC)10.1109/SEC62691.2024.00019(135-148)Online publication date: 4-Dec-2024
- Beaumont ODavid JEyraud-Dubois LThibault S(2024)Exploiting Processor Heterogeneity to Improve Throughput and Reduce Latency for Deep Neural Network Inference2024 IEEE 36th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)10.1109/SBAC-PAD63648.2024.00012(37-48)Online publication date: 13-Nov-2024