Cited By
View all- Wu JWang LJin QLiu F(2024)Graft: Efficient Inference Serving for Hybrid Deep Learning With SLO Guarantees via DNN Re-AlignmentIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.334051835:2(280-296)Online publication date: 1-Feb-2024
- Li WHacid HAlmazrouei EDebbah M(2023)A Comprehensive Review and a Taxonomy of Edge Machine Learning: Requirements, Paradigms, and TechniquesAI10.3390/ai40300394:3(729-786)Online publication date: 13-Sep-2023
- Jiang SHuang TYu BHo T(2023)SNICIT: Accelerating Sparse Neural Network Inference via Compression at Inference Time on GPUProceedings of the 52nd International Conference on Parallel Processing10.1145/3605573.3605625(51-61)Online publication date: 13-Sep-2023
- Show More Cited By