Cited By
View all- Gu TFei JCanini M(2024)OmNICCL: Zero-cost Sparse AllReduce with Direct Cache Access and SmartNICsProceedings of the 2024 SIGCOMM Workshop on Networks for AI Computing10.1145/3672198.3673804(75-83)Online publication date: 4-Aug-2024
- Ming ZHu YZhou WZheng XYao CFeng DMencagli GDazzi PLowenthal DBadia R(2024)ADTopk: All-Dimension Top-k Compression for High-Performance Data-Parallel DNN TrainingProceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing10.1145/3625549.3658678(135-147)Online publication date: 3-Jun-2024
- Lim JKwon YHwang RMaeng KSuh ERhu MTsafrir DMusuvathi MGupta RAbu-Ghazaleh N(2024)LazyDP: Co-Designing Algorithm-Software for Scalable Training of Differentially Private Recommendation ModelsProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3620665.3640384(616-630)Online publication date: 27-Apr-2024
- Show More Cited By