Cited By
View all- Liu JZhai YZhao GXu HFang JZeng ZZhu YChua TNgo CKa-Wei Lee RKumar RLauw H(2024)InArt: In-Network Aggregation with Route Selection for Accelerating Distributed TrainingProceedings of the ACM Web Conference 202410.1145/3589334.3645394(2879-2889)Online publication date: 13-May-2024
- Tairin SShen HIyer A(2024)Proactive, Accuracy-aware Straggler Mitigation in Machine Learning Clusters2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW63119.2024.00204(1196-1198)Online publication date: 27-May-2024