Cited By
View all- Liu WLi MTan GJia W(2025)Mario: Near Zero-cost Activation Checkpointing in Pipeline ParallelismProceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming10.1145/3710848.3710878(197-211)Online publication date: 28-Feb-2025
- Zhao PZhang HFu FNie XLiu QYang FPeng YJiao DLi SXue JTao YCui B(2025)MEMO: Fine-grained Tensor Management For Ultra-long Context LLM TrainingProceedings of the ACM on Management of Data10.1145/37097033:1(1-28)Online publication date: 11-Feb-2025
- Alabed SBelov DChrzaszcz BFranco JGrewe DMaclaurin DMolloy JNatan TNorman TPan XPaszke ARink NSchaarschmidt MSitdikov TSwietlik AVytiniotis DWee JEeckhout LSmaragdakis GLiang KSampson AKim MRossbach C(2025)PartIR: Composing SPMD Partitioning Strategies for Machine LearningProceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3669940.3707284(794-810)Online publication date: 30-Mar-2025
- Show More Cited By