Cited By
View all- Chen RLu GWang YZhang RHu ZMiao YCai ZLeng JGuo M(2025)BAFT: bubble-aware fault-tolerant framework for distributed DNN training with hybrid parallelismFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-023-3401-519:1Online publication date: 1-Jan-2025
- Zhang KLiu XYang HFeng TYang XLiu YLuan ZQian D(2024)Jigsaw: Accelerating SpMM with Vector Sparsity on Sparse Tensor CoreProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673108(1124-1134)Online publication date: 12-Aug-2024
- Liu XZheng XYang HLuan ZQian DLee IChabbi MSteuwer M(2024)Tetris: Accelerating Sparse Convolution by Exploiting Memory Reuse on GPUProceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming10.1145/3627535.3638471(229-242)Online publication date: 2-Mar-2024
- Show More Cited By