Cited By
View all- Carvalho MSimitsis AQueralt ARomero O(2024)Workload Placement on Heterogeneous CPU-GPU SystemsProceedings of the VLDB Endowment10.14778/3685800.368584517:12(4241-4244)Online publication date: 8-Nov-2024
- Maurya AYe JRafique MCappello FNicolae BSchiavoni VEdinger JCao JJin Z(2024)Deep Optimizer States: Towards Scalable Training of Transformer Models using Interleaved OffloadingProceedings of the 25th International Middleware Conference10.1145/3652892.3700781(404-416)Online publication date: 2-Dec-2024
- Wei YDu JJiang JShi XZhang XHuang DXiao NLu Y(2024)APTMoE: Affinity-Aware Pipeline Tuning for MoE Models on Bandwidth-Constrained GPU NodesProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC41406.2024.00096(1-14)Online publication date: 17-Nov-2024
- Show More Cited By