Cited By
View all- Lee YYu YPark YDoerfert JGrosser TLeather HSadayappan P(2025)CUrator: An Efficient LLM Execution Engine with Optimized Integration of CUDA LibrariesProceedings of the 23rd ACM/IEEE International Symposium on Code Generation and Optimization10.1145/3696443.3708944(209-224)Online publication date: 1-Mar-2025
- Xie AHu YWang YLi ZGao YCheng Z(2025)GTA: Generating high-performance tensorized program with dual-task schedulingJournal of Systems Architecture10.1016/j.sysarc.2025.103359160(103359)Online publication date: Mar-2025
- Zhang CDong RWang HZhong RChen JZhai JBagchi SZhang Y(2024)MAGPYProceedings of the 2024 USENIX Conference on Usenix Annual Technical Conference10.5555/3691992.3692034(683-698)Online publication date: 10-Jul-2024
- Show More Cited By