Cited By
View all- Zhong RJin YZhang CLei KLi SZhai J(2025)FlashTensor: Optimizing Tensor Programs by Leveraging Fine-grained Tensor PropertyProceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming10.1145/3710848.3710864(183-196)Online publication date: 28-Feb-2025
- Canesche Mdo Rosario VBorin EQuintão Pereira FKluss DAchour SPalsberg J(2025)Fusion of Operators of Computational Graphs via Greedy Clustering: The XNNC ExperienceProceedings of the 34th ACM SIGPLAN International Conference on Compiler Construction10.1145/3708493.3712689(117-127)Online publication date: 25-Feb-2025
- Ma ZWang HXing JHuang SZheng LZhang CCao HHuang KZhai MTang SWang PZhai JDoerfert JGrosser TLeather HSadayappan P(2025)IntelliGen: Instruction-Level Auto-tuning for Tensor Program with Monotonic Memory OptimizationProceedings of the 23rd ACM/IEEE International Symposium on Code Generation and Optimization10.1145/3696443.3708967(107-122)Online publication date: 1-Mar-2025
- Show More Cited By