Cited By
View all- Xia CZhao JSun QWang ZWen YYu TFeng XCui HTsafrir DMUSUVATHI MGupta RAbu-Ghazaleh N(2024)Optimizing Deep Learning Inference via Global Analysis and Tensor ExpressionsProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3617232.3624858(286-301)Online publication date: 27-Apr-2024
- Zhang ZChen YHe BZhang Z(2023)NIOT: A Novel Inference Optimization of Transformers on Modern CPUsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.326953034:6(1982-1995)Online publication date: 1-Jun-2023
- Jeon BPark SLiao PXu SChen TJia ZKloeckner AMoreira J(2022)CollageProceedings of the International Conference on Parallel Architectures and Compilation Techniques10.1145/3559009.3569651(517-529)Online publication date: 8-Oct-2022
- Show More Cited By