Cited By
View all- Cui C(2024)Acceleration of Tensor-Product Operations with Tensor CoresACM Transactions on Parallel Computing10.1145/369546611:4(1-24)Online publication date: 9-Sep-2024
- Schieffer GDe Medeiros DFaj JMarathe APeng I(2024)On the Rise of AMD Matrix Cores: Performance, Power Efficiency, and Programmability2024 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)10.1109/ISPASS61541.2024.00022(132-143)Online publication date: 5-May-2024
- Valero-Lara PJorquera ILui FVetter J(2023)Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor CoresProceedings of the SC '23 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis10.1145/3624062.3624084(179-186)Online publication date: 12-Nov-2023
- Show More Cited By