Optimizing CUDA code by kernel fusion: application on BLAS
Crossref DOI link: https://doi.org/10.1007/s11227-015-1483-z
Published Online: 2015-07-22
Published Print: 2015-10
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Filipovič, Jiří http://orcid.org/0000-0002-5703-9673
Madzin, Matúš
Fousek, Jan
Matyska, Luděk
Funding for this research was provided by:
Ministry of Education, Youth and Sports (CZ) (ED3.2.00/08.0144)
Ministry of Education, Youth and Sports (CZ) (CZ.1.07/2.3.00/30.0037)
Text and Data Mining valid from 2015-07-22