Article

LSRB-CSR: A Low Overhead Storage Format for SpMV on the GPU Systems

Authors:

Lifeng Liu,

Meilin Liu,

Chongjun Wang,

Jun WangAuthors Info & Claims

ICPADS '15: Proceedings of the 2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS)

Pages 733 - 741

https://doi.org/10.1109/ICPADS.2015.97

Published: 14 December 2015 Publication History

Abstract

Sparse matrix vector multiplication (SpMV) is a basic building block of many scientific applications. Several GPU accelerated SpMV algorithms for the CSR format suffer from workload unbalance for irregular matrices. In this paper, we propose a new auxiliary array assisted CSR format called local segmented reduction based CSR (LSRB-CSR), which enables synchronization free preprocessing and efficient SpMV algorithm with the light weight auxiliary arrays. It is efficient for both regular matrices and irregular matrices with tiny preprocessing overhead. We compare our LSRB-CSR based SpMV algorithm with the CSR-based SpMV from cuSPARSE, the SpMV algorithm based on segmented reduction adopted by CUDPP library, and the CSR5-based SpMV algorithm for both regular and irregular sparse matrices. Compared to cuSparse, our LSRB-CSR based SpMV algorithm could improve the performance by 26% on regular matrices and up to 4750% on irregular matrices. Compared to CUDPP, our LSRB-CSR based SpMV algorithm could improve the average SpMV performance by 210% on regular matrices and 250% on irregular matrices. Our LSRB-CSR based SpMV algorithm has comparable performance as the CSR5 based SpMV algorithm for regular matrices, and achieves better performance over the CSR5 based SpMV algorithm for irregular matrices. Experimental results show that the conversion overhead from the CSR to the LSRB-CSR is only 1/10 of the overhead from the CSR to the CSR5 on average.

Cited By

View all

Berger GDufrechou EEzzatti P(2023)Sparse Matrix-Vector Product for the bmSparse Matrix Format in GPUsEuro-Par 2023: Parallel Processing Workshops10.1007/978-3-031-50684-0_19(246-256)Online publication date: 28-Aug-2023
https://dl.acm.org/doi/10.1007/978-3-031-50684-0_19
Zhang JGruenwald L(2018)Regularizing irregularityProceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3210259.3210263(1-8)Online publication date: 10-Jun-2018
https://dl.acm.org/doi/10.1145/3210259.3210263

Recommendations

Optimizing partitioned CSR-based SpGEMM on the Sunway TaihuLight
Abstract
General sparse matrix-sparse matrix (SpGEMM) multiplication is one of the basic kernels in a great many applications. Several works focus on various optimizations for SpGEMM. To fully exploit the powerful computing capability of the Sunway ...
CSR: Core Surprise Removal in Commodity Operating Systems
ASPLOS '16

One of the adverse effects of shrinking transistor sizes is that processors have become increasingly prone to hardware faults. At the same time, the number of cores per die rises. Consequently, core failures can no longer be ruled out, and future ...
CSR: Core Surprise Removal in Commodity Operating Systems
ASPLOS'16

One of the adverse effects of shrinking transistor sizes is that processors have become increasingly prone to hardware faults. At the same time, the number of cores per die rises. Consequently, core failures can no longer be ruled out, and future ...

Comments

Information & Contributors

Information

Published In

ICPADS '15: Proceedings of the 2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS)

December 2015

857 pages

ISBN:9780769557854

Publisher

IEEE Computer Society

United States

Publication History

Published: 14 December 2015

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 18 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Berger GDufrechou EEzzatti P(2023)Sparse Matrix-Vector Product for the bmSparse Matrix Format in GPUsEuro-Par 2023: Parallel Processing Workshops10.1007/978-3-031-50684-0_19(246-256)Online publication date: 28-Aug-2023
https://dl.acm.org/doi/10.1007/978-3-031-50684-0_19
Zhang JGruenwald L(2018)Regularizing irregularityProceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3210259.3210263(1-8)Online publication date: 10-Jun-2018
https://dl.acm.org/doi/10.1145/3210259.3210263

Abstract

Cited By

Recommendations

Optimizing partitioned CSR-based SpGEMM on the Sunway TaihuLight

CSR: Core Surprise Removal in Commodity Operating Systems

CSR: Core Surprise Removal in Commodity Operating Systems

Comments

Information

Published In

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Share

Share this Publication link

Share on social media

Affiliations