article

Cache-Oblivious Sparse Matrix-Vector Multiplication by Using Sparse Matrix Partitioning Methods

Authors:

A. N. Yzelman,

Rob H. BisselingAuthors Info & Claims

SIAM Journal on Scientific Computing, Volume 31, Issue 4

Pages 3128 - 3154

https://doi.org/10.1137/080733243

Published: 01 July 2009 Publication History

Abstract

In this article, we introduce a cache-oblivious method for sparse matrix-vector multiplication. Our method attempts to permute the rows and columns of the input matrix using a recursive hypergraph-based sparse matrix partitioning scheme so that the resulting matrix induces cache-friendly behavior during sparse matrix-vector multiplication. Matrices are assumed to be stored in row-major format, by means of the compressed row storage (CRS) or its variants incremental CRS and zig-zag CRS. The zig-zag CRS data structure is shown to fit well with the hypergraph metric used in partitioning sparse matrices for the purpose of parallel computation. The separated block-diagonal (SBD) form is shown to be the appropriate matrix structure for cache enhancement. We have implemented a run-time cache simulation library enabling us to analyze cache behavior for arbitrary matrices and arbitrary cache properties during matrix-vector multiplication within a $k$-way set-associative idealized cache model. The results of these simulations are then verified by actual experiments run on various cache architectures. In all these experiments, we use the Mondriaan sparse matrix partitioner in one-dimensional mode. The savings in computation time achieved by our matrix reorderings reach up to 50 percent, in the case of a large link matrix.

Cited By

View all

Lu YLiu WMohror KArnold DBadia R(2023)DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector MultiplicationProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607051(1-14)Online publication date: 12-Nov-2023
https://dl.acm.org/doi/10.1145/3581784.3607051
Trotter JEkmekçibaşı SLangguth JTorun TDüzakın EIlic AUnat DMohror KArnold DBadia R(2023)Bringing Order to Sparsity: A Sparse Matrix Reordering Study on Multicore CPUsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607046(1-13)Online publication date: 12-Nov-2023
https://dl.acm.org/doi/10.1145/3581784.3607046
Helal ALaukemann JChecconi FTithi JRanadive TPetrini FChoi JZhou HMoreira JMueller FEtsion Y(2021)ALTOProceedings of the 35th ACM International Conference on Supercomputing10.1145/3447818.3461703(404-416)Online publication date: 3-Jun-2021
https://dl.acm.org/doi/10.1145/3447818.3461703
Show More Cited By

Recommendations

Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks
SPAA '09: Proceedings of the twenty-first annual symposium on Parallelism in algorithms and architectures

This paper introduces a storage format for sparse matrices, called compressed sparse blocks (CSB), which allows both Ax and A,x to be computed efficiently in parallel, where A is an n×n sparse matrix with nnzen nonzeros and x is a dense n-vector. Our ...
Two-dimensional cache-oblivious sparse matrix-vector multiplication

In earlier work, we presented a one-dimensional cache-oblivious sparse matrix-vector (SpMV) multiplication scheme which has its roots in one-dimensional sparse matrix partitioning. Partitioning is often used in distributed-memory parallel computing for ...
Hypergraph Partitioning Based Models and Methods for Exploiting Cache Locality in Sparse Matrix-Vector Multiplication

Sparse matrix-vector multiplication (SpMxV) is a kernel operation widely used in iterative linear solvers. The same sparse matrix is multiplied by a dense vector repeatedly in these solvers. Matrices with irregular sparsity patterns make it difficult to ...

Comments

Information & Contributors

Information

Published In

cover image SIAM Journal on Scientific Computing

SIAM Journal on Scientific Computing Volume 31, Issue 4

June 2009

797 pages

ISSN:1064-8275

Issue’s Table of Contents

Publisher

Society for Industrial and Applied Mathematics

United States

Publication History

Published: 01 July 2009

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

17
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 27 Jul 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Lu YLiu WMohror KArnold DBadia R(2023)DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector MultiplicationProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607051(1-14)Online publication date: 12-Nov-2023
https://dl.acm.org/doi/10.1145/3581784.3607051
Trotter JEkmekçibaşı SLangguth JTorun TDüzakın EIlic AUnat DMohror KArnold DBadia R(2023)Bringing Order to Sparsity: A Sparse Matrix Reordering Study on Multicore CPUsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607046(1-13)Online publication date: 12-Nov-2023
https://dl.acm.org/doi/10.1145/3581784.3607046
Helal ALaukemann JChecconi FTithi JRanadive TPetrini FChoi JZhou HMoreira JMueller FEtsion Y(2021)ALTOProceedings of the 35th ACM International Conference on Supercomputing10.1145/3447818.3461703(404-416)Online publication date: 3-Jun-2021
https://dl.acm.org/doi/10.1145/3447818.3461703
Mouna MBellatreche LBoustia NDesai BCho W(2020)HYRAQProceedings of the 24th Symposium on International Database Engineering & Applications10.1145/3410566.3410582(1-10)Online publication date: 12-Aug-2020
https://dl.acm.org/doi/10.1145/3410566.3410582
Alappat CBasermann ABishop AFehske HHager GSchenk OThies JWellein G(2020)A Recursive Algebraic Coloring Technique for Hardware-efficient Symmetric Sparse Matrix-vector MultiplicationACM Transactions on Parallel Computing10.1145/33997327:3(1-37)Online publication date: 29-Jun-2020
https://dl.acm.org/doi/10.1145/3399732
Jiang PHong CAgrawal GGupta RShen X(2020)A novel data transformation and execution strategy for accelerating sparse matrix multiplication on GPUsProceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming10.1145/3332466.3374546(376-388)Online publication date: 19-Feb-2020
https://dl.acm.org/doi/10.1145/3332466.3374546
Hong CSukumaran-Rajam ANisa ISingh KSadayappan PHollingsworth JKeidar I(2019)Adaptive sparse tiling for sparse matrix multiplicationProceedings of the 24th Symposium on Principles and Practice of Parallel Programming10.1145/3293883.3295712(300-314)Online publication date: 16-Feb-2019
https://dl.acm.org/doi/10.1145/3293883.3295712
Abubaker NAkbudak KAykanat C(2019)Spatiotemporal Graph and Hypergraph Partitioning Models for Sparse Matrix-Vector Multiplication on Many-Core ArchitecturesIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2018.286472930:2(445-458)Online publication date: 1-Feb-2019
https://dl.acm.org/doi/10.1109/TPDS.2018.2864729
Sicheng Li Wang YWujie Wen Wang YYiran Chen Hai Li (2016)A data locality-aware design framework for reconfigurable sparse matrix-vector multiplication kernel2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)10.1145/2966986.2966987(1-6)Online publication date: 7-Nov-2016
https://dl.acm.org/doi/10.1145/2966986.2966987
Buono DPetrini FChecconi FLiu XQue XLong CTuan T(2016)Optimizing Sparse Matrix-Vector Multiplication for Large-Scale Data AnalyticsProceedings of the 2016 International Conference on Supercomputing10.1145/2925426.2926278(1-12)Online publication date: 1-Jun-2016
https://dl.acm.org/doi/10.1145/2925426.2926278
Show More Cited By

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

Cited By

Recommendations

Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks

Two-dimensional cache-oblivious sparse matrix-vector multiplication

Hypergraph Partitioning Based Models and Methods for Exploiting Cache Locality in Sparse Matrix-Vector Multiplication

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations