Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1109/ICPP.2011.41guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

A Scalable Tridiagonal Solver for GPUs

Published: 13 September 2011 Publication History

Abstract

We present the design and evaluation of a scalable tridiagonal solver targeted for GPU architectures. We observed that two distinct steps are required to solve a large tridiagonal system in parallel: 1) breaking down a problem into multiple sub problems each of which is independent of other, and 2) solving the sub problems using an efficient algorithm. We propose a hybrid method of tiled parallel cyclic reduction(tiled PCR) and thread-level parallel Thomas algorithm(p-Thomas). Algorithm transition from tiled PCR to p-Thomas is determined by input system size and hardware capability in order to achieve optimal performance. The proposed method is scalable as it can cope with various input system sizes by properly adjusting algorithmtrasition point. Our method on a NVidia GTX480 shows up to 8.3x and 49x speedups over multithreaded and sequential MKL implementations on a 3.33GHz Intel i7 975 in double precision, respectively.

Cited By

View all
  • (2023)Tridigpu: A GPU Library for Block Tridiagonal and Banded Linear Equation SystemsACM Transactions on Parallel Computing10.1145/358037310:1(1-33)Online publication date: 29-Mar-2023
  • (2022)A new GPU-based corrected explicit-implicit domain decomposition scheme for convection-dominated diffusion problemsComputers & Mathematics with Applications10.1016/j.camwa.2022.08.015123:C(184-203)Online publication date: 1-Oct-2022
  • (2021)Tridiagonal GPU Solver with Scaled Partial Pivoting at Maximum BandwidthProceedings of the 50th International Conference on Parallel Processing10.1145/3472456.3472484(1-10)Online publication date: 9-Aug-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
ICPP '11: Proceedings of the 2011 International Conference on Parallel Processing
September 2011
796 pages
ISBN:9780769545103

Publisher

IEEE Computer Society

United States

Publication History

Published: 13 September 2011

Author Tags

  1. GPGPU
  2. GPU Computing
  3. Tridiagonal solver
  4. Tridiagonal systems

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Tridigpu: A GPU Library for Block Tridiagonal and Banded Linear Equation SystemsACM Transactions on Parallel Computing10.1145/358037310:1(1-33)Online publication date: 29-Mar-2023
  • (2022)A new GPU-based corrected explicit-implicit domain decomposition scheme for convection-dominated diffusion problemsComputers & Mathematics with Applications10.1016/j.camwa.2022.08.015123:C(184-203)Online publication date: 1-Oct-2022
  • (2021)Tridiagonal GPU Solver with Scaled Partial Pivoting at Maximum BandwidthProceedings of the 50th International Conference on Parallel Processing10.1145/3472456.3472484(1-10)Online publication date: 9-Aug-2021
  • (2019)Accelerating reduction and scan using tensor core unitsProceedings of the ACM International Conference on Supercomputing10.1145/3330345.3331057(46-57)Online publication date: 26-Jun-2019
  • (2019)Tree Partitioning ReductionACM Transactions on Mathematical Software10.1145/332873145:3(1-26)Online publication date: 8-Aug-2019
  • (2019)Parallel prefix operations on GPUThe Journal of Supercomputing10.1007/s11227-018-2676-z75:3(1510-1523)Online publication date: 1-Mar-2019
  • (2017)Research on tridiagonal matrix solver design based on a combination of processorsComputers and Electrical Engineering10.1016/j.compeleceng.2017.07.01462:C(1-16)Online publication date: 1-Aug-2017
  • (2016)FinParACM Transactions on Architecture and Code Optimization10.1145/289835413:2(1-27)Online publication date: 27-Jun-2016
  • (2014)A hierarchical tridiagonal system solver for heterogenous supercomputersProceedings of the 5th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems10.1109/ScalA.2014.12(69-76)Online publication date: 16-Nov-2014
  • (2013)Efficient heterogeneous execution on large multicore and accelerator platformsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2013.07.01273:12(1578-1591)Online publication date: 1-Dec-2013
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media