Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1198555.1198795acmconferencesArticle/Chapter ViewAbstractPublication PagessiggraphConference Proceedingsconference-collections
Article

Linear algebra operators for GPU implementation of numerical algorithms

Published: 31 July 2005 Publication History

Abstract

In this work, the emphasis is on the development of strategies to realize techniques of numerical computing on the graphics chip. In particular, the focus is on the acceleration of techniques for solving sets of algebraic equations as they occur in numerical simulation. We introduce a framework for the implementation of linear algebra operators on programmable graphics processors (GPUs), thus providing the building blocks for the design of more complex numerical algorithms. In particular, we propose a stream model for arithmetic operations on vectors and matrices that exploits the intrinsic parallelism and efficient communication on modern GPUs. Besides performance gains due to improved numerical computations, graphics algorithms benefit from this model in that the transfer of computation results to the graphics processor for display is avoided. We demonstrate the effectiveness of our approach by implementing direct solvers for sparse matrices, and by applying these solvers to multi-dimensional finite difference equations, i.e. the 2D wave equation and the incompressible Navier-Stokes equations.

References

[1]
Anderson, E., Bai, Z., Bischof, C., Blackford, S., Demmel, J., Dongarra, J., Du Croz, J., Greenbaum, A., Hammarling, S., McKenney, A., and Sorensen, D. 1999. LAPACK Users' Guide, third ed. Society for Industrial and Applied Mathematics, Philadelphia, PA.
[2]
ATI, 2003. Sample effects on the ATI graphics cards. http://www.ati.com/developer/techpapers.html.
[3]
Baraff, D., and Witkin, A. 1998. Large steps in cloth simulation. Computer Graphics SIGGRAPH 98 Proceedings, 43--54.
[4]
Bolz, J., Farmer, I., Grinspun, E., and Schroeder, P. 2003. Sparse matrix solvers on the GPU: Conjugate gradients and multigrid. Computer Graphics SIGGRAPH 03 Proceedings.
[5]
Chen, J., and Da Vitoria Lobo, N. 1995. Towards interactive-rate simulation of fluids with moving obstacles using Navier-Stokes equations. Graphical Models and Image Processing 57, 2.
[6]
Curtis, C., Anderson, S., Seims, J., Fleischer, F., and Salesin, D. 1997. Computer-generated watercolor. Computer Graphics SIGGRAPH 97 Proceedings.
[7]
Debunne, G., Desbrun, M., M.-P., C., and Barr, A. 2001. Dynamic real-time deformations using space and time adaptive sampling. In Computer Graphics SIGGRAPH 01 Proceedings.
[8]
Desbrun, M., Meyer, M., Schroeder, P., and Barr, A. 1999. Implicit fairing of irregular meshes using diffusion and curvature flow. In Computer Graphics SIGGRAPH 99 Proceedings, 317--324.
[9]
Dongarra, J., Du Croz, J., Hammarling, S., and Hanson, R. 1988. An extended set of FORTRAN basic linear algebra subprograms. ACM Transactions on Mathematical Software 14, 1--17.
[10]
Dongarra, J., Du Croz, J., Hammarling, S., and Hanson, R. 1990. A set of level 3 basic linear algebra subprograms,. ACM Transactions on Mathematical Software 16, 1--17.
[11]
Elder, G. 2002. Radeon 9700. In Proceedings Eurographics/SIGGRAPH Workshop on Graphics Hardware 2002.
[12]
Fedkiw, R., Stam, J., and Jensen, H. 2001. Visual simulation of smoke. Computer Graphics SIGGRAPH 01 Proceedings, 15--22.
[13]
Foster, N., and Fedkiw, R. 2001. Practical animation of liquids. Computer Graphics SIGGRAPH 01 Proceedings, 23--30.
[14]
Foster, N., and Metaxas, D. 1996. Realistic animation of liquids. Graphical Models and Image Processing 58, 5, 471--483.
[15]
Harris, M., Coombe, G., Scheuermann, T., and Lastra, A. 2002. Physically-based visual simulation on graphics hardware. In Proceedings Eurographics/SIGGRAPH Workshop on Graphics Hardware 2002.
[16]
Hart, J. 2001. Perlin noise pixel shaders. In Proceedings Eurographics/SIGGRAPH Workshop on Graphics Hardware 2001.
[17]
Heidrich, W., Westermann, R., Seidel, H.-P., and Ertl, T. 1999. Applications of pixel textures in visualization and realistic image synthesis. In ACM Symposium on Interactive 3D Graphics, 110--119.
[18]
Hillesland, K., Molinov, S., and Grzeszczuk, R. 2003. Nonlinear Optimization Framework for Image-Based Modelling on Programmable Graphics Hardware. Computer Graphics SIGGRAPH 03 Proceedings.
[19]
Hopf, M., and Ertl, T. 1999. Accelerating 3D convolution using graphics hardware. In Proceedings IEEE Visualization'99, 471--474.
[20]
Hopf, M., and Ertl, T. 2000. Hardware accelerated wavelet transformations. In Proceedings EG/IEEE TCVG Symposium on Visualization VisSym '00, 93--103.
[21]
Jobard, B., Erlebacher, G., and Hussaini, Y. 2000. Lagrangian-Eulerian advection of noise and dye textures for unsteady flow visualization. In Proceedings IEEE Visualization'00, 110--118.
[22]
Kaas, M., and Miller, G. 1990. Rapid, stable fluid dynamics for computer graphics. Computer Graphics SIGGRAPH 90 Proceedings, 49--57.
[23]
Larsen, E. S., and McAllister, D. 2001. Fast matrix multiplies using graphics hardware. In Proceedings Supercomputing 2001.
[24]
Lindholm, E., Kilgard, M., and Moreton, H. 2001. A user-programmable vertex engine. Computer Graphics SIGGRAPH 01 Proceedings.
[25]
Microsoft, 2002. DirectX9 SDK. http://www.microsoft.com/DirectX.
[26]
Montrym, J., and Moreton, H. 2002. GeForce4. In Proceedings Eurographics/SIGGRAPH Workshop on Graphics Hardware 2002.
[27]
NVidia, 2002. nvidia OpenGL game of life. http://www.nvidia.com/view.asp?IO=ogl-gameoflife.
[28]
NVidia, 2003. Sample effects on the nVIDIA graphics cards. http://developer.nvidia.com/view.asp?PAGE=papers.
[29]
Olano, M., and Lastra, A. 1998. A shading-language on graphics hardware. Computer Graphics SIGGRAPH 98 Proceedings, 159--168.
[30]
Press, W., Teukolsky, S., Vetterling, W., and Flannery, B. 2002. Numerical Recipes in C++: The Art of Scientific Computing. Cambridge University Press.
[31]
Purcell, T., Buck, I., Mark, W., and Hanrahan, P. 2002. Ray tracing on programmable graphics hardware. Computer Graphics SIGGRAPH 98 Proceedings, 703--712.
[32]
Stam, J. 1999. Stable fluids. Computer Graphics SIGGRAPH 99 Proceedings, 121--128.
[33]
Strzodka, R., and Rumpf, M. 2001. Nonlinear diffusion in graphics hardware. In Proceedings EG/IEEE TCVG Symposium on Visualization 2001, 75--84.
[34]
Strzodka, R., and Rumpf, M. 2001. Using graphics cards for quantized FEM computations. In Proceedings VIIP 2001, 98--107.
[35]
Thompson, C., Hahn, S., and Oskin, M. 2002. Using modern graphics architectures for general-purpose computing: A framework and analysis. Proceedings of 35th International Symposium on Microarchitecture (MICRO-35).
[36]
Weiskopf, D., Hopf, M., and Ertl, T. 2001. Hardware-accelerated visualization of time-varying 2D and 3D vector fields by texture advection via programmable per-pixel operations. In Proceedings Workshop on Vision, Modeling, and Visualization VMV'01, 439--446.
[37]
Weiskopf, D., Hopf, M., and Ertl, T. 2002. Hardware-accelerated Lagrangian-Eulerian texture advection for 2D flow visualization. In Proceedings Workshop on Vision, Modeling, and Visualization VMV '02.

Cited By

View all
  • (2024)A Novel Method for Regional Prospecting Based on Modern 3D GraphicsMinerals10.3390/min1404035414:4(354)Online publication date: 28-Mar-2024
  • (2024)NetCL: A Unified Programming Framework for In-Network ComputingSC24: International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC41406.2024.00051(1-20)Online publication date: 17-Nov-2024
  • (2024)An Investigation into Fault Detection and Correction in GPU Pipelines with a Hybrid XOR Approach2024 IEEE 15th Latin America Symposium on Circuits and Systems (LASCAS)10.1109/LASCAS60203.2024.10506130(1-5)Online publication date: 27-Feb-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGGRAPH '05: ACM SIGGRAPH 2005 Courses
July 2005
7157 pages
ISBN:9781450378338
DOI:10.1145/1198555
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 July 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. graphics hardware
  2. numerical simulation

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 1,822 of 8,601 submissions, 21%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)68
  • Downloads (Last 6 weeks)5
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Novel Method for Regional Prospecting Based on Modern 3D GraphicsMinerals10.3390/min1404035414:4(354)Online publication date: 28-Mar-2024
  • (2024)NetCL: A Unified Programming Framework for In-Network ComputingSC24: International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC41406.2024.00051(1-20)Online publication date: 17-Nov-2024
  • (2024)An Investigation into Fault Detection and Correction in GPU Pipelines with a Hybrid XOR Approach2024 IEEE 15th Latin America Symposium on Circuits and Systems (LASCAS)10.1109/LASCAS60203.2024.10506130(1-5)Online publication date: 27-Feb-2024
  • (2024)An Investigation into Fault Detection and Correction in GPU Pipelines with a Hybrid XOR Approach2024 IEEE 15th Latin America Symposium on Circuits and Systems (LASCAS)10.1109/LASCAS60203.2024.10506122(1-5)Online publication date: 27-Feb-2024
  • (2024)GPU-Accelerated Flow Simulations on Unstructured Grids Using a Multi-colored Gauss-Seidel Method2023 Asia-Pacific International Symposium on Aerospace Technology (APISAT 2023) Proceedings10.1007/978-981-97-3998-1_56(657-671)Online publication date: 2-Jul-2024
  • (2024)Accelerator CardsParallel C++10.1007/978-3-031-54369-2_17(185-186)Online publication date: 3-Feb-2024
  • (2023)Evaluating an XOR-based Hybrid Fault Tolerance Technique to Detect Faults in GPU Pipelines2023 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)10.1109/ISVLSI59464.2023.10238657(1-6)Online publication date: 20-Jun-2023
  • (2022)Accelerated 2-D Real-Time Refraction-Corrected Transcranial Ultrasound ImagingIEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control10.1109/TUFFC.2022.318960069:9(2599-2610)Online publication date: Sep-2022
  • (2021)Efficient multi-GPU shared memory via automatic optimization of fine-grained transfersProceedings of the 48th Annual International Symposium on Computer Architecture10.1109/ISCA52012.2021.00020(139-152)Online publication date: 14-Jun-2021
  • (2021)A high-performance batched matrix multiplication framework for GPUs under unbalanced input distributionThe Journal of Supercomputing10.1007/s11227-021-03936-9Online publication date: 21-Jun-2021
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media