Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A GPU-based framework for finite element analysis of elastoplastic problems

Published: 05 March 2023 Publication History

Abstract

Elastoplasticity is observed in a wide range of materials like metals that have real-world applications. The design and optimization process of such materials depends strongly on elastoplastic analysis for the prediction of displacement and stress. However, elastoplastic simulation is computationally expensive and often requires the use of parallel computers in real-world applications like crashworthiness and metal forming. This paper presents a novel parallel framework for finite element analysis of elastoplastic problems on massively parallel Graphics Processing Units (GPUs) architecture. We propose GPU-based parallel algorithms for all expensive steps in elastoplastic analysis, namely the computation of elemental matrices and their assembly, the computation of stress using the well-known radial-return method and the computation of internal force vectors and their assembly. Since GPUs have limited memory, assembly is done directly into a sparse storage format that can be seamlessly integrated with a GPU-based linear solver. The proposed algorithms are optimized for efficient memory access and fine-grain parallelism and prefer computation over data storage and reuse. In the proposed framework, all the computations are performed on the GPU and expensive data transfers to the CPU are avoided to achieve the best performance. Numerical experiments are conducted over three benchmark examples in three dimensions (3D) considering 8-noded hexahedral elements to demonstrate the performance of the proposed framework. The comparison of execution timings with sequential CPU implementation reveals speedups in the range 20.4×–69.7× for computation of elemental matrices and assembly, 47.2×–66.1× for computation of stress using radial-return method, 53.7×–67.3× for computation of internal force vectors and their assembly. A comparison of wall-clock timings shows 1.4× to 7.2× speedup by the proposed GPU implementation. The proposed framework is able to solve up to 5.1 million degrees of freedom (DOFs) elastoplasticity problem on a single GPU.

References

[1]
Valberg HS Applied Metal Forming: Including FEM Analysis 2010 New York Cambridge University Press
[2]
Xia L, Shih CF, and Hutchinson JW A computational approach to ductile crack growth under large scale yielding conditions J Mech Phys Solids 1995 43 3 389-413
[3]
Gautam SS and Dixit PM Numerical simulation of ductile fracture in cylindrical tube impacted against a rigid surface Int J Damage Mech 2012 21 3 341-371
[4]
Deng D, Murakawa H, and Liang W Numerical simulation of welding distortion in large structures Comput Method Appl Mech Eng 2007 196 45 4613-4627
[5]
Jones N Structural Impact 2011 2 New York Cambridge University Press
[6]
Hong Y, Wang L, Zhang J, and Gao Z 3D elastoplastic model for fine-grained gassy soil considering the gas-dependent yield surface shape and stress-dilatancy J Eng Mech 2020 146 5 04020037
[7]
de Souza Neto EA, Perić D, and Owen DRJ Computational methods for plasticity: theory and applications 2008 John Wiley & Sons Ltd
[8]
Simo JC and Hughes TJR Computational inelasticity 1998 New York Springer
[9]
Dunne F and Petrinic N Introduction to computational plasticity 2005 Oxford Oxford University Press
[10]
Kim N-H Introduction to nonlinear finite element analysis 2015 New York Springer
[11]
Vi F, Mocellin K, Digonnet H, Perchat E, and Fourment L Hybrid parallel multigrid preconditioner based on automatic mesh coarsening for 3D metal forming simulations Int J Numer Method Eng 2018 114 6 598-618
[12]
Bhardwaj M, Pierson K, Reese G, Walsh T, Day D, Alvin K, Peery J, Farhat C, Lesoinne M (2002) Salinas: A scalable software for high-performance structural and solid mechanics simulations. In: SC’02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, pp. 35–35. IEEE
[13]
Yusa Y, Okada H, Yamada T, and Yoshimura S Scalable parallel elastic - plastic finite element analysis using a quasi-Newton method with a balancing domain decomposition preconditioner Comput Mech 2018 62 6 1563-1581
[14]
Balay S, Abhyankar S, Adams MF, Benson S, Brown J, Brune P, Buschelman K, Constantinescu EM, Dalcin L, Dener A, Eijkhout V, Gropp WD, Hapla V, Isaac T, Jolivet P, Karpeev D, Kaushik D, Knepley MG, Kong F, Kruger S, May DA, McInnes LC, Mills RT, Mitchell L, Munson T, Roman JE, Rupp K, Sanan P, Sarich J, Smith BF, Zampini S, Zhang H, Zhang H, Zhang J (2021) PETSc Web page. https://petsc.org/
[15]
Saad Y Iterative methods for sparse linear systems 2003 2 Philadelphia, USA Society for Industrial and Applied Mathematics
[16]
Czarnul P, Proficz J, and Drypczewski K Survey of methodologies, approaches, and challenges in parallel programming using high-performance computing systems Sci Program 2020
[17]
Ding K, Qin Q-H, Cardew-Hall M, and Kalyanasundaram S Efficient parallel algorithms for elastic-plastic finite element analysis Comput Mech 2008 41 4 563-578
[18]
Markopoulos A, Hapla V, Cermak M, and Fusek M Massively parallel solution of elastoplasticity problems with tens of millions of unknowns using Permoncube and FLLOP packages Appl Math Comput 2015 267 698-710
[19]
Irina D, Matsuoka S, Toshio E (2011) GPU-based approach for elastic-plastic deformation simulations. Technical Report 12, Information Processing Society of Japan (IPSJ)
[20]
Khalevitsky YV, Burmasheva NV, Konovalov AV, and Partin AS Comparative study of Krylov subspace method implementations for a GPU cluster in elastoplastic problems AIP Conf Proc 2016 1785 1
[21]
He G, Wang H, Huang G, Liu H, and Li G A parallel elastoplastic reanalysis based on GPU platform Int J Comput Method 2017 14 05 1750051
[22]
Prabhune BC and Suresh K A fast matrix-free elasto-plastic solver for predicting residual stresses in additive manufacturing Comput-Aided Des 2020 123
[23]
Wyser E, Alkhimenkov Y, Jaboyedoff M, and Podladchikov YY An explicit GPU-based material point method solver for elastoplastic problems (ep2-3de v1.0) Geosci Model Dev 2021 14 12 7749-7774
[24]
Macioł P, Płaszewski P, and Banaś K 3D finite element numerical integration on GPUs Procedia Comput Sci 2010 1 1 1093-1100
[25]
Sanfui S, Sharma D (2017) A two-kernel based strategy for performing assembly in FEA on the graphics processing unit. In: Advances in Mechanical, Industrial, Automation and Management Systems (AMIAMS), 2017 International Conference On, pp. 1–9. IEEE
[26]
Kiran U, Sharma D, and Gautam SS GPU-warp based finite element matrices generation and assembly using coloring method J Comput Des Eng 2018 6 4 705-718
[27]
Sanfui S and Sharma D A three-stage graphics processing unit-based finite element analyses matrix generation strategy for unstructured meshes Int J Numer Method Eng 2020 121 17 3824-3848
[28]
Cecka C, Lew AJ, and Darve E Assembly of finite element methods on graphics processors Int J Numer Methods Eng 2011 85 5 640-669
[29]
Sanfui S and Sharma D Symbolic and numeric kernel division for graphics processing unit-based finite element analysis assembly of regular meshes with modified sparse storage formats J Comput Inform Sci Eng 2021
[30]
Li R and Saad Y GPU-accelerated preconditioned iterative linear solvers J Supercomput 2013 63 2 443-466
[31]
Anzt H, Gates M, Dongarra J, Kreutzer M, Wellein G, and Köhler M Preconditioned krylov solvers on GPUs Parall Comput 2017 68 32-44
[32]
Kiran U, Gautam SS, and Sharma D GPU-based matrix-free finite element solver exploiting symmetry of elemental matrices Computing 2020 102 9 1941-1965
[33]
Ratnakar SK, Sanfui S, and Sharma D Graphics processing unit-based element-by-element strategies for accelerating topology optimization of three-dimensional continuum structures using unstructured all-hexahedral mesh J Comput Inform Sci Eng 2021
[34]
Dixit PM and Dixit US Plasticity: fundamentals and applications 2015 Boca Raton, Florida CRC Press
[35]
Dalton S, Bell N, Olson L, Garland M (2014) Cusp: Generic Parallel Algorithms for Sparse Matrix and Graph Computations. Version 0.5.0. http://cusplibrary.github.io/
[36]
Anzt H, Cojean T, Flegar G, Göbel F, Grützmacher T, Nayak P, Ribizel T, Tsai YM, and Quintana-Ortí ES Ginkgo: a modern linear operator algebra framework for high performance computing ACM Trans Math Softw 2022
[37]
Bell N, Hoberock J (2012) Thrust: A productivity-oriented library for CUDA. In: Hwu, W.-m.W. (ed.) GPU Computing Gems Jade Edition. Applications of GPU Computing Series, pp. 359–371. Morgan Kaufmann, Boston.
[38]
Georgescu S, Chow P, and Okuda H GPU acceleration for fem-based structural analysis Arch Comput Method Eng 2013 20 2 111-121
[39]
Corporation N (2022) NVIDIA CUDA C++ Programming Guide. Version 12.0. https://docs.nvidia.com/cuda/pdf/CUDA_C_Programming_Guide.pdf
[40]
Systèmes D (2017) ABAQUS 2017. Documentation. Dassault Systèmes, Rhode Island, Rhode Island

Cited By

View all
  • (2024)An efficient framework for matrix-free SpMV computation on GPU for elastoplastic problemsMathematics and Computers in Simulation10.1016/j.matcom.2023.09.016216:C(318-346)Online publication date: 1-Feb-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Computing
Computing  Volume 105, Issue 8
Aug 2023
218 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 05 March 2023
Accepted: 17 February 2023
Received: 20 June 2022

Author Tags

  1. Elastoplasticity
  2. Finite element method
  3. GPU computing
  4. Parallel computing
  5. J2-Plasticity

Author Tags

  1. 74S05
  2. 74C05
  3. 65Y05

Qualifiers

  • Research-article

Funding Sources

  • SERB, India

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)An efficient framework for matrix-free SpMV computation on GPU for elastoplastic problemsMathematics and Computers in Simulation10.1016/j.matcom.2023.09.016216:C(318-346)Online publication date: 1-Feb-2024

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media