Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/978-3-030-90539-2_8guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Nonlinear Mode Decomposition and Reduced-Order Modeling for Three-Dimensional Cylinder Flow by Distributed Learning on Fugaku

Published: 24 June 2021 Publication History

Abstract

Nonlinear modes of the three-dimensional flow field around a cylinder were extracted by distributed learning on Fugaku. Mode decomposition is an approach used to decompose flow fields into physically important flow structures known as modes. In this study, convolutional neural network-based mode decomposition was applied to the three-dimensional flow field. However, because this process is costly in terms of calculation and memory usage for even a small flow field problem, the enormous computational and memory resources of the supercomputer Fugaku were employed. A hybrid parallelism method combining the distribution of network structure (model parallelism) and the input data (data parallelism) using up to 10,500 nodes on Fugaku was employed for learning. Further, we constructed a reduced-order model to predict the time evolution of latent vector, using the long short-term memory networks. Finally, we compared the reproduced flow field of the model with that of the original full-order model. In addition, we evaluated the execution performance of the learning process. Using a single core memory group, the whole learning process indicates a value of 129.50 GFLOPS being achieved, 7.57% of the single-precision floating-point arithmetic peak performance. Notably, the convolution calculation for backward-propagation achieved 1103.09 GFLOPS, which is 65.39% of the peak. Furthermore, with the weak scaling test, the whole learning process indicates 72.9% with 25,250 nodes (1,212,000 cores) relative to 750 nodes, the sustained performance is 7.8 PFLOPS. In particular, the convolution calculation for backward-propagation indicates a result of 113 PFLOPS (66.2% of the peak performance).

References

[1]
Taira K et al. Modal analysis of fluid flows: an overview AIAA J. 2017 55 12 4013-4041
[2]
Jolliffe IT Principal Component Analysis, Springer Series in Statistics 2002 2 Heidelberg Springer
[3]
Lumley, J.L.: The structure of inhomogeneous turbulent flows. In: Yaglom, A.M., Tatarski, V.I. (Eds.) Atmospheric Turbulence and Wave Propagation, pp. 166–178. Nauka (1967)
[4]
Holmes P, Lumley JL, Berkooz G, and Rowley CW Turbulence, Coherent Structures, Dynamical Systems and Symmetry 2012 2 Cambridge Cambridge University Press
[5]
Chatterjee A An introduction to the proper orthogonal decomposition Curr. Sci. 2000 78 7 808-817
[6]
Berkooz G, Holmes P, and Lumley JL The proper orthogonal decomposition in the analysis of turbulent flows Annu. Rev. Fluid Mech. 1993 25 539-575
[7]
Taira K Proper orthogonal decomposition in fluid flow analysis: 1 introduction Nagare 2011 30 115-124 (in Japanese)
[8]
Aubry N, Holmes P, Lumley JL, and Stone E The dynamics of coherent structures in the wall region of a turbulent boundary layer J. Fluid Mech. 1988 192 115-173
[9]
Noack BR, Morzynski M, and Tadmor G Reduced-Order Modelling for Flow Control 2011 Heidelberg Springer
[10]
Taira K et al. Modal analysis of fluid flows: applications and outlook AIAA J. 2020 58 3 998-1022
[11]
Kim, Y., Choi, Y., Widemann, D., Zohdi, T.: Efficient nonlinear manifold reduced order model (2020). https://arxiv.org/pdf/2011.07727.pdf
[12]
Murata T, Fukami K, and Fukagata K Nonlinear mode decomposition with convolutional neural networks for fluid dynamics J. Fluid Mech. 2020 882 A13
[13]
Hasegawa K, Fukami K, Murata T, and Fukagata K Machine-learning-based reduced-order modeling for unsteady flows around bluff bodies of various shapes Theor. Comput. Fluid Dyn. 2020 34 4 367-383
[14]
Hochreiter S and Schmidhuber J Long short-term memory Neural Comput. 1997 9 1735-1780
[16]
Patton, R.M., et al.: 167-PFlops deep learning for electron microscopy: from learning physics to atomic manipulation. In: SC18: International Conference for High Performance Computing, Networking, Storage and Analysis, Dallas, TX, USA, pp. 638–648 (2018).
[17]
Kurth, T., et al.: Exascale deep learning for climate analytics. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2018, p. 51. IEEE Press, NJ, USA (2018)
[18]
Yang, L., et al.: Highly-scalable, Physics-Informed GANs for Learning Solutions of Stochastic PDEs. In: 2019 IEEE/ACM Third Workshop on Deep Learning on Supercomputers (DLS), pp. 1–11, November 2019
[19]
Jia, W., et al.: Pushing the limit of molecular dynamics with ab initio accuracy to 100 million atoms with machine learning. In: 2020 SC20: International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Atlanta, GA, US, pp. 1–14 (2020).
[20]
Yoshida T Fujitsu high performance CPU for the post-K computer Hot Chips 2018 30 1-22
[21]
RIKEN Center for Computational Science, Specifications Post-K (Fugaku) Information. https://postk-web.r-ccs.riken.jp/spec.html
[22]
TOP500.org.: TOP500 Supercomputer Sites (2020)
[23]
RIKEN Center for Computational Science, Post-K (Fugaku) Information. https://postk-web.r-ccs.riken.jp/perf.html
[24]
RIKEN: Japan’s Fugaku retains title as the world’s fastest supercomputer. https://www.riken.jp/en/news_pubs/news/2020/20201117_2/index.html
[25]
RIKEN Center for Computational Science, Outline of the Development of the Supercomputer Fugaku. https://www.r-ccs.riken.jp/en/fugaku/project/outline
[26]
OneAPI Deep Neural Network Library (oneDNN). https://01.org/oneDNN
[27]
Deep Neural Network Library for AArch64. https://github.com/fujitsu/dnnl_aarch64
[28]
Paszke, A., et al.: Automatic differentiation in PyTorch. In: NIPS-W (2017)
[29]
Jansson N, Bale R, Onishi K, and Tsubokura M CUBE: A scalable framework for large-scale industrial simulations Int. J. High. Perform. Comput. App. 2019 33 4 678-698
[30]
Onishi K and Tsubokura M Topology-free immersed boundary method for incompressible turbulence flows: an aerodynamic simulation for “dirty” CAD geometry Comput. Method Appl. Mech. Eng. 2021
[31]
Nakahashi, K.: Building-cube method for flow problems with broadband characteristic length. In: Armfield, S., Morgan, P., Srinivas, K. (Eds) Computational Fluid Dynamics, 77–81. Springer, Berlin Heidelberg (2003).
[32]
Onishi, K., Obayashi, S., Nakahashi, K., Tsubokura, M.: Use of the immersed boundary method within the building cube method and its application to real vehicle CAD Data. In: 21st AIAA Computational Fluid 1029 Dynamics Conference. American Institute of Aeronautics and Astronautics, San Diego, CA (2013).
[33]
Peskin CS The immersed boundary method Acta. Numerica. 2002 11 1032 479-517
[34]
Bhalla APS, Bale R, Griffith BE, and Patankar NA A unified mathematical framework and an adaptive numerical method for fluid-structure interaction with rigid, deforming, and elastic bodies J. Comput. Phys. 2013 250 1 446-476
[35]
Hinton GE and Salakhutdinov RR Reducing the dimensionality of data with neural networks Science 2006 313 5786 504-507
[36]
Olah, C.: Understanding LSTM Networks. http://colah.github.io/posts/2015-08-Understanding-LSTMs/. Accessed 04 Apr 2021
[37]
Nakamura T, Fukami K, Hasegawa K, Nabae Y, and Fukagata K Convolutional neural network and long short-term memory based reduced order surrogate for minimal turbulent channel flow Phys. Fluids 2021 33 025116

Index Terms

  1. Nonlinear Mode Decomposition and Reduced-Order Modeling for Three-Dimensional Cylinder Flow by Distributed Learning on Fugaku
              Index terms have been assigned to the content through auto-classification.

              Recommendations

              Comments

              Information & Contributors

              Information

              Published In

              cover image Guide Proceedings
              High Performance Computing: ISC High Performance Digital 2021 International Workshops, Frankfurt am Main, Germany, June 24 – July 2, 2021, Revised Selected Papers
              Jun 2021
              518 pages
              ISBN:978-3-030-90538-5
              DOI:10.1007/978-3-030-90539-2
              • Editors:
              • Heike Jagode,
              • Hartwig Anzt,
              • Hatem Ltaief,
              • Piotr Luszczek

              Publisher

              Springer-Verlag

              Berlin, Heidelberg

              Publication History

              Published: 24 June 2021

              Author Tags

              1. Distributed learning
              2. Supercomputer Fugaku
              3. Mode decomposition
              4. Three-dimensional flow field
              5. Reduced-order model
              6. Long short-term memory networks (LSTMs)
              7. Computational fluid dynamics (CFD)

              Qualifiers

              • Article

              Contributors

              Other Metrics

              Bibliometrics & Citations

              Bibliometrics

              Article Metrics

              • 0
                Total Citations
              • 0
                Total Downloads
              • Downloads (Last 12 months)0
              • Downloads (Last 6 weeks)0
              Reflects downloads up to 08 Feb 2025

              Other Metrics

              Citations

              View Options

              View options

              Figures

              Tables

              Media

              Share

              Share

              Share this Publication link

              Share on social media