1 Introduction
1 Introduction
1 Introduction
Why it is important
Introduction
HPC = ?
NO
Introduction
Examples :
62%
65%
85%
74%
https://en.wikichip.org/wiki/ibm/microarchitectures/power9
Today
https://www.arm.com/blogs/blueprint/fujitsu-a64fx-arm
https://www.top500.org/news/fugaku-holds-top-spot-exascale-remains-elusive/
Cloud and High Performance Computing
101
MFLOPS/W KW
DEVELOPMENT
TOOLS
DEVELOPERS
Development tools
GNU gcc – free
Intel Parallel Studio XE – from 699 US$ NOW FREE
(oneAPI)
PGI Accelerator Workstation – from 759 US$ again NOW FREE
NVIDIA HPC SDK – free
Raw numbers or real performances ?
64 GB main memory
Memory
BUS
192GB/s nVidia GTX 580
(1581 Gflops)
Memory
BUS
nVidia GTX 580
192GB/s
(198 Gflops)
0 < Efficiency ≤ 1
CUDA reference
OpenACC - fastmath
CUDA - fastmath
OpenACC
CUDA
MPI - Intel - 2 cpu
MPI - Intel - 1 cpu
OpenMP - Intel - 2 cpu
MFLOPs/$ - whole system
OpenMP - Intel - 1 cpu
MFLOPs/$ - comp. units
Sequential - Intel
Sequential - gcc GFLOPs
35
$ 4,000 $ 200 20
Today
https://developer.nvidia.com/embedded/jetson-modules
Example: Functional Magnetic Resonance Imaging
The science on that is pretty well established. They knew how to take the
data that was coming from the MRI, and they could compute on it and
create a model of what’s going on inside the brain. But in 2012, when we
started the project, they estimated it would take 44 years on their cluster
they parallelized their code and saw huge increases in performance
But they also looked at it algorithmically with machine learning and AI,
They put it all together and ended up with a 10,000X increase in
performance. They went from something requiring a supercomputing
project at a national lab to something that could be done clinically inside
a hospital in a couple of minutes.
https://www.princeton.edu/news/2017/02/23/princeton-intel-collaboration-breaks-new-
ground-studies-brain
https://www.hpcwire.com/2017/06/08/code-modernization-bringing-codes-parallel-age/
Again on SW
https://www.nextplatform.com/2021/12/06/stacking-up-amd-
mi200-versus-nvidia-a100-compute-
engines/?mc_cid=11aeb90192&mc_eid=e50c89e962
Again on SW
Libraries
Conclusions
The efficient exploitation of current heterogeneous HPC solutions
require good understanding of HW and SW features (architectures,
instructions sets, sdk, …)
Not only HW
Skilled developers
State-of-the-art software libraries and programming tools.