cuda_mode_lecture2
cuda_mode_lecture2
ch. 1-3
CUDA-MODE
Lecture 2
Agenda for Lecture 2
• 1: Introduction
• 2: Heterogeneous data parallel computing
• 3: Multidimensional grids and data
Ch 1: Introduction
• motivation: GPU go brrr, more FLOPS please
• Why? Simulation & world-models (games, weather, proteins,
robotics)
• Bigger models are smarter -> AGI (prevent wars, fix climate, cure
cancer)
• GPUs are the backbone of modern deep learning
• classic software: sequential programs
• higher clock rate trend for CPU slowed in 2003: energy
consumption & heat dissipation
• multi-core CPU came up
• developers had to learn multi-threading (deadlocks, races etc.)
The Power Wall
10
8 Transistors
10 (thousands)
7
10
6
10
5
10 Frequency
4
10 (MHz)
3
10
2
10
1
10
01970 1975 1980 1985 1990 1995 2000 2005 2010 2015 2020
Input Vector y:
Output Vector z:
...
- row-major: - column-major:
A B C A D G
D E F B E H
G H I C F I