Lecture 30 GPU Programming Loop Parallelism
Lecture 30 GPU Programming Loop Parallelism
No Loop‐carried dependence
CKV
Detecting and Enhancing Loop Level Parallelism
Loop‐carried dependence
for (i=1;i<100;i=i+1) {
Y[i] = Y[i‐1] + Y[i];
}