Fundamentals of Accelerated Computing With CUDA Python

Uploaded by

Mahesh Gulla

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

123 views

Fundamentals of Accelerated Computing With CUDA Python

Uploaded by

Mahesh Gulla

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

FUNDAMENTALS

DEEP LEARNING FOR OF ACCELERATED

COMPUTING
PREDICTIVE WITH CUDA PYTHON
MAINTENANCE

This workshop teaches you the fundamental tools and techniques for running GPU-accelerated Python applications using
CUDA® and the Numba compiler GPUs. You’ll work though dozens of hands-on coding exercises and, at the end of the training,
implement a new workflow to accelerate a fully functional linear algebra program originally designed for CPUs, observing
impressive performance gains. After the workshop ends, you’ll have additional resources to help you create new GPU-
accelerated applications on your own.

Learning Objectives
At the conclusion of the workshop, you’ll have an understanding of the fundamental tools and techniques for GPU-accelerated
Python applications with CUDA and Numba:
> GPU-accelerate NumPy ufuncs with a few lines of code.
> Configure code parallelization using the CUDA thread hierarchy.
> Write custom CUDA device kernels for maximum performance and flexibility.
> Use memory coalescing and on-device shared memory to increase CUDA kernel bandwidth.

Workshop Information and Prerequisites:

Duration: 8 hours

Price: Contact us for pricing. During the workshop, each participant will have dedicated access to a
fully conﬁgured, GPU-accelerated workstation in the cloud.

Prerequisites: Basic Python competency, including familiarity with variable types, loops, conditional
statements, functions, and array manipulations. NumPy competency, including the use of
ndarrays and ufuncs. No previous knowledge of CUDA programming is required.

Tools, libraries, and Numba, NumPy

frameworks:

Assessment type: Code-based

Certificate: Upon successful completion of the assessment, participants will receive an NVIDIA
DLI certificate to recognize their subject matter competency and support professional
career growth.

Hardware/software Desktop or laptop computer capable of running the latest version of Chrome or Firefox.
requirements: Each participant will be provided with dedicated access to a fully configured, GPU-accelerated
workstation in the cloud.

Languages: English

FUNDAMENTALS OF ACCELERATED COMPUTING WITH CUDA PYTHON 1

Sample Workshop Outline
Introduction (15 mins) > Meet the instructor.
> Create an account at courses.nvidia.com/join
Introduction to CUDA > Begin working with the Numba compiler and CUDA programming in Python.
Python with Numba > Use Numba decorators to GPU-accelerate numerical Python functions.
(120 mins) > Optimize host-to-device and device-to-host memory transfers.
Break (60 mins)

Custom CUDA Kernels in > Learn CUDA’s parallel thread hierarchy and how to extend parallel program
Python with Numba possibilities.
(120 mins) > Launch massively parallel, custom CUDA kernels on the GPU.
> Utilize CUDA atomic operations to avoid race conditions during parallel execution.
Break (15 mins)

RNG, Multidimensional Grids, > Use xoroshiro128+ RNG to support GPU-accelerated Monte Carlo methods.
and Shared Memory for CUDA > Learn multidimensional grid creation and how to work in parallel on 2D matrices.
Python with Numba
> Leverage on-device shared memory to promote memory coalescing while reshaping
(120 mins) 2D matrices.
Final Review (15 mins) > Review key learnings and wrap up questions.
> Complete the assessment to earn a certificate.
> Take the workshop survey

Why Choose NVIDIA Deep Learning Institute for Hands-On Training?

> Access workshops from anywhere with just your desktop/laptop and an internet connection. Each participant will have
access to a fully configured, GPU-accelerated workstation in the cloud.
> Obtain hands-on experience with the most widely used, industry-standard software, tools, and frameworks.
> Learn to build deep learning and accelerated computing applications for industries, such as healthcare, robotics,
manufacturing, accelerated computing, and more.
> Gain real-world experience through content designed in collaboration with industry leaders, such as the Children’s Hospital
of Los Angeles, Mayo Clinic, and PwC.
> Earn an NVIDIA Deep Learning Institute certificate to demonstrate your subject matter competency and support your
career growth.

For the latest DLI workshops and trainings, visit www.nvidia.com/dli

For questions, contact us at nvdli@nvidia.com

© 2021 NVIDIA Corporation. All rights reserved. NVIDIA, the NVIDIA logo, and CUDA are trademarks and/or registered
trademarks of NVIDIA Corporation in the U.S. and other countries. All other trademarks and copyrights are the property of
their respective owners. Jul21

FUNDAMENTALS OF ACCELERATED COMPUTING WITH CUDA PYTHON 2

Building Transformer-Based Natural Language Processing Applications
No ratings yet
Building Transformer-Based Natural Language Processing Applications
3 pages
Administrative Assistant Resume Template
100% (1)
Administrative Assistant Resume Template
2 pages
Cuda Lab Manual
100% (1)
Cuda Lab Manual
22 pages
Numbapro Quickstart: How Do I Install It?
No ratings yet
Numbapro Quickstart: How Do I Install It?
1 page
GPUProgramming Talk
No ratings yet
GPUProgramming Talk
18 pages
Cuda-: An Emerging Technology That Can Make Robots Reflex Action Faster
No ratings yet
Cuda-: An Emerging Technology That Can Make Robots Reflex Action Faster
11 pages
Pytorch Tutorial
0% (1)
Pytorch Tutorial
65 pages
Cuda PDF
No ratings yet
Cuda PDF
18 pages
Image Rotation Using CUDA
No ratings yet
Image Rotation Using CUDA
18 pages
Setting Up A Deep Learning Workplace With An NVIDIA Graphics Card (GPU) - For Windows OS
No ratings yet
Setting Up A Deep Learning Workplace With An NVIDIA Graphics Card (GPU) - For Windows OS
25 pages
Introduction To CUDA Platform 1
No ratings yet
Introduction To CUDA Platform 1
18 pages
GPU Computing For Data Science - John Joo
No ratings yet
GPU Computing For Data Science - John Joo
34 pages
CUDA
No ratings yet
CUDA
20 pages
Compute Unified Device Architecture
No ratings yet
Compute Unified Device Architecture
6 pages
Dli Catalog
No ratings yet
Dli Catalog
26 pages
2013 07 22-Python-CUDA
No ratings yet
2013 07 22-Python-CUDA
25 pages
Lab04.2 - Hello CUDA
No ratings yet
Lab04.2 - Hello CUDA
9 pages
GPU Computing Revolution CUDA
100% (1)
GPU Computing Revolution CUDA
5 pages
Cuda
No ratings yet
Cuda
15 pages
Environment Setup
No ratings yet
Environment Setup
25 pages
Nvidia Cuda Thesis
100% (3)
Nvidia Cuda Thesis
8 pages
CUDA Introduction
No ratings yet
CUDA Introduction
39 pages
CUDA 6.0: Acknowledgements
No ratings yet
CUDA 6.0: Acknowledgements
13 pages
CUDA
No ratings yet
CUDA
46 pages
Distributed-Shared CUDA: Virtualization of Large-Scale GPU Systems For Programmability and Reliability
No ratings yet
Distributed-Shared CUDA: Virtualization of Large-Scale GPU Systems For Programmability and Reliability
6 pages
Cuda Emulator
No ratings yet
Cuda Emulator
7 pages
Best GPU For Deep Learning Guide
No ratings yet
Best GPU For Deep Learning Guide
4 pages
A Whirlwind Tour of Python
No ratings yet
A Whirlwind Tour of Python
24 pages
CUDA Developer Guide For Optimus Platforms
No ratings yet
CUDA Developer Guide For Optimus Platforms
15 pages
Cuda On CL Iwocl2017
No ratings yet
Cuda On CL Iwocl2017
4 pages
Gpu Cuda Part2
No ratings yet
Gpu Cuda Part2
15 pages
NVCC - CUDA Toolkit Documentation
No ratings yet
NVCC - CUDA Toolkit Documentation
1 page
Introduction To Gpu Programming With Cuda and Openacc
100% (1)
Introduction To Gpu Programming With Cuda and Openacc
40 pages
CUDA Getting Started Linux
No ratings yet
CUDA Getting Started Linux
19 pages
Nvidia Learning Learning Path Developers It Administrators
No ratings yet
Nvidia Learning Learning Path Developers It Administrators
17 pages
CUDA Zone - Library of Resources - NVIDIA Developer
No ratings yet
CUDA Zone - Library of Resources - NVIDIA Developer
7 pages
Programming Gpus With Cuda: John Mellor-Crummey
No ratings yet
Programming Gpus With Cuda: John Mellor-Crummey
42 pages
Lab 02 - GPU Device Properties
No ratings yet
Lab 02 - GPU Device Properties
7 pages
GPU Computing With Apache Spark and Python: April 5, 2016
No ratings yet
GPU Computing With Apache Spark and Python: April 5, 2016
55 pages
IntroGPUs
No ratings yet
IntroGPUs
36 pages
NB4-06 PT I Using CNN
No ratings yet
NB4-06 PT I Using CNN
21 pages
Nvidia-Learning-Training Course-Catalog
No ratings yet
Nvidia-Learning-Training Course-Catalog
27 pages
DL TR 2022 002
No ratings yet
DL TR 2022 002
20 pages
S62256 - Demystify CUDA Debugging and Performance with Powerful Developer Tools
No ratings yet
S62256 - Demystify CUDA Debugging and Performance with Powerful Developer Tools
44 pages
Acceleratingpythonongpus
No ratings yet
Acceleratingpythonongpus
33 pages
Gpucc: An Open-Source GPGPU Compiler
No ratings yet
Gpucc: An Open-Source GPGPU Compiler
12 pages
Achieve All Your Long Term Goals
No ratings yet
Achieve All Your Long Term Goals
19 pages
Intro To Gpu &amp Cuda
No ratings yet
Intro To Gpu &amp Cuda
15 pages
Nvidia Cuda C Getting Started Guide For Linux: Installation and Verification On Linux Systems
No ratings yet
Nvidia Cuda C Getting Started Guide For Linux: Installation and Verification On Linux Systems
16 pages
nvidia-learning-learning-path-developers-it-administrators
No ratings yet
nvidia-learning-learning-path-developers-it-administrators
19 pages
Virtual_Machines GCP
No ratings yet
Virtual_Machines GCP
75 pages
On Implementation of GPU Virtualization Using PCI Pass-Through
No ratings yet
On Implementation of GPU Virtualization Using PCI Pass-Through
6 pages
Introduction To Massively Parallel Computing
No ratings yet
Introduction To Massively Parallel Computing
44 pages
HANDS ON LAB S4795 Accelerating Computer Vision Opencv Cuda
No ratings yet
HANDS ON LAB S4795 Accelerating Computer Vision Opencv Cuda
19 pages
pny-nvidia-quadro-t1000-embedded
No ratings yet
pny-nvidia-quadro-t1000-embedded
1 page
CUDA 1_Introduction to GPU, CUDA (1)
No ratings yet
CUDA 1_Introduction to GPU, CUDA (1)
21 pages
Graphics_Optimization_ForTechArtist
No ratings yet
Graphics_Optimization_ForTechArtist
48 pages
Kanhu Charan Patel: Experience Summary
No ratings yet
Kanhu Charan Patel: Experience Summary
5 pages
GPU Programming: Dr. Florian Ferreira
No ratings yet
GPU Programming: Dr. Florian Ferreira
101 pages
CUDA Programming with Python: From Basics to Expert Proficiency
From Everand
CUDA Programming with Python: From Basics to Expert Proficiency
William Smith
No ratings yet
Mastering CUDA Python Programming
From Everand
Mastering CUDA Python Programming
Ed A Norex
No ratings yet
FPGA Implementation of Wavelet Transform Based On Lifting Scheme
No ratings yet
FPGA Implementation of Wavelet Transform Based On Lifting Scheme
27 pages
Printer Sharing Unit
No ratings yet
Printer Sharing Unit
11 pages
October 2009
No ratings yet
October 2009
48 pages
An Overview of Remote Access VPNS: Architecture and Efficient Installation
No ratings yet
An Overview of Remote Access VPNS: Architecture and Efficient Installation
10 pages
Lecture 3 Software ReEngineering 10102022 104311am
No ratings yet
Lecture 3 Software ReEngineering 10102022 104311am
37 pages
Assignment 04
No ratings yet
Assignment 04
3 pages
Edet T. Compiler Construction With C... Efficient Interpreters and Compilers 2024
No ratings yet
Edet T. Compiler Construction With C... Efficient Interpreters and Compilers 2024
423 pages
File Handling
No ratings yet
File Handling
14 pages
Minor Project 4th Sem
No ratings yet
Minor Project 4th Sem
38 pages
Class Section PDF
No ratings yet
Class Section PDF
27 pages
Srikanth Resume Personal
No ratings yet
Srikanth Resume Personal
5 pages
Haslebacher Forums
No ratings yet
Haslebacher Forums
11 pages
DVRS KSCST
No ratings yet
DVRS KSCST
7 pages
Sample PPT Bidder's Point
No ratings yet
Sample PPT Bidder's Point
108 pages
Program To Implement Binary Search Tree in C
No ratings yet
Program To Implement Binary Search Tree in C
5 pages
Concept GIS and GPS - Userguide 1
No ratings yet
Concept GIS and GPS - Userguide 1
66 pages
18csc206j Sepm - Ex 3 Team 8
No ratings yet
18csc206j Sepm - Ex 3 Team 8
5 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
70 pages
Cyber Security Greens Syllabus
No ratings yet
Cyber Security Greens Syllabus
35 pages
Minimization of Blast Furnace Fuel Rate by Optimizing Burden and Gas Distributions
No ratings yet
Minimization of Blast Furnace Fuel Rate by Optimizing Burden and Gas Distributions
1 page
CMDB
100% (1)
CMDB
16 pages
PAC Productivity Suite: Integrated PLC and SCADA Solution
No ratings yet
PAC Productivity Suite: Integrated PLC and SCADA Solution
6 pages
Analisis Dan Perancangan Sistem Informasi Berbasis Website Menggunakan Arsitektur MVC Dengan Framework Codeigniter
No ratings yet
Analisis Dan Perancangan Sistem Informasi Berbasis Website Menggunakan Arsitektur MVC Dengan Framework Codeigniter
20 pages
Samsung A7 A6 A6+
No ratings yet
Samsung A7 A6 A6+
2 pages
Resilient Azure Architecture
No ratings yet
Resilient Azure Architecture
19 pages
Career Objective: Chiranjeevi Oracle Apps Technical Cell: +91-8801136343
No ratings yet
Career Objective: Chiranjeevi Oracle Apps Technical Cell: +91-8801136343
2 pages
Online4US Brochure en
No ratings yet
Online4US Brochure en
2 pages
Cka PDF
75% (4)
Cka PDF
58 pages
Full download An Introduction to Parallel Programming Pacheco Peter S Malensek Matthew pdf docx
100% (1)
Full download An Introduction to Parallel Programming Pacheco Peter S Malensek Matthew pdf docx
54 pages

Fundamentals of Accelerated Computing With CUDA Python

Uploaded by

Fundamentals of Accelerated Computing With CUDA Python

Uploaded by

FUNDAMENTALS

DEEP LEARNING FOR OF ACCELERATED

Workshop Information and Prerequisites:

Tools, libraries, and Numba, NumPy

Assessment type: Code-based

FUNDAMENTALS OF ACCELERATED COMPUTING WITH CUDA PYTHON 1

Why Choose NVIDIA Deep Learning Institute for Hands-On Training?

For the latest DLI workshops and trainings, visit www.nvidia.com/dli

FUNDAMENTALS OF ACCELERATED COMPUTING WITH CUDA PYTHON 2

You might also like