0% found this document useful (0 votes)

35 views

Pytorch Tutorial 1

Uploaded by

Da HUANG

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views

Pytorch Tutorial 1

Uploaded by

Da HUANG

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Machine Learning

Pytorch Tutorial
TA : 曾元（Yuan Tseng）
2022.02.18
Outline
● Background: Prerequisites & What is Pytorch?
● Training & Testing Neural Networks in Pytorch
● Dataset & Dataloader
● Tensors
● torch.nn: Models, Loss Functions
● torch.optim: Optimization
● Save/load models
Prerequisites
● We assume you are already familiar with…
1. Python3
■ if-else, loop, function, file IO, class, ...
■ refs: link1, link2, link3
2. Deep Learning Basics
■ Prof. Lee’s 1st & 2nd lecture videos from last year
■ ref: link1, link2

Some knowledge of NumPy will also be useful!

What is PyTorch?
● An machine learning framework in Python.
● Two main features:
○ N-dimensional Tensor computation (like NumPy) on GPUs
○ Automatic diﬀerentiation for training deep neural networks
Training Neural Networks

Define Neural Optimization

Loss Function
Network Algorithm

Training

More info about the training process in last year's lecture video.
Training & Testing Neural Networks

Training Validation Testing

Guide for training/validation/testing can be found here.

Training & Testing Neural Networks - in Pytorch
Step 1.
torch.utils.data.Dataset &
Load Data torch.utils.data.DataLoader

Training Validation Testing

Dataset & Dataloader
● Dataset: stores data samples and expected values
● Dataloader: groups data in batches, enables multiprocessing

● dataset = MyDataset(file)
● dataloader = DataLoader(dataset, batch_size, shuffle=True)

Training: True
Testing: False

More info about batches and shuﬄing here.

Dataset & Dataloader
from torch.utils.data import Dataset, DataLoader

class MyDataset(Dataset):
def __init__(self, file):
self.data = ... Read data & preprocess

def getitem(self, index):

return self.data[index] Returns one sample at a time

def __len__(self):
return len(self.data) Returns the size of the dataset
Dataset & Dataloader
dataset = MyDataset(file)

dataloader = DataLoader(dataset, batch_size=5, shuffle=False)

DataLoader
__getitem__(0) 0
__getitem__(1) 1
Dataset __getitem__(2) 2 batch_size
__getitem__(3) 3
__getitem__(4) 4
mini-batch
Tensors
● High-dimensional matrices (arrays)

1-D tensor 2-D tensor 3-D tensor

e.g. audio e.g. black&white e.g. RGB images
images
Tensors – Shape of Tensors
● Check with .shape

4
3

5
3
5 5
(5, ) (3, 5) (4, 5, 3)

dim 0 dim 0 dim 1 dim 0 dim 1 dim 2

Note: dim in PyTorch == axis in NumPy

Tensors – Creating Tensors
● Directly from data (list or numpy.ndarray) tensor([[1., -1.],
x = torch.tensor([[1, -1], [-1, 1]]) [-1., 1.]])

x = torch.from_numpy(np.array([[1, -1], [-1, 1]]))

● Tensor of constant zeros & ones tensor([[0., 0.],

[0., 0.]])
x = torch.zeros([2, 2])

x = torch.ones([1, 2, 5]) tensor([[[1., 1., 1., 1., 1.],

shape [1., 1., 1., 1., 1.]]])
Tensors – Common Operations
Common arithmetic functions are supported, such as:

● Addition ● Summation

z = x + y y = x.sum()

● Subtraction ● Mean

z = x - y y = x.mean()

● Power

y = x.pow(2)
Tensors – Common Operations
● Transpose: transpose two speciﬁed dimensions

>>> x = torch.zeros([2, 3])

2
>>> x.shape
3
torch.Size([2, 3])

>>> x = x.transpose(0, 1)

>>> x.shape 3

torch.Size([3, 2])
2
Tensors – Common Operations
● Squeeze: remove the speciﬁed dimension with length = 1

>>> x = torch.zeros([1, 2, 3])

>>> x.shape 1
3
2
torch.Size([1, 2, 3])

>>> x = x.squeeze(0)
(dim = 0)
>>> x.shape 2

torch.Size([2, 3]) 3
Tensors – Common Operations
● Unsqueeze: expand a new dimension

>>> x = torch.zeros([2, 3]) 2

>>> x.shape
3
torch.Size([2, 3])

>>> x = x.unsqueeze(1) (dim = 1)

>>> x.shape 2

torch.Size([2, 1, 3]) 3
1
Tensors – Common Operations
x 2
3
1

● Cat: concatenate multiple tensors

y 2
>>> x = torch.zeros([2, 1, 3])
3
3
>>> y = torch.zeros([2, 3, 3])

>>> z = torch.zeros([2, 2, 3]) z

>>> w = torch.cat([x, y, z], dim=1) 3

>>> w.shape
w
torch.Size([2, 6, 3]) 2
3
6
more operators: https://pytorch.org/docs/stable/tensors.html
Tensors – Data Type
● Using diﬀerent data types for model and data will cause errors.

Data type dtype tensor

32-bit ﬂoating point torch.float torch.FloatTensor

64-bit integer (signed) torch.long torch.LongTensor

see oﬃcial documentation for more information on data types.

Tensors – PyTorch v.s. NumPy
● Similar attributes

PyTorch NumPy
x.shape x.shape
x.dtype x.dtype

see oﬃcial documentation for more information on data types.

ref: https://github.com/wkentaro/pytorch-for-numpy-users
Tensors – PyTorch v.s. NumPy
● Many functions have the same names as well

PyTorch NumPy
x.reshape / x.view x.reshape
x.squeeze() x.squeeze()
x.unsqueeze(1) np.expand_dims(x, 1)

ref: https://github.com/wkentaro/pytorch-for-numpy-users
Tensors – Device
● Tensors & modules will be computed with CPU by default

Use .to() to move tensors to appropriate devices.

● CPU
x = x.to(‘cpu’)
● GPU
x = x.to(‘cuda’)
Tensors – Device (GPU)
● Check if your computer has NVIDIA GPU

torch.cuda.is_available()

● Multiple GPUs: specify ‘cuda:0’, ‘cuda:1’, ‘cuda:2’, ...

● Why use GPUs?

○ Parallel computing with more cores for arithmetic calculations
○ See What is a GPU and do you need one in deep learning?
Tensors – Gradient Calculation
1 >>> x = torch.tensor([[1., 0.], [-1., 1.]], requires_grad=True)

2 >>> z = x.pow(2).sum()

3 >>> z.backward()

4 >>> x.grad
1 2
tensor([[ 2., 0.],

[-2., 2.]])
3 4

See here to learn about gradient calculation.

Training & Testing Neural Networks – in Pytorch
Step 2.
torch.nn.Module
Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm
torch.nn – Network Layers
● Linear Layer (Fully-connected Layer)

nn.Linear(in_features, out_features)

Input Tensor Output Tensor

nn.Linear(32, 64)
* x 32 * x 64

can be any shape (but last dimension must be 32)

e.g. (10, 32), (10, 5, 32), (1, 1, 3, 32), ...
torch.nn – Network Layers
● Linear Layer (Fully-connected Layer)

ref: last year's lecture video

torch.nn – Neural Network Layers
● Linear Layer (Fully-connected Layer)

y1
x1

y2
x2

32 y3 64 W x x + b = y
x3 (64x32)
...

...

x32
y64
torch.nn – Network Parameters
● Linear Layer (Fully-connected Layer)

>>> layer = torch.nn.Linear(32, 64)

>>> layer.weight.shape

torch.Size([64, 32]) W x x + b = y
(64x32)
>>> layer.bias.shape

torch.Size([64])
torch.nn – Non-Linear Activation Functions
● Sigmoid Activation

nn.Sigmoid()

● ReLU Activation

nn.ReLU()

See here to learn about why we need activation functions.

torch.nn – Build your own neural network
import torch.nn as nn

class MyModel(nn.Module):
def __init__(self):
super(MyModel, self).__init__()
self.net = nn.Sequential(
nn.Linear(10, 32), Initialize your model & deﬁne layers
nn.Sigmoid(),
nn.Linear(32, 1)
)

def forward(self, x):

Compute output of your NN
return self.net(x)
torch.nn – Build your own neural network
import torch.nn as nn import torch.nn as nn

class MyModel(nn.Module): class MyModel(nn.Module):

def __init__(self): def __init__(self):
super(MyModel, self).__init__() super(MyModel, self).__init__()
self.net = nn.Sequential( self.layer1 = nn.Linear(10, 32)
nn.Linear(10, 32), self.layer2 = nn.Sigmoid(),
nn.Sigmoid(), = self.layer3 = nn.Linear(32,1)
nn.Linear(32, 1)
) def forward(self, x):
out = self.layer1(x)
def forward(self, x): out = self.layer2(out)
return self.net(x) out = self.layer3(out)
return out
Training & Testing Neural Networks – in Pytorch
Step 3.
torch.nn.MSELoss
torch.nn.CrossEntropyLoss etc.
Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm
torch.nn – Loss Functions
● Mean Squared Error (for regression tasks)

criterion = nn.MSELoss()

● Cross Entropy (for classiﬁcation tasks)

criterion = nn.CrossEntropyLoss()

● loss = criterion(model_output, expected_value)

Training & Testing Neural Networks – in Pytorch
Step 4.
torch.optim
Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm
torch.optim
● Gradient-based optimization algorithms that adjust network
parameters to reduce error. (See Adaptive Learning Rate lecture video)

● E.g. Stochastic Gradient Descent (SGD)

torch.optim.SGD(model.parameters(), lr, momentum = 0)

torch.optim
optimizer = torch.optim.SGD(model.parameters(), lr, momentum = 0)

● For every batch of data:

1. Call optimizer.zero_grad() to reset gradients of model parameters.
2. Call loss.backward() to backpropagate gradients of prediction loss.
3. Call optimizer.step() to adjust model parameters.

See oﬃcial documentation for more optimization algorithms.

Training & Testing Neural Networks – in Pytorch

Load Data
Define Neural
Network

Loss Function Training Validation Testing

Optimization
Algorithm Step 5.
Entire Procedure
Neural Network Training Setup

dataset = MyDataset(file) read data via MyDataset

tr_set = DataLoader(dataset, 16, shuffle=True) put dataset into Dataloader

model = MyModel().to(device) construct model and move to device (cpu/cuda)

criterion = nn.MSELoss() set loss function

optimizer = torch.optim.SGD(model.parameters(), 0.1) set optimizer

Neural Network Training Loop
for epoch in range(n_epochs): iterate n_epochs

model.train() set model to train mode

for x, y in tr_set: iterate through the dataloader

optimizer.zero_grad() set gradient to zero

x, y = x.to(device), y.to(device) move data to device (cpu/cuda)

pred = model(x) forward pass (compute output)

loss = criterion(pred, y) compute loss

loss.backward() compute gradient (backpropagation)

optimizer.step() update model with optimizer

Neural Network Validation Loop
model.eval() set model to evaluation mode

total_loss = 0

for x, y in dv_set: iterate through the dataloader

x, y = x.to(device), y.to(device) move data to device (cpu/cuda)

with torch.no_grad(): disable gradient calculation

pred = model(x) forward pass (compute output)

loss = criterion(pred, y) compute loss

total_loss += loss.cpu().item() * len(x) accumulate loss

avg_loss = total_loss / len(dv_set.dataset) compute averaged loss

Neural Network Testing Loop
model.eval() set model to evaluation mode

preds = []

for x in tt_set: iterate through the dataloader

x = x.to(device) move data to device (cpu/cuda)

with torch.no_grad(): disable gradient calculation

pred = model(x) forward pass (compute output)

preds.append(pred.cpu()) collect prediction

Notice - model.eval(), torch.no_grad()
● model.eval()

Changes behaviour of some model layers, such as dropout and batch

normalization.

● with torch.no_grad()

Prevents calculations from being added into gradient computation

graph. Usually used to prevent accidental training on validation/testing
data.
Save/Load Trained Models
● Save

torch.save(model.state_dict(), path)

● Load

ckpt = torch.load(path)

model.load_state_dict(ckpt)
More About PyTorch
● torchaudio
○ speech/audio processing
● torchtext
○ natural language processing
● torchvision
○ computer vision
● skorch
○ scikit-learn + pyTorch
More About PyTorch
● Useful github repositories using PyTorch
○ Huggingface Transformers (transformer models: BERT, GPT, ...)
○ Fairseq (sequence modeling for NLP & speech)
○ ESPnet (speech recognition, translation, synthesis, ...)
○ Most implementations of recent deep learning papers
○ ...
References
● Machine Learning 2021 Spring Pytorch Tutorial
● Oﬃcial Pytorch Tutorials
● https://numpy.org/
Any questions?

Inputs Process Output: Conceptual Framework
100% (2)
Inputs Process Output: Conceptual Framework
2 pages
Pytorch Cheatsheet EN
No ratings yet
Pytorch Cheatsheet EN
1 page
Employee Management System: CS8582-Object Oriented Analysis and Design Lab
No ratings yet
Employee Management System: CS8582-Object Oriented Analysis and Design Lab
19 pages
Segment Routing Fundamentals26
100% (1)
Segment Routing Fundamentals26
350 pages
Sentiment Analysis of Restaurant Review - Project Report
No ratings yet
Sentiment Analysis of Restaurant Review - Project Report
20 pages
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
vertopal.com_PyTorch_CrashCourse
No ratings yet
vertopal.com_PyTorch_CrashCourse
16 pages
PyTorch Crash Course 1713016363
No ratings yet
PyTorch Crash Course 1713016363
15 pages
PyTorch_CrashCourse
No ratings yet
PyTorch_CrashCourse
17 pages
Module02 PyTorch
No ratings yet
Module02 PyTorch
36 pages
Chapter1 Intro
No ratings yet
Chapter1 Intro
35 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
Pytorch Slides
No ratings yet
Pytorch Slides
31 pages
CS236 Introduction To PyTorch
100% (4)
CS236 Introduction To PyTorch
33 pages
unit-4-part-3
No ratings yet
unit-4-part-3
8 pages
2c PyTorch4
No ratings yet
2c PyTorch4
4 pages
Pytorch Basics - For Absolute Beginners - Sel, Tam (Sel, Tam) - 2021 - Anna's Archive - Copie
No ratings yet
Pytorch Basics - For Absolute Beginners - Sel, Tam (Sel, Tam) - 2021 - Anna's Archive - Copie
62 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
lec-3
No ratings yet
lec-3
30 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
chapter1 (1)
No ratings yet
chapter1 (1)
50 pages
(Deep Learning Using PyTorch) (Cheatsheet)
No ratings yet
(Deep Learning Using PyTorch) (Cheatsheet)
7 pages
Beginner's PyTorch Guide
No ratings yet
Beginner's PyTorch Guide
35 pages
PyTorch PDF
No ratings yet
PyTorch PDF
72 pages
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
No ratings yet
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
11 pages
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
No ratings yet
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
8 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
Pytorch
No ratings yet
Pytorch
38 pages
DIP Lab 10
No ratings yet
DIP Lab 10
11 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
Pytorch Tutorial: - Ntu Machine Learning Course
No ratings yet
Pytorch Tutorial: - Ntu Machine Learning Course
64 pages
Py Torch
No ratings yet
Py Torch
786 pages
PyTorch - A Comprehensive Overview
No ratings yet
PyTorch - A Comprehensive Overview
7 pages
Deep Learning Unit 4
No ratings yet
Deep Learning Unit 4
11 pages
chapter3 (1)
No ratings yet
chapter3 (1)
26 pages
Introduction To PyTorch
No ratings yet
Introduction To PyTorch
35 pages
یادگیری پایتورچ
No ratings yet
یادگیری پایتورچ
30 pages
PyTorch Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
No ratings yet
PyTorch Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
45 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
Tensors
No ratings yet
Tensors
12 pages
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
No ratings yet
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
108 pages
WWW Learnpytorch
No ratings yet
WWW Learnpytorch
14 pages
00 Pytorch and Deep Learning Fundamentals PDF
No ratings yet
00 Pytorch and Deep Learning Fundamentals PDF
44 pages
Py Torch
No ratings yet
Py Torch
19 pages
S06_DNN_Tensorflow_PyTorch_wip
No ratings yet
S06_DNN_Tensorflow_PyTorch_wip
24 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
Pytorch 101: Deep Learning PHD Course 2017/2018
No ratings yet
Pytorch 101: Deep Learning PHD Course 2017/2018
19 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
unit 4 part 3 dl_1
No ratings yet
unit 4 part 3 dl_1
5 pages
Pytorch Tutorial PDF
No ratings yet
Pytorch Tutorial PDF
27 pages
A Brief Introduction To Pytorch: (A Deep Learning Library)
No ratings yet
A Brief Introduction To Pytorch: (A Deep Learning Library)
32 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Pytorch For Beginners
No ratings yet
Pytorch For Beginners
13 pages
Lab 5
No ratings yet
Lab 5
27 pages
03 02 Neural Networks
No ratings yet
03 02 Neural Networks
23 pages
Intro To Pytorch
No ratings yet
Intro To Pytorch
12 pages
Activation Functions: Ismail Elezi
No ratings yet
Activation Functions: Ismail Elezi
30 pages
Pytorch: Tensors and Datasets
No ratings yet
Pytorch: Tensors and Datasets
9 pages
Pytorch Exercise
No ratings yet
Pytorch Exercise
5 pages
DL 1 - ComputerVision With PyTorch Notes
No ratings yet
DL 1 - ComputerVision With PyTorch Notes
304 pages
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
3/5 (4)
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
MediHub Version 3 User Guide For 1st Time Login and Subsequent Login - 21 Aug 2019 Effective 1 Sep 2019
No ratings yet
MediHub Version 3 User Guide For 1st Time Login and Subsequent Login - 21 Aug 2019 Effective 1 Sep 2019
16 pages
Naveed Hussain: Software Engineer
No ratings yet
Naveed Hussain: Software Engineer
3 pages
Тест 2
No ratings yet
Тест 2
27 pages
Sentinel 2 Products Specification Document
No ratings yet
Sentinel 2 Products Specification Document
524 pages
2-Way Radio & Personal Navigator: Owner's Manual and Reference Guide
No ratings yet
2-Way Radio & Personal Navigator: Owner's Manual and Reference Guide
88 pages
Kub Commands
No ratings yet
Kub Commands
2 pages
USTH Write Pseudocode - Practice
No ratings yet
USTH Write Pseudocode - Practice
10 pages
Assignment #4: Fall - 2017
No ratings yet
Assignment #4: Fall - 2017
5 pages
GM All Key Relearn Instructions
No ratings yet
GM All Key Relearn Instructions
3 pages
Bridging The Gap: The Digital Substation: Hitachi Abb Power Grids
100% (1)
Bridging The Gap: The Digital Substation: Hitachi Abb Power Grids
11 pages
Instant Download Windows Kernel Programming Second Edition Pavel Yosifovich PDF All Chapters
100% (1)
Instant Download Windows Kernel Programming Second Edition Pavel Yosifovich PDF All Chapters
67 pages
Ansible Full Course: For Beginners
No ratings yet
Ansible Full Course: For Beginners
18 pages
DS 4
No ratings yet
DS 4
9 pages
ssss
No ratings yet
ssss
282 pages
ADOBE Master Colection 2022 I 2021
100% (1)
ADOBE Master Colection 2022 I 2021
2 pages
[22331] Trendlines with Breaks [LUXAlgo] and Gaussian Channel Strategy
No ratings yet
[22331] Trendlines with Breaks [LUXAlgo] and Gaussian Channel Strategy
3 pages
The Convergence of Internet of Things and Cloud For Smart Computing
No ratings yet
The Convergence of Internet of Things and Cloud For Smart Computing
139 pages
Brandes 2008
No ratings yet
Brandes 2008
17 pages
vocab 2 simix
No ratings yet
vocab 2 simix
4 pages
Product Description: Huawei B312-926 Lte Cpe V100R001
No ratings yet
Product Description: Huawei B312-926 Lte Cpe V100R001
26 pages
PLD Unit 1.2
No ratings yet
PLD Unit 1.2
31 pages
Digital Logic Design (ES216) Lec 27-28 Counters
No ratings yet
Digital Logic Design (ES216) Lec 27-28 Counters
24 pages
Case Study
No ratings yet
Case Study
2 pages
10 - Processor Structure and Function
No ratings yet
10 - Processor Structure and Function
45 pages
Wms Config Sheet Ssg00016cj
100% (1)
Wms Config Sheet Ssg00016cj
2 pages
Manual Steps v1.7
No ratings yet
Manual Steps v1.7
8 pages