0% found this document useful (0 votes)

5 views

Week 6 Unsupervised Learning

Uploaded by

realhyperush

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Week 6 Unsupervised Learning

Uploaded by

realhyperush

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

APS360: Applied Fundamentals of Deep Learning

Week 6: Unsupervised Learning

Content
● Motivation
● Autoencoders
● Variational Autoencoders
● Convolutional Autoencoders
● Pre-training with Autoencoders
● Self-Supervised Learning
Motivation
Motivation
Q: What is one thing that all the algorithms covered up to now have had in common?
Motivation
Q: What is one thing that all the algorithms covered up to now have had in common?

They require data with ground truth

Supervised Learning
Motivation
Challenges with Supervised Learning
● Requires large amounts of labeled data
● Obtaining labeled data is expensive
○ Medical tests are expensive → require a specialist to review them
○ Chemical data collection → wet-lab tests are time consuming
● Often there is a lot more unlabeled data than labeled
● Not what we see in biology
Motivation
Q: How do humans learn? More specifically how do babies learn?
Motivation
Q: How do humans learn? More specifically how do babies learn?

Recognize patterns within observations without explicit supervisory signals

Unsupervised Learning
Motivation
Unsupervised Learning
● Our brains are constantly observing the world around us for patterns, or some
structure to relate objects.
● Patterns or clusters of similar features can tell us a great deal about the data
before we even have a label.
Feature Clustering

4
7
9
5 8
3
6 1

0 2
Definitions
Unsupervised Learning
● Learning patterns from data without human annotations
● e.g., clustering, density estimation, dimensionality reduction
Self-supervised Learning
● Use the success of supervised learning without relying on human provided
supervision (automatic supervision)
● e.g., mask part of the input and predict the masked information
Semi-supervised Learning
● Learning from data that mostly consists of unlabeled samples
● A small amount of human-labeled data are available as well
Autoencoders
Autoencoders
Find efficient representations of input data that could be used to reconstruct the
original input using two components:
● Encoder
○ Converts the inputs to an internal representation
○ Dimensionality reduction
● Decoder
○ Converts the internal representation to the outputs
○ Generative network
Autoencoders
The number of outputs is the same as the inputs
Hourglass shape creating a bottleneck layer, lower dimensional representation
Autoencoders
It is forced to learn the most important features in the input data and drop the
unimportant ones
Applications
● Feature Extraction
● Unsupervised Pre-training
● Dimensionality Reduction
● Generate new data
● Anomaly detection → Autoencoders are bad at reconstructing outliers
PyTorch Implementation
class Autoencoder(nn.Module):
def __init__(self):
super(Autoencoder, self).__init__()
encoding_dim = 32
self.encoder = nn.Linear(28 * 28, encoding_dim)
self.decoder = nn.Linear(encoding_dim, 28 * 28)

def forward(self, img):

flattened = img.view(-1, 28 * 28)
x = self.encoder(flattened)
# sigmoid for scaling output from 0 to 1
x = F.sigmoid(self.decoder(x))
return x

criterion = nn.MSELoss()
Stacked Autoencoders
● Autoencoders can have multiple hidden layers: stacked (deep) autoencoders
● Typically symmetrical with regards to the central coding layer.
Visualizing Reconstructions
One way to ensure that an autoencoder is properly trained is to compare the inputs
and the outputs.

Perfect reconstruction does not rule out overfitting!

Denoising Autoencoders
● Noise can be added to the input images of the autoencoder to force it to learn
useful features
● Autoencoder is trained to recover the original, noise-free inputs.
● Prevents it from trivially copying its inputs to its outputs, has to find patterns in
the data.
Denoising Autoencoders

Add Gaussian noise

Randomly mask inputs

PyTorch Implementation
# how much noise to add to images
nf = 0.4

# add random noise to the input images

noisy_img = img + nf * torch.randn(*img.shape)

# Clip the images to be between 0 and 1

noisy_img = np.clip(noisy_img, 0., 1.)

# compute predicted outputs using noisy_img

outputs = model(noisy_img)

# the target is the original img

loss = criterion(outputs, img)
Denoising Autoencoders

Original image

Noisy image

Reconstructed image
Generating New Images
● Since we are drastically reducing the dimensionality of the image, there has to be
some kind of structure in the codings (i.e. embedding space).
● That is, the network should be able to save space by mapping similar images to
similar embeddings.
● Let’s see how we can exploit this to allow us to generate new types of images.
New Images with Interpolation
● First compute low-dimensional embeddings of two images.
● Then interpolate between the two embeddings and decode those as well!
● Interpolated codings result in new images that are somewhere in between the
two starting images.
Plotting Interpolated Codings
We can do this for other image combinations
Plotting Interpolated Codings
What if we randomly select a coding?
The latent space in autoencoders can become disjoint and non-continues
Variational AutoEncoders (VAE)
VAEs
They are quite different from the autoencoders we have discussed so far:
● Probabilistic → their outputs are partly determined by chance even after training
● Generative → they can generate new instances that look like they were sampled
from the training set.
They impose a distribution constraint on the latent space to have a smooth space.
VAEs
Encoder generates a normal distribution with mean µ and a standard deviation σ
instead of a fixed embedding.
An embedding is sampled from the distribution and decoder decodes the sample to
reconstruct the input.
VAEs
We want the encoder distribution ew df to be close to prior
We can use Kullback–Leibler (KL) divergence to measure the difference between two
distributions P(X) and Q(X):

If we plug-in the encoder distribution and the prior into KL-divergence of two
multivariate Gaussians, we get:
VAEs

Training

sample
Generating N(0, I) Embedding
Generating Data
Generate images that look like handwritten digits by training a variational
autoencoder.
Intermission
(5 to 10 min break)
Convolutional Autoencoders
Convolutional Autoencoder
Convolutional autoencoders take advantage of spatial information.
● Encoder → Learns visual embedding using convolutional layers
● Decoder → Up-samples the learned visual embedding to match the original size
of the image.
Transposed Convolution
The opposite of the convolution is the transposed convolution (different from an
inverse convolution).
They work with filters, kernels, padding, strides just as the convolution layers.
Instead of mapping KxK pixels to 1, they can map from 1 pixel to KxK pixels.
The kernels are learned just like normal convolutional kernels.

output dimension input dimension stride kernel size padding output padding
Transposed Convolution
1. Take each pixel of your input image
2. Multiply each value of your kernel with the input pixel to get a weighted kernel
3. Insert it in the output to create an image
4. Where the outputs overlap sum them
Transposed Convolution
1. Take each pixel of your input image
2. Multiply each value of your kernel with the input pixel to get a weighted kernel
3. Insert it in the output to create an image
4. Where the outputs overlap sum them
Padding
The effect is the opposite of what happens with the convolution layers:
1. Compute the output as normal
2. Remove rows and columns around the perimeter
Output padding
● When stride > 1, Conv2d maps multiple input shapes to the same output shape.
● E.g. Inputs of size 7x7 and 8x8 both return an output of 3x3 for a kernel of size
3x3 with stride=2
● When applying the transpose convolution, it is ambiguous that which output
shape to return, 7x7 or 8x8 for stride=2 transpose convolution.
● Output padding is provided to resolve this ambiguity by effectively increasing the
calculated output shape on one side.
● It is only used to find output shape, but does not actually add zero-padding to
output.
Strides
The effect is also the opposite from what happens with the convolution layers
Increasing the stride results in an increase in the upsampling effect.

s=2
PyTorch Implementation
A convolution transpose layer with the exact same specifications as the convolution
layer would have the reverse effect on the shape.

conv = nn.Conv2d(in_channels=8, convt = nn.ConvTranspose2d(in_channels=8,

out_channels=8, out_channels=8,
kernel_size=5) kernel_size=5)

x = torch.randn(2, 8, 64, 64) convt(y).shape # should be same as x.shape

y = conv(x)
y.shape

torch.Size([2, 8, 60, 60]) torch.Size([2, 8, 64, 64])

PyTorch Implementation
We also have the option of including convolution transpose padding:

convt = nn.ConvTranspose2d(in_channels=16,
out_channels=8,
kernel_size=5,
padding=2)
x = torch.randn(32, 16, 64, 64)
y = convt(x)
y.shape

torch.Size([32, 8, 64, 64])

PyTorch Implementation
We can add a stride to the convolution to increase our resolution!

convt = nn.ConvTranspose2d(in_channels=16,
out_channels=8,
kernel_size=5,
stride=2,
padding=2)

x = torch.randn(32, 16, 64, 64)

y = convt(x)
y.shape

torch.Size([32, 8, 127, 127])

PyTorch Implementation
Output padding is another type of padding that adds an additional row and column to
the output. Easy to mix it up with padding.

convt = nn.ConvTranspose2d(in_channels=16,
out_channels=8,
kernel_size=5,
stride=2,
padding=2,
output_padding=1)

x = torch.randn(32, 16, 64, 64)

y = convt(x)
y.shape

torch.Size([32, 8, 128, 128])

PyTorch Implementation
class Autoencoder(nn.Module):
def __init__(self):
super(Autoencoder, self).__init__()
self.encoder = nn.Sequential(
nn.Conv2d(1, 16, 3, stride=2, padding=1),
nn.ReLU(),
nn.Conv2d(16, 32, 3, stride=2, padding=1),
nn.ReLU(),
nn.Conv2d(32, 64, 7)
)
self.decoder = nn.Sequential(
nn.ConvTranspose2d(64, 32, 7),
nn.ReLU(),
nn.ConvTranspose2d(32, 16, 3, stride=2, padding=1,output_padding=1),
nn.ReLU(),
nn.ConvTranspose2d(16, 1, 3, stride=2, padding=1, output_padding=1),
nn.Sigmoid()
)
PyTorch Implementation

def forward(self, x):

x = self.encoder(x)
x = self.decoder(x)
return x

def embed(self, x)
return self.encoder(x)

def decode(self, e):

return self.decode(e)
Pre-training with Autoencoders
Pre-training with Autoencoders
Previously we discussed how transfer learning could use features obtained from
ImageNet data to improve classification on other image tasks.
● Assumption that the ImageNet data is similar in the new task.
● If the new task is to detect new objects from similar images, then transfer
learning makes sense.
Autoencoders can achieve similar results by pretraining on large set of unlabeled
data, same type of data, just missing labels
Pre-training with Autoencoders
Pre-training with Autoencoders

Classifier
Self-Supervised Learning
Self-supervised learning with pretext tasks
What if we can cast unsupervised learning into supervised setting?
define proxy supervised tasks such that:
● The labels are generated automatically for free
● Solving the task, requires the model to “understand” the content
The challenge is devising the tasks such that they enforce the model to learn robust
representations.
RotNet
Idea: Rotate images randomly by 0, 90, 180, or 270 degrees and make the model to
predict the rotation angle

if someone is not aware of the concepts of the objects depicted in the images, they
cannot recognize the rotation that was applied to them.
RotNet
The task is multiclass classification with 4 classes (cross-entropy loss) with free
labels being generated automatically
Contrastive Learning
Autoencoding methods: Contrastive methods:
● Reconstruct input ● Contrast pair of positive/negative samples
● Compute the loss in output space ● Compute the loss in embedding space
● Compress all the details ● Compress relevant information
● Requires lots of negative examples
SimCLR

MLP

CNN
Questions?

Sixth Edition
89% (28)
Sixth Edition
577 pages
Ucsp Q1 Mod 1 PDF
89% (9)
Ucsp Q1 Mod 1 PDF
26 pages
Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
An Introduction To Grids Graphs and Networks
No ratings yet
An Introduction To Grids Graphs and Networks
299 pages
Deep Learning Module-2 & 4
No ratings yet
Deep Learning Module-2 & 4
48 pages
Auto Encoder
No ratings yet
Auto Encoder
39 pages
Autoencoder
No ratings yet
Autoencoder
39 pages
Keras1 - 1.4 Advanced Model Architectures
No ratings yet
Keras1 - 1.4 Advanced Model Architectures
11 pages
Unit 4 (Adl)
No ratings yet
Unit 4 (Adl)
18 pages
Chap 6 Embedding
No ratings yet
Chap 6 Embedding
44 pages
Lec 10
No ratings yet
Lec 10
74 pages
Generative_Models
No ratings yet
Generative_Models
65 pages
Autoencoders
No ratings yet
Autoencoders
14 pages
Auto Encoder s
No ratings yet
Auto Encoder s
22 pages
Autoencoder - Unit 4
No ratings yet
Autoencoder - Unit 4
39 pages
Deep Learning Autoencoders
No ratings yet
Deep Learning Autoencoders
31 pages
AD3501-DL-UNIT 5 NOTES
No ratings yet
AD3501-DL-UNIT 5 NOTES
16 pages
Chapter17 Autoencoders
No ratings yet
Chapter17 Autoencoders
23 pages
Autoencoder_GAN_edited
No ratings yet
Autoencoder_GAN_edited
138 pages
Lecture 14 Autoencoders
No ratings yet
Lecture 14 Autoencoders
39 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
7& 9 Autoencoder and Variational Autoencoder
No ratings yet
7& 9 Autoencoder and Variational Autoencoder
13 pages
ANN_Unit-2
No ratings yet
ANN_Unit-2
48 pages
Study Materials - Denoising Autoencoders
No ratings yet
Study Materials - Denoising Autoencoders
7 pages
465-Lecture 12
No ratings yet
465-Lecture 12
31 pages
Experiment 4
No ratings yet
Experiment 4
26 pages
659451A19_DL_EXP5
No ratings yet
659451A19_DL_EXP5
8 pages
Unit 4
No ratings yet
Unit 4
10 pages
Convolution Model Step by Step v1
No ratings yet
Convolution Model Step by Step v1
31 pages
BTP Project Report
No ratings yet
BTP Project Report
13 pages
B.Tech Project Mid Term Report: Handwritten Digits Recognition Using Neural Networks
No ratings yet
B.Tech Project Mid Term Report: Handwritten Digits Recognition Using Neural Networks
13 pages
Attention Is All You Need
50% (2)
Attention Is All You Need
11 pages
Attention Is All You Need Paper - Removed
No ratings yet
Attention Is All You Need Paper - Removed
9 pages
IP Report Final
No ratings yet
IP Report Final
20 pages
Lec16 - Autoencoders
No ratings yet
Lec16 - Autoencoders
18 pages
Lec9 NN I
No ratings yet
Lec9 NN I
47 pages
deep Learning
No ratings yet
deep Learning
31 pages
Model Questions DWT COMPLETE SOLUTIONS
No ratings yet
Model Questions DWT COMPLETE SOLUTIONS
18 pages
DLT Mid-2 Answers
No ratings yet
DLT Mid-2 Answers
20 pages
DL UNIT 4
No ratings yet
DL UNIT 4
21 pages
7181-attention-is-all-you-need
No ratings yet
7181-attention-is-all-you-need
11 pages
495 Lecture 10 Attall
No ratings yet
495 Lecture 10 Attall
18 pages
ML Lec 19 Autoencoder
No ratings yet
ML Lec 19 Autoencoder
54 pages
D5_PPT
No ratings yet
D5_PPT
79 pages
UNIT V
No ratings yet
UNIT V
32 pages
DL-midterm-report-topic-id-41
No ratings yet
DL-midterm-report-topic-id-41
8 pages
Attention is All you Need - NIPS-2017-attention-is-all-you-need-Paper
No ratings yet
Attention is All you Need - NIPS-2017-attention-is-all-you-need-Paper
11 pages
Arabic OCR Report
No ratings yet
Arabic OCR Report
20 pages
DL Miid1 Mansi
No ratings yet
DL Miid1 Mansi
18 pages
Unit 5
No ratings yet
Unit 5
27 pages
DataEnggineering
No ratings yet
DataEnggineering
16 pages
PA3_Autoencoders
No ratings yet
PA3_Autoencoders
3 pages
Deep Learning PDF
No ratings yet
Deep Learning PDF
55 pages
INTRO TO Deep Learning Focusing on ToolS_Knowlexon_Biswa
No ratings yet
INTRO TO Deep Learning Focusing on ToolS_Knowlexon_Biswa
37 pages
Pattern Recognition
No ratings yet
Pattern Recognition
14 pages
deep learning u1
No ratings yet
deep learning u1
5 pages
CS 601 Machine Learning Unit 3
No ratings yet
CS 601 Machine Learning Unit 3
37 pages
Machine Learning For Data Science 2 - Normalizing Flows V2
No ratings yet
Machine Learning For Data Science 2 - Normalizing Flows V2
50 pages
Minggu04 - Convolutional Neural Network (CNN)
No ratings yet
Minggu04 - Convolutional Neural Network (CNN)
55 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
No ratings yet
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
22 pages
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
No ratings yet
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
22 pages
Reading and Writing 3rd Quarter Final
No ratings yet
Reading and Writing 3rd Quarter Final
13 pages
Strategic Human Resource Management: Mba Iv Sem
No ratings yet
Strategic Human Resource Management: Mba Iv Sem
6 pages
Ricky Policy
No ratings yet
Ricky Policy
11 pages
Presentation 1
No ratings yet
Presentation 1
10 pages
Romanowsky Stain - Wikipedia
No ratings yet
Romanowsky Stain - Wikipedia
1 page
MACStan HSR 3 Cleaning Procedure Change of Immersion Tin Bath
No ratings yet
MACStan HSR 3 Cleaning Procedure Change of Immersion Tin Bath
1 page
Positioning Wxford For The Upturn
No ratings yet
Positioning Wxford For The Upturn
114 pages
To Opt For Coatex Rheology Modifiers: To Develop High-Performance and Low Impact Coatings
No ratings yet
To Opt For Coatex Rheology Modifiers: To Develop High-Performance and Low Impact Coatings
18 pages
En GB
No ratings yet
En GB
4 pages
LEADERSHIP THEORIES AND STYLES Handout
No ratings yet
LEADERSHIP THEORIES AND STYLES Handout
7 pages
P VI - C R M, D: ART Omplaints and Edress Echanism AND Irectory
No ratings yet
P VI - C R M, D: ART Omplaints and Edress Echanism AND Irectory
26 pages
Peran Ngo Dalam Mendukung Sdgs Pendidikan Berkualitas (Studi Kasus: Project Child Indonesia Di Yogyakarta (2018-2022)
No ratings yet
Peran Ngo Dalam Mendukung Sdgs Pendidikan Berkualitas (Studi Kasus: Project Child Indonesia Di Yogyakarta (2018-2022)
16 pages
EJournals ASCE 2023-43
No ratings yet
EJournals ASCE 2023-43
2 pages
Roles of A Traner Facilitator
No ratings yet
Roles of A Traner Facilitator
13 pages
Ee-Comm Skills Level 5
No ratings yet
Ee-Comm Skills Level 5
3 pages
A1. Illustration of Cell Cycle
No ratings yet
A1. Illustration of Cell Cycle
5 pages
Blended Learning Lesson Plan
No ratings yet
Blended Learning Lesson Plan
6 pages
Design of Girt Member
No ratings yet
Design of Girt Member
3 pages
Brochure 61508
No ratings yet
Brochure 61508
12 pages
Writer's Effect
No ratings yet
Writer's Effect
17 pages
Distance Sensor Dx35 Quickstart: Set Key
No ratings yet
Distance Sensor Dx35 Quickstart: Set Key
2 pages
Genil - Yzabel - B - Activity - 2 - in - The - Contemporary - World
No ratings yet
Genil - Yzabel - B - Activity - 2 - in - The - Contemporary - World
2 pages
Myelin - The Brain's Supercharger (PDFDrive)
No ratings yet
Myelin - The Brain's Supercharger (PDFDrive)
324 pages
IELTS Reading General Practice - Justin
100% (1)
IELTS Reading General Practice - Justin
109 pages
Hari Organic Manure (L)
No ratings yet
Hari Organic Manure (L)
3 pages
Lesson 12 EAPP
No ratings yet
Lesson 12 EAPP
2 pages
Chapter Review
No ratings yet
Chapter Review
5 pages