Lecture 12 - Deep Learning

This document provides an overview of deep learning. It defines deep learning as a machine learning subfield that uses algorithms to learn multiple levels of representation from data using neural layers. Deep learning is exceptionally effective at learning patterns in data. It can learn not only the mapping from inputs to outputs but also learn representations from the data itself through techniques like autoencoders. Deep learning uses multiple hidden layers to extract higher-level features from raw input. Deeper models can execute more sequential instructions to solve complex tasks. Convolutional neural networks apply filters to input images to extract features, and recurrent neural networks are useful when context is important for tasks like language modeling.

Uploaded by

divyadupati

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views

Lecture 12 - Deep Learning

Uploaded by

divyadupati

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Deep Learning

Santosh GSK
Industry Expert
What is Deep Learning?

• A machine learning subfield of learning representations of data, where we use algorithms that
attempt to learn (multiple levels of) representation by using a hierarchy of multiple neural
layers
• If you provide the system tons of information, it begins to understand it and respond in useful
ways.
• DL is exceptionally effective at learning patterns.
Background - Machine Learning Example

• Suppose we want to separate two categories of data by drawing a line between them in a
scatterplot.
• In the plot on the left, we represent some data using Cartesian coordinates, and the task is
impossible.
• In the plot on the right, we represent the data with polar coordinates and the task becomes
simple to solve with a vertical line.
Solution

• Use Machine Learning to discover not only the mapping from representation to output but the
representation as well
• This is called Representation Learning
• Enable AI systems to rapidly adapt to new tasks with minimal human intervention

• Manually designing features for complex task requires a great deal of human effort
• The quintessential example of representation learning is autoencoder.
Drawbacks of traditional learning

• A major source of difficulty in many real-world AI applications is that factors of variation influence
every single example of data we observe
• Most applications want us to disentangle the factors of variation and discard the ones we do not
care about
• Many of these factors of variation, such as speaker’s accent (in Speech Recognition), can be
identified only using sophisticated, nearly human-level understanding of data
• When it is nearly as difficult to obtain a representation as to solve the problem, representation
learning does not, at first glance, seem to help us
Deep Learning to rescue

• DL solves these problems in representation learning by introducing representations that are

expressed in terms of other, simpler representations
• DL enables the computer to build complex concepts out of simpler concepts.
• Other challenges like understanding non-linearity, complex data-types, feature engineering can
also be solved efficiently using Deep Learning
MLP vs Deep Learning
Illustrations of a Deep Learning Model

• Learning or evaluating such mapping

seems insurmountable if tackled directly
• DL solves this mapping by breaking a
complicated mapping into nested simple
mappings, each described at a layer
• A series of hidden layers extract abstract
features from the image. These layers are
called “hidden” because these values are
not given in the data
• The visualizations are a representation of
features at each hidden layer
• Given the input pixels, the first layer
identifies the edges
• The second layer can search for corners
and contours which are recognizable as
collection of edges
Illustrations of a Deep Learning Model

• Given the second layer’s representation of

edges, the third layer can detect entire
parts of a specific objects
• Finally, this description of image in terms
of object parts it contains can be used to
recognize the objects present in the image
Depth of a model

• The idea of learning the right representation for the data provides one perspective on deep
learning.
• Another perspective on deep learning is that depth allows the computer to learn a multi-step
computer program
• Each layer of the representation can be thought of as the state of the computer’s memory after
executing another set of instructions in parallel.
• Networks with greater depth can execute more instructions in sequence.
• Sequential instructions offer great power because later instructions can refer back to the results
of earlier instructions.
Depth of a model

• There are two main ways of measuring the depth of a model

1. The first view is based on the number of sequential instructions that must be executed to
evaluate the architecture.
1. We can think of this as the length of the longest path through a flow chart that describes how to
compute each of the model’s outputs given its inputs.
2. Just as two equivalent computer programs will have different lengths depending on which language
the program is written in,
3. The same function may be drawn as a flowchart with different depths depending on which functions
we allow to be used as individual steps in the flowchart.
Depth of a model

2. Another approach, used by deep probabilistic models, regards the depth of a model as being
not the depth of the computational graph but the depth of the graph describing how concepts
are related to each other.
1. This is because the system’s understanding of the simpler concepts can be refined given information
about the more complex concepts.
2. For example, an AI system observing an image of a face with one eye in shadow may initially only see
one eye. After detecting that a face is present, it can then infer that a second eye is probably present
as well.
3. The graph of concepts only includes two layers—a layer for eyes and a layer for faces—but the graph
of computations includes 2n layers if we refine our estimate of each concept given the other n times.
Depth of a model

• Because it is not always clear which of these two views is most relevant, and
• There is no single correct value for the depth of an architecture, just as there is no single correct
value for the length of a computer program.
• Nor is there a consensus about how much depth a model requires to qualify as “deep.”

• However, deep learning can safely be regarded as the study of models that either involve a
greater amount of composition of learned functions or learned concepts than traditional machine
learning does.

• Deep learning is a particular kind of machine learning that achieves great power and flexibility by
learning to represent the world as a nested hierarchy of concepts,
• with each concept defined in relation to simpler concepts, and
• more abstract representations computed in terms of less abstract ones
Deep Learning and AI
Flowchart of AI concepts
Convolutional Neural Networks

Convolutional
Input Image
3x3 filter
CNN – Filters
Convolution in 3 dimensions

= (1*0)+(1*0)+(1*1)+(1*1)+
(0*1)+(1*0)+(1*1)+(0*0) +
(1*1)+(2*1)+(3*1)+(4*1)

= 13

Even a 3D convolution gives a 2D

output
CNN Summary

To summarize, the Conv Layer:

● Accepts a volume of size W1×H1×D1

● Requires four hyperparameters:
○ Number of filters K,
○ their spatial extent F,
○ the stride S,
○ the amount of zero padding P.
● Produces a volume of size W2×H2×D2 where:
○ W2=(W1−F+2P)/S + 1
○ H2=(H1−F+2P)/S + 1
○ D2=K
● With parameter sharing, it introduces F⋅F⋅D1 weights per filter, for a total of (F⋅F⋅D1)⋅K weights and K biases.

A common setting of the hyperparameters is F=3,S=1,P=1. But varies for different types of problems and
architectures.
Why do CNNs work?

• The networks can be large and hence bias is minimized

• The weights are mostly zero because of convolution.
• ReLU(Activation Function) and Dropout makes even fewer weights.
• Hence, variance is minimized
When Context is important

• Sometimes, we need to have the contextual information to be able to perform Machine Learning
predictions

• E.g., Estimating the probabilities of words given the context

• The clouds are in the ___ (sky – no further context is needed)
• I fell in love with this French girl. My parents were against it initially as they were worried about the
cultural differences. However, after they met her they realized how wonderful a person she is and
agreed for our marriage. All that is left is convincing her parents. Here, I am booking my tickets to
______ (A lot of context is needed)
Recurrent Neural Network (RNN)
Tasks where context is useful
Tasks where context is useful
Thank You!
In our next session:
Optimization Models

Physics Informed Neural Network Theory and Applications
No ratings yet
Physics Informed Neural Network Theory and Applications
44 pages
18AI742
No ratings yet
18AI742
2 pages
Infrastructure Audit Checklist
No ratings yet
Infrastructure Audit Checklist
3 pages
Introduction To Control Engineering: Andy Pomfret and Tim Clarke
100% (1)
Introduction To Control Engineering: Andy Pomfret and Tim Clarke
54 pages
Precision Hawk Brochure
No ratings yet
Precision Hawk Brochure
12 pages
Raptor Solar Example Report Spring 2018
No ratings yet
Raptor Solar Example Report Spring 2018
8 pages
Visualeyez Getting Started Guide
100% (1)
Visualeyez Getting Started Guide
19 pages
GDSS - Group Decision Support System
100% (1)
GDSS - Group Decision Support System
20 pages
Deep Learning Lecture 0 Introduction Alexander Tkachenko
No ratings yet
Deep Learning Lecture 0 Introduction Alexander Tkachenko
31 pages
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
No ratings yet
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
63 pages
CS231A Course Notes 1: Camera Models: Kenji Hata and Silvio Savarese
No ratings yet
CS231A Course Notes 1: Camera Models: Kenji Hata and Silvio Savarese
17 pages
TS25 - Deep Learning
No ratings yet
TS25 - Deep Learning
12 pages
Computer Vision Course
No ratings yet
Computer Vision Course
552 pages
Project
No ratings yet
Project
15 pages
DNN Hyperparameter Tuning
No ratings yet
DNN Hyperparameter Tuning
105 pages
Image Processing in UAV
No ratings yet
Image Processing in UAV
11 pages
Image Processing
No ratings yet
Image Processing
39 pages
Overfitting and Underfitting in Machine Learning
No ratings yet
Overfitting and Underfitting in Machine Learning
3 pages
What Computer Vision With The OpenCV
100% (5)
What Computer Vision With The OpenCV
137 pages
Activations, Loss Functions & Optimizers in ML
No ratings yet
Activations, Loss Functions & Optimizers in ML
29 pages
2 Convolutional Neural Network For Image Classification
No ratings yet
2 Convolutional Neural Network For Image Classification
6 pages
Performance Metrics (Classification) : Enrique J. de La Hoz D
100% (1)
Performance Metrics (Classification) : Enrique J. de La Hoz D
30 pages
Adaptive Histogram Equalization Is A Computer Image Processing Technique Used To Improve Contrast
No ratings yet
Adaptive Histogram Equalization Is A Computer Image Processing Technique Used To Improve Contrast
12 pages
3 ETH Lecture Mobile Robots Kinematics Add Ons 2017 RS
100% (1)
3 ETH Lecture Mobile Robots Kinematics Add Ons 2017 RS
36 pages
Computer Vision 1
No ratings yet
Computer Vision 1
16 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Image Enhancement
No ratings yet
Image Enhancement
144 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Object Detection and Tracking Algorithms For Vehicle Counting: A Comparative Analysis
No ratings yet
Object Detection and Tracking Algorithms For Vehicle Counting: A Comparative Analysis
11 pages
Unit 1 WSN
No ratings yet
Unit 1 WSN
139 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
320 Cohort 9 Report Final
No ratings yet
320 Cohort 9 Report Final
46 pages
Regularization: Swetha V, Research Scholar
No ratings yet
Regularization: Swetha V, Research Scholar
32 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Camera Calibration Toolbox For Matlab
50% (2)
Camera Calibration Toolbox For Matlab
3 pages
Reconfigurable Hardware Design Approach For Economic Neural Network
No ratings yet
Reconfigurable Hardware Design Approach For Economic Neural Network
5 pages
MalenoV Code 5 Layer CNN 65x65x65 Voxels
No ratings yet
MalenoV Code 5 Layer CNN 65x65x65 Voxels
30 pages
Introduction To Artificial Learning Lecture One
No ratings yet
Introduction To Artificial Learning Lecture One
16 pages
Motion Detection
No ratings yet
Motion Detection
33 pages
DLunit 4
No ratings yet
DLunit 4
16 pages
Histogram Specification
No ratings yet
Histogram Specification
6 pages
YOLO V3 ML Project
No ratings yet
YOLO V3 ML Project
15 pages
Smart Camera As Embedded Systems: M.Tech
No ratings yet
Smart Camera As Embedded Systems: M.Tech
21 pages
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
No ratings yet
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
13 pages
Computer Vision Unit 4
No ratings yet
Computer Vision Unit 4
186 pages
Image Fusion: Institute For Plasma Research
100% (1)
Image Fusion: Institute For Plasma Research
48 pages
Autonomous Systems in The Intelligence Community: Many Possibilities and Challenges
No ratings yet
Autonomous Systems in The Intelligence Community: Many Possibilities and Challenges
10 pages
Image Segmentation For Object Detection Using Mask R-CNN in Colab
No ratings yet
Image Segmentation For Object Detection Using Mask R-CNN in Colab
5 pages
Image Registration in Radiotherapy: Nilesh Kumar PG Radiation Physics Department of Radiation Physics
No ratings yet
Image Registration in Radiotherapy: Nilesh Kumar PG Radiation Physics Department of Radiation Physics
27 pages
Chapter 8 Code Optimization and Code Generation
No ratings yet
Chapter 8 Code Optimization and Code Generation
58 pages
Unit -3-NNDL- Notes
No ratings yet
Unit -3-NNDL- Notes
17 pages
IEA PVPS Task 1 Trends Report 2024
No ratings yet
IEA PVPS Task 1 Trends Report 2024
104 pages
Image Processing
No ratings yet
Image Processing
41 pages
Autonomous Robotic Systems
No ratings yet
Autonomous Robotic Systems
22 pages
Image Processing QB
100% (1)
Image Processing QB
29 pages
Chapter_1_Introduction_to_computer_vision_and_image_processing_for
No ratings yet
Chapter_1_Introduction_to_computer_vision_and_image_processing_for
81 pages
Forest Fires Data Set Analysis Using Machine Learning: Name: 1.pawan Jakke (111815018) 2.utkarsh Dubey (111815047)
No ratings yet
Forest Fires Data Set Analysis Using Machine Learning: Name: 1.pawan Jakke (111815018) 2.utkarsh Dubey (111815047)
8 pages
Emotion Detection
No ratings yet
Emotion Detection
23 pages
DLunit 2
No ratings yet
DLunit 2
8 pages
MC0086 Digital Image Processing
No ratings yet
MC0086 Digital Image Processing
9 pages
Stochastic Gradient Descent - Term Paper
No ratings yet
Stochastic Gradient Descent - Term Paper
8 pages
Automatics Vehicle License Plate Recognition Using MATLAB
No ratings yet
Automatics Vehicle License Plate Recognition Using MATLAB
5 pages
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Cs I PR II MQP
No ratings yet
Cs I PR II MQP
2 pages
MSP Lab Manual
No ratings yet
MSP Lab Manual
35 pages
Currency Conversion From Local Currency To USD in BW - How2BW
No ratings yet
Currency Conversion From Local Currency To USD in BW - How2BW
12 pages
Machine-Learning-Algorith 8800284 Powerpoint
No ratings yet
Machine-Learning-Algorith 8800284 Powerpoint
10 pages
Self-Quiz Unit 2 - Attempt Review
No ratings yet
Self-Quiz Unit 2 - Attempt Review
5 pages
Mis of Axis Bank
0% (1)
Mis of Axis Bank
29 pages
Actual vs. Budget Ytd Year 2021: G/L Code Account Title Actual Budget Remaining $ Remaining %
No ratings yet
Actual vs. Budget Ytd Year 2021: G/L Code Account Title Actual Budget Remaining $ Remaining %
4 pages
Chaitanya Resume
No ratings yet
Chaitanya Resume
3 pages
Ieee Transactions On Learning Technologies, Manuscript Id
No ratings yet
Ieee Transactions On Learning Technologies, Manuscript Id
13 pages
Computer Vision Important Questions Answers 250322 101712
No ratings yet
Computer Vision Important Questions Answers 250322 101712
26 pages
Machine Learning Hand Written Notes ?
No ratings yet
Machine Learning Hand Written Notes ?
57 pages
DB Present
No ratings yet
DB Present
22 pages
Labworks Pedo Version 2.0
No ratings yet
Labworks Pedo Version 2.0
22 pages
Task RMO6-template - English
No ratings yet
Task RMO6-template - English
1 page
Innova Well Seeker Pro Brochure
No ratings yet
Innova Well Seeker Pro Brochure
2 pages
IBM CC0103EN Certificate Cognitive Class
No ratings yet
IBM CC0103EN Certificate Cognitive Class
1 page
Emtech Module3 Ans Key
100% (1)
Emtech Module3 Ans Key
2 pages
Guru Nanak Dev University: Date-Sheet For
No ratings yet
Guru Nanak Dev University: Date-Sheet For
1 page
FIT3176 Week 03 Lab 02 Activity Sheet
No ratings yet
FIT3176 Week 03 Lab 02 Activity Sheet
7 pages
Etech Finals
No ratings yet
Etech Finals
2 pages
Crash Team Racing: Secrets
No ratings yet
Crash Team Racing: Secrets
5 pages
How Compact Fluorescent Lamps Work - and How To Dim Them
No ratings yet
How Compact Fluorescent Lamps Work - and How To Dim Them
8 pages
Oncology: Legacy Data SDTM
100% (1)
Oncology: Legacy Data SDTM
42 pages
HFSS Vector Field Calculations
No ratings yet
HFSS Vector Field Calculations
27 pages
Revit Architecture 2008 Read This First For Download: Before You Install
No ratings yet
Revit Architecture 2008 Read This First For Download: Before You Install
3 pages
Mod Menu Crash 2024 03 17-23 07 36
No ratings yet
Mod Menu Crash 2024 03 17-23 07 36
2 pages
Avanade - Windows Server 2003 Case Study
No ratings yet
Avanade - Windows Server 2003 Case Study
5 pages