Lecture 12 - Deep Learning
Lecture 12 - Deep Learning
Lecture 12 - Deep Learning
Santosh GSK
Industry Expert
What is Deep Learning?
• A machine learning subfield of learning representations of data, where we use algorithms that
attempt to learn (multiple levels of) representation by using a hierarchy of multiple neural
layers
• If you provide the system tons of information, it begins to understand it and respond in useful
ways.
• DL is exceptionally effective at learning patterns.
Background - Machine Learning Example
• Suppose we want to separate two categories of data by drawing a line between them in a
scatterplot.
• In the plot on the left, we represent some data using Cartesian coordinates, and the task is
impossible.
• In the plot on the right, we represent the data with polar coordinates and the task becomes
simple to solve with a vertical line.
Solution
• Use Machine Learning to discover not only the mapping from representation to output but the
representation as well
• This is called Representation Learning
• Enable AI systems to rapidly adapt to new tasks with minimal human intervention
• Manually designing features for complex task requires a great deal of human effort
• The quintessential example of representation learning is autoencoder.
Drawbacks of traditional learning
• A major source of difficulty in many real-world AI applications is that factors of variation influence
every single example of data we observe
• Most applications want us to disentangle the factors of variation and discard the ones we do not
care about
• Many of these factors of variation, such as speaker’s accent (in Speech Recognition), can be
identified only using sophisticated, nearly human-level understanding of data
• When it is nearly as difficult to obtain a representation as to solve the problem, representation
learning does not, at first glance, seem to help us
Deep Learning to rescue
• The idea of learning the right representation for the data provides one perspective on deep
learning.
• Another perspective on deep learning is that depth allows the computer to learn a multi-step
computer program
• Each layer of the representation can be thought of as the state of the computer’s memory after
executing another set of instructions in parallel.
• Networks with greater depth can execute more instructions in sequence.
• Sequential instructions offer great power because later instructions can refer back to the results
of earlier instructions.
Depth of a model
2. Another approach, used by deep probabilistic models, regards the depth of a model as being
not the depth of the computational graph but the depth of the graph describing how concepts
are related to each other.
1. This is because the system’s understanding of the simpler concepts can be refined given information
about the more complex concepts.
2. For example, an AI system observing an image of a face with one eye in shadow may initially only see
one eye. After detecting that a face is present, it can then infer that a second eye is probably present
as well.
3. The graph of concepts only includes two layers—a layer for eyes and a layer for faces—but the graph
of computations includes 2n layers if we refine our estimate of each concept given the other n times.
Depth of a model
• Because it is not always clear which of these two views is most relevant, and
• There is no single correct value for the depth of an architecture, just as there is no single correct
value for the length of a computer program.
• Nor is there a consensus about how much depth a model requires to qualify as “deep.”
• However, deep learning can safely be regarded as the study of models that either involve a
greater amount of composition of learned functions or learned concepts than traditional machine
learning does.
• Deep learning is a particular kind of machine learning that achieves great power and flexibility by
learning to represent the world as a nested hierarchy of concepts,
• with each concept defined in relation to simpler concepts, and
• more abstract representations computed in terms of less abstract ones
Deep Learning and AI
Flowchart of AI concepts
Convolutional Neural Networks
Convolutional
Input Image
3x3 filter
CNN – Filters
Convolution in 3 dimensions
= (1*0)+(1*0)+(1*1)+(1*1)+
(0*1)+(1*0)+(1*1)+(0*0) +
(1*1)+(2*1)+(3*1)+(4*1)
= 13
A common setting of the hyperparameters is F=3,S=1,P=1. But varies for different types of problems and
architectures.
Why do CNNs work?
• Sometimes, we need to have the contextual information to be able to perform Machine Learning
predictions