Lecture Machinelearning
Lecture Machinelearning
Machine learning is programming computers to optimize a performance criterion using example data or past
experience. Machine learning can automatically detect patterns in data, and then to use the uncovered patterns to
predict future data or other outcomes of interest.
Human Learning
Observ Learning
ations
What
model/
Machine Learning Data predictor
Start-Tech Academy
MACHINE LEARNING
Machine learning based on statistics is basically attempting to find the relationship between input and output
variables.
Information from data is being used to take key business/ political decisions
Start-Tech Academy
MACHINE LEARNING
Machine learning is closely associated with Statistics, AI and Data mining
Start-Tech Academy
MACHINE LEARNING
Machine learning based on statistics is basically attempting to find the relationship between input and output
variables.
Example
Start-Tech Academy
MACHINE LEARNING
Machine learning based on statistics is basically attempting to find the relationship between input and output
variables.
Example
Start-Tech Academy
MACHINE LEARNING
Machine learning based on statistics is basically attempting to find the relationship between input and output
variables.
• Identify:
– Prospective customers
– Dissatisfied customers
Use cases – Good customers
– Bad payers
Banking / • Obtain:
Telecom / Retail – More effective advertising
– Less credit risk
– Fewer fraud
– Decreased churn rate
Start-Tech Academy
MACHINE LEARNING
Machine learning based on statistics is basically attempting to find the relationship between input and output
variables.
• Medicine:
– Screening
– Diagnosis and prognosis
Use cases – Drug discovery
Biomedical /
Biometrics • Security:
– Face recognition
– Signature / fingerprint / iris verification
– DNA fingerprinting
Start-Tech Academy
MACHINE LEARNING
Machine learning based on statistics is basically attempting to find the relationship between input and output
variables.
• Computer interfaces:
– Troubleshooting wizards
– Handwriting and speech
Use cases – Chat bots
Computer /
• Internet
Internet – Hit ranking
– Spam filtering
– Text categorization
– Text translation
– Recommendation
Start-Tech Academy
MACHINE LEARNING
Machine learning based on statistics is basically attempting to find the relationship between input and output
variables.
For example, a real estate agent who wants to price a particular property will
have:
So that whenever s/he gives a value of the input variables to this function, s/he
can get the price of the property.
Start-Tech Academy
WHY ESTIMATE f(x)
f(x) defines the relationship between dependent and independent variables.
Start-Tech Academy
WHY ESTIMATE f(x)
f(x) defines the relationship between dependent and independent variables.
Whereas highly non-linear models may be predicting very accurately but the
relationship may be very difficult to interpret
Start-Tech Academy
HOW TO ESTIMATE F(x)
Next, we need to specify the type of learning method.
Start-Tech Academy
HOW TO ESTIMATE F(x)
Parametric vs Non parametric
Parametric approach
Parametric vs • Usually more interpretable
Non parametric • May not be as accurate
• Preferable if inference is the reason estimating f(x)
Non-parametric approach, w
• Less interpretable
• Potentially more accurate
• Needs large amount of data to train
• Preferable if prediction is the priority
Start-Tech Academy
TYPES OF LEARNING
Supervised vs Unsupervised learning
Supervised Learning:
• Supervised learning is where you have input variables (x) and an output variable
(Y) and you use an algorithm to learn the mapping function from the input to the
output.
Supervised • The goal is to approximate the mapping function so well that when you have
new input data (x) that you can predict the output variables (Y) for that data.
Vs
Unsupervised Learning:
Unsupervised • Unsupervised learning is where you only have input data (X) and no
corresponding output variables.
• The goal for unsupervised learning is to model the underlying structure or
distribution in the data in order to learn more about the data.
Start-Tech Academy
Supervised Learning: Example
examples
label
label1
Supervised label3
Learning labeled examples
Example label4
label5
Start-Tech Academy
Supervised Learning: Example
Category Weight
Apple 100 gm
Supervised
Apple 80 gm model/
Learning
predictor
Example
Banana 40 gm
Banana 60 gm
Start-Tech Academy
Supervised Learning: classification
Supervised
model/
Learning predictor
Predicted Category
Example
(classification)
Start-Tech Academy
Supervised Learning (classification)
Classification:
• Example: Credit scoring
• Differentiating between low-risk and
high-risk customers from their income
and savings
Supervised • Model - Discriminant
Learning IF income > θ1 AND savings > θ2
THEN low-risk ELSE high-risk
(classification)
Applications :
• Pattern recognition
• Face recognition
• Character recognition
• Medical diagnosis
• Web Advertising
Start-Tech Academy
Supervised Learning: Regression
Supervised
model/
Learning predictor
Predicted Weight
Example
(Regression)
Start-Tech Academy
Supervised Learning (Regression)
Regression:
• Example: Price of a used car
• x : car attributes
(e.g. mileage)
y : price
y = wx+w0
Supervised • Model – Linear Regression
Learning y = wx+w0
(Regression) Applications :
• Weather forecast
• Sales forecasting
• Advertising budget allocation
• Product pricing
Start-Tech Academy
Supervised Learning Algorithms
Supervised
Learning
Algorithms
Start-Tech Academy
Unsupervised Learning: Example
Unsupervised
Learning
Example
Start-Tech Academy
Unsupervised Learning Algorithms
Start-Tech Academy
Machine Learning Model
Start-Tech Academy
Machine Learning Model
Start-Tech Academy
Machine Learning Model
2. Data
Tidying
Start-Tech Academy
Machine Learning Model
• Filter data
• Aggregate values
• Variable transformation
• Variable reduction
Start-Tech Academy
Machine Learning Model
Start-Tech Academy
Machine Learning Model
5. Model 𝑦 = 𝑓(𝑥)
Training
Output Function Input variables
Start-Tech Academy
Machine Learning Model
In Sample error
• Error resulted from applying your prediction algorithm to the dataset you
6. Performance built it with
Metrics and Out of Sample error
Validation • Error resulted from applying your prediction algorithm to a new data set
Start-Tech Academy
Machine Learning Model
Start-Tech Academy