0% found this document useful (0 votes)

4 views

1. Machine Learning - Introduction

The document provides an introduction to Machine Learning, covering its definition, types of learning algorithms (supervised, unsupervised, semi-supervised, and reinforced), and the process of building a machine learning model. It highlights various applications of machine learning across different sectors, including banking, telecommunications, and biomedical fields. Additionally, it discusses the importance of data pre-processing and performance evaluation metrics in developing effective machine learning models.

Uploaded by

tannutanya2408

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

1. Machine Learning - Introduction

Uploaded by

tannutanya2408

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 73

Machine Learning:

Introduction

Disclaimer: This material is protected under copyright act AnalytixLabs ©, 2011-2016. Unauthorized use and/ or duplication of this material or any part of this material
including data, in any form without explicit and written permission from AnalytixLabs is strictly prohibited. Any violation of this copyright will attract legal actions
Introduction to Machine Learning

1. What is Machine Learning & Use Cases

2. Major Classes of Learning Algorithms:
• Supervised, Unsupervised, Semi- Supervised, Reinforced
3. Building a Machine Learning Model
• Data Pre-processing, Training & Test Split, Model Building, Validation, Prediction
4. Performance Metrics & Evaluation – Concept of:
• Over/Under-fitting;
• Bias, Variance, and Trade off
• Regularization
• Cross Validation
A. What is Machine Learning?
Context
Around 90% of WW data is generated within the last 2 years!!

What do we do with that?

• Data is an asset for any organization to generate actionable insights
• Data is used for MIS reporting at the lowest level to building predictive &
prescriptive models at the higher levels
• Machine Learning (ML) is an integral part of predictive & prescriptive modeling
Amazon e-Marketplace
Google News
Google Search
Applications of Machine Learning
• Banking / Telecom / • Computer / Internet • Biomedical /
Retail • Computer interfaces: Biometrics
• Identify: • Troubleshooting • Medicine:
wizards
• Prospective customers • Screening
• Handwriting and
• Dissatisfied customers speech • Diagnosis and
• Good customers • Brain waves prognosis
• Bad payers • Drug discovery
• Obtain: • Internet • Security:
• More effective • Hit ranking • Face recognition
advertising • Spam filtering • Signature / fingerprint
• Less credit risk • Text categorization / iris verification
• Fewer fraud • Text translation • DNA fingerprinting
• Decreased churn rate • Recommendation
Example: Credit Card Approval
We would like to be able to predict customers those are likely to default

Application Information:
age 23 years
gender Male
annual salary USD 60,000
year in residence 1 year
year in job 0.5 year
current debt USD 5,000

unknown pattern to be learned:

‘approve credit card good for bank?’
Example: Credit Card application approval problem
• input: x ∈ X (customer application)
• output: y ∈ Y (good/bad after approving credit card)
• unknown pattern to be learned ⇔ target function:
f : X → Y (ideal credit approval formula)
• data ⇔ training examples: D = {(x1, y1), (x2, y2), · · · , (xN, yN )}
(historical records in bank)
• hypothesis ⇔ skill with hopefully good performance:
g : X → Y (‘learned’ formula to be used)

{(xn ,yn )} from f ML g

Use-Cases (Examples)
• Spam Email Detection – any mail providers
• Machine Translation (Language Translation) – e.g. google translator
• Image Search (Similarity): search engines
• Clustering (K-Means) : Amazon Recommendations
• Classification : Google News
• Text Summarization - Google News
• Rating a Review/Comment: Yelp
• Fraud detection : Credit card Providers
• Decision Making : e.g. Bank/Insurance sector
• Sentiment Analysis: e.g. product reviews (Visual analytics tools)
• Speech Understanding – iPhone with Siri
• Face Detection – Facebook’s Photo tagging
• Image processing & Pattern recognition
Where we need Machine Learning?

Primarily Machine Learning can be applied where:

Explicit coding is not possible due to hidden patterns
There is issue related to large Complex data sets (Scalability)

Where Machine Learning cannot be applied:

• Optimization related problems where we have explicit mathematical
relations
What is Machine Learning?
Machine Learning is the field of study that gives computers the ability to learn
without being explicitly programmed – (Arthur Samuel, 1959)
So how does computers learn?
• According to Tom Mutchell (1998): “A computer learns effectively if its performance on
certain tasks improves with experience - as measured by some performance metrics”

Human learning: Skills with experience accumulated from observations

machine learning: acquiring skill with experience accumulated/computed from data

A More Concrete Definition …
Skill ⇔ improve some performance measure (e.g. prediction accuracy)

machine learning: improving some performance measure(s)

with experience computed from data

improved
data ML performance
measure

An Application in Computational Finance

stock data ML more investment gain
Machine Learning vs. Statistics, Artificial Intelligence (AI), Data Mining
ML is closely associated with Statistics, AI, and Data Mining
Machine Learning Vs. Statistics
• Traditional Statistics focuses on provable results with math assumptions, and care less about
computation
• “Statistics: A useful tool for Machine Learning"

Machine Learning Vs. Artificial Intelligence

• " Machine Learning is one possible route to realize AI"

Machine Learning Vs. Data Mining (DM)

• Traditional DM focuses on provable results with math assumptions along with efficient computation in
large database
• “Difficult to distinguish ML and DM in reality"
SO, HOW COMPUTERS LEARN IN MACHINE LEARNING?

Machine Learning
Supervised Unsupervised
Learning Learning

Recommender Association
Clustering
rules
Statistical
modeling Dimension Self
reduction organizing
maps

Reinforced
Learning
Types of Learning
• Supervised (Inductive) Learning
• Training data includes desired outputs
• Labelled input data.
• Creating classifiers to predict unseen inputs.
• Unsupervised Learning
• Training data does not include desired outputs
• Unlabelled input data.
• Creating a function to predict the relation and output
• Semi-supervised Learning
• Training data includes a few desired outputs
• Combines Supervised and Unsupervised Learning methodology
• Reinforcement Learning (e.g., Markov Decision Process, Q Learning etc.)
• No label is provided, but only indicates if a label is correct or not
• Direct Rewards from sequence of actions
• Reward-Punishment based agent
Supervised Learning
Target function
Applications
1. Supervised (inductive) Learning • Classifier
Patient Tumor Clump … Malignant? • Spam Detection
Age Size Thickness • Information Retrieval
55 5 3 TRUE • Personalisation based on ranks
70 4 7 TRUE • Speech Recognition
Training
Data 85 4 6 FALSE LEARNING Model
35 2 1 FALSE
… … … … FALSE
TRUE
Labeled Data-
Set

• Training data includes both predictors (Xi), and Patient age Tumor size Clump … Malignant
response (Yi) 72 3 3 ?
• Labelled input data. 66 4 4 ?
• Creating classifiers to predict unseen inputs.
Test Data
Supervised Learning - Algorithms
• Linear Regression
• Logistic Regression
• Decision Trees (CHAID, CART & Random Forest)
• k-Nearest Neighbours (KNN)
• Naive Bayes (Bayesian Learning)
• Discriminant Analysis (LDA/QDA) – Classification using linear regression
• Neural Networks
• SVM and Kernel estimation
• Perceptron and Multi-level Perceptrons (ANN – Deep Learning)
• Ensemble Models
• …
Un-supervised Learning
2. Un-Supervised Learning to detect natural patterns
Annual Marital
Age State status
Income
25 CA $80,000 M
45 NY $150,000 D Naturally
55 WA $100,500 M
occurring (hidden)
structure
18 TX $85,000 S
… … … …

No Label
Applications
• Un-labelled input data. • Market segmentation - divide potential
• In this situation only the Xi’s are observed. customers into groups based on their
• We need to use the Xi’s to guess what Yi would have been characteristics
and build a model from there. • Pattern Recognition
• Finding hidden structure in data • Groupings based on a distance measure
• Creating a function to predict the relation and output • Group of People, Objects, …
Unsupervised Learning - Algorithms
• Clustering
• k-Means, Hierarchical Clustering
• Hidden Markov Models (HMM)
• Dimension Reduction (Factor Analysis, PCA)
• Feature Extraction methods
• Self-organizing Maps (Neural Nets)
•…
Semi-Supervised Learning

Semi-Supervised Learning
• Training data includes a few desired outputs
• Combines Supervised & Unsupervised
Learning Methodologies

Application
• Webpage Classification:
Reinforced Learning

• No label is provided, but only

indicates if a label is correct or not
• Direct Rewards from sequence of
actions
• Reward-Punishment based agent
• e.g: Markov Decision Process, Q
Learning etc.
Machine Learning Problems

Broadly, the entire range of ML problems can be of TWO types:

Predicting a Value or an Event/Class (Classifier)
Regression: Predict a Continuous Value
• Predicting movie score
• Predicting price
• Predicting credit line approval
• Predicting stock prices
• Predicting temperature
• Predicting how many days to recover from sickness

Some techniques:
- Linear Regression / GLM
- Decision Trees
- Support vector regression
- Ensembles
- Etc…
Classification: Predicting a Category
Binary Problems Multi Class Problems
• credit approve/disapprove •written digits ⇒ 0, 1, · · · , 9
• email spam/non-spam •pictures ⇒ apple, orange, strawberry
• patient sick/not sick •emails ⇒ spam, primary, social, promotion, update (Google)
• ad profitable/not profitable
• answer correct/incorrect
Some techniques:
- Naïve Bayes
- Decision Tree
- Logistic Regression/GLM
- Support Vector Machines
- Neural Network
- Ensembles
- LDA, QDA
- Etc…
Product Affinity & Recommendation
A. Product-to-Product Affinity C. Customer-to-Product Propensity

Some techniques:
- Market Basket Analysis
- FP Growth
B. Identifying frequent item sets - A-priori Algorithm
Item 4
Item 1

Item 2

Item 3

Item 5

Item 1

Item 2

Item 3

Item 4

Item 5
- Collaborative Filtering
- Etc…

…
Y N N Y N Y N N Y N
Tx 1 Tx 1
Y N N Y N Y N N Y N
Tx 2 Tx 2
Y Y N Y N Y Y N Y N
Tx 3 Tx 3
N N Y Y Y N N Y Y Y
Tx 4 Tx 4
Tx 5 Tx 5
… …
B. Building a Machine Learning Application
Building a Machine Learning Application

1. Formulating the problem

2. Data Tidying & Pre-processing
3. Training-Test Split thru’ Sampling
4. Model Building
5. Validation & Model Accuracy
6. Using the chosen model for prediction
Building a ML Model

1. Formulating the problem

• Define a clear problem statement as per business need

• Is this a ‘Value’ or ‘Event/Class’ prediction?
Building a ML Model

2. Data Tidying & Pre-processing

2. Data Tidying & Pre-processing Data

1. Around 80% of data analysis time is spent on the process of cleaning & preparing the data
2. Data cleaning starts with data tidying where each variable is a column, each observation is a row, and each type of
observational unit is a table
3. Fixed variables should come first, followed by measured variables, each ordered so that related variables are
contiguous. Rows can then be ordered by the first variable, breaking ties with the second and subsequent (fixed)
variables

Table-1 Table-2

Tidy
Table-3

Source: Tidy Data |Wickham H | Journal of Statistical Software

2. Data Tidying & Pre-processing Data (Contd.)

Common Errors…
Modeling & Analysis becomes
1. Column headers are values, not variable names… easy with this structure…

TS Data Product 1-Jan-15 2-Jan-15 3-Jan-15 4-Jan-15 5-Jan-15 6-Jan-15 Product DATE Qty
XAZ256 27 34 60 81 76 137 XAZ256 1/1/2015 27
XAZ256 12 27 37 52 35 70
XAZ256 1/2/2015 34
XAZ256 27 21 30 34 33 58
XAZ256 1/3/2015 60
XAZ256 418 617 732 670 638 1116
XAZ256 1 9 7 9 11 34 XAZ256 1/4/2015 81
XAZ256 20 27 24 24 21 30 XAZ256 1/5/2015 76
XAZ256 19 19 25 25 30 95 XAZ256 1/6/2015 137
XAZ256 1/1/2015 12
XAZ256 1/2/2015 27

Source: Tidy Data |Wickham H | Journal of Statistical Software

2. Data Tidying & Pre-processing Data (Contd.)

Common Errors…
Modeling & Analysis becomes
2. Multiple variables stored in one column… easy with this structure…

WHO Data on Country Year Candidate Cases Country Year Gender Age Cases
some disease as AD 2000 m014 0 AD 2000 m 0-14 0
extracted AD 2000 m1524 0 AD 2000 m 15-24 0
AD 2000 m2534 1 AD 2000 m 25-34 1
AD 2000 m3544 0 AD 2000 m 35-44 0
AD 2000 m4554 0 AD 2000 m 45-54 0
AD 2000 m5564 0 AD 2000 m 55-64 0
AD 2000 m65 0 AD 2000 m 65+ 0
AE 2000 m014 2 AE 2000 m 0-14 2
AE 2000 m1524 4 AE 2000 m 15-24 4
AE 2000 m2534 4 AE 2000 m 25-34 4
AE 2000 m3544 6 AE 2000 m 35-44 6
AE 2000 m4554 5 AE 2000 m 45-54 5
AE 2000 m5564 12 AE 2000 m 55-64 12
AE 2000 m65 10 AE 2000 m 65+ 10
AE 2000 f014 3 AE 2000 f 0-14 3

Source: Tidy Data |Wickham H | Journal of Statistical Software

2. Data Tidying & Pre-processing Data (Contd.)

Common Errors…
Modeling & Analysis becomes
3. Variables are stored in both rows and columns… easy with this structure…

ID Date Tmax Tmin

Weather data from
Global Climatology MX17004 1/30/2010 27.8 14.5
Network ID Date Element Value MX17004 2/2/2010 27.3 14.4
MX17004 1/30/2010 tmax 27.8 MX17004 2/3/2010 24.1 14.4
MX17004 1/30/2010 tmin 14.5 MX17004 2/11/2010 29.7 13.4
MX17004 2/2/2010 tmax 27.3 MX17004 2/23/2010 29.9 10.7
MX17004 2/2/2010 tmin 14.4 MX17004 3/5/2010 32.1 14.2
MX17004 2/3/2010 tmax 24.1 MX17004 3/10/2010 34.5 16.8
MX17004 2/3/2010 tmin 14.4 MX17004 3/16/2010 31.1 17.6
MX17004 2/11/2010 tmax 29.7 MX17004 4/27/2010 36.3 16.7
MX17004 2/11/2010 tmin 13.4 MX17004 5/27/2010 33.2 18.2
MX17004 2/23/2010 tmax 29.9
MX17004 2/23/2010 tmin 10.7

Source: Tidy Data |Wickham H | Journal of Statistical Software

After data tidying, it’s time for Data Pre-Processing

1. Filter: sub-setting or removing observations based on some condition.

2. Aggregate: collapsing multiple values into a single value (e.g., by summing or taking means).
3. Missing Value Treatment
• Zero, Series average, Neighborhood Average/Median/Mode, Regression, Business Logic, etc.
4. Outlier Treatment
• Std. Deviation Method:
 Mean ±2.5 Std. Deviation (sample size < 80), and Mean ±3.0 Std. Deviation (sample size > 80)
• Interquartile Range IQR: Median ± 2 * IQR
• Replacement by steps similar to MVT
5. Data Modification
• Standardize/Normalize Data
6. Data Reduction
• Dimensionality Reduction (e.g. PCA)
7. Feature Creation/Transformation
• Transformation (log, polynomial, multiply features)
• Kernel
Building an ML Model

3. Training-Test Split thru’ Sampling

• A random Sampling is done to split the data into training & test data
• A time series data should not be sampled randomly, it should be
contiguous in nature
• One time period might depending all time periods previously
• Well distributed balanced sample
• Over & Under-Sampling need to Carried out for an imbalanced data!!
Building a ML Model

4. Model Building & Training

The Machine Learning framework

y = f(x)
output prediction function Image feature

• Training: given a training set of labeled examples {(x1,y1), …, (xN,yN)}, estimate the prediction
function f by minimizing the prediction error on the training set
• Testing: apply f to a never before seen test example x and output the predicted value y = f(x)

Methods includes Parametric Methods and Non-parametric Methods

Machine Learning framework: Parametric vs Non-parametric

• Parametric: Algorithms that assumes a known form of function are called parametric machine
learning algorithms.
• The algorithms involve two steps:
• Select a form for the function.
• Learn the coefficients for the function from the training data.
• E.g., Linear and Logistic Regression.
• Non-Parametric: Algorithms that do not make strong assumptions about the form of the mapping
function are called nonparametric machine learning algorithms.
• By not making assumptions, they are free to learn any functional form from the training data.
• Non-parametric methods are often more flexible, achieve better accuracy but require a lot more data
and training time.
• E.g., Support Vector Machines, Neural Networks and Decision Trees.
Building a ML Model

5. Performance Metrics & Validation

In-Sample Vs. Out of Sample Errors

 In sample error: Error resulted from applying your prediction algorithm to the dataset you built it with
• Also known as resubstitution error
• Often optimistic (less than on a new sample) as the model may be tuned to error of the sample

 Out of sample error: Error resulted from applying your prediction algorithm to a new data set
• Also known as generalization error
• Out of sample error most important as it better evaluates how the model should perform

 In sample error < out of sample error

• Will explain the reasons in a minute….
Regression Setting: Measuring Quality of Fit
Suppose we have a regression problem.
One common measure of accuracy is the mean squared error (MSE)
i.e.

1 n
MSE   ( yi  y
ˆi )2
n i 1
Where ŷi is the prediction our method gives for the observation in
our training data.
The Problem
The training method has been designed to make MSE small on the
training data we are looking at (e.g. with linear regression we choose the
line such that MSE is minimized.)

What we really care about is how well the method works on new data.
We call this new data “Test Data”.

There is no guarantee that the method with the smallest training MSE
will have the smallest test (i.e. new data) MSE.
Training vs. Test MSE’s
In general the more flexible a method is the lower its training MSE will be
i.e. it will “fit” or explain the training data very well.

However, the test MSE may in fact be higher for a more flexible method
than for a simple approach like linear regression.
The Trade-off

It can be shown that for any given, X=x0, the expected test MSE for a
new Y at x0 will be equal to

Expected Test MSE = E (Y - f (x0 )) = Bias2 +Var + s2

Irreducible Error

As a model gets more complex, the bias will decrease and the
variance will increase but expected test MSE may go up or down!
A Fundamental Picture
 Variance refers to how much your estimate for
f would change by if you had a different
training data set.
 Generally, the more flexible a method is the
more variance it has.
 In general training errors will always decline if
model complexity increases
 However, test errors will decline at first (as
reductions in bias dominate) but will then start
to increase again (as increases in variance
dominate).

We must always keep this picture in mind when choosing a learning method.
More flexible/complicated is not always better!
Bias/ Variance Tradeoff
The previous graphs of test versus training MSE’s
illustrates a very important tradeoff that governs the
choice of statistical learning methods.

Low Bias, High Variance

There are always two competing forces that govern the
choice of learning method i.e. bias and variance.

 High Variance – Over-fitting

 High Bias – Under-fitting High Bias, Low Variance
Over-fitting/ Under-fitting
y = f(x) y = f(x) y = f(x)

o Due to erroneous model specification(s), the model fits the noise.

o Frequently occurs when model is excessively complex

o Poor performance when deployed

To avoid overfitting
1) Regularization
2) Cross Validation
Image source: pingax.com
Test MSE, Bias and Variance
Bias – Variance Trade Off – Few Tips
• High number of features and less examples (observations)
• Reduce number of features (But that is information lost)
• Regularization
• If your predictions are seeing large error -
•Get more training data
•Try a smaller set a features
•Try getting additional features
•Adding polynomial features
•Building your own, new, better features based on your knowledge of the problem
– Can be risky if you accidentally over-fit your data by creating new features which are
inherently specific/relevant to your training data
Regularization
Constrain the weights.
Impose penalty for complexity
Model=argmin ∑L (actual, predicted(Model))+λR (Model)

For Linear/Logistic Regression

If λ2 is 0 , the regularization is called LASSO. Advantage: It does feature selection too.

If λ1 is 0, the regularization is called LARS/ Ridge Regression

λ1 + λ2 is always 1. If both are present in the objective function, it is called elastic net
regularization
Cross Validation
Procedures: Split training set into sub-training/test sets  Build model on sub-training set  Evaluate on
sub-test set 
Repeat and average estimated errors
Result:
• we are able to fit/test various different models with different variables included to find the best one on
the cross-validated test sets
• we are able to test out different types of prediction algorithms to use and pick the best performing one
• we are able to choose the parameters in prediction function and estimate their values
• Note: original test set completely untouched, so when final prediction algorithm is applied, the result
will be an unbiased measurement of the out of sample accuracy of the model
Approaches:
• Random subsampling/ Holdout Method
• K-fold
• Leave one out
Considerations:
• For time series data must be used in “chunks”
- one time period might depending all time periods previously (should not take random samples)
Cross validation Approach-1: Train, Test, Validate Datasets

Validation set used as a proxy to estimate out of sample error

Observed Data Validation Set Test Set

Training Set
Sample Design Guidelines for Prediction Study
 For large sample sizes: 60% training - 20% test - 20% validation
 For medium sample sizes: 60% training - 40% test – no validation set to refine model (to ensure test set is of
sufficient size)
 For small sample sizes:
• Carefully consider if there are enough sample to build a prediction algorithm
• Report caveat of small sample size and highlight the fact that the prediction algorithm has never been
tested for out of sample error
 There should always be a test/validation set that is held away and should NOT be looked at when building
model
• When complete, apply the model to the held-out set only one time
 Randomly sample training and test sets
• For data collected over time, build training set in chunks of times
 Datasets must reflect structure of problem
• If prediction evolves with time, split train/test sets in time chunks (known as back testing in finance)
 Subsets of data should reflect as much diversity as possible
Cross-Validation Approach-2: K-fold validation
 Break training set into K subsets
 Build the model/predictor on the remaining training
data in each subset and applied to the test subset
 Rebuild the data K times with the training and test
subsets
 Average the findings

 Considerations:
larger k = less bias, more variance
smaller k = more bias, less variance

Image Source: www.kaggle.com/c/chess

source: wikipedia
Cross-Validation Approach-3: Leave one out (LOO)
 leave out exactly one sample and build predictor on the rest of training data
 predict value for the left out sample
 repeat for each sample

source: wikipedia
Let’s Check Our Understanding…
Quiz-1

Which of the following is best suited for machine learning?

1A. Predicting whether the next cry of the baby girl happens at an even-numbered minute or not
2B. Determining whether a given graph contains a cycle
3C. Deciding whether to approve credit card to some customer
4D. Guessing whether the earth will be destroyed by the misuse of nuclear power in the next ten years

A. no pattern
B. programmable definition
C. pattern: customer behavior; definition: not easily programmable; data: history of bank operation
D. arguably no (or not enough) data yet
Quiz-1

Which of the following is best suited for machine learning?

Answer: C

A. no pattern
B. programmable definition
C. pattern: customer behavior; definition: not easily programmable; data: history of bank operation
D. arguably no (or not enough) data yet
Quiz-2

Which of the following claim is not totally true?

A. machine learning is a route to realize artificial intelligence

B. machine learning, data mining and statistics all need data
C. data mining is just another name for machine learning
D. statistics can be used for data mining

Note: While data mining and machine learning do share a huge overlap, they are arguably not
equivalent because of the difference of focus.
Quiz-2

Which of the following claim is not totally true?

A. machine learning is a route to realize artificial intelligence

B. machine learning, data mining and statistics all need data
C. data mining is just another name for machine learning
D. statistics can be used for data mining

Answer: C

Note: While data mining and machine learning do share a huge overlap, they are arguably not
equivalent because of the difference of focus.
Quiz-3
Of the following examples, which would you address using an unsupervised learning algorithm?

A) Given email labeled as spam/not spam, learn a spam filter.

B) Given a set of news articles found on the web, group them into set of articles about the same story.
C) Given a database of customer data, automatically discover market segments and group customers into
different market segments.
D) Given a dataset of patients diagnosed as either having diabetes or not, learn to classify new patients as havin
diabetes or not.
Quiz-3
Of the following examples, which would you address using an unsupervised learning algorithm?

A) Given email labeled as spam/not spam, learn a spam filter.

The Answer: B & C

Quiz-4
You’re running a company, and you want to develop learning algorithms to address each of two problems.

Problem 1: You have a large inventory of identical items. You want to predict how many of these items
will sell over the next 3 months.
Problem 2: You’d like software to examine individual customer accounts, and for each account decide if it
has been hacked/compromised.

Should you treat these as classification or as regression problems?

A) Treat both as classification problems.

B) Treat problem 1 as a classification problem, problem 2 as a regression problem.

C) Treat problem 1 as a regression problem, problem 2 as a classification problem.

D) Treat both as regression problems.

Quiz-4
You’re running a company, and you want to develop learning algorithms to address each of two problems.

Should you treat these as classification or as regression problems?

A) Treat both as classification problems.

B) Treat problem 1 as a classification problem, problem 2 as a regression problem.

C) Treat problem 1 as a regression problem, problem 2 as a classification problem.

D) Treat both as regression problems.

The Answer: C
Quiz-5

The entrance system of the school gym, which does automatic face recognition based on machine
Learning, is built to charge four different groups of users differently: staff, student, professor, other.
What type of learning problem best fits the need of the system

A. Binary Classification
B. Multi Class classification
C. Regression
D. None of the above
Quiz-5

A. Binary Classification
B. Multi Class classification
C. Regression
D. None of the above

The Answer: B
Puzzle
A huntsman can hit a target with probability of 0.2.

He sees a flock of birds (150 birds) atop a banyan tree. He takes

aim and fires three continuous shots. He hits exactly one of the
birds.

Question: How many birds remain on the tree?

And then, there were none

Domain knowledge is very important

Don’t lose the big picture

What we have ‘learnt’ so far…

 What is Machine Learning & Use Cases

 Major Classes of Learning Algorithms:
 Supervised, Unsupervised, Semi- Supervised, Reinforced
 Building a Machine Learning Model
 Data Pre-processing, Training & Test Split, Model Building, Validation, Prediction
 Performance Metrics & Evaluation – Concept of:
 Over/Under-fitting;
 Bias, Variance, and Trade off
 Regularization
 Cross Validation

Next Session: Supervised ML | Linear Regression

Visit us on: http://www.analytixlabs.in/

For more information, please contact us: http://www.analytixlabs.co.in/contact-us/

Or email: info@analytixlabs.co.in

Call us we would love to speak with you: (+91) 88021-73069

Join us on:
Twitter - http://twitter.com/#!/AnalytixLabs
Facebook - http://www.facebook.com/analytixlabs
LinkedIn - http://www.linkedin.com/in/analytixlabs
Blog - http://www.analytixlabs.co.in/category/blog/

Prediction and Analysis of Franchise Cricket
No ratings yet
Prediction and Analysis of Franchise Cricket
8 pages
Machine Learning Notes
100% (10)
Machine Learning Notes
19 pages
1. Machine Learning - Introduction
No ratings yet
1. Machine Learning - Introduction
138 pages
ML-Unit1
No ratings yet
ML-Unit1
15 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Unit 1
No ratings yet
Unit 1
19 pages
Lect3 Machine Learning
No ratings yet
Lect3 Machine Learning
27 pages
ai faheem
No ratings yet
ai faheem
16 pages
01 - ML - Introduction (1)
No ratings yet
01 - ML - Introduction (1)
65 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
Machine Learning Slides
No ratings yet
Machine Learning Slides
46 pages
Machine Learning L1
No ratings yet
Machine Learning L1
34 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
Python UNIT-5
100% (1)
Python UNIT-5
67 pages
Introduction To Machine Learning-1
No ratings yet
Introduction To Machine Learning-1
28 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Module 01 - Introduction (1)
No ratings yet
Module 01 - Introduction (1)
35 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
68 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
49 pages
ML (Theorey)
No ratings yet
ML (Theorey)
18 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
16 pages
L3 - Supervised and Unsupervised Learning
100% (3)
L3 - Supervised and Unsupervised Learning
24 pages
Chapter 7- Artificial Intelligence Application
No ratings yet
Chapter 7- Artificial Intelligence Application
29 pages
lec001
No ratings yet
lec001
17 pages
Supervised and Deep Learning
No ratings yet
Supervised and Deep Learning
83 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
asset-v1_MKAU+SEng9032+DEV_01+type@asset+block@ChapOne
No ratings yet
asset-v1_MKAU+SEng9032+DEV_01+type@asset+block@ChapOne
29 pages
Machine Learning: Louis Fippo Fitime
No ratings yet
Machine Learning: Louis Fippo Fitime
37 pages
Unit 1 PDF
No ratings yet
Unit 1 PDF
135 pages
DM Chapter 0
No ratings yet
DM Chapter 0
4 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
ML-Unit 1
No ratings yet
ML-Unit 1
43 pages
ML-Unit 1 Merged
No ratings yet
ML-Unit 1 Merged
151 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
Military AI-Week 02-Key Concept Machine Learning
No ratings yet
Military AI-Week 02-Key Concept Machine Learning
84 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
AI Chapter 3 Part 1
No ratings yet
AI Chapter 3 Part 1
33 pages
unit V
No ratings yet
unit V
67 pages
ML 01
No ratings yet
ML 01
15 pages
Chapter1
No ratings yet
Chapter1
30 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
Unit 1
No ratings yet
Unit 1
62 pages
ML Chapter 1
No ratings yet
ML Chapter 1
37 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
78 pages
Lecture 1
No ratings yet
Lecture 1
30 pages
Tirth.pdf
No ratings yet
Tirth.pdf
19 pages
CH 4
No ratings yet
CH 4
106 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages
ML -1_Sovan_Introduction to ML
No ratings yet
ML -1_Sovan_Introduction to ML
83 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
28 pages
Week 8
No ratings yet
Week 8
70 pages
CPCS335 - Chapter 8-Final
No ratings yet
CPCS335 - Chapter 8-Final
23 pages
01_ml-overview_notes
No ratings yet
01_ml-overview_notes
19 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
IT446 Test Bank
No ratings yet
IT446 Test Bank
57 pages
Detection and Classification of Faults On Parallel Transmission Lines Using Wavelet Transform and Neural Network
No ratings yet
Detection and Classification of Faults On Parallel Transmission Lines Using Wavelet Transform and Neural Network
5 pages
Stock Market Price Prediction Analysis - Removed
100% (4)
Stock Market Price Prediction Analysis - Removed
54 pages
Kali Muthu 2020
No ratings yet
Kali Muthu 2020
7 pages
Teaching Tips
No ratings yet
Teaching Tips
55 pages
Thyroid Detection Using Machine Learning
No ratings yet
Thyroid Detection Using Machine Learning
5 pages
Literature
No ratings yet
Literature
368 pages
Text Classification PDF
No ratings yet
Text Classification PDF
7 pages
Unsupervised Feature Extraction With Autoencoders For EEG Based Multiclass Motor Imagery BCI
No ratings yet
Unsupervised Feature Extraction With Autoencoders For EEG Based Multiclass Motor Imagery BCI
10 pages
IEEE Conference On Snake Species Identification Review
No ratings yet
IEEE Conference On Snake Species Identification Review
5 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
3644280 research paper
No ratings yet
3644280 research paper
6 pages
Ensemble Learning
No ratings yet
Ensemble Learning
8 pages
MTech - CSE - Curriculum-2022
No ratings yet
MTech - CSE - Curriculum-2022
94 pages
Bayesian Network Classifiers in Weka
No ratings yet
Bayesian Network Classifiers in Weka
23 pages
Duda Solutions PDF
No ratings yet
Duda Solutions PDF
77 pages
Lecture 26-30 Unit 2
No ratings yet
Lecture 26-30 Unit 2
20 pages
Back Propagation Neural Network
No ratings yet
Back Propagation Neural Network
10 pages
19bit0368 Capstone Final Review
No ratings yet
19bit0368 Capstone Final Review
48 pages
SecondAssignment EDA
No ratings yet
SecondAssignment EDA
20 pages
Final Report
No ratings yet
Final Report
103 pages
Birla Institute of Technology & Science, Pilani Course Handout Part A: Content Design
No ratings yet
Birla Institute of Technology & Science, Pilani Course Handout Part A: Content Design
5 pages
2016 Syllaus
No ratings yet
2016 Syllaus
27 pages
2021 IWBF Classification Rules Version 202110 1
No ratings yet
2021 IWBF Classification Rules Version 202110 1
108 pages
Hierarchical Document Classification As A Sequence Generation Task
No ratings yet
Hierarchical Document Classification As A Sequence Generation Task
9 pages
CROP & FERTILIZER RECOMANDATION SYSTEM USING ML
No ratings yet
CROP & FERTILIZER RECOMANDATION SYSTEM USING ML
51 pages
HandwrittenDigitRecognitionusing PDF
No ratings yet
HandwrittenDigitRecognitionusing PDF
9 pages
Deep Learning - IIT Ropar - Unit 5 - Week 2
No ratings yet
Deep Learning - IIT Ropar - Unit 5 - Week 2
4 pages
Machine Learning Based Education Data Mining Through Student Session Streams
No ratings yet
Machine Learning Based Education Data Mining Through Student Session Streams
12 pages