What Is Naive Bayes Algorithm?

The Naive Bayes algorithm is a classification technique based on Bayes' theorem that assumes independence between predictors. It calculates the probability of a class given attribute values using the frequency of values in the training data. Even though it assumes independence, Naive Bayes often performs well on real-world problems and is particularly suitable for large datasets due to its simplicity and speed.

Uploaded by

JUAN ESTEBAN LOPEZ BEDOYA

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

109 views

What Is Naive Bayes Algorithm?

Uploaded by

JUAN ESTEBAN LOPEZ BEDOYA

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

What is Naive Bayes

algorithm?
What is Naive Bayes algorithm?
It is a classification technique based on Bayes’ Theorem with an
assumption of independence among predictors. In simple terms, a
Naive Bayes classifier assumes that the presence of a particular feature
in a class is unrelated to the presence of any other feature.
What is Naive Bayes algorithm?
For example, a fruit may be considered to be an apple if it is red, round,
and about 3 inches in diameter. Even if these features depend on each
other or upon the existence of the other features, all of these
properties independently contribute to the probability that this fruit is
an apple and that is why it is known as ‘Naive’.
What is Naive Bayes algorithm?
Naive Bayes model is easy to build and particularly useful for very large
data sets. Along with simplicity, Naive Bayes is known to outperform
even highly sophisticated classification methods.
What is Naive Bayes algorithm?
Bayes theorem provides a way of calculating posterior probability P(c|x) from P(c),
P(x) and P(x|c). Look at the equation below:

Above,
• P(c|x) is the posterior probability of class (c, target) given predictor (x, attributes).
• P(c) is the prior probability of class.
• P(x|c) is the likelihood which is the probability of predictor given class.
• P(x) is the prior probability of predictor.
How Naive Bayes algorithm works?
Let’s understand it using an example. Below I have a training data set of
weather and corresponding target variable ‘Play’ (suggesting
possibilities of playing). Now, we need to classify whether players will
play or not based on weather condition. Let’s follow the below steps to
perform it.
• Step 1: Convert the data set into a frequency table
• Step 2: Create Likelihood table by finding the probabilities
• Step 3: Now, use Naive Bayesian equation to calculate the posterior
probability for each class. The class with the highest posterior
probability is the outcome of prediction.
How Naive Bayes algorithm works?
How Naive Bayes algorithm works?
Problem: Players will play if weather is sunny. Is this statement is correct?
We can solve it using above discussed method of posterior probability.
P(Yes | Sunny) = P( Sunny | Yes) * P(Yes) / P (Sunny)
Here we have P (Sunny |Yes) = 3/9 = 0.33, P(Sunny) = 5/14 = 0.36, P( Yes)=
9/14 = 0.64
Now, P (Yes | Sunny) = 0.33 * 0.64 / 0.36 = 0.60, which has higher
probability.
Naive Bayes uses a similar method to predict the probability of different
class based on various attributes.
Anything else?

Yes please, another example!

Game Prediction Using Bayes' Theorem
Let's continue our Naive Bayes tutorial and predict the future with
some weather data.
Here we have our data, which comprises the day, outlook, humidity,
and wind conditions. The final column is 'Play,' i.e., can we play outside,
which we have to predict.
Game Prediction Using Bayes' Theorem
Game Prediction Using Bayes' Theorem
First, we will create a frequency table using each attribute of the dataset.
Game Prediction Using Bayes' Theorem
For each frequency table, we will generate a likelihood table.

• Likelihood of ‘Yes’ given ‘Sunny‘ is:

P(c|x) = P(Yes|Sunny) = P(Sunny|Yes)* P(Yes) / P(Sunny) = (0.3 x 0.71) /0.36 = 0.591
• Similarly, the likelihood of ‘No’ given ‘Sunny‘ is:
P(c|x) = P(No|Sunny) = P(Sunny|No)* P(No) / P(Sunny) = (0.4 x 0.36) /0.36 = 0.40
Game Prediction Using Bayes' Theorem
Now, in the same way, we need to create the Likelihood Table for other
attributes as well.
Game Prediction Using Bayes' Theorem
Suppose we have a Day with the following values :
• Outlook = Rain
• Humidity = High
• Wind = Weak
• Play = ?

• So, with the data, we have to predict wheter "we can play on that day or not."
• Likelihood of 'Yes' on that Day = P(Outlook = Rain|Yes)*P(Humidity= High|Yes)* P(Wind= Weak|
Yes)*P(Yes)
• = 2/9 * 3/9 * 6/9 * 9/14 = 0.0199
• Likelihood of 'No' on that Day = P(Outlook = Rain|No)*P(Humidity= High|No)* P(Wind= Weak|No)*P(No)
• = 2/5 * 4/5 * 2/5 * 5/14 = 0.0166
• Now, when we normalize the value, we get:
• P(Yes) = 0.0199 / (0.0199+ 0.0166) = 0.55
• P(No) = 0.0166 / (0.0199+ 0.0166) = 0.45
• Our model predicts that there is a 55% chance there will be a game tomorrow.
So?

At the end!
What are the Pros and Cons of Naive
Bayes?
Pros:
• It is easy and fast to predict class of test data set. It also perform well
in multi class prediction
• When assumption of independence holds, a Naive Bayes classifier
performs better compare to other models like logistic regression and
you need less training data.
• It perform well in case of categorical input variables compared to
numerical variable(s). For numerical variable, normal distribution is
assumed (bell curve, which is a strong assumption).
What are the Pros and Cons of Naive
Bayes?
Cons:
• If categorical variable has a category (in test data set), which was not
observed in training data set, then model will assign a 0 (zero) probability
and will be unable to make a prediction. This is often known as “Zero
Frequency”. To solve this, we can use the smoothing technique. One of the
simplest smoothing techniques is called Laplace estimation.
• On the other side naive Bayes is also known as a bad estimator, so the
probability outputs from predict_proba are not to be taken too seriously.
• Another limitation of Naive Bayes is the assumption of independent
predictors. In real life, it is almost impossible that we get a set of predictors
which are completely independent

Department of Computing
No ratings yet
Department of Computing
5 pages
QuantEconlectures Python3 PDF
100% (1)
QuantEconlectures Python3 PDF
1,125 pages
Bootstrap Powerpoint
100% (1)
Bootstrap Powerpoint
20 pages
Naive Bayes
No ratings yet
Naive Bayes
38 pages
Tigabu Dagne
No ratings yet
Tigabu Dagne
125 pages
Artificial Neural Network
100% (2)
Artificial Neural Network
20 pages
Machine Learning
0% (1)
Machine Learning
3 pages
CU6051NA - Artificial Intelligence 20% Individual Coursework 2019-20 Autumn
100% (1)
CU6051NA - Artificial Intelligence 20% Individual Coursework 2019-20 Autumn
20 pages
Data Science
No ratings yet
Data Science
39 pages
Feature Selection in Machine Learning
No ratings yet
Feature Selection in Machine Learning
4 pages
Lung Disease Prediction System Using Naive Bayes and K Means Clustering
No ratings yet
Lung Disease Prediction System Using Naive Bayes and K Means Clustering
36 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Supervised Vs Unsupervised Learning
No ratings yet
Supervised Vs Unsupervised Learning
20 pages
02 - Decision Tree Classification On Iris Dataset
No ratings yet
02 - Decision Tree Classification On Iris Dataset
6 pages
The Growth of Machine Learning in Cybersecurity
No ratings yet
The Growth of Machine Learning in Cybersecurity
17 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
66 pages
Roadmap To Build A Machine Learning Model
No ratings yet
Roadmap To Build A Machine Learning Model
12 pages
Download Full Deep Learning 1st Edition Dulani Meedeniya PDF All Chapters
100% (2)
Download Full Deep Learning 1st Edition Dulani Meedeniya PDF All Chapters
50 pages
Data Mart Info
No ratings yet
Data Mart Info
5 pages
Supervised Learning
No ratings yet
Supervised Learning
19 pages
ML - Expectation-Maximization Algorithm
No ratings yet
ML - Expectation-Maximization Algorithm
3 pages
Feature Selection Techniques in Machine Learning - Javatpoint
No ratings yet
Feature Selection Techniques in Machine Learning - Javatpoint
9 pages
Neural Networks
No ratings yet
Neural Networks
54 pages
How To Code A Neural Network With Backpropagation in Python
No ratings yet
How To Code A Neural Network With Backpropagation in Python
133 pages
Role of Machine Learning in The Field of Fiber Reinforced Polymer
No ratings yet
Role of Machine Learning in The Field of Fiber Reinforced Polymer
6 pages
Unit 3 Data Mining
No ratings yet
Unit 3 Data Mining
21 pages
Parkison's Diseases Prediction Using Machine Learning
No ratings yet
Parkison's Diseases Prediction Using Machine Learning
10 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Feature Engineering / Feature Selection
No ratings yet
Feature Engineering / Feature Selection
33 pages
Feature Selection Techniques in ML With Python-1
No ratings yet
Feature Selection Techniques in ML With Python-1
7 pages
Car Make and Model Recognition Using Ima
No ratings yet
Car Make and Model Recognition Using Ima
8 pages
Machine Learning 1
No ratings yet
Machine Learning 1
11 pages
Lec 06 Feature Selection and Extraction
No ratings yet
Lec 06 Feature Selection and Extraction
43 pages
Chapter 17 - Logistic Regression
No ratings yet
Chapter 17 - Logistic Regression
32 pages
Navies Bayes
No ratings yet
Navies Bayes
18 pages
Criminal Face Recognition Using GAN
No ratings yet
Criminal Face Recognition Using GAN
3 pages
Life Expectancy Using Data Analytics
100% (1)
Life Expectancy Using Data Analytics
9 pages
Feature Selection Methods
No ratings yet
Feature Selection Methods
24 pages
Prediction of Alzheimer's Disease Using CNN
100% (2)
Prediction of Alzheimer's Disease Using CNN
11 pages
Get Feature Engineering Bookcamp 1st Edition Sinan Ozdemir free all chapters
100% (2)
Get Feature Engineering Bookcamp 1st Edition Sinan Ozdemir free all chapters
55 pages
Customer Churn Prediction
No ratings yet
Customer Churn Prediction
70 pages
AIML - 04 Single Layer Perceptron
No ratings yet
AIML - 04 Single Layer Perceptron
11 pages
Chapter Simple Linear Regression 1
100% (1)
Chapter Simple Linear Regression 1
77 pages
Smart Disease Prediction Using Machine Learning
No ratings yet
Smart Disease Prediction Using Machine Learning
5 pages
Bias Variance Tradeoff
No ratings yet
Bias Variance Tradeoff
6 pages
Top 10 Data Mining Algorithms
No ratings yet
Top 10 Data Mining Algorithms
65 pages
Cluster Analysis: Concepts and Techniques - Chapter 7
100% (1)
Cluster Analysis: Concepts and Techniques - Chapter 7
60 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Chapter 7 - Regression Analysis
100% (1)
Chapter 7 - Regression Analysis
111 pages
ML Unit 1 Notes
100% (1)
ML Unit 1 Notes
19 pages
Lecture13 ANFIS
No ratings yet
Lecture13 ANFIS
43 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
9 pages
Design A Machine Learning System
No ratings yet
Design A Machine Learning System
9 pages
ML Project Shivani Pandey
100% (2)
ML Project Shivani Pandey
49 pages
SimPy For First Time Users - SimPy v2.2 Documentation
No ratings yet
SimPy For First Time Users - SimPy v2.2 Documentation
15 pages
Customer Analytics at Flipkart
No ratings yet
Customer Analytics at Flipkart
4 pages
PM in Oil & Gas by ML Algorithms
100% (1)
PM in Oil & Gas by ML Algorithms
41 pages
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Mastering Parallel Programming with R
From Everand
Mastering Parallel Programming with R
Simon R. Chapple
No ratings yet
Tabla D3 Duncan Statistics Control
No ratings yet
Tabla D3 Duncan Statistics Control
3 pages
Research and Medical Statistics (Basic To Inference) .PPT (Read-Only) (Compa
No ratings yet
Research and Medical Statistics (Basic To Inference) .PPT (Read-Only) (Compa
265 pages
Prof. Ed15 - Special Topic 3
No ratings yet
Prof. Ed15 - Special Topic 3
2 pages
GRMD2102 Homework 1
No ratings yet
GRMD2102 Homework 1
3 pages
Midterm Examination
No ratings yet
Midterm Examination
6 pages
Statistics Chapter 5a (Probability Concept)
No ratings yet
Statistics Chapter 5a (Probability Concept)
26 pages
Lesson 8 Statistical Treatment
No ratings yet
Lesson 8 Statistical Treatment
23 pages
Fundamentals of Sampling Procedure
100% (1)
Fundamentals of Sampling Procedure
13 pages
Chapter 10
No ratings yet
Chapter 10
35 pages
Session Commands
No ratings yet
Session Commands
1,046 pages
Putational Statistics Using Matlab
No ratings yet
Putational Statistics Using Matlab
78 pages
Stat Assignment 4
No ratings yet
Stat Assignment 4
3 pages
UFS SW Module 4 Review KEY
No ratings yet
UFS SW Module 4 Review KEY
6 pages
05 Handout 1
No ratings yet
05 Handout 1
13 pages
Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi
No ratings yet
Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi
36 pages
Statistics and Probability 4th Quarter
No ratings yet
Statistics and Probability 4th Quarter
3 pages
Basic and Applied Questions - Hypothesis Testing-Homework 2 PDF
100% (1)
Basic and Applied Questions - Hypothesis Testing-Homework 2 PDF
15 pages
Lecture 7.6 - Conditional Probability - Independent Events Properties
No ratings yet
Lecture 7.6 - Conditional Probability - Independent Events Properties
43 pages
Week 8 WGN
No ratings yet
Week 8 WGN
5 pages
1 Inch
No ratings yet
1 Inch
9 pages
Worksheet #4: Conditional Probability Answer Key
0% (1)
Worksheet #4: Conditional Probability Answer Key
3 pages
Mangaldan National High School Mangaldan, Pangasinan Budgeted Lesson in Statistics & Probability
100% (1)
Mangaldan National High School Mangaldan, Pangasinan Budgeted Lesson in Statistics & Probability
14 pages
Mathematical Statistics (MA212M) : Lecture Slides
No ratings yet
Mathematical Statistics (MA212M) : Lecture Slides
16 pages
Lampiran SPSS Uji T Pretes
No ratings yet
Lampiran SPSS Uji T Pretes
2 pages
Non Parametric Statistics
No ratings yet
Non Parametric Statistics
96 pages
Big Data Analytics Statistical Methods
No ratings yet
Big Data Analytics Statistical Methods
8 pages
Stat. & Prob. 4th Q.
No ratings yet
Stat. & Prob. 4th Q.
4 pages
Variable 1 Variable 2: T-Test: Two-Sample Assuming Equal Variances
No ratings yet
Variable 1 Variable 2: T-Test: Two-Sample Assuming Equal Variances
4 pages
Appendices: A B C D
No ratings yet
Appendices: A B C D
14 pages