Linear and Logistic Regression

Linear regression is a supervised machine learning algorithm used for predicting continuous outcomes based on input features, exemplified by housing price prediction. It assumes a linear relationship between variables and provides interpretable coefficients. Logistic regression, on the other hand, is used for binary classification problems, predicting probabilities of outcomes using the sigmoid function.

Uploaded by

chaudharysidhu11

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Linear and Logistic Regression

Uploaded by

chaudharysidhu11

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Need of Linear Regression

Linear regression is a fundamental machine learning algorithm used for predicting

a continuous outcome variable based on one or more input features.
Need for linear regression with an example:
Example: Housing Price Prediction
Imagine you are working on a real estate project where the goal is to predict the
price of a house based on various features such as square footage, number of
bedrooms, and neighborhood. The dataset might look like this:

Square Bedrooms Neighbouhood Price

1500 2 Urban 300,000
2000 3 Sub Urban 400,000
1200 2 Rural 250,000
Examples
• Continuous Outcome: Linear regression is suitable when the outcome
variable (price in this case) is continuous. It models the relationship
between the input features and the continuous output.
• Linear Relationship: Linear regression assumes a linear relationship
between the input features and the outcome. In the real estate example,
you may expect that as the square footage increases, the price also tends
to increase. Linear regression helps quantify and model such linear
relationships.
• Interpretability: Linear regression provides interpretable coefficients for
each input feature, indicating the strength and direction of their influence
on the outcome. For example, a positive coefficient for square footage
suggests that an increase in square footage is associated with a higher
house price.
Linear regression
• Linear regression is a supervised machine learning algorithm used for
predicting a continuous outcome variable (also called the dependent
variable) based on one or more predictor variables (independent
variables). It assumes a linear relationship between the input
variables and the output.
• The basic idea behind linear regression is to find the best-fitting
straight line (linear equation) that minimizes the sum of the squared
differences between the observed and predicted values. The
equation for a simple linear regression with one independent variable
is typically represented as:
Y=mx+b
• Where:
• y is the dependent variable (the variable you are trying to predict).
• x is the independent variable (the variable used to make predictions).
• m is the slope of the line, representing the relationship between x
and y.
• b is the y-intercept, the point where the line crosses the y-axis.
• For multiple linear regression with more than one independent
variable, the equation is extended to:
• For multiple linear regression with more than one independent
variable, the equation is extended to:
Basic steps to implement linear Regression
• Data Collection: Gather a dataset with relevant information.
• Data Preprocessing: Clean the data, handle missing values, and preprocess
features if needed.
• Feature Selection: Choose the relevant features that might influence the
dependent variable.
• Split Data: Divide the dataset into a training set and a testing set.
• Model Training: Use the training set to find the best-fitting line by
adjusting the coefficients.
• Model Evaluation: Evaluate the model's performance on the testing set.
• Prediction: Use the trained model to make predictions on new or unseen
data.
Numerical Example
• Consider a dataset with one independent variable (x) and one
dependent variable (y).
• We'll use the equations mentioned earlier to find the slope (m) and y-
intercept (b) for the best-fitting line.
• Suppose we have the following dataset:
• x:1,2,3,4,5
• y:2,4,5,4,5
We want to find the equation of the line y=mx+b that best fits this data.
0.6
Numerical
Need of Logistic Regression
• logistic regression is valuable for binary classification problems,
especially when you need to model the probability of an event
occurring based on multiple input features.
• Supervised Machine learning Classification Model
• Dependent Variable is categorical and binary(0 or 1)
• Independent Variable-Study Hours Dependent Variable-Exam Results
Study Hours Exam Result
2 0
4 0
6 0
8 1
10 1
Imagine you are working on a medical diagnosis project where the goal is to predict
whether a patient has a particular medical condition based on some features. For simplicity,
let's consider a binary outcome: 1 if the patient has the condition and 0 if the patient does
not.
Now, suppose you have a dataset with the following features and outcomes:

Age Blood Pressure Cholesterol Level Outcome

40 120 200 1
55 140 250 0
60 130 180 1

Not possible to apply linear regression in such a case.

The logistic regression model uses the sigmoid function to map the linear combination of
features to a probability in the range [0, 1]. The sigmoid function ensures that the predicted
probabilities are well-behaved and can be interpreted as the likelihood of belonging to a specific
class.
Logistic Regression
• Logistic regression is a supervised machine learning algorithm used
for binary classification problems, where the outcome variable
(dependent variable) is categorical and has only two possible classes,
often denoted as 0 and 1. It's named "regression," but it's primarily
used for classification tasks.
• The logistic regression model predicts the probability that a given
input belongs to a particular class. Unlike linear regression, where the
output is a continuous value, logistic regression uses the logistic
function (also known as the sigmoid function) to squash the output
into the range of [0, 1].
• The logistic function maps any real-valued number z to the
range [0, 1], which can be interpreted as a probability. The
output of the logistic regression model can be interpreted as the
probability that the given input belongs to class 1.
• The logistic regression model makes predictions by comparing
the output probability to a threshold (usually 0.5). If the
predicted probability is greater than or equal to the threshold,
the input is classified as belonging to class 1; otherwise, it is
classified as belonging to class 0.
• The training process involves finding the optimal values for the
coefficients (b0,b1,…,bn) that minimize the difference between
the predicted probabilities and the actual class labels in the
training data. This is typically done through an optimization
algorithm, such as gradient descent.
Key steps in logistic Regression:
• Data Collection: Gather a dataset with input features and corresponding binary
class labels.
• Data Preprocessing: Clean the data, handle missing values, and preprocess
features if needed.
• Feature Selection: Choose the relevant features that might influence the binary
outcome.
• Split Data: Divide the dataset into a training set and a testing set.
• Model Training: Use the training set to find the optimal coefficients through an
optimization algorithm.
• Model Evaluation: Evaluate the model's performance on the testing set using
metrics like accuracy, precision, recall, or F1 score.
• Prediction: Use the trained model to make predictions on new or unseen data.
Suppose we have a dataset with information about whether students
pass (1) or fail (0) an exam based on the number of hours they studied.

• Feature X: Hours Studied

• Target y: Result (Pass or Fail)
Types of Logistic Regression
• On the basis of the categories, Logistic Regression can be classified
into three types:
• Binomial: In binomial Logistic regression, there can be only two
possible types of the dependent variables, such as 0 or 1, Pass or Fail,
etc.
• Multinomial: In multinomial Logistic regression, there can be 3 or
more possible unordered types of the dependent variable, such as
“cat”, “dogs”, or “sheep”
• Ordinal: In ordinal Logistic regression, there can be 3 or more
possible ordered types of dependent variables, such as “low”,
“Medium”, or “High”.
Assumptions of Logistic Regression
• Independent observations: Each observation is independent of the other.
meaning there is no correlation between any input variables.
• Binary dependent variables: It takes the assumption that the dependent
variable must be binary or dichotomous, meaning it can take only two
values. For more than two categories SoftMax functions are used.
• Linearity relationship between independent variables and log odds: The
relationship between the independent variables and the log odds of the
dependent variable should be linear.
• No outliers: There should be no outliers in the dataset.
• Large sample size: The sample size is sufficiently large
Logistic Regression Numerical example

Study Hours Exam Result

2 0
3 0
4 0
5 1
6 1
7 1
8 1

Machine Learning For Absolute Beginners - Oliver Theobald
No ratings yet
Machine Learning For Absolute Beginners - Oliver Theobald
128 pages
Logistic Regression
No ratings yet
Logistic Regression
22 pages
Unit - 26 - Machine - Learning - Assignment - 01 (1) Alish
No ratings yet
Unit - 26 - Machine - Learning - Assignment - 01 (1) Alish
42 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
DA Unit-3
No ratings yet
DA Unit-3
13 pages
Unit-Iii-1 1
No ratings yet
Unit-Iii-1 1
31 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
6 ML Updated
No ratings yet
6 ML Updated
23 pages
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-34-62
No ratings yet
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-34-62
29 pages
ml (08-08-2024)
No ratings yet
ml (08-08-2024)
5 pages
Intro to Linear and Logistic Reg
No ratings yet
Intro to Linear and Logistic Reg
5 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
What Is Logistic Regression
No ratings yet
What Is Logistic Regression
20 pages
09_23ECE216_LogisticRegression
No ratings yet
09_23ECE216_LogisticRegression
40 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
logisticregression
No ratings yet
logisticregression
22 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
Lecture Material 11
No ratings yet
Lecture Material 11
14 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
7 pages
Regression in M.L
No ratings yet
Regression in M.L
13 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Regression Analysis Linear Multiple Logistic
No ratings yet
Regression Analysis Linear Multiple Logistic
25 pages
MACHINE LEARNING Presentation Logistic Regression
No ratings yet
MACHINE LEARNING Presentation Logistic Regression
18 pages
Module-2_Logistic Regression in Machine Learning
No ratings yet
Module-2_Logistic Regression in Machine Learning
28 pages
Regression Modelling
No ratings yet
Regression Modelling
25 pages
SUPERVISED MACHINE LEARNING
No ratings yet
SUPERVISED MACHINE LEARNING
56 pages
UNIT-2 ML
No ratings yet
UNIT-2 ML
39 pages
compare & contrast Linear Vs Logistic Regression
No ratings yet
compare & contrast Linear Vs Logistic Regression
3 pages
4. Logistic Regression
No ratings yet
4. Logistic Regression
21 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
Unit 2
No ratings yet
Unit 2
19 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
LAB04-RegressionTasks
No ratings yet
LAB04-RegressionTasks
31 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Lecture Note #9_PEC-CS701E
No ratings yet
Lecture Note #9_PEC-CS701E
41 pages
Logistic regression
No ratings yet
Logistic regression
12 pages
Experiment No 8
No ratings yet
Experiment No 8
4 pages
Logistic Regression
No ratings yet
Logistic Regression
17 pages
03 Logistic Regression
No ratings yet
03 Logistic Regression
23 pages
11-Logistic Regression
No ratings yet
11-Logistic Regression
27 pages
A Research Project On Applying Logistic Regression To Predict Result of Binary Classification Problems
No ratings yet
A Research Project On Applying Logistic Regression To Predict Result of Binary Classification Problems
6 pages
ML Lec-9
No ratings yet
ML Lec-9
13 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Lecture 4-Logistic Regression
No ratings yet
Lecture 4-Logistic Regression
20 pages
DS
No ratings yet
DS
2 pages
2.1 Regression Analysis
No ratings yet
2.1 Regression Analysis
28 pages
Logistic Regression Report
No ratings yet
Logistic Regression Report
39 pages
Machine learning notes
No ratings yet
Machine learning notes
53 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
3 pages
13 Logistic Regression Main
No ratings yet
13 Logistic Regression Main
14 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
10 pages
Sonia Jessica - 2022 - How Does Logistic Regression Work
No ratings yet
Sonia Jessica - 2022 - How Does Logistic Regression Work
4 pages
ML CLASS 5 Logistic Regression Algorithm
No ratings yet
ML CLASS 5 Logistic Regression Algorithm
16 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
B.Tech_V_KCS055_Unit2_2
No ratings yet
B.Tech_V_KCS055_Unit2_2
7 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Master Thesis TU Delft Dinesh Bisesser 2020
No ratings yet
Master Thesis TU Delft Dinesh Bisesser 2020
104 pages
Aiml Q Bank
No ratings yet
Aiml Q Bank
25 pages
Big Data and Business Intelligence
No ratings yet
Big Data and Business Intelligence
108 pages
Deep Learning Introduction Unit 1
No ratings yet
Deep Learning Introduction Unit 1
21 pages
Final Year Project
No ratings yet
Final Year Project
24 pages
Instant ebooks textbook Natural Language Processing with PyTorch 2019th Edition Delip Rao download all chapters
No ratings yet
Instant ebooks textbook Natural Language Processing with PyTorch 2019th Edition Delip Rao download all chapters
40 pages
Module 01
No ratings yet
Module 01
25 pages
All algos_of_ML
No ratings yet
All algos_of_ML
31 pages
Attachment 1
No ratings yet
Attachment 1
14 pages
Minor Project Report
No ratings yet
Minor Project Report
23 pages
UNIT - II - Data Mining Essentials
No ratings yet
UNIT - II - Data Mining Essentials
20 pages
Question Bank for ML
No ratings yet
Question Bank for ML
3 pages
Manual de Pci Geomatica 240103 062603
No ratings yet
Manual de Pci Geomatica 240103 062603
162 pages
Machine Learning by Sahil
No ratings yet
Machine Learning by Sahil
15 pages
Machine Learning
No ratings yet
Machine Learning
37 pages
Audit Practitioners Guide To Machine Learning Part 1 - WHPAPG1 - WHP - Eng - 1022
No ratings yet
Audit Practitioners Guide To Machine Learning Part 1 - WHPAPG1 - WHP - Eng - 1022
21 pages
34 Machine Learning Interview Questions & Answers For 2020
No ratings yet
34 Machine Learning Interview Questions & Answers For 2020
27 pages
Taking The Human Out of Learning Applications: A Survey On Automated Machine Learning
No ratings yet
Taking The Human Out of Learning Applications: A Survey On Automated Machine Learning
20 pages
Heart Disease Python Report 1st Phase
No ratings yet
Heart Disease Python Report 1st Phase
33 pages
Unit - III
No ratings yet
Unit - III
40 pages
Ôn Thi KTDL
No ratings yet
Ôn Thi KTDL
18 pages
Research Paper1
No ratings yet
Research Paper1
11 pages
Machine Learning 1.4.19
No ratings yet
Machine Learning 1.4.19
23 pages
Machine Learning Projects in Python
100% (14)
Machine Learning Projects in Python
135 pages
Unit - 5 Re-Inforcement Learning
No ratings yet
Unit - 5 Re-Inforcement Learning
3 pages
What Is Machine Learning ?
No ratings yet
What Is Machine Learning ?
4 pages
Final Year Project Report
50% (2)
Final Year Project Report
53 pages
Data Science, AI, ML
No ratings yet
Data Science, AI, ML
25 pages