0% found this document useful (0 votes)

60 views

Python Machine Learning - Logistic Regression

1. Logistic regression is used to solve classification problems with categorical outcomes by predicting the probability of an observation belonging to a specific class. 2. The example shows logistic regression predicting whether a tumor is cancerous (binomial classification) based on its size in cm. 3. The model is trained on tumor size and cancer data, then used to predict the probability of a tumor with size 3.46mm being cancerous.

Uploaded by

ahmed salem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views

Python Machine Learning - Logistic Regression

Uploaded by

ahmed salem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

 Tutorials  Exercises  Get Certified  Services  Bootcamps Spaces Sign Up Log in

Dark mode
Dark code
HTML CSS JAVASCRIPT SQL PYTHON JAVA PHP BOOTSTRAP HOW TO W3.CSS C C++ C# REACT R JQUERY DJANGO   
atp ot b Scatte
Matplotlib Bars
ADVERTISEMENT
Matplotlib Histograms
Matplotlib Pie Charts

Machine Learning
Getting Started
Mean Median Mode
Machine Learning - Logistic Regression
Standard Deviation
❮ Previous Next ❯
Percentile
Data Distribution
Normal Data Distribution
On this page, W3schools.com collaborates with NYC Data Science Academy, to deliver digital training content to our students.
Scatter Plot
Linear Regression

Logistic Regression
Polynomial Regression
Multiple Regression
Scale
Logistic regression aims to solve classification problems. It does this by predicting categorical outcomes, unlike linear regression
Train/Test
that predicts a continuous outcome.
Decision Tree
Confusion Matrix In the simplest case there are two outcomes, which is called binomial, an example of which is predicting if a tumor is malignant
Hierarchical Clustering or benign. Other cases have more than two outcomes to classify, in this case it is called multinomial. A common example for
Logistic Regression multinomial logistic regression would be predicting the class of an iris flower between 3 different species.

Grid Search
Here we will be using basic logistic regression to predict a binomial variable. This means it has only two possible outcomes.
Categorical Data
K-means
Bootstrap Aggregation
Cross Validation
How does it work?
In Python we have modules that will do the work for us. Start by importing the NumPy module.

import numpy

Store the independent variables in X.

Store the dependent variable in y.

Below is a sample dataset:

#X represents the size of a tumor in centimeters.

COLOR PICKER
X = numpy.array([3.78, 2.44, 2.09, 0.14, 1.72, 1.65, 4.92, 4.37, 4.96, 4.52, 3.69, 5.88]).reshape(-1,1)

#Note: X has to be reshaped into a column from a row for the LogisticRegression() function to work.
#y represents whether or not the tumor is cancerous (0 for "No", 1 for "Yes").
y = numpy.array([0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1])


We will use a method from the sklearn module, so we will have to import that module as well:

from sklearn import linear_model

From the sklearn module we will use the LogisticRegression() method to create a logistic regression object.

This object has a method called fit() that takes the independent and dependent values as parameters and fills the regression
object with data that describes the relationship:

logr = linear_model.LogisticRegression()
logr.fit(X,y)

Now we have a logistic regression object that is ready to whether a tumor is cancerous based on the tumor size:

#predict if tumor is cancerous where the size is 3.46mm:

predicted = logr.predict(numpy.array([3.46]).reshape(-1,1))

Example Get your own Python Server

See the whole example in action:

ADVERTISEMENT
import numpy
from sklearn import linear_model

#Reshaped for Logistic function.

X = numpy.array([3.78, 2.44, 2.09, 0.14, 1.72, 1.65, 4.92, 4.37, 4.96, 4.52, 3.69, 5.88]).reshape(-1,1)
y = numpy.array([0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1])

logr = linear_model.LogisticRegression()
logr.fit(X,y)

#predict if tumor is cancerous where the size is 3.46mm:

predicted = logr.predict(numpy.array([3.46]).reshape(-1,1))
print(predicted)

Result

[0]

Run example »

We have predicted that a tumor with a size of 3.46mm will not be cancerous.

Learn more about NYCDSA

Coefficient
In logistic regression the coefficient is the expected change in log-odds of having the outcome per unit change in X.

This does not have the most intuitive understanding so let's use it to create something that makes more sense, odds.

Example
See the whole example in action:

import numpy
from sklearn import linear_model

#Reshaped for Logistic function.

X = numpy.array([3.78, 2.44, 2.09, 0.14, 1.72, 1.65, 4.92, 4.37, 4.96, 4.52, 3.69, 5.88]).reshape(-1,1)
y = numpy.array([0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1])

logr = linear_model.LogisticRegression()
logr.fit(X,y)

log_odds = logr.coef_
odds = numpy.exp(log_odds)

print(odds)

Result

[4.03541657]

Run example »

This tells us that as the size of a tumor increases by 1mm the odds of it being a cancerous tumor increases by 4x.

Probability
The coefficient and intercept values can be used to find the probability that each tumor is cancerous.

Create a function that uses the model's coefficient and intercept values to return a new value. This new value represents
probability that the given observation is a tumor:

def logit2prob(logr,x):
log_odds = logr.coef_ * x + logr.intercept_
odds = numpy.exp(log_odds)
probability = odds / (1 + odds)
return(probability)

Function Explained
To find the log-odds for each observation, we must first create a formula that looks similar to the one from linear regression,
extracting the coefficient and the intercept.

log_odds = logr.coef_ * x + logr.intercept_

To then convert the log-odds to odds we must exponentiate the log-odds.

odds = numpy.exp(log_odds)

Now that we have the odds, we can convert it to probability by dividing it by 1 plus the odds.

probability = odds / (1 + odds)

Let us now use the function with what we have learned to find out the probability that each tumor is cancerous.

Example
See the whole example in action:

import numpy
from sklearn import linear_model

X = numpy.array([3.78, 2.44, 2.09, 0.14, 1.72, 1.65, 4.92, 4.37, 4.96, 4.52, 3.69, 5.88]).reshape(-1,1)
y = numpy.array([0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1])

logr = linear_model.LogisticRegression()
logr.fit(X,y)

def logit2prob(logr, X):

log_odds = logr.coef_ * X + logr.intercept_
odds = numpy.exp(log_odds)
probability = odds / (1 + odds)
return(probability)

print(logit2prob(logr, X))

Result

[[0.60749955]
[0.19268876]
[0.12775886]
[0.00955221]
[0.08038616]
[0.07345637]
[0.88362743]
[0.77901378]
[0.88924409]
[0.81293497]
[0.57719129]
[0.96664243]]

Run example »

Results Explained
3.78 0.61 The probability that a tumor with the size 3.78cm is cancerous is 61%.

2.44 0.19 The probability that a tumor with the size 2.44cm is cancerous is 19%.

2.09 0.13 The probability that a tumor with the size 2.09cm is cancerous is 13%.

❮ Previous Log in to track progress Next ❯

Spaces Upgrade Newsletter Get Certified Report Error

W3Schools is Powered by W3.CSS.

2025 Geo Grade 12 Research QP Eng
25% (4)
2025 Geo Grade 12 Research QP Eng
14 pages
Solution Manual for Probability, Statistics, and Random Processes For Electrical Engineering, 3/E 3rd Edition Alberto Leon-Garcia all chapter instant download
100% (7)
Solution Manual for Probability, Statistics, and Random Processes For Electrical Engineering, 3/E 3rd Edition Alberto Leon-Garcia all chapter instant download
47 pages
Capstone Project Manual
100% (5)
Capstone Project Manual
13 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
The Practically Cheating Calculus Handbook
From Everand
The Practically Cheating Calculus Handbook
S. Deviant
3.5/5 (7)
Python Machine Learning Linear Regression
No ratings yet
Python Machine Learning Linear Regression
1 page
Lab Manual 04
No ratings yet
Lab Manual 04
12 pages
Commonly Used Machine Learning Algorithms
No ratings yet
Commonly Used Machine Learning Algorithms
38 pages
Logistic Regression
No ratings yet
Logistic Regression
32 pages
Machine Learning
100% (3)
Machine Learning
46 pages
Commonly Used Machine Learning Algorithms (With Python and R Codes)
No ratings yet
Commonly Used Machine Learning Algorithms (With Python and R Codes)
19 pages
AI-ML Syllabus
100% (1)
AI-ML Syllabus
8 pages
Day 2 Presentation
No ratings yet
Day 2 Presentation
65 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
Learn Machine Learning in One Lesson Book
No ratings yet
Learn Machine Learning in One Lesson Book
8 pages
FYMCA IDSLab A6 Submission
No ratings yet
FYMCA IDSLab A6 Submission
9 pages
ML RECORD - Merged
No ratings yet
ML RECORD - Merged
33 pages
Krishna Edx Machine Learning With Python
No ratings yet
Krishna Edx Machine Learning With Python
18 pages
DATA SCIENCE iNTERVIEW QUESTION
No ratings yet
DATA SCIENCE iNTERVIEW QUESTION
42 pages
Machine learning with pythone_syllabus
No ratings yet
Machine learning with pythone_syllabus
13 pages
Machine Learning With Python Unit 1-17-84 Final13092024
No ratings yet
Machine Learning With Python Unit 1-17-84 Final13092024
68 pages
Machine Learning The Basics
No ratings yet
Machine Learning The Basics
158 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
Course Plan 21CSC307P - Machine Learning For Data Analytics
No ratings yet
Course Plan 21CSC307P - Machine Learning For Data Analytics
13 pages
NumPy Ufuncs - Logs
No ratings yet
NumPy Ufuncs - Logs
1 page
Short Details of Business Analyst Course
No ratings yet
Short Details of Business Analyst Course
4 pages
Essentials of Machine Learning Algorithms
No ratings yet
Essentials of Machine Learning Algorithms
15 pages
Logistic Distribution
No ratings yet
Logistic Distribution
1 page
LTI1
No ratings yet
LTI1
20 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
Data Science Course in Hyderabad - Innomatics
No ratings yet
Data Science Course in Hyderabad - Innomatics
10 pages
(Ebook) Introduction to Machine Learning with Python: A Guide for Data Scientists by Andreas C. Müller, Sarah Guido ISBN 9781449369415, 1449369413 download
100% (3)
(Ebook) Introduction to Machine Learning with Python: A Guide for Data Scientists by Andreas C. Müller, Sarah Guido ISBN 9781449369415, 1449369413 download
56 pages
Introduction_to_Machine_Learning_Exercises
No ratings yet
Introduction_to_Machine_Learning_Exercises
18 pages
What Are The Differences Between Supervised and Unsupervised Learning?
No ratings yet
What Are The Differences Between Supervised and Unsupervised Learning?
21 pages
Broadly, There Are 3 Types of Machine Learning Algorithms.
No ratings yet
Broadly, There Are 3 Types of Machine Learning Algorithms.
33 pages
Document1
No ratings yet
Document1
6 pages
Datascienceusing Python Training
No ratings yet
Datascienceusing Python Training
11 pages
Machine Learning With Python
100% (2)
Machine Learning With Python
137 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
B24 ML Exp-1
No ratings yet
B24 ML Exp-1
10 pages
Data Science Master
No ratings yet
Data Science Master
11 pages
Machine Learning Mathematics in Python -- Jamie Flux -- 2024
No ratings yet
Machine Learning Mathematics in Python -- Jamie Flux -- 2024
238 pages
Ml Record
No ratings yet
Ml Record
23 pages
Machine Learing Algorithms
No ratings yet
Machine Learing Algorithms
13 pages
20dit073 Jay Prajapati ML
No ratings yet
20dit073 Jay Prajapati ML
68 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
Machine Learning (Chapter1)
No ratings yet
Machine Learning (Chapter1)
8 pages
Top 90+ Data Science Interview Questions and Answers (2024)
No ratings yet
Top 90+ Data Science Interview Questions and Answers (2024)
38 pages
2-Machine Learning Algorithms
No ratings yet
2-Machine Learning Algorithms
16 pages
_OceanofPDF.com_Hands-On_Machine_Learning_from_Scratch_-_Venelin_Valkov
No ratings yet
_OceanofPDF.com_Hands-On_Machine_Learning_from_Scratch_-_Venelin_Valkov
119 pages
305 BA PYTHON - APR 2022 ANSWER Key
No ratings yet
305 BA PYTHON - APR 2022 ANSWER Key
14 pages
Basic ML Algorithm
No ratings yet
Basic ML Algorithm
74 pages
WIP - ML-22-DEC Weekend
No ratings yet
WIP - ML-22-DEC Weekend
40 pages
Information Retrieval Important questions
No ratings yet
Information Retrieval Important questions
20 pages
Commonly Used Machine Learning Algorithms
No ratings yet
Commonly Used Machine Learning Algorithms
27 pages
4. Logistic Regression
No ratings yet
4. Logistic Regression
21 pages
What Are The Differences Between Supervised and Unsupervised Learning?
No ratings yet
What Are The Differences Between Supervised and Unsupervised Learning?
22 pages
Data Science Bootcamp (Day-01) (1) - Compressed
No ratings yet
Data Science Bootcamp (Day-01) (1) - Compressed
161 pages
AI
No ratings yet
AI
28 pages
ML Syllabus
No ratings yet
ML Syllabus
4 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Python - Join Tuples
No ratings yet
Python - Join Tuples
1 page
Matplotlib Plotting
No ratings yet
Matplotlib Plotting
1 page
Python - Copy Dictionaries
No ratings yet
Python - Copy Dictionaries
1 page
Pandas - Cleaning Empty Cells
No ratings yet
Pandas - Cleaning Empty Cells
1 page
Python Machine Learning Scatter Plot
No ratings yet
Python Machine Learning Scatter Plot
1 page
Python Lists
No ratings yet
Python Lists
1 page
NumPy Data Types
No ratings yet
NumPy Data Types
1 page
Pandas Series
No ratings yet
Pandas Series
1 page
Python Numbers
No ratings yet
Python Numbers
1 page
Pandas Tutorial
No ratings yet
Pandas Tutorial
1 page
Python Inheritance
No ratings yet
Python Inheritance
1 page
Pandas - Removing Duplicates
No ratings yet
Pandas - Removing Duplicates
1 page
Pandas - Cleaning Data of Wrong Format
No ratings yet
Pandas - Cleaning Data of Wrong Format
1 page
Pandas Read JSON
No ratings yet
Pandas Read JSON
1 page
Python - Update Tuples
No ratings yet
Python - Update Tuples
1 page
NumPy Ufuncs - Summations
No ratings yet
NumPy Ufuncs - Summations
1 page
Pareto Distribution
No ratings yet
Pareto Distribution
1 page
Python Iterators
No ratings yet
Python Iterators
1 page
NumPy Creating Arrays
No ratings yet
NumPy Creating Arrays
1 page
Python Booleans
No ratings yet
Python Booleans
1 page
Python JSON
No ratings yet
Python JSON
1 page
Python - Change List Items
No ratings yet
Python - Change List Items
1 page
Matplotlib Histograms
No ratings yet
Matplotlib Histograms
1 page
Python While Loops
No ratings yet
Python While Loops
1 page
Python Math
No ratings yet
Python Math
1 page
NumPy Array Copy Vs View
No ratings yet
NumPy Array Copy Vs View
1 page
Python Variables - Assign Multiple Values
No ratings yet
Python Variables - Assign Multiple Values
1 page
Python String Methods
No ratings yet
Python String Methods
1 page
Monthwise Syllabus Class 12 Com
No ratings yet
Monthwise Syllabus Class 12 Com
8 pages
Research 2 First Quarter Examination
No ratings yet
Research 2 First Quarter Examination
5 pages
Perceived Investment in Employee Development and Turnover Intention: A Social Exchange Perspective
No ratings yet
Perceived Investment in Employee Development and Turnover Intention: A Social Exchange Perspective
11 pages
Course Outline PDF
No ratings yet
Course Outline PDF
12 pages
Revlon Marketing Research Proposal
No ratings yet
Revlon Marketing Research Proposal
23 pages
1223-Article Text-5565-1-10-20200702
No ratings yet
1223-Article Text-5565-1-10-20200702
7 pages
The Impact of Strategic Thinking On The Performance of Industrial Companies Listed On The Amman Stock Exchange
No ratings yet
The Impact of Strategic Thinking On The Performance of Industrial Companies Listed On The Amman Stock Exchange
21 pages
Chapter 5: Correlation and Linear Regression: Phan Thi Khanh Van
No ratings yet
Chapter 5: Correlation and Linear Regression: Phan Thi Khanh Van
19 pages
Statistik Deskriptif
No ratings yet
Statistik Deskriptif
33 pages
Activity 02 - Descriptive Statistics (Describing Data Sets)
No ratings yet
Activity 02 - Descriptive Statistics (Describing Data Sets)
5 pages
A Comparative Study Between Male and Female Purchase Intention Toward Visual Merchandising at Centro by Parkson Department Store Mantos
No ratings yet
A Comparative Study Between Male and Female Purchase Intention Toward Visual Merchandising at Centro by Parkson Department Store Mantos
13 pages
NN 2
No ratings yet
NN 2
42 pages
Mean Median Mode PDF
No ratings yet
Mean Median Mode PDF
52 pages
Urban Land Management Policy Under The Ethiopian Federation, The Case of Adama City
No ratings yet
Urban Land Management Policy Under The Ethiopian Federation, The Case of Adama City
110 pages
Jawaban PSM by Blackbox
No ratings yet
Jawaban PSM by Blackbox
2 pages
Casting Factors
No ratings yet
Casting Factors
13 pages
Operational Definitions of Inventory Record Accuracy
No ratings yet
Operational Definitions of Inventory Record Accuracy
10 pages
I.I.I. 3rd and 4th Q Module
No ratings yet
I.I.I. 3rd and 4th Q Module
53 pages
Cheat Sheet Midterm
No ratings yet
Cheat Sheet Midterm
1 page
Zuur Et Al 2009 BOOK - Chap01 - Introduction
No ratings yet
Zuur Et Al 2009 BOOK - Chap01 - Introduction
10 pages
Statistical Analysis of NBA Players Performances Compared to Ages
No ratings yet
Statistical Analysis of NBA Players Performances Compared to Ages
3 pages
Key Concept in Applied Linguistic Research
No ratings yet
Key Concept in Applied Linguistic Research
13 pages
Get Business Research Method Bajpai PDF ebook with Full Chapters Now
100% (7)
Get Business Research Method Bajpai PDF ebook with Full Chapters Now
55 pages
Psychology Dissertation Results Section Example
100% (2)
Psychology Dissertation Results Section Example
9 pages
Art. Rot Vs MBT
No ratings yet
Art. Rot Vs MBT
8 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
8 pages
Bayesian_and_Kalman
No ratings yet
Bayesian_and_Kalman
3 pages

Python Machine Learning - Logistic Regression

Uploaded by

Python Machine Learning - Logistic Regression

Uploaded by

 Tutorials  Exercises  Get Certified  Services  Bootcamps Spaces Sign Up Log in

Store the independent variables in X.

Store the dependent variable in y.

Below is a sample dataset:

#X represents the size of a tumor in centimeters.

from sklearn import linear_model

#predict if tumor is cancerous where the size is 3.46mm:

Example Get your own Python Server

See the whole example in action:

#Reshaped for Logistic function.

#predict if tumor is cancerous where the size is 3.46mm:

Learn more about NYCDSA

#Reshaped for Logistic function.

log_odds = logr.coef_ * x + logr.intercept_

To then convert the log-odds to odds we must exponentiate the log-odds.

probability = odds / (1 + odds)

def logit2prob(logr, X):

❮ Previous Log in to track progress Next ❯

Spaces Upgrade Newsletter Get Certified Report Error

Top Tutorials Top References Top Examples Get Certified

Copyright 1999-2023 by Refsnes Data. All Rights Reserved.

You might also like