0% found this document useful (0 votes)

154 views30 pages

Logistic Regression

Uploaded by

Logistic regression can be used for classification problems where the target variable is categorical. The logistic regression model estimates the probability of an observation belonging to a particular class based on predictor variables. Several metrics can evaluate the classification performance of logistic regression models, including accuracy, confusion matrices, and information criteria scores. Variable selection methods may help identify the most predictive variables and reduce overfitting.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Logistic Regression

Uploaded by

Thành Cao Đức

0% found this document useful (0 votes)

154 views30 pages

Original Title

10. LOGISTIC REGRESSION

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

154 views30 pages

Logistic Regression

Uploaded by

Thành Cao Đức

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

You are on page 1/ 30

PHUONG NGUYEN

LOGISTIC REGRESSION
CONTENT
1. INTRODUCTION

2. LOGISTIC REGRESSION MODEL

3. EVALUATING CLASSIFICATION PERFORMANCE

INTRODUCTION
▪

▪
INTRODUCTION
▪
▪

▪
INTRODUCTION

5
LOGISTIC RESPONSE FUNCTION
1
𝑝=
1 + 𝑒 −𝑥

6
PROBABILITY

 

1
𝑝= −(𝛽 0 +𝛽 1 𝑥 1 +𝛽2 𝑥 2 + …𝛽 𝑞 𝑥 𝑞 )
1+ 𝑒
ODDS

𝑝
𝑂𝑑𝑑𝑠 =
1−𝑝

𝑂𝑑𝑑𝑠 1
𝑝= =
1 + 𝑂𝑑𝑑𝑠 1 + 𝑂𝑑𝑑𝑠 −1
ODDS

𝑝
𝑂𝑑𝑑𝑠 =
1−𝑝
LOGIT

𝑂𝑑𝑑𝑠 = 𝑒 𝛽0 +𝛽1𝑥1 +𝛽2𝑥2 +⋯+𝛽𝑞𝑥𝑞

ln(𝑂𝑑𝑑𝑠) = 𝛽0 + 𝛽1 𝑥1 + 𝛽2 𝑥2 + ⋯ + 𝛽𝑞 𝑥𝑞
LOGIT
𝑝
𝐿𝑜𝑔𝑖𝑡 = 𝑙𝑛
1−𝑝
LOGISTIC REGRESSION MODEL

▪
PERSONAL LOAN OFFER
UNIVERSALBANK.CSV

▪
▪

▪
SINGLE PREDICTOR MODEL

 
SINGLE PREDICTOR MODEL

▪
PYTHON FUNCTIONALITY NEEDED
import numpy as np
import pandas as pd
from sklearn.linear_model import LogisticRegression,
LogisticRegressionCV
from sklearn.model_selection import train_test_split
import statsmodels.api as sm
from mord import LogisticIT
import matplotlib.pylab as plt
import seaborn as sns
from dmba import classificationSummary, gainsChart,
liftChart
from dmba.metric import AIC_score

https://github.com/nnbphuong/datascience4biz/blob/
master/Logistic_Regression.ipynb
DATA PREPROCESSING
bank_df = pd.read_csv('UniversalBank.csv')
bank_df.drop(columns=['ID', 'ZIP Code'], inplace=True)
bank_df.columns = [c.replace(' ', '_') for c in bank_df.columns]

# Treat education as categorical, convert to dummy variables

bank_df['Education'] = bank_df['Education'].astype('category')
new_categories = {1: 'Undergrad', 2: 'Graduate', 3:
'Advanced/Professional'}
bank_df.Education.cat.rename_categories(new_categories, inplace=True)
bank_df = pd.get_dummies(bank_df, prefix_sep='_', drop_first=True)

y = bank_df['Personal_Loan']
X = bank_df.drop(columns=['Personal_Loan’])

# partition data
train_X, valid_X, train_y, valid_y = train_test_split(X, y,
test_size=0.4, random_state=1)
FITTING THE MODEL
▪ 

# fit a logistic regression

logit_reg = LogisticRegression(penalty="l2", C=1e42,
solver='liblinear')
logit_reg.fit(train_X, train_y)
print('intercept ', logit_reg.intercept_[0])
print(pd.DataFrame({'coeff': logit_reg.coef_[0]},
index=X.columns).transpose())
print('AIC', AIC_score(valid_y, logit_reg.predict(valid_X),
df = len(train_X.columns) + 1))
FITTING THE MODEL OUTPUT
intercept -12.61895521314035

Age Experience Income Family CCAvg Mortgage

coeff -0.032549 0.03416 0.058824 0.614095 0.240534 0.001012

Securities_Account CD_Account Online CreditCard

coeff -1.026191 3.647933 -0.677862 -0.95598

Education_Graduate Education_Advanced/Professional
coeff 4.192204 4.341697

AIC -709.1524769205962
CONVERTING FROM LOGIT TO PROBABILITY
𝑙𝑜𝑔𝑖𝑡
𝑂𝑑𝑑𝑠
𝑂𝑑𝑑𝑠 = 𝑒 →𝑝=
1 + 𝑂𝑑𝑑𝑠
logit_reg_pred = logit_reg.predict(valid_X)
logit_reg_proba = logit_reg.predict_proba(valid_X)
logit_result = pd.DataFrame({'actual': valid_y,
'p(0)': [p[0] for p in logit_reg_proba],
'p(1)': [p[1] for p in logit_reg_proba],
'predicted': logit_reg_pred })

# display four different cases

interestingCases = [2764, 932, 2721, 702]
print(logit_result.loc[interestingCases])

OUTPUT
actual p(0) p(1) predicted
2764 0 0.976 0.024 0
932 0 0.335 0.665 1
2721 1 0.032 0.968 1
702 1 0.986 0.014 0
INTERPRETING PROBABILITY AND ODDS
▪

▪ 
EVALUATING CLASSIFICATION PERFORMANCE
classificationSummary(train_y, logit_reg.predict(train_X))
classificationSummary(valid_y, logit_reg.predict(valid_X))

OUTPUT
Confusion Matrix (Accuracy 0.9080)

Prediction
Actual 0 1
0 2632 81
1 195 92
Confusion Matrix (Accuracy 0.9110)

Prediction
Actual 0 1
0 1763 44
1 134 59
VARIABLE SELECTION
▪
▪
▪

▪
VARIABLE SELECTION
▪

×
VARIABLE SELECTION
▪

→
→
MODEL SELECTION
▪

▪
SUMMARY
▪

Titanic: Logistic Regression Project
No ratings yet
Titanic: Logistic Regression Project
19 pages
Machine Learning Project On Cars
92% (13)
Machine Learning Project On Cars
22 pages
Scilab Assignment
100% (1)
Scilab Assignment
9 pages
Introduction To Data Science and Analytics
100% (1)
Introduction To Data Science and Analytics
31 pages
I Feel For You Chaka Khan Lyrics and Structure
No ratings yet
I Feel For You Chaka Khan Lyrics and Structure
1 page
Regressione Logistica1
No ratings yet
Regressione Logistica1
8 pages
Approachin190808095205 PDF
No ratings yet
Approachin190808095205 PDF
112 pages
Laboratorio Regresión Logística - Colaboratory Grupo 2
No ratings yet
Laboratorio Regresión Logística - Colaboratory Grupo 2
7 pages
Logistics Regression
100% (1)
Logistics Regression
5 pages
SGD For Linear Regression
No ratings yet
SGD For Linear Regression
4 pages
Custom Single Purpose Processor Design
No ratings yet
Custom Single Purpose Processor Design
24 pages
'Whitegrid': # PLT - Style.use ("Dark - Background")
No ratings yet
'Whitegrid': # PLT - Style.use ("Dark - Background")
16 pages
Sales Forecasting
100% (1)
Sales Forecasting
10 pages
Intro LOGIT
No ratings yet
Intro LOGIT
46 pages
Complexity: Erin Keith
No ratings yet
Complexity: Erin Keith
36 pages
2D Transaformations-14-22
No ratings yet
2D Transaformations-14-22
9 pages
ML0101EN Clas Logistic Reg Churn Py v1
No ratings yet
ML0101EN Clas Logistic Reg Churn Py v1
9 pages
week_11_logistic_fitting_statsmodels
No ratings yet
week_11_logistic_fitting_statsmodels
14 pages
MLP - Week 6 - MNIST - LogitReg - Ipynb - Colaboratory
No ratings yet
MLP - Week 6 - MNIST - LogitReg - Ipynb - Colaboratory
19 pages
ML - Lab-6.ipynb - Colab
No ratings yet
ML - Lab-6.ipynb - Colab
4 pages
Mnist2.ipynb - Colaboratory
No ratings yet
Mnist2.ipynb - Colaboratory
6 pages
Note 4
No ratings yet
Note 4
18 pages
Correction
No ratings yet
Correction
3 pages
Human Face Detection Using CNN 1682855909
No ratings yet
Human Face Detection Using CNN 1682855909
131 pages
Slides Kal To Fen
No ratings yet
Slides Kal To Fen
40 pages
Matlab Talk
No ratings yet
Matlab Talk
43 pages
P3) Code Neural Networks
No ratings yet
P3) Code Neural Networks
3 pages
Machine Learning with PySpark and MLlib — Solving a Binary Classification Problem _ by Susan Li _ Towards Data Science
No ratings yet
Machine Learning with PySpark and MLlib — Solving a Binary Classification Problem _ by Susan Li _ Towards Data Science
10 pages
vertopal.com_MSML_Project_1
No ratings yet
vertopal.com_MSML_Project_1
8 pages
Supervised Learning With Scikit-Learn: Introduction To Regression
No ratings yet
Supervised Learning With Scikit-Learn: Introduction To Regression
31 pages
Group Work Assignment Supervised and Unsupervised Learning
No ratings yet
Group Work Assignment Supervised and Unsupervised Learning
10 pages
Calibration of Stochastic Interest Rate Model
No ratings yet
Calibration of Stochastic Interest Rate Model
31 pages
2D Transformation
No ratings yet
2D Transformation
23 pages
Calling Procedure in ABAP Using ADBC
No ratings yet
Calling Procedure in ABAP Using ADBC
2 pages
Matlabtalk 2
No ratings yet
Matlabtalk 2
43 pages
ZFNet For CIFAR-10 Classification
No ratings yet
ZFNet For CIFAR-10 Classification
33 pages
ML P-6 - 024
No ratings yet
ML P-6 - 024
22 pages
Feature Engineering On Banks' Private Credit Data - Ipynb - Colab
No ratings yet
Feature Engineering On Banks' Private Credit Data - Ipynb - Colab
6 pages
Example - LinearDiscriminantAnalysis - Ipynb Colaboratory
No ratings yet
Example - LinearDiscriminantAnalysis - Ipynb Colaboratory
2 pages
ch4 PDF
No ratings yet
ch4 PDF
32 pages
16798differentail Calculus
No ratings yet
16798differentail Calculus
83 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Point of Tangency
No ratings yet
Point of Tangency
5 pages
Lab 4
No ratings yet
Lab 4
21 pages
Ml0101En-Reg-Nonelinearregression-Py-V1: 1 Non Linear Regression Analysis
No ratings yet
Ml0101En-Reg-Nonelinearregression-Py-V1: 1 Non Linear Regression Analysis
12 pages
ML-FINANCE - NPTES_BSR
No ratings yet
ML-FINANCE - NPTES_BSR
36 pages
Practical 4
No ratings yet
Practical 4
3 pages
Chapter 3 Homework (Take 2)
No ratings yet
Chapter 3 Homework (Take 2)
7 pages
Project 5 - Cars
100% (1)
Project 5 - Cars
22 pages
Cnnbyrohanga: # Create Datasets
No ratings yet
Cnnbyrohanga: # Create Datasets
1 page
2011 Olson Pyamgtutorial
No ratings yet
2011 Olson Pyamgtutorial
30 pages
Data Preprocessing & Visualization1
No ratings yet
Data Preprocessing & Visualization1
2 pages
linear-regression
No ratings yet
linear-regression
8 pages
f9-acca-notes-oull-develop-the-knowledge-and-skills-expected-of-a-finance-manager-in-relation
No ratings yet
f9-acca-notes-oull-develop-the-knowledge-and-skills-expected-of-a-finance-manager-in-relation
95 pages
R Workshop PART 2
No ratings yet
R Workshop PART 2
36 pages
Regression Anallysis Hands0n 1
100% (1)
Regression Anallysis Hands0n 1
3 pages
Objective & Declaration
No ratings yet
Objective & Declaration
22 pages
Factor Backtest
No ratings yet
Factor Backtest
13 pages
NN LAB 13 SEP - Jupyter Notebook
No ratings yet
NN LAB 13 SEP - Jupyter Notebook
6 pages
pratham ML
No ratings yet
pratham ML
14 pages
Simple_and_Multiple_Regression
No ratings yet
Simple_and_Multiple_Regression
9 pages
Digital and Microprocessor Techniques V10
From Everand
Digital and Microprocessor Techniques V10
Clive W. Humphris
No ratings yet
Trí tuệ nhân tạo trong điều khiển: Convolution Neural Networks Mạng nơron tích chập
No ratings yet
Trí tuệ nhân tạo trong điều khiển: Convolution Neural Networks Mạng nơron tích chập
25 pages
Artificial Intelligence: Long Short Term Memory Networks
No ratings yet
Artificial Intelligence: Long Short Term Memory Networks
14 pages
Artificial Intelligence: Binary Classifiers For Multi-Class Classification Problems
No ratings yet
Artificial Intelligence: Binary Classifiers For Multi-Class Classification Problems
12 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
47 pages
Predictive Performance
No ratings yet
Predictive Performance
33 pages
Artificial Intelligence: Alexnet
No ratings yet
Artificial Intelligence: Alexnet
20 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
32 pages
The Data Science Process
100% (1)
The Data Science Process
53 pages
Tree-Based Methods
No ratings yet
Tree-Based Methods
32 pages
Data Visualization
No ratings yet
Data Visualization
55 pages
Business Analytics
No ratings yet
Business Analytics
42 pages
A Crash Course On Python
No ratings yet
A Crash Course On Python
27 pages
Phuong Nguyen: The Complete Guide To Cluster Analysis Using Python
No ratings yet
Phuong Nguyen: The Complete Guide To Cluster Analysis Using Python
68 pages
DATA SUMMARIZATION - Print
No ratings yet
DATA SUMMARIZATION - Print
28 pages
Activity No. 8 GAB
No ratings yet
Activity No. 8 GAB
15 pages
Group Project: Members: Parth Patel (08IT071) Sunil Patel (09IT256)
No ratings yet
Group Project: Members: Parth Patel (08IT071) Sunil Patel (09IT256)
24 pages
P-223.2013 (GIS Plant)
No ratings yet
P-223.2013 (GIS Plant)
58 pages
P 5808 Rev J Teleymo DTS Belt Hardware New Format
No ratings yet
P 5808 Rev J Teleymo DTS Belt Hardware New Format
56 pages
Wpip-Rod-Str-S30-Dr-Cb-501601 - (A1-C01) Piling Dets SHT2
No ratings yet
Wpip-Rod-Str-S30-Dr-Cb-501601 - (A1-C01) Piling Dets SHT2
1 page
Curitiba - Case Study of A Sustainable City
No ratings yet
Curitiba - Case Study of A Sustainable City
2 pages
2 Metasploit
No ratings yet
2 Metasploit
102 pages
12-Channel Low Quiescent Current LED Driver: Features
No ratings yet
12-Channel Low Quiescent Current LED Driver: Features
54 pages
Omololu Masturah New
No ratings yet
Omololu Masturah New
4 pages
Beyond Schein Dental
No ratings yet
Beyond Schein Dental
9 pages
Amit Kumar Sharma PDF
No ratings yet
Amit Kumar Sharma PDF
58 pages
Mini-Case: Cabana
100% (1)
Mini-Case: Cabana
2 pages
1.Mata Amritanandamayi-Life and Experiences of Devotees
No ratings yet
1.Mata Amritanandamayi-Life and Experiences of Devotees
314 pages
Multiple Roles Senior Agronomist
No ratings yet
Multiple Roles Senior Agronomist
2 pages
Created By: 1. Muhammad Ridho (061530701245) 2. Nurkhasan (061530701248) 3. Rota Pradisti (061530701251)
No ratings yet
Created By: 1. Muhammad Ridho (061530701245) 2. Nurkhasan (061530701248) 3. Rota Pradisti (061530701251)
6 pages
Steam Car Thermodynamics Project
No ratings yet
Steam Car Thermodynamics Project
12 pages
Bogota 2006 1
No ratings yet
Bogota 2006 1
1 page
Technical Integrity Management
No ratings yet
Technical Integrity Management
23 pages
Dany Grade 11 Work Energy and Power Work Booklet and Answer Key
No ratings yet
Dany Grade 11 Work Energy and Power Work Booklet and Answer Key
15 pages
B2B and B2C Strategies in Ebusiness
No ratings yet
B2B and B2C Strategies in Ebusiness
29 pages
Download Complete Nonlinear Control Systems Analysis and Design 1st Edition Horacio Márquez PDF for All Chapters
100% (5)
Download Complete Nonlinear Control Systems Analysis and Design 1st Edition Horacio Márquez PDF for All Chapters
85 pages
Model 2-Way, Direct-Acting, Solenoid-Operated Directional Blocking Poppet Valve (740 Series)
No ratings yet
Model 2-Way, Direct-Acting, Solenoid-Operated Directional Blocking Poppet Valve (740 Series)
3 pages
2020 Specimen Paper 2 Mark Scheme
No ratings yet
2020 Specimen Paper 2 Mark Scheme
10 pages
Feedback and Assessment
No ratings yet
Feedback and Assessment
30 pages
LAS 7 Mean of A Discrete Random Variable
No ratings yet
LAS 7 Mean of A Discrete Random Variable
1 page
Fluid Waves 1st Edition Manasseh 2024 Scribd Download
100% (3)
Fluid Waves 1st Edition Manasseh 2024 Scribd Download
40 pages
Proper Proportioning of Cement Sand Gravel
No ratings yet
Proper Proportioning of Cement Sand Gravel
4 pages
Effect of Probiotics As A Complement To Non Surgical Periodontal Therapy in Chronic Periodontitis A Systematic Revie
No ratings yet
Effect of Probiotics As A Complement To Non Surgical Periodontal Therapy in Chronic Periodontitis A Systematic Revie
7 pages
English 976
No ratings yet
English 976
6 pages