0% found this document useful (0 votes)

16 views

Lecture 8

Uploaded by

ahmed.iqbal2907

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Lecture 8

Uploaded by

ahmed.iqbal2907

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 39

Categorical Data Analysis

Lecture 8
Analyzing a Binary Response
• We have discusses how to estimate and make
inferences about a single probability of success .
And
• generalizes this discussion to the situation of two
probabilities of success that are now dependent on a
level of a group.
• now completes the generalization to a situation where
there are many different possible probabilities of
success to estimate and perform inferences upon.
• Furthermore, this chapter allows us to quantify how an explanatory
variable with many possible levels (perhaps continuous rather than
categorical) affects the probability of success. These generalizations
are made through the use of binary regression models.
Linear models
Review of normal linear regression models

Yi = 0 + 1xi1 + 2xi2 + … + pxip + i

where i ~ independent N(0, 2) and i = 1, …, n

Note that

E(Yi) = 0 + 1xi1 + 2xi2 + … + pxip

E(Yi) is what one would expect Yi to be on average for a

set of xi1, xi2, …, xip values. Also, note that this model
implies that Yi ~ independent N(E(Yi), 2).
If k (one of the ’s above) is equal to 0, this says there is no
linear relationship between the corresponding explanatory
variable xk and the response variable. If k > 0, there is a
positive relationship, and if k < 0, there is a negative
relationship. All of these statements are with respect to the
other explanatory variables in the model remaining constant.
Regression models for a binary response

Let Yi be a binary response variable for i = 1, …, n,

where a 1 denotes a success and a 0 denotes a failure.

Suppose Yi has a Bernoulli distribution again as in

Chapter 1, but now with the probability of success
parameter i. Thus, Yi ~ independent Bernoulli(i).
Notice that the probability of success can be different for
i = 1, …, n observations. Potentially, there could be n
different parameters then we need to estimate!
We can simplify the number of parameters that we need
to estimate by using a linear model of the form

E(Yi) = 0 + 1xi1

where I am using just one explanatory variable to

simplify the explanation. Because E(Yi) is i, we could
also write the model as

i = 0 + 1xi1

Therefore, instead of potentially having n different

parameters to estimate, we now only have two!!!
To estimate the parameters, we should not proceed as
we would with normal linear regression models for the
following reasons:
 Yi is binary here, but Yi had a continuous distribution in
normal linear regression.
 Var(Yi) = i(1 – i) for a Bernoulli random variable;
thus, the variance potentially changes for each Y i. With
normal linear regression, Var(Yi) = Var(i) = 2 is
constant for i = 1, …, n.
We estimate the ’s through using maximum likelihood
estimation. The likelihood function is

L(0 , 1 | y1,,yn )  P(Y1  y1 )    P(Yn  yn )

n
  P(Yi  yi )
i1
n
1 yi
   (1  i )
yi
i
i1

where i = 0 + 1xi1. Maximizing the likelihood function leads

to the maximum likelihood estimates of 0 and 1.
Unfortunately, there is still a problem – i = 0 + 1xi1 is
not constrained to be within 0 and 1. For particular
values of 0, 1, and xi1, i may end be greater than 1 or
less than 0.
Logistic regression models

• There are a number of solutions to prevent

i from being outside the range of a
probability. Most solutions rely on non-
linear transformations to prevent these
types of problems from occurring. The most
commonly used transformation results in
the logistic regression model:
0 1xi1 p xip
e
i 
1  e0 1xi1p xip

Notice that exp(0  1xi1    p xip )  0 so that the

numerator is always less than the denominator. Thus, 0
< i <1.

The logistic regression model can also be written as

 i 
log    0  1xi1    p xip
 1  i 
• Notice that the left-hand side is the log
transformation of the odds of a success!
This will be very important for us later when
interpreting the effect an explanatory
variable has on the response variable.
 i 
 The log   transformation is often referred to as
 1  i 
the logit transformation. Thus, the most compact way
that people write the model as is

logit( i )  0  1xi1    p xip

 The 0  1xi1    p xip part of the model is often

referred to as the linear predictor.
 We can write the model without the i subscript when
we want to state the model in general:
0 1x1 p xp
e   
 0 1x1 p xp
and log    0  1x1    p xp
1 e  1  

Obviously, this leads to some notational ambiguity with

what we had in Section 1.1 for , but the meaning
should be obvious within the context of the problem.
Example: Plot of  vs. x (PiPlot.R)

When there is only one explanatory variable, 0 = 1, and

1 = 0.5 (or -0.5), a plot of  vs. x looks like the following:
e10.5x 1 e 10.5x 1
 
1  e10.5x 1 1  e10.5x 1
1.0

1.0
0.8

0.8
0.6

0.6



0.4

0.4
0.2

0.2
0.0

-15 -5 0 5 10 15 0.0 -15 -5 0 5 10 15

x1 x1
We can make the following generalizations:
 0<<1
 When 1 > 0, there is a positive relationship between
x1 and . When 1 < 0, there is a negative relationship
between x1 and .
 The shape of the curve is somewhat similar to the
letter s.
 Above  = 0.5 is a mirror image of below  = 0.5.
 The slope of the curve is dependent on the value of x 1.
We can show this mathematically by taking the
d
derivative with respect to x1:  1(1  )
dx1
Questions:

• What happens to the 1 = 0.5 plot when 1

is increased?
• What happens to the 1 = 0.5 plot when 1
is decreased to be close to 0?
• Suppose a plot of logit() vs. x1 was made.
What would the plot look like?
Parameter estimation
Maximum likelihood estimation is used to estimate the
parameters of the model. As shown earlier, the likelihood
function is
n
L(0 , 1,, p | y1,,yn )   iyi (1  i )1 yi
i1

but now

e0 1xi1p xip

i 
1  e0 1xi1p xip
we can find the log likelihood function:
n
log L(0 ,, p | y1,, yn )   yi log( i )  (1  yi )log(1  i )
i 1

n  e0 1xi1 pxip   e0 1xi1  p xip


  yi log  0 1xi1  p xip   (1  yi )log  1  0 1xi1   p xip 
i 1
 1  e   1 e 
n
0 1xi1  p xip
  yi (0  1xi1    p xip )  yi log(1  e )
i 1

(1  yi )log(1  e0 1xi1 pxip )

Taking derivatives with respect to 0, …, p, setting them
equal to 0, and then solving for the parameters lead to
the MLEs. These parameter estimates are denoted by
ˆ 0 , …, ̂p . Corresponding estimates of  are

ˆ 0 ˆ1x1 ˆ p xp
e
ˆ  ˆ 0 ˆ1x1 ˆ p xp
1 e
Unfortunately, there are no closed form expressions that can
be written out for ˆ 0 , …, ̂p except in very simple cases. The
MLEs instead are found through using iterative numerical
procedures.

Newton-Raphson procedure, one of these iterative numerical

procedures, for finding the MLE of  in a homogeneous
population setting

We will use a procedure called iteratively reweighted

least squares (IRLS) to find the maximum likelihood
estimates.
Without going into all of the details behind IRLS,
ˆ
initial estimates for the parameters, say 0 , …, ̂p ,
(0) (0)

are found. Weighted least squares estimation (see

Chapter 11 of my STAT 870 notes; weights are
based on ̂i ) is used to find a “better” set of
parameter estimates.
If the new parameter estimates, say ˆ (1)
0 , …, ̂(1)
p , are very
ˆ
close to 0 , …, ̂p , the iterative numerical procedure is
(0) (0)

said to “converge” and 0 , …, ̂p are used as the MLEs ˆ 0 ,

ˆ (1) (1)

…, ̂p . If the new parameter estimates ˆ (1)

0 , …, ̂(1)
p are not
ˆ
very close to 0 , …, ̂p , weighted least squares estimation
(0) (0)

is used again with new weights.

This iterative process continues until convergence or a prior-

specified maximum number of iterations is reached
The glm() function computes the parameter estimates.

Question: If the prior-specified maximum number of

iterations limit is reached, should the last set of
parameter estimates be used as ˆ 0 , …, ̂p ?
Example: Placekicking (Placekick.R, Placekick.csv)

This example is motivated by the work that I did for my

MS report and Bilder and Loughin (Chance, 1998).

The purpose of this and future examples involving this data

is to estimate the probability of success for a placekick.
Below are the explanatory variables to be considered:

 Week: week of the season

 Distance: Distance of the placekick in yards
 Change: Binary variable denoting lead-change (1) vs.
non-lead-change (0) placekicks; successful lead-
change placekicks are those that change which team
is winning the game.
 Elap30: Number of minutes remaining before the end
of the half with overtime placekicks receiving a value
of 0
 PAT: Binary variable denoting the type of placekick
where a point after touchdown (PAT) is a 1 and a field
goal is a 0
 Type: Binary variable denoting dome (0) vs. outdoor
(1) placekicks
 Field: Binary variable denoting grass (1) vs. artificial
turf (0) placekicks
Wind: Binary variable for placekicks attempted in windy
conditions (1) vs. non-windy conditions (0); I define windy as
a wind stronger than 15 miles per hour at kickoff in an
outdoor stadium
The response variable is referred to as “Good” in the
data set. It is a 1 for successful placekicks and a 0 for
failed placekicks.

There are 1,425 placekick observations from the 1995 NFL

season that are within this data set.
For this particular example, we are only going to use the
distance explanatory variable to estimate the probability
of a successful placekick. Thus, our logistic regression
model is

logit( )  0  1x1

where Y is the good response variable and x1 denotes

the distance in yards for the placekick. Less formally, we
will also write the model as

logit( )  0  1distance
• R to estimate the model with the glm() function
The estimated logistic regression model is

logit( ˆ )  5.8121  0.1150distance

Note that the function gets its name from “generalized

linear model”. This is a general class of linear models
which includes logistic regression models. At the end of
this chapter, I will formally define this general class.
Question: What happens to the estimated probability of
success as the distance increases?
Now is a good time for a reminder of
why R is often referred to as an
“object oriented language, every
object in R has a class associated
with it. The classes for mod.fit are:
Notice all of the method functions have the class name
at their end. For example, there is a summary.glm()
function. When the generic function summary() is run,
R first finds the class of the object and then checks to
see if there is a summary.glm() function. Because the
function exists, this method function completes the main
calculations.
The purpose of generic functions is to use a familiar
language set with any object. For example, we
frequently want to summarize data or a model
(summary()), compute confidence intervals
(confint()), and find predictions (predict()), so
it is convenient to use the same language set no
matter the application.
We can find the estimated probability of success for a
particular distance using:

e5.81210.1150distance
ˆ 
1  e5.81210.1150distance

For example, the probability of success at a distance of

20 is 0.97:

The estimated probability of success for a distance

of 50 yards is 0.52
Using this method to estimate the probability of success,
we can now plot the model with the curve() function:
If more than one explanatory variable is included in the
model, the variable names can be separated by “+” symbols
in the formula argument. For example, suppose we include
the change variable in addition to distance in the model

mod.fit2<-glm(formula = good ~ change + distance, family

= binomial(link = logit), data = placekick)

Regresi Logistik
No ratings yet
Regresi Logistik
34 pages
Binary Logistic Regression
No ratings yet
Binary Logistic Regression
8 pages
26GeneralizedLinearModelBernoulliAnnotated PDF
No ratings yet
26GeneralizedLinearModelBernoulliAnnotated PDF
46 pages
Lecture15 Binary Dependent Variables
No ratings yet
Lecture15 Binary Dependent Variables
38 pages
Logreg
No ratings yet
Logreg
26 pages
Unitb - II - Linear Probability, Logit and Probit
No ratings yet
Unitb - II - Linear Probability, Logit and Probit
34 pages
TSNotes 1
No ratings yet
TSNotes 1
29 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
Logistic Nota
No ratings yet
Logistic Nota
87 pages
Regression With A Binary Dependent Variable
No ratings yet
Regression With A Binary Dependent Variable
63 pages
Regression 101
No ratings yet
Regression 101
18 pages
Section 9 Limited Dependent Variables
No ratings yet
Section 9 Limited Dependent Variables
17 pages
Dummy Dependent Variable
100% (1)
Dummy Dependent Variable
58 pages
FCDS - RA ch1 Sp21
No ratings yet
FCDS - RA ch1 Sp21
14 pages
Notes 13
No ratings yet
Notes 13
18 pages
Logistic regression
No ratings yet
Logistic regression
19 pages
Notes 15
No ratings yet
Notes 15
20 pages
Week04 Lecture BB
No ratings yet
Week04 Lecture BB
80 pages
DA-Unit-3-Trio
No ratings yet
DA-Unit-3-Trio
13 pages
Econometrics__2__Notes (2)
No ratings yet
Econometrics__2__Notes (2)
14 pages
An Introduction To Logistic Regression
No ratings yet
An Introduction To Logistic Regression
48 pages
Topic 3a
No ratings yet
Topic 3a
64 pages
Chapter 5-LDVM-2024
No ratings yet
Chapter 5-LDVM-2024
27 pages
Reg Analysis
No ratings yet
Reg Analysis
63 pages
Logistic
No ratings yet
Logistic
14 pages
Lecture 7 Probit
No ratings yet
Lecture 7 Probit
24 pages
Advice: sciences/business/economics/kit-baum-workshops/Bham13P4slides PDF
No ratings yet
Advice: sciences/business/economics/kit-baum-workshops/Bham13P4slides PDF
11 pages
Logistic Regression
No ratings yet
Logistic Regression
54 pages
NASA Regression Lecture
No ratings yet
NASA Regression Lecture
268 pages
In All The Regression Models That We Have Considered So
100% (1)
In All The Regression Models That We Have Considered So
52 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
Limited Dependent Variables - Binary Dependent Variables
No ratings yet
Limited Dependent Variables - Binary Dependent Variables
24 pages
Msfe Week9
No ratings yet
Msfe Week9
5 pages
UnivariateRegression 3
No ratings yet
UnivariateRegression 3
81 pages
ES714glm Generalized Linear Models
No ratings yet
ES714glm Generalized Linear Models
26 pages
Microeconometrie Chapitre1 BinaryOutcomeModels
No ratings yet
Microeconometrie Chapitre1 BinaryOutcomeModels
42 pages
Section 11 PDF
No ratings yet
Section 11 PDF
7 pages
Regression3 Slides
No ratings yet
Regression3 Slides
47 pages
Understanding Maximum Likelihood
No ratings yet
Understanding Maximum Likelihood
5 pages
3.Handouts_binary_dependent_variables
No ratings yet
3.Handouts_binary_dependent_variables
8 pages
Econometria Avanzada: Generalized Linear Models
No ratings yet
Econometria Avanzada: Generalized Linear Models
30 pages
Econometric Analysis of Cross Section and Panel Data, 2e: Models For Fractional Responses
No ratings yet
Econometric Analysis of Cross Section and Panel Data, 2e: Models For Fractional Responses
104 pages
glm
No ratings yet
glm
4 pages
Econometrics Eviews 6
No ratings yet
Econometrics Eviews 6
12 pages
SAHADEB - Logistic Reg - Sessions 8-10
No ratings yet
SAHADEB - Logistic Reg - Sessions 8-10
145 pages
Econometrics - Exercise set 2 (solution)
No ratings yet
Econometrics - Exercise set 2 (solution)
12 pages
The Linear Regression Model
No ratings yet
The Linear Regression Model
25 pages
2-Logistic Regression
No ratings yet
2-Logistic Regression
17 pages
Roni Presentation
No ratings yet
Roni Presentation
17 pages
Note 4: EECS 189 Introduction To Machine Learning Fall 2020 1 MLE and MAP For Regression (Part I)
No ratings yet
Note 4: EECS 189 Introduction To Machine Learning Fall 2020 1 MLE and MAP For Regression (Part I)
6 pages
A Simple But Effective Logistic Regression Derivation
No ratings yet
A Simple But Effective Logistic Regression Derivation
6 pages
Econ Shu301 CH11
No ratings yet
Econ Shu301 CH11
53 pages
Linear Regression
100% (2)
Linear Regression
228 pages
Chapter 2
No ratings yet
Chapter 2
18 pages
Quant_Chapter_05_ols
No ratings yet
Quant_Chapter_05_ols
15 pages
Generalized Linear Models
No ratings yet
Generalized Linear Models
109 pages
Categorical-Notes-Ch5
No ratings yet
Categorical-Notes-Ch5
14 pages
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Introduction to Logarithms and Exponentials
From Everand
Introduction to Logarithms and Exponentials
Simone Malacrida
No ratings yet
Group#10 (Cluster Analysis)
No ratings yet
Group#10 (Cluster Analysis)
53 pages
Lecture 10
No ratings yet
Lecture 10
13 pages
Lecture 7
No ratings yet
Lecture 7
31 pages
Stat-304 Lecture 3
No ratings yet
Stat-304 Lecture 3
15 pages
Stat-304 Lecture 1
No ratings yet
Stat-304 Lecture 1
17 pages
Stat-304 Lecture 2
No ratings yet
Stat-304 Lecture 2
14 pages
Sampling With Varying Probabilities
No ratings yet
Sampling With Varying Probabilities
8 pages
Relationship Between Spiritual Intelligence and Professional Quality of Life Among Professionals of University of Gujrat
No ratings yet
Relationship Between Spiritual Intelligence and Professional Quality of Life Among Professionals of University of Gujrat
12 pages
Oracle Modern Best Practice Handbook PDF
No ratings yet
Oracle Modern Best Practice Handbook PDF
29 pages
Case 5.1 - Gojek-Traveloka Merger
No ratings yet
Case 5.1 - Gojek-Traveloka Merger
3 pages
EODB Coordination Meeting
No ratings yet
EODB Coordination Meeting
25 pages
Đề thi HSG Tiếng Anh 9 huyện Triệu Sơn
No ratings yet
Đề thi HSG Tiếng Anh 9 huyện Triệu Sơn
6 pages
2240 DS-BayController 20240716
No ratings yet
2240 DS-BayController 20240716
24 pages
SVC
No ratings yet
SVC
35 pages
2020 Hitachi Electric Chain Hoist - Product Sheet
No ratings yet
2020 Hitachi Electric Chain Hoist - Product Sheet
5 pages
Problem Solution Essay
100% (1)
Problem Solution Essay
4 pages
Physiology Marrow Ed8 [Medicalstudyzone.com]
No ratings yet
Physiology Marrow Ed8 [Medicalstudyzone.com]
267 pages
Practice Questions
No ratings yet
Practice Questions
12 pages
Afs Ball Valve Product Catalog
No ratings yet
Afs Ball Valve Product Catalog
8 pages
G729E
No ratings yet
G729E
152 pages
WBP Summer 2023 Model Answer Paper
No ratings yet
WBP Summer 2023 Model Answer Paper
26 pages
Renewable and Sustainable Energy Reviews: Nelson Fumo
No ratings yet
Renewable and Sustainable Energy Reviews: Nelson Fumo
8 pages
Choosing A Research Topic
No ratings yet
Choosing A Research Topic
2 pages
Portafolio de Inversión
No ratings yet
Portafolio de Inversión
27 pages
Assignment3 Btech Sem3 TD Mu207
No ratings yet
Assignment3 Btech Sem3 TD Mu207
2 pages
Man Synplus Uk
No ratings yet
Man Synplus Uk
122 pages
Application For Final Project Thesis
No ratings yet
Application For Final Project Thesis
4 pages
Ahmed Abdallah
100% (4)
Ahmed Abdallah
3 pages
Chs Caradol Ed56 200
No ratings yet
Chs Caradol Ed56 200
4 pages
Java Full Stack Brochure
No ratings yet
Java Full Stack Brochure
11 pages
Balance Statement
No ratings yet
Balance Statement
2 pages
IMI Remosa Product Diverter AW
No ratings yet
IMI Remosa Product Diverter AW
4 pages
Canal Top Solar Power Plant
No ratings yet
Canal Top Solar Power Plant
61 pages
ECP - Paper, Pulp & Process Pumps ENG
No ratings yet
ECP - Paper, Pulp & Process Pumps ENG
56 pages
Medical Device Regulations in Canada Key Challenges and International Initiatives
No ratings yet
Medical Device Regulations in Canada Key Challenges and International Initiatives
18 pages
Ug940 Vivado Tutorial Embedded Design
No ratings yet
Ug940 Vivado Tutorial Embedded Design
108 pages
PDF Design and Power Quality Improvement of Photovoltaic Power System 1st Edition Adel A. Elbaset download
100% (3)
PDF Design and Power Quality Improvement of Photovoltaic Power System 1st Edition Adel A. Elbaset download
62 pages
Proportional Reducing Valves Rzgo, Hzgo, Kzgo: Pilot Operated, ISO 4401 Size 06, 10
No ratings yet
Proportional Reducing Valves Rzgo, Hzgo, Kzgo: Pilot Operated, ISO 4401 Size 06, 10
4 pages