0% found this document useful (0 votes)

44 views

Section and Solution

This document discusses different models for predicting binary dependent variables: the linear probability model (LPM), logit model, and probit model. It notes that while the LPM can be estimated with ordinary least squares regression, it has problems in that the predicted probabilities can be less than 0 or greater than 1. The logit and probit models are specifically designed for binary outcomes and always yield predicted probabilities between 0 and 1. It also discusses how to interpret the estimates from logit/probit models by calculating marginal effects and performing significance tests of coefficients.

Uploaded by

Firomsa

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views

Section and Solution

Uploaded by

Firomsa

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

EEP/IAS 118

Andrew Dustan
Section Handout 13

1. Linear Probability Model vs. Logit (or Probit)

We have often used binary ("dummy") variables as explanatory variables in regressions. What about when we
want to use binary variables as the dependent variable?

It's possible to use OLS:

= + + ⋯ + +

where y is the dummy variable. This is called the linear probability model.

Estimating the equation:

= 1| = = + + ⋯ +
is the predicted probability of having = 1 for the given values of … .

Problems with the linear probability model (LPM):

1. Heteroskedasticity: can be fixed by using the "robust" option in Stata. Not a big deal.
2. Possible to get < 0 or > 1. This makes no sense—you can't have a probability below 0 or above 1.
This is a fundamental problem with the LPM that we can't patch up.

Solution: Use the logit or probit model. These models are specifically made for binary dependent variables and
always result in 0 < < 1. Let's leave the technicalities aside and look at a graph of a case where LPM goes
wrong and the logit works:

Linear Probability Model Logit (probit looks similar)

1.5 1.5

1 1
--------

--------
= 1| = 1|
0.5 0.5

0 0

-0.5 -0.5
0 + 1 1 + ⋯ + 0 + 1 1 + ⋯ +

This is the main feature of a logit/probit that distinguishes it from the LPM – predicted probability of = 1 is
never below 0 or above 1, and the shape is always like the one on the right rather than a straight line.
2. Marginal Effects for Logit (or Probit)
We talked about how to estimate the logit using "maximum likelihood" in lecture, which is fairly complicated—
much more complicated than OLS. Moreover, the results from the estimation are not easy to interpret.

What we want are results that look like those from OLS or the LPM: the marginal effect of changing x on , the
probability of getting = 1.

"Problem": the marginal effect is different depending on what the x values are. Look again at the graph:
1

---------------
= 1| 0.5

0
0 + 1 1 + ⋯ +

How much does change as we increase + + ⋯ + (i.e. how big are marginal effects) when:

+ + ⋯ + is very low? a little______

+ + ⋯ + is neither high nor low? _a lot______

+ + ⋯ + is very high? _a little______

We compromise by finding the marginal effect for the "average" person/whatever in the data, i.e. the marginal
effect when = ̅ , … , = ̅ . This is what the Stata command "mfx" does.

Example: Probability of a male adult being arrested, as a function of income (in $100) and minority status:
. logit arrest minority inc86

Logistic regression Number of obs = 2725

LR chi2(2) = 152.22
Prob > chi2 = 0.0000
Log likelihood = -1532.0747 Pseudo R2 = 0.0473

------------------------------------------------------------------------------
arrest | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
minority | .5853512 .0886866 6.60 0.000 .4115286 .7591738
inc86 | -.0074475 .0008404 -8.86 0.000 -.0090947 -.0058003
_cons | -.8499352 .069239 -12.28 0.000 -.9856411 -.7142294
------------------------------------------------------------------------------

The signs of these coefficients tell us something: minorities are more likely to be arrested, and higher income
lowers the probability of being arrested. How big are these effects? Run "mfx" to find out:

. mfx

Marginal effects after logit

y = Pr(arrest) (predict)
= .26160966
------------------------------------------------------------------------------
variable | dy/dx Std. Err. z P>|z| [ 95% C.I. ] X
---------+--------------------------------------------------------------------
minority*| .1165188 .018 6.47 0.000 .081238 .1518 .378716
inc86 | -.0014386 .00016 -9.17 0.000 -.001746 -.001131 54.967
------------------------------------------------------------------------------
(*) dy/dx is for discrete change of dummy variable from 0 to 1

Practice:
1. For males with the average level of income in this sample ($5497 in 1986 dollars), how much more likely are
minorities to be arrested? (Notice that for dummy variables, Stata calculates the change from going from 0 to 1.)

11.7%

2. For males with the average level of income in this sample, how does a $1000 increase in income affect the
predicted probability of being arrested?

−.0014 × 10 = −0.014 = −1.4%, so 1.4% less likely to be arrested.

3. Tests for Parameters

For linear regression, we used the t-test for the significance of one parameter and the F-test for the significance
of multiple parameters. There are similar tests in the logit/probit models.

One parameter: z-test

Do this just the same way as a t-test with infinite degrees of freedom. You can read it off of the logit/probit
estimation results, or the mfx results. The formula for testing : " = 0 is, just like for a t-test:
"
#=
$%&" '
Practice:
Can we reject the null hypothesis of ()*+, = 0 at the 1% significance level?:

Yes, since # = −9.17, so we comfortably reject the null hypothesis.

Multiple parameters: likelihood ratio test

With the F-test, we estimated the restricted and unrestricted models, and then compared their goodness of fit
(/ 0 ). We don't have an / 0 for logit or probit, so we compare the "log likelihood" instead. The log likelihood
doesn't have much meaning for us, except for this test. The closer the log likelihood gets to zero (it's always
negative), the better the model fits.

To perform the likelihood ratio test, estimate the restricted (fewer variables) and unrestricted (more variables)
models and then construct the test statistic:
1/ = 2logℒ7 − logℒ8
where ℒ9 is the likelihood from the unrestricted model and ℒ: is from the restricted model. The test statistic is
distributed ; 0 < where q is the number of restrictions, just like in the F-test. If LR is higher than the critical
value, we reject the null hypothesis. This is exactly like the F-test but using the ; 0 table instead of the F table.

Practice:
We can add two variables to the arrest model: total time spent in prison in the past, and average sentence length
from previous sentences (if any):
. logit arrest minority inc86 tottime avgsen

Logistic regression Number of obs = 2725

LR chi2(4) = 154.89
Prob > chi2 = 0.0000
Log likelihood = -1530.7407 Pseudo R2 = 0.0482

------------------------------------------------------------------------------
arrest | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
minority | .5956365 .0891583 6.68 0.000 .4208894 .7703836
inc86 | -.0075452 .0008458 -8.92 0.000 -.0092029 -.0058875
tottime | -.035892 .02659 -1.35 0.177 -.0880074 .0162233
avgsen | .0332144 .0334359 0.99 0.321 -.0323187 .0987474
_cons | -.8407443 .0696835 -12.07 0.000 -.9773215 -.7041672
------------------------------------------------------------------------------

Do these new variables help to predict arrest, after controlling for minority status and income?
Step:
1: Write hypotheses : =>==(?@ = ABCD@) = 0
: EFG

2: Compute LR 1/ = 2H−1530.74 − −1532.07K = 2.66 ~ ; 0 2

3: Get critical value N.O = 5.99

4: Reject/fail to reject 2.66 < 5.99 so fail to reject the null hypothesis

5: Conclude We have no evidence that time spent in prison and average sentence length from
previous sentences help to predict future imprisonment, after controlling for minority
status and income.

From Bioeconomics To Degrowth
No ratings yet
From Bioeconomics To Degrowth
30 pages
Getting Started in Logit and Ordered Logit Regression: (Ver. 3.1 Beta)
No ratings yet
Getting Started in Logit and Ordered Logit Regression: (Ver. 3.1 Beta)
15 pages
Getting Started in Logit and Ordered Logit Regression: (Ver. 3.1 Beta)
No ratings yet
Getting Started in Logit and Ordered Logit Regression: (Ver. 3.1 Beta)
14 pages
Lecture 7 Probit
No ratings yet
Lecture 7 Probit
24 pages
Censoring & Truncation
No ratings yet
Censoring & Truncation
14 pages
Modern Regression Homework 5-1
No ratings yet
Modern Regression Homework 5-1
8 pages
Class 3 Count Models 1.0
No ratings yet
Class 3 Count Models 1.0
39 pages
Longitudinal Data Analysis Instructor: Natasha Sarkisian
No ratings yet
Longitudinal Data Analysis Instructor: Natasha Sarkisian
31 pages
125.785 Module 2.1
No ratings yet
125.785 Module 2.1
94 pages
Basic Econometrics III
No ratings yet
Basic Econometrics III
23 pages
Lecture 8: Heteroskedasticity: Causes Consequences Detection Fixes
No ratings yet
Lecture 8: Heteroskedasticity: Causes Consequences Detection Fixes
46 pages
Loss Function - Ipynb - Colaboratory
No ratings yet
Loss Function - Ipynb - Colaboratory
6 pages
1 What Is A Randomized Algorithm?: Lecture Notes CS:5360 Randomized Algorithms
No ratings yet
1 What Is A Randomized Algorithm?: Lecture Notes CS:5360 Randomized Algorithms
8 pages
RSA and power analysis
No ratings yet
RSA and power analysis
13 pages
An Introduction To Logistic Regression: Johnwhitehead Department of Economics East Carolina University
No ratings yet
An Introduction To Logistic Regression: Johnwhitehead Department of Economics East Carolina University
48 pages
Unit - 5
No ratings yet
Unit - 5
111 pages
Multiple Regression: Model and Interpretation
No ratings yet
Multiple Regression: Model and Interpretation
10 pages
NLopt Tutorial - AbInitio
No ratings yet
NLopt Tutorial - AbInitio
13 pages
Dea
No ratings yet
Dea
22 pages
Economics 210 Handout # 6 The Probit, Logit, Tobit and Linear Probability Models
No ratings yet
Economics 210 Handout # 6 The Probit, Logit, Tobit and Linear Probability Models
6 pages
Rlrtest
No ratings yet
Rlrtest
11 pages
LR Ratios Test
No ratings yet
LR Ratios Test
12 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
Probit Logit Interpretation
No ratings yet
Probit Logit Interpretation
26 pages
6.4 Process Capability
No ratings yet
6.4 Process Capability
13 pages
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
No ratings yet
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
27 pages
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
No ratings yet
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
24 pages
D3 Logit
No ratings yet
D3 Logit
37 pages
Marginal Effects For Continuous Variables: Richard Williams, University of Notre Dame, Last Revised January 29, 2019
No ratings yet
Marginal Effects For Continuous Variables: Richard Williams, University of Notre Dame, Last Revised January 29, 2019
12 pages
Testing 4 White
No ratings yet
Testing 4 White
54 pages
Chapter 7 Dynamic Econometric Models
No ratings yet
Chapter 7 Dynamic Econometric Models
15 pages
Autocorrelation
0% (1)
Autocorrelation
49 pages
Logit R101
No ratings yet
Logit R101
27 pages
PETE 4051 Reserve Evaluation and Reservoir Management: Financial Accounting Probabilistic Reserves Assessment
No ratings yet
PETE 4051 Reserve Evaluation and Reservoir Management: Financial Accounting Probabilistic Reserves Assessment
40 pages
Econ321 2017 Tutorial 2 Lab
No ratings yet
Econ321 2017 Tutorial 2 Lab
9 pages
Seu Ds610 Mod03
No ratings yet
Seu Ds610 Mod03
45 pages
Rlrtest
No ratings yet
Rlrtest
10 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
Numerical Methods To Solve ODE-Handout 7
No ratings yet
Numerical Methods To Solve ODE-Handout 7
14 pages
Sociology: Intermediate Quantitative Research Method
No ratings yet
Sociology: Intermediate Quantitative Research Method
35 pages
Abhijit Das Questions in C
No ratings yet
Abhijit Das Questions in C
3 pages
TP MSDC 3
No ratings yet
TP MSDC 3
6 pages
Logit
No ratings yet
Logit
48 pages
Missing Value 11
No ratings yet
Missing Value 11
14 pages
BSC (Hons) Finance Ii/ BSC (Hons) Finance With Law Ii
No ratings yet
BSC (Hons) Finance Ii/ BSC (Hons) Finance With Law Ii
49 pages
Regn_lect_5
No ratings yet
Regn_lect_5
9 pages
Spring07 OBrien T
No ratings yet
Spring07 OBrien T
40 pages
Chapter 6
No ratings yet
Chapter 6
35 pages
vol2b-scipyoptimize-20171-pdf
No ratings yet
vol2b-scipyoptimize-20171-pdf
9 pages
Sample Exam Questions Solutions
No ratings yet
Sample Exam Questions Solutions
3 pages
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
No ratings yet
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
10 pages
Logit Marginal Effects
No ratings yet
Logit Marginal Effects
12 pages
Calibration: Constructing A Calibration Curve
No ratings yet
Calibration: Constructing A Calibration Curve
10 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
From Everand
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
Stuart A. Klugman
4/5 (1)
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Digital and Microprocessor Techniques V10
From Everand
Digital and Microprocessor Techniques V10
Clive W. Humphris
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Statistics and Probabiltiy 4TH Exam 2022 2023
No ratings yet
Statistics and Probabiltiy 4TH Exam 2022 2023
2 pages
Class VII English Honeycomb (Prose) Chapter 1. Three Questions
No ratings yet
Class VII English Honeycomb (Prose) Chapter 1. Three Questions
36 pages
Telangana: Gramin Rojgar Kaylan Sansthan
No ratings yet
Telangana: Gramin Rojgar Kaylan Sansthan
3 pages
8 SOP - Webster University
No ratings yet
8 SOP - Webster University
4 pages
Evolution Deceit Harun Yahya PDF
No ratings yet
Evolution Deceit Harun Yahya PDF
2 pages
Research Proposal Writing Guideline
No ratings yet
Research Proposal Writing Guideline
10 pages
Practical Research 1 - Week 3
No ratings yet
Practical Research 1 - Week 3
5 pages
Revised M.Phil Syllabus in Economics: Semester Course No. Name of The Course Contact Hours Credit
100% (1)
Revised M.Phil Syllabus in Economics: Semester Course No. Name of The Course Contact Hours Credit
9 pages
Multiple Choice Questions (The Answers Are Provided After The Last Question.)
No ratings yet
Multiple Choice Questions (The Answers Are Provided After The Last Question.)
92 pages
Permission Letter
No ratings yet
Permission Letter
9 pages
Pegasus Overview
No ratings yet
Pegasus Overview
5 pages
Morgenstern's Guide To Alchemy
No ratings yet
Morgenstern's Guide To Alchemy
15 pages
26 Qualitative Observation Marry France A. Sain
No ratings yet
26 Qualitative Observation Marry France A. Sain
31 pages
New Directions of STEM Research and Learning in the World Ranking Movement: A Comparative Perspective John N. Hawkins download pdf
100% (1)
New Directions of STEM Research and Learning in the World Ranking Movement: A Comparative Perspective John N. Hawkins download pdf
55 pages
RHA81 Critica de Libros
No ratings yet
RHA81 Critica de Libros
39 pages
Working With Uncertainty
No ratings yet
Working With Uncertainty
2 pages
CREATIVE PRESENTATION Rubrics 1
No ratings yet
CREATIVE PRESENTATION Rubrics 1
5 pages
Analysis On Lux Soap
No ratings yet
Analysis On Lux Soap
38 pages
1st Circular 87th Annual Session NASI
No ratings yet
1st Circular 87th Annual Session NASI
6 pages
"J"-Radiation Is Mother of Hydrogen?... (A New Theory On Supernature, Nature, Science)
No ratings yet
"J"-Radiation Is Mother of Hydrogen?... (A New Theory On Supernature, Nature, Science)
10 pages
Welcome Letter
No ratings yet
Welcome Letter
1 page
Subfiels of Linguistics
No ratings yet
Subfiels of Linguistics
1 page
De La Garza Toledo-La Epistemologia Critica y El Concepto de Configuracion
50% (2)
De La Garza Toledo-La Epistemologia Critica y El Concepto de Configuracion
20 pages
Group 3 Chapter I V Latest Send
No ratings yet
Group 3 Chapter I V Latest Send
37 pages
First Few 1 Pages
No ratings yet
First Few 1 Pages
10 pages
FALLSEM2023-24 SWE2020 ETH VL2023240103291 2023-11-22 Reference-Material-II
No ratings yet
FALLSEM2023-24 SWE2020 ETH VL2023240103291 2023-11-22 Reference-Material-II
26 pages
Pondering Plants: First Grade Science Exploration
No ratings yet
Pondering Plants: First Grade Science Exploration
24 pages
The Nature of Sociology
100% (1)
The Nature of Sociology
15 pages
How I Discovered The Science of Bhakti Yoga
No ratings yet
How I Discovered The Science of Bhakti Yoga
5 pages