0% found this document useful (0 votes)

2 views

Chapter 6. Limited dependent variable models FINAL

Chapter 6 discusses Limited Dependent Variable Models, focusing on the Linear Probability Model, Probit, and Logit models for analyzing binary outcomes. It explains the limitations of using OLS for binary data and presents alternative methods for estimation, including maximum likelihood estimation. The chapter also compares the Probit and Logit models, highlighting their similarities and differences in application and interpretation.

Uploaded by

Mahmud Abdurohman

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Chapter 6. Limited dependent variable models FINAL

Uploaded by

Mahmud Abdurohman

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Chapter 6: Limited Dependent Variable Models

6.1. The Linear Probability Model

It is among discrete choice models or dichotomous choice models. In this

case the dependent variable takes only two values: 0 and 1. There are
several methods to analyze regression models where the dependent
variable is 0 or 1. The simplest method is to use the least squares method.
In this case the model is called linear probability model.

The other method is where there is an underlying or latent variable

which we do not observe.

6.1

This is the idea behind Logit and Probit models

In this case the variable is an indicator variable that denotes the

occurrence and non occurrence of an event. For instance in the analysis of
the determinants of unemployment, we have data on each person that
shows whether or not the person is employed and we have some
explanatory variables that determine employment.

In regression form that is written as:

6.2

Where, and the conditional expectation ,

which is the probability that the event will occur given .

Since takes only two values, 0 and 1, the regression in the above
equation can take only two values,

and

The variance of , 6.3

1
Using OLS would result in heteroskedasticity problem.

This problem can be overcome by using the following two step

estimation procedure.

1. Estimate using OLS 6.4

2. Compute and use weighted least squares, i.e,

6.5

Then, regress

However, the problem with this procedure (the least squares or weighted
least squares) is:

1. may be negative
2. are not normally distributed and there is problem with the
application of the usual tests of significance.
3. The conditional expectation be interpreted as the probability
that the event will occur. In many cases can lie outside the limits
.

6.2. The Probit and Logit Models

An alternative approach is to assume the following regression model

6.6

2
Where is not observed. It is commonly called a latent variable. What

we observe is a dummy variable defined by:

6.7

The Probit and Logit models differ in the specification of the distribution
of the error term

For instance, if the observed dummy variable is whether or not a person is

employed or not, would be defined as ‘propensity or ability to find
employment.’

Thus,

6.8

Where, F is the cumulative density function of

a) The Probit Model:

The cumulative standard density is given:

2
t
1  Z2
p Y 1   e dt  Z 
 2 6.9

Where, Z  0  1 X1   2 X 2  ...   k X k

b)The Logit Model:

The cumulative logistic function for Logit model is based on the concept
of an odds ratio.

Let the log odds that Y 1 be given by:

 p 
ln    0  1 X 1   2 X 2  ...   k X k
 1 p  6.10
Solving for the probability that Y 1 we will get:

3
p
e Z
1 p 6.11
 P 1  P  e e  pe
Z Z Z

 p  pe Z e Z
 p 1  e Z  e Z
eZ 1 1 1
 p  Z  z 
e 1  e  e 1 1  e
Z z
1 e Z
6.12
 Z 
The above logistic probability is simply denoted as .

Both Probit and Logit distributions are ‘S’ shaped, but differ in the
relative thickness of the tails. Logit is relatively thicker than Probit. This
difference would, however, disappear, as the sample size gets large.

The relationship between   can be represented as a ‘latent’

Z & p Y 1
underlying index that determines choices. The latent index function, Z, is
determined in linear fashion by a set of independent variables X. In turn,
the latent index Z determines P(Y=1).

The Bernoulli trail of Probit and Logit model conditional on Z is given

by:
1 Y
f Y / Z  PY 1  P 
6.13
Plugging either the standard normal cumulative density function (for
Probit) or the cumulative logistic function (for Logit) into the above
function to have the appropriate probability function gives:

1 Yi 
f Yi / Z   Z  i 1   
Y

6.14

for Probit model

f Yi / Z   Z  i 1   Z 
Y 1 Yi

6.15

for Logit model

The likelihood function for these models is given by:

n
1 Yi 
L   k / Yi , X i     z  i 1   
Y

i 1 6.16
for Probit Model

4
n 1 Yi 
L   k / Yi , X i     z  i 1    z 
Y

i 1 6.17
for Logit Model

The Log Likelihood function of these models is give as:

n
ln L   k / Yi , X i    Yi ln   z   1  Yi ln 1   z 
i 1 6.18
for Probit
n
ln L   k / Yi , X i    Yi ln   z   1  Yi ln 1   z 
and i 1 6.19
for Logit

These functions can be optimized using standard methods to get the

parameter values.

In choosing between Probit and Logit models, there is no statistical

theory for preferring one to the other. Thus, it makes no difference which
one to choose. The two models are quite similar in large samples. But in
small samples the two models differ significantly.

However, choice between the two models can be made on convenience. It

is much easier to compute Probit probabilities (table of z statistic). Logit
is simpler mathematically.

The probability model in the form of a regression is:

E Y / X  0  1  F   ' X   1  F   ' X 

6.20
F   ' X 
Whatever distribution is used, the parameters of the model like those of
any other nonlinear regression model, are not necessarily the marginal
effects:
E Y / X   dF   ' X  
 
X  d   ' X  
6.21
 f  ' X  
f .
Where, is the density function that corresponds to the cumulative
F ..
density distribution,

a) For the normal distribution, this is:

5
E  Y / X 
   ' X  
X
6.22
where   is the standard normal density.
 .

b) For logistic distribution

d   ' X  e ' X
    ' X   1     ' X 
d   ' X  1  e  ' X 2
6.23
E Y / X 
X    ' X   1     ' X  
= 6.24

In interpreting the estimated model, in most cases the means of the

regressors are used. In other instances, pertinent values are used based on
the choice of the researcher.

For an independent variable, say k, that is binary the marginal effect can
be computed as:
prob  Y 1/ X * , K 1  prob  Y 1/ X * , K 0 
6.25
Where, X * denotes the mean of all other variables in the model.

Therefore, the marginal effects can be evaluated at the sample means of

the data. Or the marginal effects can be evaluated at every observation
and the average can be computed to represent the marginal effects.

More generally, the marginal effects are give as

6.3. Estimation of Binary Choice Models

The log likelihood function for the two models is:


log L  yi log F   ' X   1  yi log 1  F   ' X  
6.26
The first order condition with respect to the parameters of the model is be
given by:

6
 log L y f  fi 
  i i  1  yi   X i 0
  Fi 1  Fi   6.27
dFi
d  ' X 
Where fi is the density , here i indicates that the function has an
argument  ' X i .
i) For a normal distribution (Probit), the log likelihood is
log L   log  1     ' X i    log    ' X i 
yi 0 yi 1
6.28
 log L  i 
 Xi   i Xi
 yi 0 1   i yi 1  i
6.29

ii) For a Logit model, the log likelihood is:

n
ln L   k / Yi , X i    Yi ln   z   1  Yi ln 1   z 
i 1

6.30
 log L
  yi  i X i 0


6.4. Measures of Goodness of fit

When the independent variable to be measure is dichotomous, there is a

problem of using the conventional as a measure of goodness of fit.

1. Measures based on likelihood ratios

Let be the maximum likelihood function when maximized with

respect to all the parameters and be the maximum when maximized
with restrictions .

6.31

Cragg and Uhler (1970) suggested a pseudo that lies between 0 and 1.

7
6.32

McFadden (1974) defined as

6.33

can also be linked in terms of the proportion of correct predictions.

After computing , we can classify the observation as belonging to
group 1 if and group 2 if . We can then count the
number of correct predictions.

6.34

Count 6.35

Example: Regression results of a Probit model of house ownership and

income is given below

6.36

We want to measure the effect of a unit change in income on the

probability of owning a house

6.37

8
Where, is the standard normal probability density function
evaluated at .

At value of , the normal density function at

that is equal to

= 0.3066 6.38

Now multiplying this value by the slop coefficient of income, we get

0.01485.

Logit model of owning a house

6.39

This means for a unit increase in weighted income the weighted log of the
odds in favour of owning a house goes up by 0.08 units.

Converting into odds ratio, we take the antilog

6.40

6.5. Maximum Likelihood Estimation

For Linear Regression Model, the MLE of a normal variable

conditional on with mean and variance , the pdf for an
observation is:

6.41

The pdf of a normal variable with mean and is often expressed in

terms of the pdf standardized normal variable with mean 0 and
variance of 1.

9
6.42

Thus, 6.43

The Likelihood can be written as

6.44

6.6. Limited Dependent Variables

The density function of a normally distributed variable with mean and

variance of is given by:

6.45
Where y N ¿ )

For a standard normal distribution,

6.46

The density of a standard normal variable is

6.47

The cumulative density function of a normal distribution is

10
6.48

Due to symmetry,

In limited variable models we may encounter some form of truncation.

If has density the distribution of truncated from below at a

given c is given by:

if and 0 otherwise 6.49

If y is a standard normal variable, the truncated distribution of has

the probability:

6.50

If the distribution is truncated from above

If has a normal distribution with mean and variance , the

truncated distribution has mean

6.51

Where,

And

6.6.1. Tobit (Censored Regression) Model

In certain applications, the dependent variable is continuous, but its range

may be constrained. Most commonly this occurs when the dependent

11
variable is zero for a substantial part of the population but positive for the
rest of the population.

6.52

Where,

In this model all negative values are mapped to zeros. i.e. observations
are censored (from below) at zero.

The model describes two things:

1. The possibility that given

6.53

2. The distribution of given that it is positive.

This is truncated normal distribution with expectation

6.54

The last term shows the conditional expectation of a mean zero normal
variable given that it is no larger than The conditional expectation of

no longer equals but depends non linearly on through .

Marginal effects of the Tobit Model

1. The probability of a zero outcome is:

12
6.55

2. The expected value of (positive values) is

6.56

Thus the marginal effect on the expected value of of a change in is

given by

6.57
This means the marginal effect of a change in upon the expected
outcome is given by the model’s coefficient multiplied by the
possibility of having a positive outcome.

3. The marginal effect up on the latent variable is

Maximum Likelihood Estimation of the Tobit Model

The contribution of an observation either equals the probability mass (at

the observed point ) or the conditional density of , given that it is
positive times the probability mass of observing

6.58

Using appropriate expression for normal distribution we can obtain:

6.59

13
Maximizing this function with respect to the parameters will give the
maximum likelihood estimates.

6.6.2. Sample Selection

Tobit model imposes a structure that is often restrictive: exactly the same
variables affecting the probability of nonzero observation determine the
level of positive observation and more over with the same sign.

This implies, for example, that those who are more likely to spend a
positive amount are, on average, also those that spend more on durable
goods.

For example, we might be interested in explaining wages. Obviously

wages are observed for people that are actually working, but we might be
interested in (potential) wages not conditional on this selection.

For example, a change in some variable may lower someone’s wage

such that he decides to stop working. Consequently, his wage would not
be observed and the effect of this variable could be underestimated from
the available data.

Because, a sample of workers may not be a random sample of the

population (of potential workers), one may expect that people with lower
(potential) wages are more likely to be unemployed - This problem is
often referred to as sample selection.

Consider the following sample selection model of wage:

6.60

Where, denotes vector of exogenous characteristics of the person,

denotes the persons wage.

The wage is not observable for people that are not working.

Thus to describe whether a person is working or not, a second equation is

specified, which is binary choice type:

14
6.61

Where,

6.62

The binary variable indicates working or not working. The error terms
of the two equations have mean of zero with variances of ,
respectively and covariance of .

One usually sets the restriction, for normalization restriction of the

Probit model. The conditional expected wage given that a person is
working is given by:

6.63

The conditional expected wage equals only if . So if the

error terms of the two equations are not correlated the wage equation can
be consistently estimated by OLS.

A sample selection bias of OLS arises if

15
The term is known as the inverse Mill’s ratio and is denoted by

by Heckman (1979) and is referred as Heckman’s model.

Stata Training Course
No ratings yet
Stata Training Course
43 pages
Notes 13
No ratings yet
Notes 13
18 pages
Probit and Logit-Madesh
No ratings yet
Probit and Logit-Madesh
22 pages
Logit and Probit: Models With Discrete Dependent Variables
No ratings yet
Logit and Probit: Models With Discrete Dependent Variables
30 pages
CH 5 2023 Eonometrics For Acct and Finance
No ratings yet
CH 5 2023 Eonometrics For Acct and Finance
6 pages
Poles and Zeros: - The Dynamic Behavior of A Transfer Function Model Can Be Characterized by The Numerical Value of Its Poles and Zeros
No ratings yet
Poles and Zeros: - The Dynamic Behavior of A Transfer Function Model Can Be Characterized by The Numerical Value of Its Poles and Zeros
43 pages
Extensions of The Two-Variable Linear Regression Model: True
No ratings yet
Extensions of The Two-Variable Linear Regression Model: True
13 pages
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
No ratings yet
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
20 pages
countreg
No ratings yet
countreg
11 pages
More General Transfer Function Models
No ratings yet
More General Transfer Function Models
40 pages
Econometric Analysis of Cross Section and Panel Data, 2e: Models For Fractional Responses
No ratings yet
Econometric Analysis of Cross Section and Panel Data, 2e: Models For Fractional Responses
104 pages
The Independence of Irrelevant Alternatives - 230919 - 191757
No ratings yet
The Independence of Irrelevant Alternatives - 230919 - 191757
26 pages
Semi-Parametric Models: Cox Regression Model: Cox-Snell Residuals
No ratings yet
Semi-Parametric Models: Cox Regression Model: Cox-Snell Residuals
7 pages
Econ Shu301 CH11
No ratings yet
Econ Shu301 CH11
53 pages
Limited Dependent Variable Models
No ratings yet
Limited Dependent Variable Models
64 pages
08 SS039
No ratings yet
08 SS039
17 pages
Probit Logit Interpretation
No ratings yet
Probit Logit Interpretation
26 pages
Generalized Methods of Moments (GMM) Estimation With PDF
No ratings yet
Generalized Methods of Moments (GMM) Estimation With PDF
30 pages
02 LogisticRegression
No ratings yet
02 LogisticRegression
29 pages
Chapter 1
No ratings yet
Chapter 1
90 pages
STATS 330: Lecture 23: Multiple Logistic Regression
No ratings yet
STATS 330: Lecture 23: Multiple Logistic Regression
33 pages
A Comparative Analysis of Hyperbolic Copulas Induced by a One Factor Lévy Model
No ratings yet
A Comparative Analysis of Hyperbolic Copulas Induced by a One Factor Lévy Model
20 pages
04 Chap04 ClassificationMethods-LogisticRegression 2024
No ratings yet
04 Chap04 ClassificationMethods-LogisticRegression 2024
23 pages
nhso401_r6_LogisticRegression
No ratings yet
nhso401_r6_LogisticRegression
14 pages
6A. Econometrics Review
No ratings yet
6A. Econometrics Review
8 pages
LPM, Logit and Probit Models
No ratings yet
LPM, Logit and Probit Models
21 pages
Logit Probit
No ratings yet
Logit Probit
17 pages
Lecture-9 with remarks
No ratings yet
Lecture-9 with remarks
31 pages
Robotics: Dr. Omar Salah Eldin Mahmoud
No ratings yet
Robotics: Dr. Omar Salah Eldin Mahmoud
33 pages
Chapter 5-LDVM-2024
No ratings yet
Chapter 5-LDVM-2024
27 pages
06.4 Pp 79 95 the Quantum Rotor Model
No ratings yet
06.4 Pp 79 95 the Quantum Rotor Model
17 pages
Binaryresponsemf IMP
No ratings yet
Binaryresponsemf IMP
11 pages
Lecture11
No ratings yet
Lecture11
10 pages
Logit and Probit Models
No ratings yet
Logit and Probit Models
44 pages
CP 2
No ratings yet
CP 2
2 pages
Arch03v37n1 6
No ratings yet
Arch03v37n1 6
16 pages
Benders Decomposition With Adaptive Oracles For Large Scale Optimization
No ratings yet
Benders Decomposition With Adaptive Oracles For Large Scale Optimization
19 pages
Probit Logit Ohio PDF
No ratings yet
Probit Logit Ohio PDF
16 pages
09 Discrete Choice 1 Notes
No ratings yet
09 Discrete Choice 1 Notes
17 pages
Capstone - Https:Users - Ox.ac - Uk: Jesu0073:Lecture 3:LogisticRegression
No ratings yet
Capstone - Https:Users - Ox.ac - Uk: Jesu0073:Lecture 3:LogisticRegression
17 pages
Stat5900 f24 Lec11 Handout
No ratings yet
Stat5900 f24 Lec11 Handout
5 pages
Logistic Reg
No ratings yet
Logistic Reg
87 pages
Business Statistics in Practices Chap - 06
No ratings yet
Business Statistics in Practices Chap - 06
33 pages
New Trends in Exact Algorithms For The 0 PDF
No ratings yet
New Trends in Exact Algorithms For The 0 PDF
8 pages
Logistic Regression: Logistic Regression and The New: Residual Logistic Regression
No ratings yet
Logistic Regression: Logistic Regression and The New: Residual Logistic Regression
31 pages
A Guide To Modern Econometrics by Verbeek 181 190
No ratings yet
A Guide To Modern Econometrics by Verbeek 181 190
10 pages
Functional Forms of Regression Models
No ratings yet
Functional Forms of Regression Models
6 pages
Brody - Calculating Spin Correlations With Quantum Computer
No ratings yet
Brody - Calculating Spin Correlations With Quantum Computer
7 pages
Ar HW3 Team 5
No ratings yet
Ar HW3 Team 5
10 pages
3 Logistic Regression
No ratings yet
3 Logistic Regression
21 pages
Modelling Claim Frequency in Vehicle Insurance: Jiří Valecký
No ratings yet
Modelling Claim Frequency in Vehicle Insurance: Jiří Valecký
7 pages
Challenges of VHDL X-Propagation Simulations
No ratings yet
Challenges of VHDL X-Propagation Simulations
8 pages
Section 9 Limited Dependent Variables
No ratings yet
Section 9 Limited Dependent Variables
17 pages
Limited Dependent Variables - Binary Dependent Variables
No ratings yet
Limited Dependent Variables - Binary Dependent Variables
24 pages
Limited Dependent Variables Models-1
No ratings yet
Limited Dependent Variables Models-1
23 pages
Uberti-2022-SJ-22-1-Interpreting-logit-models
No ratings yet
Uberti-2022-SJ-22-1-Interpreting-logit-models
17 pages
One-Dimensional Relaxations and LP Bounds For Orthogonal Packing G. Belov, V. Kartak, H. Rohling, G. Scheithauer
No ratings yet
One-Dimensional Relaxations and LP Bounds For Orthogonal Packing G. Belov, V. Kartak, H. Rohling, G. Scheithauer
26 pages
Exponentiation of conditional expectations under stochastic volatility_Gatheral_Radoicic
No ratings yet
Exponentiation of conditional expectations under stochastic volatility_Gatheral_Radoicic
16 pages
Limited Depenedt Variable Models and Sample Selection Corrections
No ratings yet
Limited Depenedt Variable Models and Sample Selection Corrections
62 pages
Rec 4 20
No ratings yet
Rec 4 20
4 pages
Iterative Optimizers: Difficulty Measures and Benchmarks
From Everand
Iterative Optimizers: Difficulty Measures and Benchmarks
Maurice Clerc
No ratings yet
Monetary ch 1 PPT I
No ratings yet
Monetary ch 1 PPT I
28 pages
Monetary ch 3 PPT III
No ratings yet
Monetary ch 3 PPT III
44 pages
Monetary ch 2 PPT II
No ratings yet
Monetary ch 2 PPT II
42 pages
Mahmud - Determinants of Agricultural Productivity and Off Farm Household Income in Rural Ethiopia, MSC Thesis
100% (1)
Mahmud - Determinants of Agricultural Productivity and Off Farm Household Income in Rural Ethiopia, MSC Thesis
112 pages
STATA Commands
No ratings yet
STATA Commands
42 pages
Limited Dependent Variable Models Example
No ratings yet
Limited Dependent Variable Models Example
5 pages
Lecture Note 2019 PDF
100% (1)
Lecture Note 2019 PDF
235 pages
1 PB
No ratings yet
1 PB
14 pages
Harvard Ec 1123 Econometrics Problem Set 6 - Tarun Preet Singh
No ratings yet
Harvard Ec 1123 Econometrics Problem Set 6 - Tarun Preet Singh
3 pages
Runoff Estimation and Water Management F
No ratings yet
Runoff Estimation and Water Management F
225 pages
Leisure Diversity As An Indicator of Cultural Capital: Glenn John Stalker
No ratings yet
Leisure Diversity As An Indicator of Cultural Capital: Glenn John Stalker
23 pages
Ngo 2020
No ratings yet
Ngo 2020
10 pages
2021 Volume - 21!1!2021 Consumer Peception On Food Waste Management
No ratings yet
2021 Volume - 21!1!2021 Consumer Peception On Food Waste Management
838 pages
v1 Covered
No ratings yet
v1 Covered
33 pages
The Tobit Model
No ratings yet
The Tobit Model
13 pages
Download Complete (Ebook) Health Econometrics Using Stata by Partha Deb, Edward C. Norton, Willard G. Manning ISBN 9781597182287, 1597182281 PDF for All Chapters
100% (8)
Download Complete (Ebook) Health Econometrics Using Stata by Partha Deb, Edward C. Norton, Willard G. Manning ISBN 9781597182287, 1597182281 PDF for All Chapters
81 pages
Functional Form Issues in The Regression Analysis of Financial Leverage Ratios
No ratings yet
Functional Form Issues in The Regression Analysis of Financial Leverage Ratios
34 pages
School of Graduate Studies Haramaya University: College: Agriculture and Environmental Sciences
100% (1)
School of Graduate Studies Haramaya University: College: Agriculture and Environmental Sciences
8 pages
Microeconometric Analysis of The Eating-Out Behavior of Modern Filipino Households - The Tobit, The Craggit and The Heckit Models
No ratings yet
Microeconometric Analysis of The Eating-Out Behavior of Modern Filipino Households - The Tobit, The Craggit and The Heckit Models
20 pages
Recreation - HS Chamang 27-9-2008
No ratings yet
Recreation - HS Chamang 27-9-2008
16 pages
Socio-Economic Factors Affecting Urban Private Investment (The Case of Mekelle City)
No ratings yet
Socio-Economic Factors Affecting Urban Private Investment (The Case of Mekelle City)
55 pages
Tobit Models A Survey PDF
No ratings yet
Tobit Models A Survey PDF
59 pages
Ahmed
No ratings yet
Ahmed
16 pages
Tobit Analysis - Stata Data Analysis Examples
No ratings yet
Tobit Analysis - Stata Data Analysis Examples
10 pages
2 Determinants of Digital Innovation in The Public Sector Modified
No ratings yet
2 Determinants of Digital Innovation in The Public Sector Modified
19 pages
Belaynesh T.
100% (1)
Belaynesh T.
32 pages
Amu 2
No ratings yet
Amu 2
52 pages
Judith S. Ruud: Federcrl Rewwe Bunk QF New York, H'Ew York, Ny Llw3
No ratings yet
Judith S. Ruud: Federcrl Rewwe Bunk QF New York, H'Ew York, Ny Llw3
17 pages
Journal of Agriculture and Crops: Gedefaw Abebe Sisay Debebe
No ratings yet
Journal of Agriculture and Crops: Gedefaw Abebe Sisay Debebe
9 pages
Tobit Postestimation - Postestimation Tools For Tobit
No ratings yet
Tobit Postestimation - Postestimation Tools For Tobit
5 pages
Determinants of Saving Behaviour
No ratings yet
Determinants of Saving Behaviour
11 pages
Measuring The Causality On Cooperatives, Livelihood Diversification and Determinants
No ratings yet
Measuring The Causality On Cooperatives, Livelihood Diversification and Determinants
7 pages
Alebel Bayrau - Afford Ability &amp Willingness To Pay For Impro
No ratings yet
Alebel Bayrau - Afford Ability &amp Willingness To Pay For Impro
37 pages