binary
binary
ØThe key here is that the level of education is the same in both
expectations; the difference, 𝛿, is due to gender only.
ØThat is the regression lines have the same slopes but they differ in their
intercepts.
Suppose 𝛿 <0 (i.e., male earn more wage compared to female of the same
education level). Graphically,
Multiple Dummy Variables Regression Models
ØSuppose we have several dummy explanatory variables. For simplicity let
Y be monthly salaries of public school teachers in Addis Ababa, Sidama
and Oromia (three regions in Ethiopia).
ØThen the multiple linear regression model (assuming all independent
variables are dummy variables) is given by: Y = β1 + β2 D2 + β3 D3 + ε i
⎧1 if the region is Oromia
D2 = ⎨
⎩0 otherwise
1 𝑖𝑓 𝑡ℎ𝑒 𝑟𝑒𝑔𝑖𝑜𝑛 𝑖𝑠 𝑆𝑖𝑑𝑎𝑚𝑎
𝐷) = +
0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
Now, Addis Ababa is the base group.
Con…
ØAssuming that the error term satisfies the usual OLS assumptions,
from the multiple linear regression model we obtain:
ØMean salary of public school teachers in the Oromia region:
E(Yi / D2 = 1, D3 = 0) = 𝜷1 + 𝜷2
ØMean salary of public school teachers in the Sidama region is:
E(Yi /D2 = 0, D3 = 1) = 𝜷1 + 𝜷3
ØThe mean salary of teachers in the Addis Ababa region is given by:
E(Y /D2 = 0, D3 = 0) = 𝜷1
Example
• Suppose the results based on our multiple regression model are as
follow:
𝑒 WN
𝑃' 1 + 𝑒 WZ
N
Y1 − 𝑃 = 1 = 𝑒 WN
'
1 + 𝑒 WN
ØTaking the natural logarithm of the above equation, we obtained
[N
𝑙𝑛 YI\[N = 𝑙𝑛 𝑒 WN => 𝛽I + 𝛽Q 𝑋' + 𝑈'
Characteristics of the log odds ratio
ØIt is linear in 𝑋I and 𝛽'
ØAs 𝑃' goes from 0 to 1 (i.e. as Z varies from −∞ to +∞), the logit
L goes from −∞ to +∞. That is, logit is not bounded but
probability is bounded.
ØL is linear in 𝑋' , but the probability of themselves are not.
ØIf L, the log of odd ratio, is positive; it means when the value of
the explanatory variable(s) increases, the odd that the dependent
equals 1 increase and vice versa.
Con…
ØWe are using maximum likelihood which is generally a large
sample method, hence instead of assuming the t-statistics to
evaluate the statistical significance of a coefficient, we use the
standard normal Z-statistic.
ØSlope (βI ) measures the change in L for a unit change in X. that is it
tells how the log odds ratio in favor of success change due to a unit
change of X.
Probit model
ØThe logit model, based on the cumulative logistic distribution
function, helps to estimate regression when the dependent
variable is qualitative.
ØBut, probit model is not only cumulative distribution function
(CDF) that one can use.
ØIn some applications the normal cumulative distribution function
has been found useful. This is called Probit model.
Example
ØSuppose the choice is whether to work or not. The discrete dependent
variable we are working with will assume only two values 0 and 1
1 𝑖𝑓 𝑡ℎ𝑒 𝑖𝑡ℎ 𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙 𝑖𝑠 𝑤𝑜𝑟𝑘𝑖𝑛𝑔 𝑜𝑟 𝑠𝑒𝑒𝑘𝑖𝑛𝑔 𝑤𝑜𝑟𝑘
Y= +
0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
ØWhere i = 1,2,……n. the independent variable that are expected to
affect an individual’s choice may be X1= age, X2= martial status,
X3= gender, X4= education, etc.
ØThe economic interpretation of discrete choice model is typically
based on the principle of utility maximization leading to choice of,
say, A over B if the utility of A exceeds that of B.
Con…
ØTo motivate the probit model, assume that in our employment status
example the decision of the ith individual to participate or not depends
on an unobservable utility index yi* (also known as a latent
variable), that is determined by one or more explanatory variables,
say income.
• In such a way that the large the value of the index yi*, the greater the
probability of an individual to be part of the labor market.
ØLatent variable: unobservable y* which can take all values in (−∞,
+ ∞).
Con…
• y* =utility(labor income) – utility(non labor income)
• Underlying latent model: ∗
1, 𝑦 > 0
yi = + ∗
0, 𝑦 ≤ 0
Where; y* = 𝑥i𝛽 + ℇ𝑖
• probit is based on a latent model
∗
P(𝑦' = 1/𝑥) = 𝑝(𝑦 > 0/𝑥)
=P(𝑥𝛽 + ℇ𝑖 > 0/𝑥)
=P(ℇ𝑖 > − 𝑥𝛽/𝑥)
1 − F(− 𝑥𝛽)
Choosing Between Logit or probit
vHolding other factors constant, the odds ratio in favor of scoring an A grade in cost one
increases by a factor of 16.879 as the entrance exam result increases by one grade point.
vCeteris paribus, the odds ratio in favor of scoring an A grade in cost one increases by a factor
of 10.79 if the student has exposure to the new innovative teaching method (PSI).
vGPA and PSI are found to be statistically significant factors at 5 percent significance level.
Application of Probit Model When the dependent Variable
takes only two values
Probit regression Number of obs = 32
LR chi2(3) = 15.55
Prob > chi2 = 0.0014
Log likelihood = -12.818803 Pseudo R2 = 0.3775