Review Final Ex
Review Final Ex
Review Final Ex
Adjusted R-squared:
Question 1
Use stata:
Log(wage) = 0.128 + 0,09-4 educ + 0,041 exper – 0,000714 exper 2
(0,106) (0,0075). (0,0052) (0,000116)
n=526; R2=0,3, adj R2= 0,296
ii. Is exper2 statistically significant at the 1% level?
find the approximate return to the fifth year of experience. What is the
approximate return to the twentieth year of experience?
iv. At what value of exper does additional experience actually lower predicted
log(wage)? How many people have more experience in this sample?
Question 2
Using the data in RDCHEM, the following equation was obtained by OLS:
i. At what point does the marginal effect of sales on rdintens become negative?
ii. Would you keep the quadratic term in the model? Explain. (sig thì giữ)
iv. For the purpose of reporting the results, which equation do you prefer?
Question 3
The following model allows the return to education to depend upon the total amount of
both parents’ education, called pareduc:
log ( wage )=β 0 + β 1 edu+ β2 educ . pareduc+ β 3 exper + β 4 tenure+ u
v. Show that, in decimal form, the return to another year of education in this model
is
∆ log ( wage ) / ∆ educ=β 1 + β 2 pareduc
What sign do you expect for β 2? Why?
HOFF , β 2 positive (more highly ecucated parents, the more children can get from
another year of education)
The coef on interaction term sig at 1%, indicating that an additionsl year of
education yields a higher increase in wage for children with more highly educated
parents
Compare 2 groups of parents (pareduc=32 vs 24)
The difference in estimated return to education = 0,00078 (32-24)= 0,0062
Chapter 7
Interaction term
• How to explain the coefficient on interaction term:
- if coefficient is insignificant, there is no difference in effect of education on wage
between men and women.
- if coefficient is negative and significant, the effect of education on wage for
women is weaker than for men.
Question 1
^
colgpa = 1.241 0.0569hsize + 0.00468 hsize2 0.0132hsperc + 0.00165 sat
(0.079) (0.0164) (0.00225) (0.0006) (0.00007)
(0.018) (0.042)
where colgpa is cumulative college grade point average; hsize is size of high
school graduating class, in hundreds; hsperc is academic percentile in graduating
class; sat is combined SAT score; female is a binary gender variable; and athlete
is a binary variable, which is one for student-athletes.
v. What is the estimated GPA differential between athletes and nonathletes? Is it
statistically significant?
vi. In the model, allow the effect of being an athlete to differ by gender and test the
null hypothesis that there is no ceteris paribus difference between women
athletes and women nonathletes (base group).
^
colgpa = 1.396 0.0568 hsize + 0.00467 hsize2 0.0132 hsperc + 0.00165 sat
(0.076) (0.0164) (0.00225) (0.0006) (0.00007)
Question 2
and
The variables are defined as in Example 4.9, but we have added a dummy
variable for whether the child is male and a dummy variable indicating whether
the child is classified as white.
i. In the first equation, interpret the coefficient on the variable cigs. In particular,
what is the effect on birth weight from smoking 10 more cigarettes per day?
ii. How much more is a white child predicted to weigh than a nonwhite child,
holding the other factors in the first equation fixed? Is the difference statistically
significant?
Tstat= 4,23 => sig at 1%
HOFF, white child is predicted to weigh about 5,5% than a nonwhite child
iv. From the given information, why are you unable to compute the F statistic for
joint significance of motheduc and fatheduc? What would you have to do to
compute the F statistic?
Different sample => run sample from second equation, reestimeta equation 1 with the
same observations.
Question 3
The variable sat is the combined SAT score; hsize is size of the student’s high
school graduating class, in hundreds; female is a gender dummy variable; and
black is a race dummy variable equal to one for blacks, and zero otherwise.
i. Is there strong evidence that hsize2 should be included in the model? From this
equation, what is the optimal high school size?
HOFF, sat score of nonblack female lower than nonblack male (45 points). Tstat = -
10,51 => very stat sig
iii. What is the estimated difference in SAT score between nonblack males and
black males? Test the null hypothesis that there is no difference between their
scores, against the alternative that there is a difference.
HOFF, sat score of nonblack males is lower than black males (169 points)
Tstat -13,4
iv. What is the estimated difference in SAT score between black females and
nonblack females? What would you need to do to test whether the difference is
statistically significant?
Chapter 8
• Estimate OLS as usual -> obtain residuals -> square the residuals ()
• Run regression using squared residuals as the dependent variable and all independent
variables.
• Obtain fitted value of (h)
• Transform the model using weight = 1/h
1. Using the data in GPA3, the following equation was estimated for the fall and second
semester students:
ii. Why does the hypothesis H0: β crsgpa =1 make sense? Test this hypothesis against
the two-sided alternative at the 5% level, using both standard errors. Describe
your conclusions.
iii. Test whether there is an in-season effect on term GPA, using both standard
errors. Does the significance level at which the null can be rejected depend on the
standard error used?
i. Discuss any important differences with the usual standard errors. Làm giống
câu 1 bài trên
Chapter 9
- For endogeneity
- independent variables: all other independent variables
- add at least 2 instrumental variables
• Predict residuals of model (1)
• Add residuals into the original model
• If the residuals are significant, the variable is endogenous.
- GMM model
Question 1
F stat=( ( Rur 2- Rr 2)/q) / (1- Rur 2)/n-k-1 = ((0,0375 -0,353)/2)/(1-0,375)/169= 2,97 > 2,3
Reject Η o at 10% => two newly added variables are jointly sig at 10%
Model misspecification
Question 2
i, lnchprg: the percentage of student eligible for federally funded school lunch program.
Elgibility for funded school program is closely linked to being in poverty, so the percentage
of student eligible for this program is similar to the percentage of students living in poverty.
ii,
iii,
iv,
v,
Question 3
i) Use CEOSAL1, define a dummy variable, rosneg, which is equal to one if ros < 0
and equal to zero otherwise. Then, estimate the model:
(ii) Apply RESET test to the estimated model. Is there evidence of functional form
misspecification in the equation?
Chapter 10
Seasonality: biến thay đổi theo từng tháng Static-model: x và y là cùng t
Question 1
Use the data in BARIUM for this exercise. Equation (10.22) is run and the result is
shown below:
(i) Add a linear time trend to equation (10.22). Are any variables, other than the
trend, statistically significant?
(ii) In the equation estimated in part (i), test for joint significance of all variables
except the time trend. What do you conclude?
(iii) Add monthly dummy variables to this equation and test for seasonality. Does
including the monthly dummies change any other estimates or their standard
errors in important ways?
Question 2
Use the data set CONSUMP for this exercise.
(i) Estimate a simple regression model relating the growth in real per capita
consumption (of nondurables and services) to the growth in real per capita
disposable income. Use the change in the logarithms in both cases. Report the
results in the usual form. Interpret the equation and discuss statistical
significance.
(ii) Add a lag of the growth in real per capita disposable income to the equation
from part (i). What do you conclude about adjustment lags in consumption
growth?
(iii) Add the real interest rate to the equation in part (i). Does it affect
consumption growth?
WRAP UP:
Question 1:
This equation allows roe to have a diminishing effect on log(salary). Is this generality
necessary?
Question 2:
HOFF, the salary from utility is lower than salary from transportation industries 28,3% point.
Question 3:
Use the data in INTDEF, Model 1 is run to investigate the impact of inflation and
budget deficit on short-term interest rate and its result is as follows:
^=
i3 1.73 + 0.606 inft + 0.513 deft
t
(0.43) (0.082) (0.118)
The variable i3 is the three-month T-bill rate, inf is the annual inflation rate based on the
consumer price index (CPI), and def is the federal budget deficit as a percentage of GDP.
HOFF, if inflation rate increases by 1% point the 3-month bill rate is expected to increase by
0,606% point
HOFF, if def increase by 1% point 3-month bill rate is expected to increase by 0,513% point
(ii) The first lag of inf and def are added to Model 1, and the result is reported.
^ =1.61
i3 +0.343 inft +0.382 inft-1 0.190 deft +0.569 deft-1
t
(0.40) (0.125) (0.134) (0.221) (0.197)
Are these two lags individually significant? Are they jointly significant?
(ii) Compare the estimated LRP (Long-Run Parameter) for the effect of inflation
with that in first equation. Are they vastly different?
They have no vastly different ( 0,725 -0,606 = 0,119). They are still sig at 1%
Question 4: