ECMT1020: Introduction To Econometrics Tutorial Questions, Week 3
ECMT1020: Introduction To Econometrics Tutorial Questions, Week 3
(a) Load the data and obtain the sample mean and standard deviation.
(b) Use these results to find intervals that 68% and 95% of the data should lie in, respectively, if we
assume a normal distribution.
(c) Find the proportion of the data that actually lies in the intervals you computed in part (b). How would
you explain the discrepancy?
(d) Create a histogram of the data. How does it relate to your findings in part (c)?
Question 2. For this question, use the data set intrate.dta, also available on Canvas. It contains
monthly data on the nominal interest rate (intrate) and inflation (infl) in the United States, for
1980–2009.
(a) We first focus on the infl variable. Find its mean, standard deviation, skewness, and kurtosis, and
plot a histogram of the sample. Show that both the summary statistics and the histogram suggest that this
variable is not normally distributed.
(b) We introduced the t statistic and the related test under the assumption that the data came from a
normal distribution. Why can we still use this test, despite what we found in part (a)?
(c) Macroeconomics textbooks often state that the average inflation rate for an industrialized country
should be 4% in the long run. Based on your results from part (a), but without using the ttest com-
mand, find the t statistic and the p-value for testing the hypothesis that 4% is the population mean
inflation rate. What do you conclude?
(d) Use the ttest command to confirm your results in part (c).
(e) Find the mean and standard deviation of the interest rates as well.
(f) Create a new variable, measuring the real interest rate: the nominal interest rate minus the inflation
rate. Also find this variable’s mean and standard deviation.
(g) The variable you created in part (f) is defined as a linear combination of two variables: it is equal
to a·intrate + b·infl, where a = 1 and b = −1. Show that the formula for the mean of a linear
combination correctly gives the result that you found in part (f).
(h) A similar formula for the variance of a linear combination was given in the lecture slides. Show that
this formula does not give the correct result in this case. Why is that?
(i) The capital gains tax system was constructed around the assumption that on average, the real interest
rate should be 2.5% per year. Test whether this is the case for the years in this data set.
Non-stata questions: Even in the weeks with Stata tutorials, I will occasionally include some pencil-
and-paper questions. Your tutor may cover these questions if time allows - solutions will be posted to
Canvas.
n
X
Question 4. This question focuses on the sum of squares (xi − x̄)2 , which pops up in the definition
i=1
of a variance, as well as in several other places, as we will see later on in the semester.
n
X
Prove that this sum of squares can be simplified to x2i − nx̄2 .
i=1