Your Answer Score Explanation
Your Answer Score Explanation
diameter measurements, as well as age, weight, height and gender, for 507 physically
active individuals. The histogram below shows the sample distribution of heights in
centimeters, and the table shows sample statistics calculated based on this sample. Which
This question refers to the following learning objective(s): Define sample statistic as a
point estimate for a population parameter, for example, the sample mean is used to
estimate the population mean, and note that point estimate and sample statistic are
synonymous.
Question 2
Which of the following is false?
Your Answer Score Explanation
Standard error measures the Referring to the formula SE=σ/n−−√, standard
variability in means of samples of the error is calculated using samples from
same size taken from different C1.00 the same population (which has population
populations. o standard deviation σ).
r
In order to reduce the standard error r
by half, sample size should be e
increased by a factor of 4. c
Standard error computed based on a t
sample standard deviation will always
be lower than the standard deviation
of that sample.
As the sample size increases, the
variability of the sampling distribution
decreases.
1.00 /
Total
1.00
Question Explanation
This question refers to the following learning objective(s): Calculate the sampling
variability of the mean, the standard error, as SE=σ/n−−√.
Question 3
The standard error measures:
Your Answer Score Explanation
the variability of the sampled observations
the variability in the population
the variability of population parameters
the variability of sample statistics C1.00
Total o1.00 / 1.00
Question Explanation r
r
e
This question refers to the following learning objective(s): Distinguish standard deviation
(σ or s) and standard error (SE): standard deviation measures the variability in the data,
c
t
while standard error measures the variability in point estimates from different samples of
the same size and from the same population, i.e. measures the sampling variability.
Question 3
Students are asked to count the number of chocolate chips in 22 cookies for a class
activity. They found that the cookies on average had 14.77 chocolate chips with a standard
deviation of 4.37 chocolate chips. After collecting the data, a student reports the standard
error of the mean to be 0.93 chocolate chips. What is the best way to interpret the
student’s result?
Your Answer Score Explanation
The student either made a calculation error or his result is meaningless,
because it does not make sense to talk about 0.93 chocolate chips.
0.93 is the standard deviation of the number of chocolate chips in a
chocolate chip cookie.
0.93 chocolate chips is a measure of the variability in the mean number of
chocolate chips across all chocolate chip cookies.
0.93 chocolate chips is a measure of the variability we’d expect in
calculations of the mean number of chocolate chips if we took repeated C1.00
random samples of 22 cookies. o
r 1.00 /
Total r 1.00
Question Explanation e
c
This question refers to the following learning objective(s): Distinguishtstandard deviation
(σ or s) and standard error (SE): standard deviation measures the variability in the data,
while standard error measures the variability in point estimates from different samples of
the same size and from the same population, i.e. measures the sampling variability.
Question 4
Suppose you took a large number of random samples of size n from a large population and
calculated the mean of each sample. Then suppose you plotted the distribution of your
sample means in a histogram. Now consider the following possible attributes of your
collected data and the population from which they were sampled. For which of the
following sets of attributes would you not expect your histogram of your sample means to
Recognize that the Central Limit Theorem (CLT) is about the distribution of point
estimates, and that given certain conditions, this distribution will be nearly normal.
• The larger the sample size (n), the less important the shape of the distribution
becomes, i.e. when n is very large the sampling distribution will be nearly normal
regardless of the shape of the population distribution.
Question 5
The General Social Survey (GSS) is a sociological survey used to collect data on
demographic characteristics and attitudes of residents of the United States. In 2010, the
survey collected responses from over a thousand US residents. The survey is conducted
the questions on the survey is “For how many days during the past 30 days was your
mental health, which includes stress, depression, and problems with emotions, not good?”
Based on responses from 1,151 US residents, the survey reported a 95% confidence
interval of 3.40 to 4.24 days in 2010. Given this information, which of the following
statements would be most appropriate to make regarding the true average number of
• Interpret a confidence interval as “We are XX% confident that the true population
parameter is in this interval”, where XX% is the desired confidence level.
• Define margin of error as the distance required to travel in either direction away from
the point estimate when constructing a confidence interval.
Question 6
An insurance company is reviewing its current policy rates. When originally setting the
rates they believed that the average claim amount was $1,800. They are concerned that
the true mean is actually higher than this, because they could potentially lose a lot of
money. They randomly select 40 claims, which yield a sample mean of $1,950. Which of
• Note that the alternative hypothesis might be one-sided (μ the null value) or two-sided
(μ ≠ the null value), and the choice depends on the research question.
Question 7
Your friend likes to show off to his coworkers using statistical terminology, but he makes
errors so much that you often have to correct him. He just completed the following
hypothesis test:
H0 : μ = 100 ; HA : μ ≠ 100
x = 105, s = 10, n = 40
p-value = 0.0016
He claims the definition of this p-value is “the probability of obtaining a sample mean of
105 from a random sample of n = 40 when the true population mean is assumed to be
100.” Which of the following is true? (You may assume his calculations are correct, only
focus on his interpretation.)
Your Answer Score Explanation
Your friend is wrong, the statement should be revised as “the probability of
obtaining a sample mean of 105 or more extreme from a random sample C1.00
of n = 40 when the true population mean is assumed to be 100." o
Your friend is wrong, the sample size is irrelevant. r
r
Your friend is wrong, the statement should be revised as “the probability of
e
obtaining a sample mean of 105 from a random sample of n = 40 when the c
true population mean is assumed to be different than 105.” t
Your friend is right.
1.00 /
Total
1.00
Question Explanation
This question refers to the following learning objective(s): Define a p-value as the
conditional probability of obtaining a sample statistic at least as extreme as the one
observed given that the null hypothesis is true.
All but one of the following confidence intervals has a margin of error of 0.7. Which is the
confidence interval with the different margin of error?
Your
Score Explanation
Answer
(1.6,4.4)
The width of a confidence interval is 2 times the margin of error, since we
add and subtract the same margin of error to the sample statistics to obtain the
bounds of the confidence interval. To solve this question we need to calculate
(−0.5,0.9) I 0.00
the margin of error using this rule for each choice:
n
o
|(−0.5−0.9)/2|=0.7
r
(−4.7,−3.3) r
(20.3,21.7) e
c
0.00 /
Total t
1.00
Question Explanation
• Recognize that when the sample size increases we would expect the sampling
variability to decrease.
• Define margin of error as the distance required to travel in either direction away from
the point estimate when constructing a confidence interval, i.e. z⋆×SE.
Question 8
Which of the following is false about confidence intervals? All else held constant,
Your Answer Score Explanation
as the sample mean
increases, the margin
of error stays
constant.
This is false: To understand why, you could think about the most
extreme case: increasing the confidence level to 100% would mean
as the confidence
your confidence intervals (calculated under repeated samples)
level increases, the C1.00 would capture the true parameter 100% of the time! So increasing
width decreases. o the confidence level should increase the width of the confidence
r
interval.
r
as the standard e
deviation of the c
sample increases, the t
width increases.
as the sample size
increases, the margin
of error decreases.
1.00 /
Total
1.00
Question Explanation
• Recognize that when the sample size increases we would expect the sampling
variability to decrease.
• Define margin of error as the distance required to travel in either direction away from
the point estimate when constructing a confidence interval, i.e. z⋆×SE.
Question 9
A researcher found a 2006 - 2010 survey showing that the average age of women at first
marriage is 23.44. Suppose a researcher believes that this value may have increased more
recently, but as a good scientist he also wants to consider the possibility that the average
age may have decreased. The researcher has set up his hypothesis test; which of the
following states the appropriate HA correctly?
Your Answer Score Explanation
HA:μ<23.44 year
s old.
HA:μ≠23.44 years Because the researcher is interested in both an increase or a
old. C1.00 decrease, HA should be two-sided.
o
HA:μ>23.44 year r
s old. r
HA:μ=23.44 year e
s old. c
t
1.00 /
Total
1.00
Question Explanation
This question refers to the following learning objective(s): Note that the alternative
hypothesis might be one-sided (μ the null value) or two-sided (μ≠ the null value), and the
choice depends on the research question.
Question 9
One-sided alternative hypotheses are phrased in terms of:
Your Answer Score Explanation
≈ or =
≠
≤ or ≥
< or > C1.00
Total o1.00 / 1.00
r
Question Explanation
r
This question erefers to the following learning objective(s): Note that the alternative
c be one-sided (μ the null value) or two-sided (μ≠ the null value), and the
hypothesis might
choice dependst on the research question.
Question 10
A Type 2 error occurs when the null hypothesis is
Your Answer Score Explanation
rejected when it is true
not rejected when it is true
not rejected when it is false C1.00
rejected when it is false o
Total r 1.00 / 1.00
r
Question Explanation e
c
This question refers to the
t following learning objective(s): Note that the conclusion of a
hypothesis test might be erroneous regardless of the decision we make.
• Define a Type 1 error as rejecting the null hypothesis when the null hypothesis is
actually true.
• Define a Type 2 error as failing to reject the null hypothesis when the alternative
hypothesis is actually true.
Question 10
You set up a two-sided hypothesis test for a population mean μ with a null hypothesis
of H0:μ=100. You chose a significance level of α=0.05. The p-value calculated from the
data is 0.12, and hence you failed to reject the null hypothesis. Suppose that after your
analysis was completed and published, an expert informed you that the true value of μ is
104. How would you describe the result of your analysis?
Your Answer Score Explanation
A Type 2 error was made because you failed Type 2 error is made when the null
to reject the null hypothesis but μ is really C1.00 hypothesis is failed to be rejected when it
not equal to 100. o was indeed false.
You did not make a Type 1 or Type 2 error. r
r
A Type 1 error was made because you failed
e
to reject the null hypothesis but μ is really c
not equal to 100. t
1.00 /
Total
1.00
Question Explanation
This question refers to the following learning objective(s): Note that the conclusion of a
hypothesis test might be erroneous regardless of the decision we make.
• Define a Type 1 error as rejecting the null hypothesis when the null hypothesis is
actually true.
• Define a Type 2 error as failing to reject the null hypothesis when the alternative
hypothesis is actually true.
Question 11
A statistician is studying blood pressure levels of Italians in the age range 75-80. The
IV. The distribution of sample observations is skewed - the skew is easy to see, although
not very extreme.
The researcher is ready to use the Central Limit Theorem (CLT) in the main part of her
analysis. Which aspect of the her study is most likely to prevent her from using the CLT?
Your Answer Score Explanation
The correct answer is that the data arose as a result of an
(I), because the sample may
email survey. This data collection would likely result in a
not be random and hence
observations may not be C1.00 sample which is not a simple random sample of Italians aged
o 75-80, which would violate the independence of observations
independent.
r condition necessary for the CLT.
(II), because she only has r
data from a small e
c
proportion of the whole t
population.
(IV), because there is some
skew in the sample
distribution.
(III), because the sample
size is too small compared
to all Italians in the age
range 75-80.
1.00 /
Total
1.00
Question Explanation
• Recognize that the Central Limit Theorem (CLT) is about the distribution of point
estimates, and that given certain conditions, this distribution will be nearly normal. In
the case of the mean the CLT tells us that if (1a) the sample size is sufficiently large (n
≥ 30 or larger if the data are considerably skewed), or (1b) the population is known to
have a normal distribution, and (2) the observations in the sample are independent,
then the distribution of the sample mean will be nearly normal, centered at the true
population mean and with a standard error of σn√:
x¯∼N(mean=μ,SE=σn−−√)
When the population distribution is unknown, condition (1a) can be checked using a
histogram or some other visualization of the distribution of the observed data in the
sample. The larger the sample size (n), the less important the shape of the distribution
becomes, i.e. when n is very large the sampling distribution will be nearly normal
regardless of the shape of the population distribution.
• If the conditions necessary for the CLT to hold are not met, note this and do not go
forward with the analysis. (We will later learn about methods to use in these
situations.)
Question 12
SAT scores are distributed with a mean of 1,500 and a standard deviation of 300. You are
interested in estimating the average SAT score of first year students at your college. If you
would like to limit the margin of error of your 98% confidence interval to 40 points, at least
This question refers to the following learning objective(s): Calculate the required sample
size to obtain a given margin of error at a given confidence level by working backwards
from the given margin of error.
Question 13
The significance level in hypothesis testing is the probability of
Your Answer Score Explanation
failing to reject a true
null hypothesis
failing to reject a false
null hypothesis
rejecting an alternative
hypothesis
rejecting a null
hypothesis
rejecting a true null Note that the probability of making a Type 1 error is equivalent
hypothesis C1.00 to the significance level when the null hypothesis is true.
o
Total r 1.00 /
r 1.00
Question Explanation e
c
This question refers tot the following learning objective(s): Note that the probability of
making a Type 1 error is equivalent to the significance level when the null hypothesis is
true, and choose a significance level depending on the risks associated with Type 1 and
Type 2 errors.
large city on a random sample of thirty-six children who were identified as gifted children
soon after they reached the age of four. The following histogram shows the distribution of
the ages (in months) at which these children first counted to 10 successfully. Also provided
are some sample statistics. Suppose you read online that children first count to 10
successfully when they are 32 months old, on average. You perform a hypothesis test
evaluating whether the average age at which gifted children first count to 10 is different
than the general average of 32 months. What is the p-value of the hypothesis test? Choose
Your
Score Explanation
Answer
0.0344
0.9656
0.7183
0.9312
H0:μ=32,HA:μ≠32
0.0688 C1.00 Zp−value=30.69−324.3136√=−1.82=P(x¯1.82)=2×0.034
o 4=0.0688
r 1.00 /
Total r 1.00
e
Question Explanation
c
t
This question refers to the following learning objective(s): Calculate a p-value as the area
under the normal curve beyond the observed sample mean (either in one tail or both,
depending on the alternative hypothesis). Note that in doing so you can use a Z score,
where
Z=sample statistic−null valueSE=x¯−μ0SE
Always sketch the normal curve when calculating the p-value, and shade the appropriate
area(s) depending on whether the alternative hypothesis is one- or two-sided.