JB Ies 109 Exercises Answers
JB Ies 109 Exercises Answers
JB Ies 109 Exercises Answers
09)
Answers to Exercises
1 Introduction 1
2 Gathering Data 3
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2 Populations and Samples, Parameters and Statistics . . . . . . . . . . . . 3
2.3 Types of Sampling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.3.1 Simple Random Sampling . . . . . . . . . . . . . . . . . . . . . . . 3
2.3.2 Other Types of Random Sampling . . . . . . . . . . . . . . . . . . 4
2.4 Experiments and Observational Studies . . . . . . . . . . . . . . . . . . . 4
2.5 Chapter Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3 Descriptive Statistics 11
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3.2 Plots for Qualitative and Quantitative Variables . . . . . . . . . . . . . . . 11
3.2.1 Plots for Qualitative Variables . . . . . . . . . . . . . . . . . . . . 11
3.2.2 Plots for Quantitative Variables . . . . . . . . . . . . . . . . . . . . 12
3.3 Numerical Measures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.3.1 Summation Notation . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.3.2 Measures of Central Tendency . . . . . . . . . . . . . . . . . . . . . 12
3.3.3 Measures of Variability . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.3.4 Measures of Relative Standing . . . . . . . . . . . . . . . . . . . . 15
3.4 Boxplots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.5 Linear Transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.6 Chapter Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
3.6.1 Basic Calculations . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
3.6.2 Concepts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
3.6.3 Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
4 Probability 25
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.2 Basics of Probability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.2.1 Sample Spaces and Sample Points . . . . . . . . . . . . . . . . . . 25
i
Balka ISE 1.09 CONTENTS ii
7 Sampling Distributions 91
7.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
7.2 The Sampling Distribution of the Sample Mean . . . . . . . . . . . . . . . 93
7.3 The Central Limit Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . 94
7.4 Some Terminology Regarding Sampling Distributions . . . . . . . . . . . . 95
7.4.1 Standard Errors . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
7.4.2 Unbiased Estimators . . . . . . . . . . . . . . . . . . . . . . . . . . 95
7.5 Chapter Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
7.6 Basic Calculations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
7.7 Concepts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
7.8 Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
7.8.1 Extra Practice Questions . . . . . . . . . . . . . . . . . . . . . . . 102
Introduction
(No exercises.)
1
Balka ISE 1.09 2
Chapter 2
Gathering Data
J.B.’s strongly suggested exercises: 1, 2, 3, 4, 5, 6, 8, 9, 12, 15, 17, 18, 21, 24, 29, 31, 32
2.1 Introduction
3
Balka ISE 1.09 2.4. EXPERIMENTS AND OBSERVATIONAL STUDIES 4
4. (a) True. Each possible sample of size 10 had the same chance of being selected,
and the sample obtained is thus a simple random sample.
(b) True.
(c) True.
(d) True (since we are sampling 10 from a population of 50).
5. Statements a) and b) are true, and c) is false. Sample I is a simple random sample.
Sample II is a stratifed random sample. Both sampling designs are reasonable.
6. (a) It is an experiment.
(b) The 20 volunteers.
(c) The procedures (A and B).
(d) Time until death.
(e) Yes, since it is a randomized experiment.
7. (a) It is an experiment.
(b) The types of corn.
(c) The yield of the corn.
(d) The plots of land.
8. No, it would not be reasonable. It’s an observational study, and as such does not
provide strong evidence of a causal link. There may appear to be an association, but
this relationship could easily be caused by other variables. For example, perhaps
uninvolved parents are more likely to have children that watch lots of TV, and more
likely to have children that are arrested.
Balka ISE 1.09 2.5. CHAPTER EXERCISES 5
10. (a) All lizards of these species in this region of Inner Mongolia.
(b) No, this is not simple random sampling. Only lizards the researchers could
catch were included in the sample.
(c) The researchers would have an easier time catching certain types of lizard, and
thus certain types of lizard would tend to be overrepresented in the sample.
(Perhaps larger lizards are easier to catch, or smaller lizards are easier to
catch.) It is hard to say how much of a problem this might be, but there are
defnitely possible biases in this study. But there is no better alternative—the
researchers simply have to catch the lizards by hand.
11. (a) If patients knew they were receiving vitamin C rather than the placebo, they
may be more confdent that it is helping to reduce the duration and severity
of colds. This may show up in their reporting of the results, even if there is no
real e˙ect. If the researchers knew which treatment the patient was receiving,
they may be tempted (consciously or subconsciously) to tweak the results to
refect what they want to show.
(b) This may or may not be a problem, depending on the reasons people had for
dropping out of the study. The dropouts will not cause any bias if they are
dropping out for reasons unrelated to the study. (For example, if they move out
of town or die in an unrelated accident.) If they drop out for reasons related
to the study then this may very well cause substantial bias. (For example, if
they drop out because the vitamin C is making them too sick from colds to
report the results.) In the original study, the researchers followed up with the
dropouts, and found that most were dropping out because they were getting
bored of the study and did not want to continue.
variable.
(c) This will not cause any bias as long as the reason for the rat catching his foot
on the treadmill is unrelated to the study variables. In the absence of further
information, this is likely the case, so the results of the study are still valid.
But if the reason for the rat catching his foot was related to the study variables,
then this may result in substantial bias. (For example, if the rat caught his foot
on the treadmill because of a lack of mobility due to a massive tumour, then
omitting this rat would be problematic and may lead to misleading results.)
13. (a) Since only the numbers 1 through 7 count, the sample is made up of the
numbers 7, 1, and 3. 98797 18334. This corresponds to the countries Panama,
Belize, El Salvador.
(b) In this case, yes. But only because none of these country names share the
same frst letter.
14. The alphabetical list (c) would provide the sample that has the least bias. (But
this is open to debate.)
• Only students that showed up early to the lecture following a midterm were
included in the sample. Students that show up to the lecture after a midterm
may very well di˙er from the rest of the class in a fundamental way.
• The lecture was on a Friday morning, and students attending Friday morning
classes may di˙er from the rest of the class in a fundamental way.
• What students say their mark is on a test may very well di˙er from their mark
on the test. (Students might have a tendency say they got a higher mark than
they did.)
• The lecture was on a Friday morning, and students attending Friday morning
classes may di˙er from the rest of the class in a fundamental way.
• What students say their parents’ income is may be quite di˙erent from what
their parents’ income actually is.
While there are possible problems with sample c), they are likely not as severe as for
the other two sampling designs. There is possibly a relationship between the frst
letter of the last name and the proportion of students that are male. For example,
certain cultures may send males to university more often than females, and have
Balka ISE 1.09 2.5. CHAPTER EXERCISES 7
last names that tend to start with a letter near the beginning of the alphabet. But
these issues are likely not as severe as for the frst 2 sampling designs.
17. No, they do not represent a simple random sample. Values near the middle are more
likely than those on the ends, and thus middle sized sunfowers are more likely to
be selected than small or large sunfowers.
18. (a) False. Clearly the survey gives some information. As it relates to the opinions
of members of this organization, it would yield valuable information.
(b) False. This type of delay would not be a big problem in this situation.
(c) False. A sample of 912 is a fairly large sample, and can provide valuable
information.
(d) True. The population of interest is Americans in general, but only the opinions
of the members of this pro-gun organization are reported.
20. (a) False. The sampling design was not stated, but it is highly unlikely that the
sample represents a simple random sample.
(b) False. The wording is pretty reasonable, and would not cause much bias.
(c) False. The value 72% came from a sample, and is thus a statistic.
(d) False. A sample of size 1500 is quite a large sample for most practical purposes.
Balka ISE 1.09 2.5. CHAPTER EXERCISES 8
21. (a) It is an experiment (the researchers imposed a condition (the aspirin) on some
of the patients).
(b) Whether or not the patient had a heart attack.
(c) Whether or not the patient received aspirin.
(d) No. It is sometimes desirable, but not necessary.
(e) False. This is not what confounding means.
22. (a) No, the conclusions would not be as strong. Here we are discussing an ob-
servational study, and thus we could not reach conclusions of a causal nature.
The experiment in Question 21 could give strong evidence of a causal link.
(b) Physicians might not want to respond if they’ve had a heart attack. Or perhaps
they would be more likely to respond if they’ve had a heart attack, as they may
wish to contribute to research of this nature. (We can suspect possible biases
without knowing the direction or magnitude.) And, of course, any physician
who previously died of a heart attack would be unable to respond.
23. (a) False, there are 2 treatments groups (shock/no shock). (What we might call
a treatment group and a control group.)
(b) False, it is an experiment.
(c) False. Experiments can give strong evidence of a causal e˙ect.
(d) False. The response variable is the number of matching pairs.
25. (a) 33696 91544 76248. The frst 3 numbers selected are 3, 6, and 9. So Jerry,
Bo, and Eddie go to the control group. The second set of 3 numbers is 1, 5,
4, so Pete, Renaldo, and Alphonse go to the cheese group. The others (2, 7, 8
– Tom, Little Pete, and Huey) go to the electric shock group.
(b) This may introduce bias. You might want to save Little Pete because he is
too weak and frail, or a variety of other reasons that might introduce bias.
26. There appears to be a relationship, but we cannot say it is due to the yoga. Perhaps
women who practice yoga take care of themselves and are simply healthier overall.
Based on this type of survey, we have little evidence that yoga causes improved
Balka ISE 1.09 2.5. CHAPTER EXERCISES 9
health.
27. a: An unmeasured variable that is related to both the response and explanatory
variables.
29. Only (a) is true. This observational study does not give strong evidence of a causal
link. There may be a causal link, but this study certainly doesn’t show it. Note that
those who are more likely to get into an accident may be more likely to purchase
this service.
32. (a) The experimental surgery. The results from the experiment are more reliable
(less subject to bias).
(b) The medical condition of the dog. Owners of dogs that are in very poor shape
may be more willing to gamble on the experimental surgery.
Balka ISE 1.09 2.5. CHAPTER EXERCISES 10
Chapter 3
Descriptive Statistics
J.B.’s strongly suggested exercises: 1, 4, 7, 8, 9, 10, 11, 16, 17, 20, 21, 22, 24, 25, 36, 46,
49
3.1 Introduction
1. (a) Most victims (67%) were acquaintances of the male perpetrator. Slightly less
than one quarter (24%) were the male murderer’s intimate partner. Small
percentages of the victims were family members (4%) or strangers (4%).
(b) Compared to the victims of female murderers, victims of male murderers are
more likely to be acquaintances and less likely to be intimate partners or family
members.
(c) No, of course not. This is sample data, and for sample data there is always
the chance that what we observe is simply due to chance and not a real e˙ect.
Later on, we will conduct a χ2 test on this data, in order to assess how much
evidence there is of a real e˙ect.
(d) The proportions in each category are:
• Acquaintances: 61 91 ≈ .670
• Partners: 2291 ≈ .242
• Family: 914
≈ .044
• Stranger: 91 ≈ .044
4
11
Balka ISE 1.09 3.3. NUMERICAL MEASURES 12
These proportions make up the areas of the sections of the pie chart. The pie
chart for this data is illustrated in Figure 3.1.
Acquaintance
Stranger
Family
Partner
P 1 + 5 + 2 + (−3)
(d) The extreme value is 987. When this value is removed, nxi = =
4
1.25.
(e) The ordered values are: −3, 1, 2, 5. The median is the average of the two
middle values: 1+2
2 = 1.5.
4. (a) x̄ = 2870+2620+3120+3620
4 = 3057.5.
(b) The ordered values are: 2620, 2870, 3120, 3620. The median is the average of
2870 + 3120
the two middle values: = 2995.
2
(c) Grams (the same units as the variable).
(d) Grams (the same units as the variable).
(e) Either one is perfectly acceptable, and it’s fne to report both. (There are no
extreme values that might make the mean a misleading measure of centre.)
5. The trimmed mean is not as infuenced by extreme values as the untrimmed mean.
This can be advantageous in certain situations, as we don’t want a single value to
have undue infuence on our statistics and conclusions. A disadvantage is that the
trimmed mean omits some possibly valuable information.
6. (a) 6.5, −3.5, −8.5, 5.5. The deviation for the ith observation is xi − x̄, where
x̄ = 11.5. x1 − x̄ = 18 − 11.5 = 6.5, x2 − x̄ = 8 − 11.5 = −3.5, x3 − x̄ =
3 − 11.5 = −8.5, x4 − x̄ = 17 − 11.5 = 5.5.
(b) 0. The deviations always sum to 0.
9. (a) Any 4 values that are equal will work (e.g. 0,0,0,0, or 75, 75, 75, 75).
(b) If the 4 values are equal, s = 0.
10. (a) True. (The standard deviation will be greater than the variance if 0 < s < 1.)
(b) False. (s ≥ 0.)
(c) True. (The mean is a measure of location, but the standard deviation is a
measure of variability. There is no general relationship between them.)
(d) True. (Q3 is a measure of position, but the standard deviation is a measure
of variability. There is no general relationship between them.)
(e) False. (The standard deviation cannot be less than the average distance from
the mean.)
11. (a) Yes (the empirical rule would apply), since the distribution is roughly mound-
shaped.
(b) Approximately 68% of the observations would lie within 7.2 of 22.4 (within 1
standard deviation of the mean).
(c) Approximately 95% of the observations would lie within 14.4 of 22.4 (within
2 standard deviations of the mean).
(d) All or almost all of the observations would lie within 21.6 of 22.4 (within
3 standard deviations of the mean). (At most a very small proportion of
observations would fall outside of that range.) For this particular data set, no
observations fall outside of that range of values.
12. (a) Yes, Chebyshev’s theorem would apply. Chebyshev’s theorem applies to every
data set.
(b) Nothing. Chebyshev’s theorem is only informative if k > 1.
(c) At least 75% of the observations would lie within 14.4 of 22.4 (within 2 stan-
dard deviations of the mean).
(d) At least 88.9% of the observations would lie within 21.6 of 22.4 (within 3
standard deviations of the mean).
13. (a) The empirical rule would not oÿcially apply to this data set, since we would
not call this a mound-shaped distribution (it’s skewed to the right). However,
the empirical rule still works pretty well here. For the data given in the plot,
Balka ISE 1.09 3.3. NUMERICAL MEASURES 15
69.6% of the observations fall within 1 standard deviation of the mean, 95.2%
of the observations fall within 2 standard deviations of the mean, and 99.1%
of the observations fall within 3 standard deviations of the mean. So the
empirical rule can still be informative, even if there is some skewness.
(b) Yes, of course Chebyshev’s theorem would apply, since Chebyshev’s theorem
applies to every data set.
15. 86. Since we do not know the population mean µ, we will need to estimate it with
x̄, resulting in a loss of 1 degree of freedom. In this type of scenario, the degrees of
freedom are n − 1.
3.3.4.1 Z-scores
3.3.4.2 Percentiles
18. (a) 691. The ordered weights of the 8 boxes of cereal are:
19. 171 cm. (171 is the only plausible value. 143 cm and less are far too small, and
200 cm is far too big.) ? estimate the 90th percentile of women 20 years and over
in the United States to be 170.9 cm.
3.4 Boxplots
23. All of the measures of location or position (mean, median, Q3 ) would change. None
of the measures of variability (IQR, variance, standard deviation) would change.
Balka ISE 1.09 3.6. CHAPTER EXERCISES 17
28. (a) The values are −1.7, −1.1, −1.1, −1.0, −.7, −.6, −.6, −.3, −.2, 0.0, 0.3, 0.3, 0.3, 0.5, 1.0, 1.1, 1.3, 1.8, 1.9, 2
They result in a mean of x̄ = 0.18.
(b) Median = 0.15.
(c) s2 = 1.264842.
(d) s = 1.124652.
(e) Range = 2.4 − (−1.7) = 4.1.
29. (a) Yes, the mean is greater than the median. (The large values in the right tail
a˙ect the mean more than the median.)
(b) Yes, the distribution shows some right skewness (it is stretched out to the
right).
(c) The ffth value in the fve-number summary is the maximum. The frst value
in the fve number summary is the minimum. Max − Min = 5.8 − 0.0 = 5.8.
30. (a) The values are −4.9, −4.7, −4.5, −3.9, −3.9, −3.4, −3.1, −3, −1.6, 1.1, 2.1, 2.3, 3.2, 3.4, 3.9.
These result in a mean of x̄ = −1.133333.
(b) Median = −3.
(c) Since we would in e˙ect be multiplying every value by 10 1
, every one of the
listed statistics would change. All but the variance would change by a factor
of 10
1
. The variance would change by a factor of ( 10
1 2 1
) = 100 .
3.6.2 Concepts
32. (a) Since the deviations must sum to 0 ( (xi − x̄) = 0), the ffth deviation must
P
be −1.9.
(b) This is impossible to determine; the mean could have been any value.
(xi −x̄)2 (−2.3)2 +3.72 +1.22 +(−0.7)2 +(−1.9)2
P
(c) s = 2.475884. s2 = n−1 = 5−1 = 6.13. s =
√
6.13 = 2.475884.
Balka ISE 1.09 3.6. CHAPTER EXERCISES 19
33. The data set might be 12, 12, 12, 12, 22 or 134, 134, 134, 134, 144, or anything else of
this form. The variance of any data set of this form is the same: s2 = 20.
34. (a) True, the median is defnitely less than 4. (It would be close to 1.)
(b) False. The distribution is skewed to the right, and the mean would be greater
than the median.
(c) False. (Very few observations in this data set are greater than 5, but 25% of
the values are greater than Q3 .)
(d) True.
(e) False. The distribution is right-skewed.
(f) False. For a right-skewed distribution such as this one, Q3 would lie farther
above the median than Q1 lies below it.
(g) This is debatable. There are some large values that could be considered out-
liers.
36. (a) The mean, variance, and standard deviation would change (they would in-
crease). The median would not change, nor would Q1 , Q3 or the IQR.
(b) The mean, median, and IQR would not change. The standard deviation and
variance would increase.
37. (a) True. If the standard deviation is 0, all the values in the data set must be
equal. This implies all the values in the 5 number summary are equal.
(b) True, since all the deviations are 0.
(c) True. (The standard deviation and variance are both equal to 0.)
(d) True. (The mean and median are both equal to 0.)
(e) True. In the new data set, there would be 99 values that are equal and 1 that
is bigger. The IQR, minimum, and median would still be 0.
38. s = 0. If the standard deviation of the 4 number data set is 0, that means all values
must equal 50. Since the added value is also 50, then all numbers in the data set
are equal and s = 0.
39. (a) False. The standard deviation can be greater or less than the median—they
are measuring completely di˙erent things.
(b) False. The variance is usually greater than the standard deviation, but not if
0 ≤ s2 ≤ 1.
(c) False. The mean can be greater or less than the standard deviation—they are
measuring completely di˙erent things.
Balka ISE 1.09 3.6. CHAPTER EXERCISES 20
(d) True. The standard deviation is a little greater than the average distance from
the mean, which has a maximum of 1 in this case. The standard deviation of
the sample (0, 2) for example, is s = 1.414214.
(e) True. The median is usually less than Q3 , but they can be equal if there are re-
peated values in the data set. For example, if the data set is 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 12, 24,
then the median and Q3 will both equal 0.
40. (a) True. It would not be possible in this case for the distribution to the “stretched
out” to the left.
(b) True. The units of the variance are the square of the units of the variable.
(c) False. (By the same logic that tells us that Q1 is not in general exactly half
of the median.)
(d) False. Any of the values in the fve-number summary can be negative. (Recall
that the fve-number summary is: Minimum, Q1 , Median, Q3 , Maximum.)
(e) False. Outliers increase both the standard deviation and variance (and some-
times increase them a great deal).
41. (a) False. If the distribution is right-skewed, the mean will be greater than the
median.
(b) False. These two statistics are measuring completely di˙erent things. Sym-
metry would not make them equal.
(c) False. All or almost all observations will fall within 5 standard deviations of
the mean.
(d) False.
(e) False. Although values less than Q1 would usually have z-scores that are neg-
ative, it is not impossible for a value less than Q1 to be greater than the mean.
(e.g. consider the data set: −32847193487138947, 1, 1, 2, 3, 3, 8, 48, 52, 113.)
(f) True. If the range is 0, then all values in the data set must be equal. This
implies the variance and IQR will also equal 0.
42. (a) The median will equal 15, regardless of what those 2 values are.
(b) It is impossible to determine the value of the mean.
(c) It is impossible to determine the value of the variance.
43. (a) If we let x represent the original variable, and x∗ represent the transformed
variable, then we need to fnd a and b where x∗ = a + bx. The standard
deviation changed from 5 to 1, and thus the multiplicative constant must be
either b = 15 or b = − 15 . We can now solve for a in either case. The two possible
transformations that satisfy the conditions are x∗ = 12 + 15 x and x∗ = 18 − 15 x.
(b) Carrying out the transformation x∗ = 12 + 15 x, x∗ = 12 + 51 10 = 14.
44. The possible values are 52 and 352. Three consecutive digits have a sample standard
deviation of 1 (e.g. s = 1 for the values 5, 6, 7). Three values that are increasing by
Balka ISE 1.09 3.6. CHAPTER EXERCISES 21
100 (e.g. 500, 600, 700) will have a standard deviation of s = 100. So the missing
value could be either 52 or 352. (Verify this on your calculator or computer.)
3.6.3 Applications
(f) While the distribution shows slight left skewness, it is roughly mound-shaped,
so the empirical rule provides a bit of guidance. The standard deviation likely
falls in or close to the interval Range
6 to Range
4 . The range is somewhere between
30 and 35 kgf, so let’s estimate the range to be 33. The standard deviation is
likely in or near the interval 33 6 = 5.5 to 4 = 8.25. (Based on the raw data
33
49. (a) The distribution looks roughly symmetric, with one very large outlier at ap-
proximately 120%.
(b) The outlier looks to be approximately 85 units above the mean of the remain-
ing data. Removing the outlier would result in a decrease in the mean of
approximately 8523 . The estimated mean without the outlier is approximately
36.6 − 85
23 = 32.90. (If we return to the raw data and remove the outlier,
x̄ = 32.89.)
(c) We should be extremely reluctant to remove a real observation. When possi-
ble, we should check to make sure that an extreme observation was correctly
recorded and was not in error. If the value is correct, then we should never
simply ignore it. We should be more inclined to use measures that are not
as infuenced by extreme values (such as reporting the median instead of the
mean). Reporting the results both with and without the outlier is another
option.
(d) It depends on the reason why the machine did not record the break. If it was
simply a random occurrence that had nothing to do with the elongation of
the cord, then this is not a problem. If the reason for the malfunction was
related to the elongation, then this might be a big problem. For example, if
the machine tends to fail to record the break when the elongation percentage
is very large, then ignoring these data points might incorporate very strong
bias (our reported mean and median will be too small).
50. (a) From the boxplot, the median looks to be approximately 27 grams. The plots
do not show any major outliers or skewness, so the mean and median are
likely close in value (27 grams would also be a reasonable estimate of the
mean). (The mean and median of the raw data are both 27.2 grams.)
(b) It depends entirely on how this sample was drawn, but the sample is very
possibly strongly biased. In practice, it is impossible to get a simple random
sample of crayfsh from a lake. It is very likely that the method of capturing
the crayfsh introduced bias.
51. (a) Since this is just a linear transformation, we can carry out the transformation
on the mean. The new mean is x̄∗ = 47.5975−40
2.54 = 2.991142.
(b) Similarly for the median: 2.54 = 2.803150.
47.12−40
(c) Subtracting 40 will not change the measures of variability, and thus only the
division by 2.54 matters here. sx∗ = 2.541
× 1.212878 = 0.477511.
Balka ISE 1.09 3.6. CHAPTER EXERCISES 23
52. A linear transformation is required: x̄∗ = 2.20(x + 2) = 4.4 + 2.2x. The mean in
pounds is x̄∗ = 4.4 + 2.2 × 15.10 = 37.62. The additive constant does not a˙ect the
standard deviation, and the new standard deviation is sx∗ = 2.2 × 1.70 = 3.74.
Balka ISE 1.09 3.6. CHAPTER EXERCISES 24
Chapter 4
Probability
J.B.’s strongly suggested exercises: 2, 4, 5, 6, 7, 8, 14, 16, 17, 21, 30, 35, 39, 43, 49, 57,
61
4.1 Introduction
1. (a) Depending on our needs, we might defne the sample space in di˙erent ways.
One possibility is the set of all 52 × 51 = 2652 orderings of two cards: S =
{(2c, 3c), (2c, 4c), . . . , (As, Ks)}.
Another possibility is the set of all pairs of cards, without regard to order.
(There are 1326 possible pairs of cards, if order does not matter.)
(b) Under the frst defnition of the sample space given in 1a there are 52 × 51 =
2652 sample points: (2c, 3c), (2c, 4c), . . . , (As, Ks).
(c) Yes, any two card ordering has the same chance occurring.
(d) 4 × 3 = 12 of the sample points are a pair of twos. P (Pair of twos) = 2652
12
=
1
221 .
25
Balka ISE 1.09 4.3. RULES OF PROBABILITY 26
Complementary Events
A B
0.30
(c) The ABO blood groups are mutually exclusive (a person is classifed into one
and only one of A, B, AB, or O), so the following events are mutually exclusive:
(A, B), (A, C), (A, D), (B, C), (B, D), (C, D).
(d) (A∩E c )∪(D∩E c ), or (A∪D)∩E c . P ((A∩E c )∪(D∩E c )) = 0.06+0.07 = 0.13.
Conditional Probability
Independent Events
7. (a) P (B|C) = 1. If we are given that the number on the top face is a 6 (event
C), we know with certainty it is an even number (event B). Alternatively, we
could use the conditional probability formula:
P (B ∩ C) 1/6
P (B|C) = = =1
P (C) 1/6
Balka ISE 1.09 4.3. RULES OF PROBABILITY 28
P (B ∩ C) 1/6 1
P (C|B) = = =
P (B) 3/6 3
P (B c ∩ C) 0
P (C|B c ) = c
= =0
P (B ) 3/6
8. (a)
P (Both cards are hearts) = P (First card is a heart) × P (Second card is a heart | First card is a heart)
13 12
= ×
52 51
156 1
= =
2652 17
Balka ISE 1.09 4.4. EXAMPLES 29
(b)
P (Both cards are fves) = P (First card is a 5) × P (Second card is a 5 | First card is a 5)
4 3
= ×
52 51
12 1
= =
2652 221
(c) 13
52 × 12
51 × 11
50 × 10
49 = 17160
6497400 ≈ 0.00264.
9. (a)
P (Both dice are fours) = P (First die is a four) × P (Second die is a four | First die is a four)
1 1
= × (The rolls are independent.)
6 6
1
=
36
(b) Since the rolls are independent, these two events are independent, and we can
simply multiply the two probabilities together:
3 2 6 1
P (Roll an even number, then a number greater than 4) = × = =
6 6 36 6
4.4 Examples
A B
0.12
0.10 0.20
0.04
0.16 0.08
0.12
0.18 C
12. (a) Since any one of these 1000 individuals is equally likely to be picked, this
probability is simply the number of Republicans over the total number of
individuals:
60 + 270
P (Republican) = = 0.330
1000
(b) This is the number of individuals that said they approve, over the total number
of individuals:
60 + 361 + 139
P (Approve) = = 0.560
1000
(c) The conditional probability formula can be used here, but we can also an-
swer this question quickly using a little bit of logic. We are given that the
individual is a Republican, which means we are dealing with only the 330
Republicans in the sample. Of the 330 Republicans, 60 approve, and so
P (Approve|Republican) = 330 60
. We could have used the conditional proba-
bility formula had we so desired:
13. (a) A and B are independent in Plot B. (We can’t determine independence with
certainty by just looking at the plot, without the information that A and B
are independent for one of the plots. It can only be plot B. This plot was
created such that P (A) = 0.1, P (B) = 0.2, and P (A ∩ B) = 0.02.)
(b) Plot A: P (A|B) < P (A).
Plot B: P (A|B) = P (A).
Plot C: P (A|B) > P (A).
Plot D: P (A|B) > P (A).
Balka ISE 1.09 4.5. BAYES’ THEOREM 31
14. The tree diagram for this scenario is illustrated in Figure 4.4.
Figure 4.4: The tree diagram for Question 14. E represents the event that the person is
carrying explosives and A represents the event that the alarm sounds.
(a) This is the sum of the probabilities of the two branches of the tree diagram
that lead to the alarm sounding. More formally:
P (A) = P (A ∩ E) + P (A ∩ E c )
= P (E)P (A|E) + P (E c )P (A|E c )
= 0.0001 · 0.92 + 0.9999 · 0.02
= 0.02009
(b) We are given that the alarm has sounded, so either the frst branch or third
branch of the tree diagram has occurred. What is the conditional probability
the frst branch has occurred? This is the probability of the frst branch,
divided by the sum of the probabilities of the frst and third branches. More
formally:
P (E ∩ A)
P (E|A) =
P (A)
P (E)P (A|E)
=
P (E)P (A|E) + P (E c )P (A|E c )
0.0001 · 0.92
=
0.0001 · 0.92 + 0.9999 · 0.02
= 0.00458
Balka ISE 1.09 4.5. BAYES’ THEOREM 32
Figure 4.5: The tree diagram for Question 15. C represents the event that the fight is a
Caribou Air fight, and L represents the event that the fight is late.
15. The tree diagram for this scenario is illustrated in Figure 4.5.
(a) This is the sum of the probabilities of the two branches of the tree diagram
that lead to a late fight. More formally:
P (L) = P (L ∩ C) + P (L ∩ C c )
= P (C)P (L|C) + P (C c )P (L|C c )
= 0.65 · 0.22 + 0.35 · 0.42
= 0.29
(b) We are given that the fight is late, so either the frst branch or third branch
of the tree diagram has occurred. What is the conditional probability that the
frst branch has occurred? This is the probability of the frst branch, divided
by the sum of the probabilities of the frst and third branches. More formally:
P (C ∩ L)
P (C|L) =
P (L)
P (C)P (L|C)
=
P (C)P (L|C) + P (C c )P (L|C c )
0.65 · 0.22
=
0.65 · 0.22 + 0.35 · 0.42
= 0.493
Balka ISE 1.09 4.6. COUNTING RULES: PERMUTATIONS AND COMBINATIONS 33
16. (a)
�52
2 = 1326. (The order in which the cards are received is unimportant, so
this is a combinations problem.)
(b) 525 = 2, 598, 960. (The order in which the cards are received is unimportant,
�
so there are 4 × 135 = 5148 possible fush hands. Since each of the 2598960
�
fve card hands are equally likely, the probability of getting dealt a fush is
2598960 ≈ 0.00198. (About once every 505 deals.)
5148
17. (a)
�200
4 = 64, 684, 950. In a simple random sample, the order of selection is not
important (we care only about which people have been selected).
(b) In a simple random sample every individual has the same chance of being
selected, so the probability any individual students gets chosen is 200
4
= 501
.
(We could also have used an argument based on combinations. There are 4
�200
18. (a) 25 = 53, 130. Here the order of selection is not important (we care only
�
5
which 5 people have been picked for the committee).
(b) P730 = 10, 260, 432, 000. Here the order is important (the drivers are being
assigned to 7 di˙erent routes, so among each group of 7 drivers, there are 7!
ways of assigning them to the routes).
A B
0.025
(a) Ac (b) B
(c) Ac ∪ B (d) Ac ∩ B
21. (a) P (A ∩ B) = 0.02. By the addition rule, 0.68 = 0.5 + 0.2 − P (A ∩ B), and this
implies P (A ∩ B) = 0.02. This scenario is illustrated in Figure 4.8.
Balka ISE 1.09 4.8. CHAPTER EXERCISES 35
A B
0.32
(b)
P ((A ∩ B) ∩ (A ∪ B))
P (A ∩ B|A ∪ B) =
P (A ∪ B)
P (A ∩ B)
= (See Figure 4.10)
P (A ∪ B)
0.2
=
0.4 + 0.7 − 0.2
2
=
9
Figure 4.9: Venn diagrams illustrating that A shares no outcomes with (B ∪ A)c .
25. (a) 36 = 12 .
(b) 61 .
(c) 0. If we are given the number is even, it cannot possibly be a 3.
(d) 23 . If we are given the number is even, we know it is a 2, 4, or 6. Two of these 3
equally likely possibilities are at least 4 (4 and 6), and so P (The number is at least 4|The number is a 2
3.
2
4.8.2 Concepts
26. (a) The odds in favour of rolling a 5 or a 6 are 1:2. (Reduced from 2:4 – odds are
usually expressed using the smallest whole numbers that have the appropriate
ratio.)
The odds against rolling a 5 or a 6 are 2:1.
(b) The odds in favour of drawing a jack are 1:12. The odds against drawing a
jack are 12:1.
(c) 5+3
5
= 85 = 0.625.
(d) 19+1
1
= 201
= 0.05.
28.
29. C (independent events are never mutually exclusive). If two events A and B are
mutually exclusive, then P (A ∩ B) = 0. If A and B are independent, P (A ∩ B) =
P (A)P (B). P (A)P (B) cannot be 0 unless one or more of the events have probability
0, so for mutually exclusive events P (A ∩ B) 6= P (A)P (B).
30. (a) If P (A) = 0.30 and P (B) = 0.30, then P (A ∩ B) = 0.09. False. P (A ∩ B)
would equal 0.09 if A and B are independent, but this statement is not true in
general.
Balka ISE 1.09 4.8. CHAPTER EXERCISES 38
(b) If P (A) = 0.30 and P (B) = 0.30, and P (A ∪ B) = 0.60, then A and B are
independent. False. The given probabilities imply P (A ∩ B) = 0, which does
not equal P (A) × P (B), and thus A and B are not independent.
(c) If P (A) = 0.50, and the probability that B occurs given A occurs is 0.50, then
P (A ∩ B) = 0.25. True. P (B|A) = P P(A∩B)(A) =
P (A∩B)
0.5 = 0.5. This implies
P (A ∩ B) = 0.5 × 0.5 = 0.25.
(d) If P (A) = 0, then A is independent of any event. True. If P (A) = 0, then
P (A|anything) = 0.
(e) If P (B|A) = 0.80, then P (B) > 0.80. False. The information P (B|A) = 0.80
implies only that 0 < P (A ∩ B) < 1 and 0 < P (B) < 1.
31. (a) If P (A) = 0.6, P (B) = 0.7, then 0.3 ≤ P (A ∩ B) ≤ 0.6. True. P (A ∩ B)
cannot be greater than the lesser of P (A) and P (B) (0.6 in this case). Also,
P (A ∪ B) ≤ 1 =⇒ P (A) + P (B) − P (A ∩ B) ≤ 1 =⇒ P (A ∩ B) ≥
P (A) + P (B) − 1. Here, P (A ∩ B) ≥ 0.3. Putting these inequalities together,
0.3 ≤ P (A ∩ B) ≤ 0.6. (Drawing a Venn diagram can help to visualize this.)
(b) If P (A) = 0.5, then for any event B: 0 ≤ P (A ∩ B) ≤ 0.5. True. With no
information about B we can say only that P (A∩B) ≥ 0 (since it’s a probability)
and P (A ∩ B) ≤ 0.5 (since the probability of the intersection cannot possibly
be greater than P (A).
(c) If P (A|B) = 0, then A and B are mutually exclusive. True. P (A|B) = 0 =⇒
P (A ∩ B) = 0 =⇒ A and B are mutually exclusive.
(d) Suppose that P (A) = 0.99. Event A and its complement are mutually exclu-
sive and independent. False. A and Ac are mutually exclusive, but are not
independent. P (A|Ac ) = 0 6= P (A).
(e) If P (A|B) 6= P (B) then A and B are not independent. False. Knowing
Balka ISE 1.09 4.8. CHAPTER EXERCISES 39
P (A|B) 6= P (B) gives us very little information about A and B or their prob-
abilities of occurring. Given only the information P (A|B) 6= P (B), we cannot
possibly tell if A and B are independent, .
32. No, they are not independent. They are mutually exclusive, so P (A ∩ B) = 0, but
P (A) × P (B) = 0.5 × 0.3 = 0.15 6= 0.
33. (a) A and B are independent. (Knowing the outcome of the frst toss gives us no
information about the outcome of the second toss.)
(b) A and B c are independent. (Knowing the outcome of the frst toss gives us no
information about the outcome of the second toss.)
(c) A and Ac are not independent. P (A|Ac ) = 0 = 6 P (A). If we know the coin
came up tails (A ), we know with certainty it did not come up heads (A) on
c
35. (a) P (A|B) < P (A). False. Knowing the person weighs more than 100 kg makes
if more likely they are male: P (A|B) > P (A). (The proportion of males that
weigh more than 100 kg is greater than that of females.)
(b) P (B|A) < P (B). False. Knowing the person is a male makes it more likely
they weigh more than 100 kg: P (B|A) > P (B). (The proportion of males that
weigh more than 100 kg is greater than that of females.)
(c) A and B are mutually exclusive. False. It is possible for a Canadian adult to
be both male and weight more than 100 kg.
(d) A and B are not independent. True, these are dependent events. (P (B|A) >
P (B)). For them to be independent events, the proportion of men that weigh
more than 100 kg would have to be exactly the same as the proportion of women
that weigh more than 100 kg.
36. (a) P (B|A) > P (B). A and B are not independent. Weather systems can be
large, and take several days to pass through. If it rains next Tuesday, it is
more likely to rain on the next the day than if there were bright sunshine on
Tuesday.
(b) P (B|A) = P (B). These events are independent. Knowing the weather next
Tuesday will tell us nothing about the weather 10 years from now.
(c) P (B|A) > P (B). These events are not independent. People who do not fnish
high school are more likely to go to prison.
37. (a) No, they are not mutually exclusive. If the frst coin comes up heads and the
second tails, they will both pass the part. P (A ∩ B) = 14 .
(b) Yes, they are independent. P (A) = 12 , P (B) = 12 , and P (A ∩ B) = 14 .
38. (a) The probability that Daniel passes the assignment is a tiny bit greater than
Balka ISE 1.09 4.8. CHAPTER EXERCISES 40
39. (a) Probabilities can be negative. False. Probabilities must lie between 0 and 1.
(b) For any two events A and B, P (A|B) > P (A). False. P (A|B) can be greater
than, less than, or equal to P (A).
(c) If A and B are independent, then P (A|B) = 0. False.
(d) If two cards are drawn from a standard deck, the probability both cards are
red is greater if the cards are drawn with replacement than if they were drawn
without replacement. True. 26 26 26 25
52 × 52 > 52 × 51 .
(e) If two events A and B are independent, then their complements (Ac and B c )
are also independent. True. This isn’t terribly diÿcult to prove, but it requires
a step that might more confusing than it’s worth. It may help to think of it
in these terms: Two events A and B are independent if knowing whether A
has occurred or A has not occurred does not change the probability of B. The
statement “A has occurred or A has not occurred” is equivalent to the statement
“Ac has not occurred or Ac has occurred.” If A and B are independent, so are
their complements.
(f) If A and B are mutually exclusive, then P (A|B) = 0. True. If A and B are
mutually exclusive, P (A ∩ B) = 0, which implies P (A|B) = P P(A∩B) 0
(B) = P (B) =
0.
40. (a) If P (A) = P (B), then A and B are independent. False. We cannot possibly tell
if A and B are independent based solely on the information that P (A) = P (B).
(b) If P (A) > P (B), then P (A|B) ≥ P (B|A). True. P (A|B) = P P(A∩B) (B) and
P (A∩B )
P (B|A) = P P(A∩B) P (A∩B)
(A) . If P (A) > P (B), then P (B ) > P (A) .
(c) If A and B are independent, P (A ∪ B) = P (A) + P (B). False.
(d) If A and B are mutually exclusive, P (A ∪ B) = P (A) + P (B). True. If A and
B are mutually exclusive, P (A ∩ B) = 0 and so P (A ∪ B) = P (A) + P (B) −
P (A ∩ B) = P(A) + P(B).
(e) If P (A|B) = 1, then P (B|A) = 1. False. Consider this counterexample: Roll a
Balka ISE 1.09 4.8. CHAPTER EXERCISES 41
single die once and let A represent rolling an even number, and let B represent
rolling a 6. Here, P (A|B) = 1 but P (B|A) = 13 . (If B is a subset of A, this
does not imply A is a subset of B.)
4.8.3 Applications
41. (a) P (At least 1 is mislabelled) = 1−P (None are mislabelled) = 1−0.754 ≈ 0.68.
(b) No, it’s not a reasonable assumption (for a variety of reasons). For example,
some restaurants would be more likely than others to serve mislabelled sushi,
and thus mislabelled sushi would likely be clumped together. If, say, the frst
3 pieces you get are mislabelled, there is a very good chance the fourth one
will also be mislabelled.
42. Let M represent the event the student is male, and J represent the event the
student has a job. A Venn diagram illustrating the counts in this situation is given
in Figure 4.13.
M J
15 5 20
10
(a) P (J c ) = 25
50 = 0.5.
(b) c
P (M ∩ J ) = 50 15
= 0.3.
(c) 5
P (M |J) = 25 = 0.2.
(d) P (M |M c ∩ J) = 0.
(e) No, in this question job status and gender are not independent. P (J) = 0.5
but P (J|M ) = 20 5
. Since 20
5
6 0.5, job status is not independent of gender (if
=
a student is randomly selected from this group of students).
45. Let G represent the event that the randomly selected student attended the Uni-
versity of Guelph, and F represent the event that the randomly selected student is
female.
16, 783
(a) P (G) = = 0.3214.
52, 217
27, 278
(b) P (F ) = = 0.5224.
52, 217
(c) P (G|F ) = 10,417
27,278 = 0.3819. (This is the number of female U of G students,
over the total number of female students.)
(d) P (F |G) = 10,417
16,783 = 0.6207. (This is the number of female University of Guelph
students, over the total number of U of G students.)
P (F ∩ (G ∪ Lc ))
(e) We require: P (F |G ∪ Lc ) =
P (G ∪ Lc )
P (G ∪ L ) is the probability that a randomly selected student attended the
c
University of Guelph and not Laurier (which means they attended Guelph or
Waterloo):
16, 783 + 24, 377 41, 160
P (G ∪ Lc ) = =
52, 217 52, 217
P (F ∩ (G ∪ Lc )) is the probability that a randomly selected student is a female
student at Guelph or Waterloo (not Laurier):
10, 417 + 10, 607 21, 024
P (F ∩ (G ∪ Lc )) = =
52, 217 52, 217
We can now calculate the fnal answer:
P (F ∩ (G ∪ Lc )) 21, 024/52, 217 21, 024
P (F |(G ∪ Lc )) = = = = 0.5108
P (G ∪ Lc ) 41, 160/52, 217 41, 160
(b) P (Man|Woman) = 0.
(c) No, these events are not independent. P (Good) = 40
65 ≈ 0.615, but P (Good|Man) =
30 ≈ 0.767. (Males were more likely than females to call the dining experience
23
good.)
(d) These events are mutually exclusive and not independent.
48. (a) 4
52 × 4
52 = 1
169 ≈ 0.006.
(b)
(c)
P (Exactly one is red) = P (First card is red and second is black) + P (First card is black and second i
26 26 26 26
= × + ×
52 52 52 52
= 0.5
(d)
(e) Drawing a king is independent of drawing a red card (there are 2 red kings and
2 black kings in the deck). The two draws are independent, since the cards
Balka ISE 1.09 4.8. CHAPTER EXERCISES 44
P (At least one king and at least one red card) = P (At least one king) × P (At least one red card)
= (1 − P (no kings)) × (1 − P (no red cards))
48 26
= (1 − ( )2 ) × (1 − ( )2 )
52 52
≈ 0.1109
49. (a) 4
52 × 3
51 = 1
221 ≈ 0.0045.
(b)
(c)
P (Exactly one is red) = P (First card is red and second is black) + P (First card is black and second i
26 26 26 26
= × + ×
52 51 52 51
≈ 0.5098
(d)
(e) (One way to approach this is to draw a tree diagram, with 4 branches for each
card: red king, black king, red non-king, black non-king.) The probability of
getting at least one king and at least one red card is: 52
2
× 51 2 26 24
51 + 52 × 51 + 52 ×
4 24 2 298
51 + 52 × 51 = 52×51 ≈ 0.1123680.
50. Let F represent the event that Mateo takes his Ferrari, and A represent the event
that he gets into an accident. P (F ) = P (2 heads) = 12 · 12 = 41 . The tree diagram
for this scenario is illustrated in Figure 4.14.
Balka ISE 1.09 4.8. CHAPTER EXERCISES 45
Figure 4.14: The tree diagram for Question 50. F represents the event that Mateo takes
his Ferrari and A represents the event that he gets into an accident.
(a) This is the sum of the probabilities of the two branches of the tree diagram
that lead to an accident. More formally:
P (A) = P (A ∩ F ) + P (A ∩ F c )
= P (F )P (A|F ) + P (F c )P (A|F c )
= 0.25 · 0.08 + 0.75 · 0.04
= 0.05
P (F ∩ A)
P (F |A) =
P (A)
P (F )P (A|F )
=
P (F )P (A|F ) + P (F c )P (A|F c )
0.25 · 0.08
=
0.25 · 0.08 + 0.75 · 0.04
= 0.40
53. (a) It often helps to illustrate this type of problem with a tree diagram, as in
Figure 4.15.
P (Respond “Yes”) = P (Heads) + P (Tails ∩ Stolen)
= 0.5 + 0.5 · 0.2
= 0.6
Y Respond “Yes”
0.2
Stolen?
H N
0.5 0.8 Respond “Yes”
Heads
or Tails?
T Y Respond “Yes”
0.5 0.2
Stolen?
N
0.8 Respond “No”
(b) There are 3 branches of the tree that lead to a “Yes” response. Combined,
these 3 branches yield us a probability of “Yes” of 0.60. In two of these 3
branches, the person has actually stolen from their employer.
P (Stolen ∩ Yes) 0.5 · 0.2 + 0.5 · 0.2 1
P (Stolen|Yes) = = =
P (Yes) 0.60 3
This is the ratio of the sum of the probabilities of the branches in which the
person has both stolen and responded “Yes”, to the probability that a person
gives a “Yes” response.
(c) This question is a little bit trickier. We know that:
P (Yes) = P (Heads) + P (Tails ∩ Stolen) = P (Heads) + P (Tails)P (Stolen)
(P (Tails ∩ Stolen) = P (Tails) × P (Stolen), since getting tails on the coin
toss is independent of whether the respondent has actually stolen from their
employer.) The probability of getting heads is assumed to be 0.50 (we’re
assuming a fair coin), and thus:
P (Yes) = 0.50 + 0.50 × P (Stolen)
Balka ISE 1.09 4.8. CHAPTER EXERCISES 47
Since the best estimate of P (Yes) is 0.67 (67% of respondents said yes), the
best estimate of P (Stolen) is 0.67−0.5
0.5 = 0.34.
Keep in mind that most people fnd it easier to use the tree diagram approach
for these types of questions.
Note that this probability is quite a bit bigger than 50%. Surprising?
(c) If we sample k mice, the probability that at least one has the disorder is:
P (At least one has the disorder) = 1−P (None have the disorder) = 1−0.999k
We need to fnd the minimum value of k such that 1 − 0.999k ≥ 0.50. This
implies we need the value of k such that 0.999k ≤ 0.50. We could use trial-
and-error, or we could fnd k a little more eÿciently using logarithms:
log(0.50)
0.999k ≤ 0.50 =⇒ k·log(0.999) ≤ log(0.50) =⇒ k ≥ =⇒ k ≥ 692.8.
log(0.999)
We would need to sample at least 693 mice to have at least a 50% chance of
fnding a mouse with the defect. Would you have guessed a value this large?
60. Let J represent the event that John goes to the party, and S represent the event
that Stephanie goes to the party. In the question we are given the conditional
probabilities P (J|S) = 0.94 and P (J|S c ) = 0.03.
P (J) = 0.38
=⇒ P (S)P (J|S) + P (S )P (J|S c ) = 0.38
c
(c) They are not mutually exclusive since they can both attend the party. They
are not independent; John is much more likely to go to the party if Stephanie
goes.
61. Most people fnd it helpful to draw a tree diagram for this type of problem (see
Figure 4.18).
Balka ISE 1.09 4.8. CHAPTER EXERCISES 49
Positive 0.99
test? -ve
D 0.01 P (D ∩ −ve) = 0.00001 · 0.01
0.00001
Defect?
Dc
+ve P (Dc ∩ +ve) = 0.99999 · 0.005
0.99999
Positive 0.005
test? -ve
0.995 P (Dc ∩ −ve) = 0.99999 · 0.995
(a)
(b)
Note that there is very little chance the person has the defect, even though
they tested positive.
J.B.’s strongly suggested exercises: 2, 5, 6, 7, 8, 10, 13, 14, 15, 19, 20, 22, 23, 25, 26, 28,
30, 48, 51, 53, 54, 57, 58, 62, 64, 66 (a-d).
5.1 Introduction
1. (a) Discrete.
(b) Continuous.
(c) Continuous.
(d) Discrete.
(e) Continuous.
(f) Discrete.
(g) Discrete.
51
Balka ISE 1.09 5.3. DISCRETE PROBABILITY DISTRIBUTIONS 52
13 39 39 13 1014 13
P (X = 1) = 52 × 51 + 52 × 51 = 2652 = 34
2652 = 17 .
13 12 156 1
P (X = 2) = × =
52 51
In summary, the probability distribution of X is:
x 0 1 2
19 13 1
p(x) 34 34 17
3. (a) Discrete.
(b) 0.1 (since the probabilities must sum to 1).
(c) The most likely value is 30.
(d) P (X < 32) = P (X = 10) + P (X = 20) + P (X = 30) = 0.2 + 0.2 + 0.5 = 0.9.
(e) P (X < 25|X < 35) = 0.40.9 ≈ 0.44.
4. (a) and (c) are valid discrete probability distributions, since for these distributions
the probabilities all lie between 0 and 1 and they sum to 1. (Note that some discrete
random variables can take on negative values and non-integer values.) (b) is not a
valid discrete probability distribution, since the probabilities do not sum to 1. (d)
is not a valid discrete probability distribution, since one of the probabilities listed
in the table is negative.
(f) σ = 3.172.
σ 2 = E[(X − µ)2 ] = (x − µ)2 p(x) = (−2.5 − 4.25)2 · 0.10 + (0 − 4.25)2 · 0.10 +
P
(2.5 − 2 2 2
√ 4.25) · 0.10 + (5.0 − 4.25) · 0.40 + (7.5 − 4.25) · 0.30 = 10.0625.
σ = 10.0625 = 3.172.
(a) µ =
P
xp(x) = 1 · 0.988 + 2 · 0.01187 + 3 · 0.000128 + 4 · 0.000002 = 1.012132.
(b) σ = 0.11069.
E(X 2 ) = x2 p(x) = 12 · 0.988 + 22 · 0.01187 + 32 · 0.000128 + 42 · 0.000002 =
P
1.036664.
√ σ 2 = E(X 2 ) − (E(X))2 = 1.036664 − 1.0121322 = 0.01225281.
σ = 0.01225281 = 0.11069.
(c) P (X ≥ 2) = 0.01187 + 0.000128 + 0.000002 = 0.012. Equivalently, P (X ≥
2) = 1 − P (X = 1) = 1 − 0.988 = 0.012.
0.01187
(d) P (X = 2|X ≥ 2) = = 0.989.
0.01187 + 0.000128 + 0.000002
0.000128
(e) P (X = 3|X ≥ 3) = = 0.985.
0.000128 + 0.000002
7. (a) Probabilities of impossible events can be negative. False. (Probabilities must
lie between 0 and 1.)
(b) If a random variable can take on an infnite number of possible values, then it
cannot be a discrete random variable. False. (Some discrete random variables
take on a countably infnite number of possible values.)
(c) The mean of a discrete random variable cannot be negative. False. (Some
discrete random variables take on negative values and have negative means.)
(d) The standard deviation of a discrete random variable cannot be negative. True.
(Standard deviations cannot be negative.)
(e) The mean of a discrete random variable cannot be greater than its standard
deviation. False. (The mean may be greater than, less than, or equal to the
standard deviation.)
9. Yes, they are independent. Since the Professor of Course B is simply assigning
grades at random, knowing a student’s grade in that course yields no information
about their grade in course A.
Balka ISE 1.09 5.4. THE BERNOULLI DISTRIBUTION 54
11. (a) Let X represent the number of eights that are drawn. Then X has the Bernoulli
distribution with p = 13
1
. That is:
1 x 12 1−x
P (X = x) = ( ) ( ) for x = 0, 1.
13 13
(b) µ = p = 13
1
.
q q
(c) σ = p(1 − p) = 13
1 12 12
p
· 13 = 169 ≈ 0.266.
12. (a) P (X = 3) = 15
� 3 12 = 0.250.
3 0.2 (1 − 0.2)
(b) Using the binomial probability mass function with n = 15 and p = 0.2:
P (X ≤ 3) = P (X = 0) + P (X = 1) + P (X = 2) + P (X = 3)
= 0.0352 + 0.1319 + 0.2309 + 0.2501
= 0.648
P (X = 3 ∩ X ≤ 4) P (X = 3)
(e) P (X = 3|X ≤ 4) = =
P (X ≤ 4) P (X ≤ 4)
P (X = 3) 0.2501
= = =
P (X = 0) + P (X = 1) + P (X = 2) + P (X = 3) + P (X = 4) 0.8358
0.2993.
(f) µX =pE(X) = np = 15 · 0.2 = 3.
√
(g) σ = np(1 − p) = 15 · 0.2 · 0.8 = 1.549.
13. Let X represent the number of newborns with major structural or genetic birth
defects. Then X has a binomial distribution with parameters n = 50 and p = 0.03.
(a) P (X = 2) = 50 2 48 = 0.2555.
�
2 0.03 (1 − 0.03)
(b)
P (X ≤ 2) = P (X = 0) + P (X = 1) + P (X = 2)
50 0 50 50 1 49 50
= 0.03 (1 − 0.03) + 0.03 (1 − 0.03) + 0.032 (1 − 0.03)48
0 1 2
= 0.2181 + 0.3372 + 0.2555
= 0.8108
(a) The variance of a binomial distribution is always greater than the mean. False.
(The variance (np(1 − p)) must be less than or equal to the mean (np).)
(b) A binomial random variable can be negative. False. (A binomial random
variable represents a count, and its value must be at least 0.)
(c) The mean of a binomial random variable can be negative. False.
(d) The variance of a binomial random variable can be negative. False.
(e) A binomial random variable is a discrete random variable. True.
(f) Every discrete random variable is a binomial random variable. False. (There
are many other discrete probability distributions.)
(g) A binomial random variable represents a count. True.
(h) The mean of a binomial random variable is np. True.
(i) The standard deviation of a binomial random variable is np(1 − p). True.
p
(j) For any given n, the variance of a binomial random variable is greatest when
p = 0.5. True, since p(1 − p) is greatest when p = 0.5.
Balka ISE 1.09 5.5. THE BINOMIAL DISTRIBUTION 56
15. (a) Not binomial. The number of hearts would not have a binomial distribution,
since the trials are not independent (the probability of getting a heart will
change as cards are removed from the deck).
(b) Binomial. The number of times heads comes up will have a binomial distri-
bution with n = 10 and p equal to the probability of heads on any individual
toss.
(c) Not binomial. The number of putts that are holed will not have a binomial
distribution. The number of trials is fxed, each trial is either a success or a
failure, and we are counting the number of successes, but the probability of
success is changing and the trials are not independent.
(d) Not binomial. The number of newborn babies would not have a binomial
distribution. The conditions of the binomial distribution are defnitely not
satisfed here (there is not even a fxed number of trials).
(e) Not binomial. The amount of money withdrawn would defnitely not have a
binomial distribution. (This random variable is not even a count of a number
of successes.)
16. (a) Binomial. All of the conditions of a binomial distribution are satisfed here.
(b) Not binomial. While this random variable is a count (that almost always
takes on the value 10), it does not represent the number of successes in a fxed
number of trials.
(c) Binomial. The probability of rolling a sum of 7 on a pair of dice is 16 , so here
the number of sevens would have a binomial distribution with n = 14 and
p = 16 .
(d) Not binomial. While the number of children the woman has is a count, it does
not represent the number of successes in a fxed number of trials.
(e) Not binomial. While the number of times the person uses the washroom is
a count, it does not represent the number of successes in a fxed number of
trials.
(f) Not binomial. The weight is not a count.
17. (a) 0.2499. If they are randomly pulling cards, the number of times the king
of spades is drawn has a binomial distribution with n = 50 and p = 52 1
.
P (X ≥ 2) = 1 − P (X = 0) − P (X = 1) = 0.2499.
(b) No. This experiment yields no evidence that they have ESP, but we cannot
say there is strong evidence they do not. There is, however, strong evidence
against their claimed probability of drawing the king of spades. If their claimed
Balka ISE 1.09 5.6. THE HYPERGEOMETRIC DISTRIBUTION 57
value of p = 0.2 is true, the probability they draw the king of spades 2 or fewer
times in 50 attempts is only 0.00088.
18. Let X represent the number of white balls drawn. Then X has a hypergeometric
distribution with a = 22, N = 40, and n = 6.
�22�18
(a) P (X = 4) = 4
�402 = 0.2916.
6 �22�18 �22�18
(b) P (X ≤ 1) = P (X = 0) + P (X = 1) = 0
�40 6
+ 1
�405 = 0.00484 +
6 6
0.04911 = 0.05395.
22
(c) The mean number of white balls is 6 · = 3.3. The mean number of red balls
40
18
is 6 · = 2.7.
40
19. Here we are sampling without replacement (each person can be hired only once).
Let X represent the number of hires that have a criminal record. Then X has a
hypergeometric distribution with a = 7, N = 20, and n = 5.
(a) Here we must draw 3 criminals (from 7 total) and 2 non-criminals (from 13
total): �7�13
3
P (X = 3) = �202 = 0.1761.
5
(b) Here we must draw 2 criminals (from 7 total) and 3 non-criminals (from 13
total): �7�13
2
P (X = 2) = �203 = 0.3874.
5 �7�13
(c) P (X ≥ 1) = 1 − P (X = 0) = 1 − 0�205 = 1 − 0.0830 = 0.9170.
5
(d) µ = n · a
N =5· 7
20 = 1.75.
20. (a) The number of males in the sample has a hypergeometric distribution. The
binomial distribution can be used to approximate the hypergeometric, but
only if the sample size is a small proportion of the population size (less than
5% or so). Here the sample size is too large (we’re drawing 8 from 20), so the
binomial distribution would not provide a reasonable approximation. (Using
the hypergeometric distribution: P (X = 4) = 0.3501. Using the binomial
approximation: P (X = 4) ≈ 0.2734.)
Balka ISE 1.09 5.7. THE POISSON DISTRIBUTION 58
(b) The number of white balls follows a hypergeometric distribution, but since
we are drawing only a small proportion of the population (12 from 300), the
binomial distribution would provide a reasonable approximation. (Using the
hypergeometric distribution: P (X = 3) = 0.2141. Using the binomial approx-
imation: P (X = 4) ≈ 0.2120.)
(c) Here the number of kings has a binomial distribution with n = 60 and p = 13
1
,
so no approximation is required. P (X = 7) = 0.088.
5.7.1 Introduction
52 e−5
21. (a) P (X = 2) = 2! = 0.0842.
(b) 0.1247.
P (X ≤ 2) = P (X = 0) + P (X = 1) + P (X = 2)
50 e−5 51 e−5 52 e−5
= + +
0! 1! 2!
= 0.0067 + 0.0337 + 0.0842
= 0.1247
41 e−4
22. (a) P (X = 1) = = 0.07326. (The number of shark attacks has a Poisson
1!
distribution with λ = 4.)
(b) P (X ≤ 2) = P (X = 0) + P (X = 1) + P (X = 2) = 0.2381. (Calculated with
the Poisson probability mass function with λ = 4.)
84 e−8
(c) P (X = 4) = = 0.05725. (The number of shark attacks in a two-year
4!
period has a Poisson distribution with λ = 2 × 4 = 8.)
(a) A Poisson random variable can take on a countably infnite number of possible
values. True. (A Poisson random variable takes on the values 0, 1, 2, ...)
Balka ISE 1.09 5.7. THE POISSON DISTRIBUTION 59
24. Situation (d), where n = 500 and p = 0.01. The Poisson approximation to the
binomial works best when n is large and p is small.
P (X ≤ 2) = P (X = 0) + P (X = 1) + P (X = 2)
100 0 100 100 1 99 100
= 0.01 · 0.99 + 0.01 · 0.99 + 0.012 · 0.9998
0 1 2
= 0.36603 + 0.36973 + 0.18486
= 0.92063
P (X ≤ 2) = P (X = 0) + P (X = 1) + P (X = 2)
10 e−1 11 e−1 12 e−1
= + +
0! 1! 2!
= 0.36788 + 0.36788 + 0.18394
= 0.91970
26. The number of students would not follow a Poisson distribution, since students do
not arrive randomly and independently. Students are likely to travel in groups, and
thus there would be "clumping" of students. It is likely that there would also be
time e˙ects (for example, the theoretical rate at which students are arriving might
be very di˙erent at 12:01 and 12:29).
27. The number of cows in a randomly selected 100 m2 area of the pasture would not
follow a Poisson distribution, since the cows would not be distributed randomly and
independently (cows tend to stick together).
28. Let X be the number of donors required to get the frst one with type AB blood.
P (X ≤ 3) = P (X = 1) + P (X = 2) + P (X = 3)
= 0.970 · 0.03 + 0.971 · 0.03 + 0.972 · 0.03
= 0.0873
P (X ≤ x) = 1 − (1 − p)x
=⇒ P (X ≤ 3) = 1 − (1 − 0.03)3
= 0.0873
(c) 0.401. This could be answered using the brute force method:
P (X > 30) = 1 − P (X ≤ 30) = 1 − [P (X = 1) + P (X = 2) + . . . + P (X = 30)].
However, this would require calculating 30 geometric probabilities which would
be very cumbersome without using software. It is easier to answer this question
if we recognize that the frst success will come after the 30th person if the frst
30 people do not have blood type AB. So P (X > 30) = 0.9730 = 0.401.
(d) µ = p1 = 0.031
= 33.3̇.
q q
(e) σ = 1−p p 2 = 1−0.03
0.032
= 32.83.
29. Let X represent the required number of cups to get the frst winner.
Balka ISE 1.09 5.9. THE NEGATIVE BINOMIAL DISTRIBUTION 61
P (X ≤ 4) = P (X = 1) + P (X = 2) + P (X = 3) + P (X = 4)
5 1 5 1 5 1 5 1
= ( )0 ( ) + ( )1 ( ) + ( )2 ( ) + ( )3 ( )
6 6 6 6 6 6 6 6
= 0.5177
P (X ≤ x) = 1 − (1 − p)x
5
=⇒ P (X ≤ 4) = 1 − ( )4
6
= 0.5177.
(c) 0.0260. This could be answered using the brute force method:
P (X > 20) = 1 − P (X ≤ 20) = 1 − [P (X = 1) + P (X = 2) + . . . + P (X = 20)].
However, this would require calculating 20 geometric probabilities which would
be very cumbersome without using software. It is easier to answer this question
if we recognize that the frst winner will come after the 20th cup if the frst 20
cups have no winners. So P (X > 20) = ( 56 )20 = 0.0260.
(d) µ = p1 = 1/6
1
= 6.
31. Let X represent the number of attempts required to chip-in 4 times. X has the
negative binomial distribution with parameters r = 4 and p = 0.05. The�probability
mass function of the negative binomial distribution is: P (X = x) = x−1
r
r−1 p (1 −
p)x−r .
Balka ISE 1.09 5.10. THE MULTINOMIAL DISTRIBUTION 62
(a) P (X = 20) =
�20−1
4−1 0.054 (1 − 0.05)16 = 0.00267.
(b) P (X = 20) + P (X = 21) = 20−1
� 4 16
�21−1 4 17 =
4−1 0.05 (1 − 0.05) + 4−1 0.05 (1 − 0.05)
0.00267 + 0.00298 = 0.00564.
(c) µ = pr = 0.05
4
= 80.
q q
(d) σ = r(1−p)
p2
= 4·0.95
0.052
= 38.99.
(e) It will take more than 100 attempts to chip-in 4 times if the frst 100 attempts
have fewer than 4 chip-ins. Let Y represent the number of chip-ins in the frst
100 attempts. Then we need to fnd P (Y < 4), where Y has the binomial
distribution with n = 100 and p = 0.05.
P (Y < 4) = P (Y = 0) + P (Y = 1) + P (Y = 2) + P (Y = 3)
= 0.0059 + 0.0312 + 0.0812 + 0.1396
= 0.2578
This implies that P (X > 100) = 0.2578.
32. Let X represent the number of doors required to make 3 sales. Then X has
the negative binomial distribution with parameters r = 3 and p = 0.015. The
probability
�x−1 r mass function of the negative binomial distribution is: P (X = x) =
x−r .
r−1 p (1 − p)
(a) P (X = 50) =
�50−1
0.0153 (1 − 0.015)47 = 0.00195.
�3−1
(b) P (X = 200) = 200−1 3 197 = 0.00339.
3−1 0.015 (1 − 0.015)
(c) µ = pr = 0.015
3
= 200.
q q
(d) σ = r(1−p)
p2
= 3·0.985
0.0152
= 114.60.
(e) It will take more than 500 attempts to make 3 sales if the frst 500 attempts
have fewer than 3 sales. Let Y represent the number of sales in the frst
500 attempts. Then we need to fnd P (Y < 3), where Y has the binomial
distribution with n = 500 and p = 0.015.
P (Y < 3) = P (Y = 0) + P (Y = 1) + P (Y = 2)
= 0.00052 + 0.00398 + 0.01512
= 0.0196
This implies that P (X > 100) = 0.0196.
33. Let X1 , X2 , and X3 be the number of people with arches, loops, and whorls, respec-
tively. Then X1 , X2 , and X3 have the multinomial distribution with parameters
n = 10 and p1 = 0.12, p2 = 0.49, and p3 = 0.39.
Balka ISE 1.09 5.11. CHAPTER EXERCISES 63
10!
(a) P (X1 = 2, X2 = 3, X3 = 5) = 0.122 0.493 0.395 = 0.0385.
2!3!5!
10!
(b) P (X1 = 4, X2 = 4, X3 = 2) = 0.124 0.494 0.392 = 0.0057.
4!4!2!
(c) Since we are counting the number with arches, we will label arches a success
and non-arches (loops or whorls) a failure. The number with arches (X1 ) will
have a binomial
�10distribution with n = 10 and p = 0.12.
P (X1 = 1) = 1 0.12 (1 − 0.12)9 = 0.3798.
1
(d) The number with arches has a binomial distribution with parameters n and p,
so E(X1 ) = np = 10 × 0.12 = 1.2.
34. Let X1 , X2 , and X3 be the number of times the ball lands in red, black, and green
slots, respectively. Then X1 , X2 , and X3 have the multinomial distribution with
parameters n = 20 and p1 = 18 37 , p2 = 37 , and p3 = 37 .
18 1
20! 18 8 18 10 1 2
(a) P (X1 = 8, X2 = 10, X3 = 2) = ( ) ( ) ( 37 ) = 0.0141.
8!10!2! 37 37
20! 18 14 18 3 1 3
(b) P (X1 = 14, X2 = 3, X3 = 3) = ( ) ( 37 ) ( 37 ) = 0.000073.
14!3!3! 37
(c) Since we are counting the number of times the ball lands in a red slot, we
will label the red slots a success and non-red slots (green or black) a failure.
The number of times the ball lands in a red slot (X1 ) will have a binomial
distribution with n = 20 and p = 18 37 .
�20 18 1 18 9
P (X1 = 11) = 11 ( 37 ) (1 − 37 ) = 0.1506.
(d) The number of times the ball lands in a red slot has binomial distribution with
parameters n = 20 and p = 18 37 , so E(X1 ) = np = 20 × 37 = 37 ≈ 9.73.
18 360
36. (a) c = 0.05, since the probabilities must sum to 1. So the distribution is:
x −5 10 50 100
p(x) 0.05 0.05 0.40 0.50
(b) P (X > 21) = P (X = 50) + P (X = 100) = 0.40 + 0.50 = 0.90.
P (X > 21 ∩ X < 60) P (X = 50) 0.40
(c) P (X > 21|X < 60) = = = =
P (X < 60) P (X < 60) 0.05 + 0.05 + 0.40
0.80.
37. (a) 115. The missing probability must be 0.4. E(X) = 10 · 0.1 + 20 · 0.2 + 30 ·
0.3 + x · 0.4 = 60 =⇒ x = 115.
Balka ISE 1.09 5.11. CHAPTER EXERCISES 64
39. (a) 0.2304. Let X represent the number of 10s selected. �5 Then X has a binomial
distribution with n = 5 and p = 0.4. P (X = 3) = 3 0.4 0.62 = 0.2304.
3
40. Let X be the number of 10s selected. Then X has a binomial distribution with
n = 20 and p = 0.4.
41. Both X and Y are binomial random variables. Binomial random variables have a
mean of np and a variance of np(1 − p).
(d) σ 2 = (x − µ)2 p(x) = (0 − 0.5)2 ·√0.689 + (1 − 0.5)2 · 0.30 + (10 − 0.5)2 · 0.010 +
P
2
(100 − 0.5) · 0.001 = 11.05. σ = 11.05 = 3.324.
(e) P (X = 100|X > 0) = P (X=1)+PP(X=10)+P
(X=100) 0.001
(X=100) = 0.3+0.01+0.001 = 0.003215.
(f) 1000 · 0.5 = $500.
Balka ISE 1.09 5.11. CHAPTER EXERCISES 65
5.11.2 Concepts
43.
47. (e) (None of the above). The number of children in the selected area will be 0 or
1 (since the children spread themselves out). The number of children in the area
would have a Bernoulli distribution, but the precise value of p is a little tricky to
determine.
48. (c). If the trials were independent, then X would have a binomial distribution.
However, in this scenario the trials are not independent, since if Tom makes a
mistake then he is more likely to make more mistakes. X is a count variable with
a minimum of 0 and a maximum of 20, and it is thus a discrete random variable,
but its exact distribution is impossible to determine without further information.
49. (a) A discrete random variable can take on a countable number of possible values.
True.
(b) A discrete random variable always represents a count. False.
(c) The expected value of a discrete random variable must equal one of the vari-
able’s possible values. False.
Balka ISE 1.09 5.11. CHAPTER EXERCISES 66
50. (a) The variance and standard deviation of a discrete random variable cannot be
equal. False. (They may both equal 1.)
(b) The mean of a random variable cannot be negative. False.
(c) The standard deviation of a random variable cannot be negative. True.
(d) If X is a discrete random variable, then Y = 2X + 3 is a discrete random
variable. True.
51. (a) In a binomial setting, the probability of success usually changes from trial to
trial. False. (In a binomial setting, the probability of success stays constant
from trial to trial.)
(b) The mean of a Poisson random variable can be negative. False. (µ = λ must
be positive.)
(c) If X represents the number of students in a randomly selected 100 m2 area of
a university campus at 3:00 pm on a Tuesday afternoon, then X has a Poisson
distribution. False. (Students are not distributed randomly—they tend to group
together.)
(d) If p > 0, then the variance of a binomial random variable is less than its mean.
True.
(e) If X is a binomial random variable with n = 10 and p = 0.1, and Y is a
Poisson random variable with λ = 2, then X − Y has a binomial distribution.
False. (X −Y can take on negative values so it cannot possibly have a binomial
distribution.)
5.11.3 Applications
53. (a) Let Y represent the number of randomly selected numbers that match the
ticket. Then Y has a hypergeometric distribution with a = 6, N = 70, and
n = 20.
6 64 6 64
� � � �
5
P (Y = 5) = �7015
= 0.0059123. P (Y = 6) = 6
�7014
= 0.0002956.
20 20
P (Y ≤ 4) = 1 − [P (Y = 5) + P (Y = 6)] = 1 − (0.0059123 + 0.0002956) =
0.9937921.
Balka ISE 1.09 5.11. CHAPTER EXERCISES 67
299.0986 = 17.29.
(d) A player is paying $1 for the ticket and getting back $0.4434 on average, so
the government is keeping about 56 cents on average.
54. (a) Let X represent the number of tickets that win $25. Then X has a binomial
distribution with
�50 parameters n = 50 and p = 0.0059123.
P (X = 1) = 1 0.0059123 (1 − 0.0059123)49 = 0.2211.
1
(b) P (X ≥ 1) = 1 − P (X = 0) = 1 − 50 0 50 = 1 −
�
0 0.0059123 (1 − 0.0059123)
0.7434 = 0.2566.
(c) Let W represent the number of tickets that win a prize (either $25 or $1000).
Then W has a binomial distribution with parameters n = 50 and� p= 0.0059123+
0.0002956 = 0.0062079. P (W ≥ 1) = 1 − P (W = 0) = 1 − 50 0
0 0.0062079 ·
0.993792150 = 1 − 0.993792150 = 1 − 0.7324 = 0.2676.
(d) E(W ) p= np = 50 · 0.0062079 = 0.3104.
(e) σW = np(1 − p) = 50 · 0.0062079(1 − 0.0062079) = 0.5554.
p
x 0 25000
p(x) 0.99 0.01
57. (a) The number of siblings that are an HLA-identical match has a binomial dis-
tribution with n = 6 and p = 14 .
Balka ISE 1.09 5.11. CHAPTER EXERCISES 68
(b) Let X represent the number of siblings that are an HLA-identical match.
P (X ≥ 1) = 1 − P (X = 0) = 1 − ( 34 )6 = 0.822.
(c) P (X ≥ 2) = 1 − [P (X = 0) + P (X = 1)] = 1 − (0.1780 + 0.3560) = 0.466.
(d) µ = np = 6 × 14 = 1.5.
58. Let X represent the number of completed surveys. Then X has a binomial distri-
bution with parameters n = 100 and p = 0.09.
59. Let X represent the number of calls needed to get 4 completed surveys. Then X
has the negative binomial distribution with parameters r = 4 and p = 0.09. The
probability
�x−1 r mass function of the negative binomial distribution is: P (X = x) =
r−1 p (1 − p)x−r .
(a) P (X = 50) =
�50−1
4−1 0.094 (1 − 0.09)46 = 0.0158.
(b) P (X = 50) + P (X = 51) = 50−1
� 4 46
�51−1 4 47 =
4−1 0.09 (1 − 0.09) + 4−1 0.09 (1 − 0.09)
0.0158 + 0.0153 = 0.0311.
(c) µ = pr = 0.09
4
= 44.4.
q q
(d) σ = r(1−p)
p 2 = 4·0.91
0.092
= 21.20.
(e) It will take more than 100 calls to get 4 completed surveys if the frst 100
calls result in fewer than 4 completed surveys. Let Y represent the number
of completed surveys in the frst 100 calls. Then we need to fnd P (Y < 4),
where Y has the binomial distribution with n = 100 and p = 0.09.
P (Y < 4) = P (Y = 0) + P (Y = 1) + P (Y = 2) + P (Y = 3)
= 0.00008 + 0.00079 + 0.00388 + 0.01254
= 0.0173
60. Let X represent the number of chickens that have detectable levels of salmonella.
Then, assuming we have a random and independent sample of chickens, X has a
binomial distribution with parameters n = 12 and p = 0.14.
(a) P (X = 2) = 12 2 10 = 0.286.
�
2 0.14 0.86
Balka ISE 1.09 5.11. CHAPTER EXERCISES 69
(b) 0.7697.
P (X ≤ 2) = P (X = 0) + P (X = 1) + P (X = 2)
12 0 12 12 1 11 12
= 0.14 0.86 + 0.14 0.86 + 0.142 0.8610
0 1 2
= 0.1637 + 0.3197 + 0.2863
= 0.7697
(c) Let Y represent the number of chickens with detectable levels of salmonella.
Assuming this retailer’s claim is true, Y has a binomial distribution with
n = 20 and p = 0.02. P (Y ≥ 3) = 1 − P (X ≤ 2) = 1 − [P (Y = 0) + P (Y =
1) + P (Y = 2)] = 0.007. This yields strong evidence against the company’s
claim.
61. Let X represent the number of people in the sample that have given blood at some
point.
62. (a) The number of tickets that win a cash prize follows a binomial distribution
with n = 156 and p = 0.03.
(b) E(X) = np = 156 · 0.03 = 4.68.
(c) P (X = 0) = 0.97156 = 0.0086.
(d) This is a complicated question, without an easy answer, and there are many
factors to take into account (for example, how trustworthy you believe your
friend is). But if your friend was indeed purchasing the tickets and (theoreti-
cally) sharing the cash prizes, there is a less than 1% chance of not winning a
prize. You may not want to end a friendship over this, but it is a little fshy.
63. Let X represent the number of crashes in the next year. Since the fatal crashes can
be thought of as occurring randomly and independently with a mean of 1.1, X has
(approximately) a Poisson distribution with λ = 1.1.
1.10 e−1.1
(a) P (X = 0) = 0! = 0.3329.
2 −1.1 3 −1.1
(b) P (X = 2) + P (X = 3) = 1.1 2!
e
+ 1.1 3!
e
= 0.2014 + 0.0738 = 0.2752.
(c) P (X√> 4) = 1 − P (X ≤ 4) = 0.0054.
√
(d) σ = λ = 1.1 = 1.049.
64. Let X represent the number of crashes in the next 2 year period. Since the fatal
Balka ISE 1.09 5.11. CHAPTER EXERCISES 70
65. (a) If the mean number of impacts per million years is 2, then the mean number
of impacts in a 100,000 year period is µ = λ = 10 2
= 0.20.
0
0.2 e −0.2
P (X ≥ 1) = 1 − P (X = 0) = 1 − 0! = 1 − 0.8187 = 0.1813.
(b) If the mean number of impacts per million years is 2, then the mean number
of impacts in a 10,000 year period is µ = λ = 100 2
= 0.02.
0
0.02 e −0.02
P (X ≥ 1) = 1 − P (X = 0) = 1 − 0! = 1 − 0.9802 = 0.0198.
66. (a) The number of round/yellow plants has a binomial distribution with n = 50
and p = 16
9
.
(b) µ = np = 50 · 16
9
= 28.125.
q
(c) σ = np(1 − p) = 50( 16 9 9
p
)(1 − 16 ) = 3.5078.
(d) The number of wrinkled/green � 1plants has a binomial distribution with n = 50
and p = 16 1
. P (X = 4) = 50 4 ( 16 ) 4 ( 15 )46 = 0.1805.
16
(e) 28!12!6!4!
50!
) ( 16 ) ( 16 ) ( 16 ) = 0.00152. (The distribution of the number of
9 28 3 12 3 6 1 4
( 16
plants with the 4 phenotypes is the multinomial distribution with n = 50,
p1 = 169
, p2 = 16 3
, p3 = 163
, and p4 = 16 1
.)
(a) µ = np = 2 · 23 = 43 .
q
(b) σ = np(1 − p) = 2 · 2 1
= 23 .
p
3 · 3
69. (a) The number of people in the sample that are over 60 years old will follow
a binomial distribution with n = 20 and p = 0.178. Let X represent this
number.
(b) P (X ≥ 2) = 0.89426.
Balka ISE 1.09 5.11. CHAPTER EXERCISES 71
(c) P (X = 2) = 0.1767.
(d) P (X = 0) = (1 − 0.178)20 = 0.01984.
(e) E(X) = np = 20 · 0.178 = 3.56.
70. (a) The number of people in the sample that voted in the last federal election
has a binomial distribution with n = 5 and p = 0.60. Let X represent this
number.
(b) P (X = 4) = 0.2592.
(c) P (X ≥ 4) = P (X = 4) + P (X = 5) = 0.3370.
(d) P (X = 5|X ≥ 4) = P (X=5∩X≥4)
P (X≥4) = PP (X=5) 0.07776
(X≥4) = 0.33696 = 0.2308.
(e) E(X) = np = 5 · 0.60 = 3.0.
71. Let X be the number of people that are out of work. Then X has a binomial
distribution with n = 14 and p = 0.076.
(a) P (X = 3) = 0.0670.
(b) P (X > 2) = 1 − P (X ≤ 2) = 1 − [P (X = 0) + P (X = 1) + P (X = 2)] =
0.08496.
(c) 1. (If there are at least 2 out of work, there must be at least 1 out of work.)
(d) 0. (If there are at least 3 out of work, then there is more than 1 out of work.)
72. The number of cars that fail the test will have a binomial distribution with n = 10
and p = 0.35.
(a) P (X = 2) = 0.17565.
(b) P (X ≤ 2) = P (X = 0) + P (X = 1) + P (X = 2) = 0.2616.
(c) P (X > 2) = 1 − P (X ≤ 2) = 1 − 0.2616 = 0.7384.
(d) E(X) p= np = 10 · 0.35 = 3.5.
√
(e) σ = np(1 − p) = 10 · 0.35 · 0.65 = 1.508.
(f) Not in and of itself, but something is going on. If this was a random selection
of 10 cars that are at least 13 years old, then the probability of failing all of
them should be approximately 0.3510 = 0.000028. This is quite unlikely, so
there is a concern that the garage might be inappropriately failing cars for
some reason. But there are other possible explanations. For example, perhaps
this garage has an excellent reputation for fxing cars that are almost ready
for the junk heap, so they might tend to test cars that are in worse shape.
73. (a) The number of decays in a one-second period is assumed to follow a Poisson
0.82 e−0.8
distribution with λ = 0.8, so: P (X = 2) = = 0.1438.
2!
(b) The number of events in a three-second period is assumed to follow a Poisson
2.42 e−2.4
distribution with λ = 3 × 0.8 = 2.4, so: P (X = 2) = = 0.2613.
2!
Balka ISE 1.09 5.11. CHAPTER EXERCISES 72
75. (a) 0.999955. Let X represent the number of planes that arrive in the two hour
period. P (X ≥ 1) = 1 − P (X = 0) = 1 − 0.000045 = 0.999955 (calculated
using the Poisson pmf with λ = 2 × 5 = 10).
(b) 0.00017. Let X represent the number of planes that arrive in a 3 hour period.
P (X = 3) = 0.00017 (calculated using the Poisson pmf with λ = 3 × 5 = 15).
(c) 0.0821. Let X represent the number of planes that arrive in a half-hour period.
P (X = 0) = 0.0821 (calculated using the Poisson pmf with√ λ = 0.5 × 5 = 2.5).
√
(d) 8 × 5 = 6.325 (since for a Poisson random variable, σ = λ).
76. (a) Under the given assumptions, the number that crash has a binomial distribu-
tion with n = 10 and p = 0.01. For the remaining questions, let X represent
the number of planes that crash.
(b) P (X ≥ 1) = 1 − P (X = 0) = 1 − 0.90438 = 0.0956 (calculated using the
binomial pmf with n = 10 and p = 0.01).
(c) P (X = 1) = 0.09135 (calculated using the binomial pmf with n = 10 and
p = 0.01).
(d) P (Xp= 10) = 10−20p= 0.00000000000000000001.
(e) σ = np(1 − p) = 10 · 0.01 · (1 − 0.01) = 0.3146.
77. (a) The number that need to be selected in order to get the frst doctor has a
geometric distribution with p = 0.002. Let X represent this number. P (X =
4) = (1 − 0.002)3 × 0.002 = 0.001988.
(b) If the frst 1000 people are not doctors, then the frst doctor will occur after
the frst 1000 people sampled. P (X > 1000) = (1 − 0.002)1000 = 0.1351.
Balka ISE 1.09 5.11. CHAPTER EXERCISES 73
78. (a) The number of welds needed to get the frst unsuccessful weld has a geometric
distribution with p = 0.10. For the following questions, let X represent this
random variable.
(b) P (X = 3) = 0.92 · 0.1 = 0.081.
(c)
P (X ≤ 3) = P (X = 1) + P (X = 2) + P (X = 3)
= 0.90 · 0.1 + 0.91 · 0.1 + 0.92 · 0.1
= 0.271
(d) The frst unsuccessful weld will occur after the 13th trial if the frst 13 trials
are successful. So P (X > 13) = 0.913 = 0.254.
79. (a) The number of welds needed to get the third unsuccessful weld has a negative
binomial distribution with r = 3 and p = 0.10. Let X represent this random
variable.
P (X = 22) = 21
� 3 19
2 0.1 0.9 = 0.02837
(b) Let Y represent the number of unsuccessful welds. Then Y has a binomial
distribution with n = 20 and p = 0.10. P (X = 3) = 0.1901.
(c) P (Y ≥ 3) = 1 − P (Y ≤ 2) = 1 − [P (Y = 0) + P (Y = 1) + P (Y = 2)] =
1 − 0.6769 = 0.3231.
(d) µ = np = 20 × 0.1 = 2.
80. (a) Let X represent the number of shipments needed to get the frst rejected
shipment. Then X has a geometric distribution with p = 0.1. P (X = 10) =
0.969 · 0.04 = 0.0277.
(b) The frst rejected shipment will occur after the 10th shipment if the frst 10
are not rejected. P (X > 10) = 0.9610 = 0.6648.
(c) Let Y represent the number of rejected shipments. Then Y has a binomial
distribution with n = 60 and p = 0.04. P (Y ≤ 1) = P (Y = 0) + P (Y = 1) =
0.0864 + 0.2159 = 0.3022.
81. (a) The number of men in the sample follows a hypergeometric distribution. To
get exactly 4�men,
� we must choose 4 men from 55, and 1 woman from 30:
55 30
4 1
P (X = 4) = �85 = 0.3119.
5
Balka ISE 1.09 5.11. CHAPTER EXERCISES 74
82. Let X represent the number of dandelions in the randomly selected area. Then X
has a Poisson distribution with λ = 4.
(a) P (X = 2) = 0.1465.
(b) P (X > 2) = 1 − P (X ≤ 2) = 1 − [P (X = 0) + P (X = 1) + P (X = 2)] =
1 − 0.2381 = 0.7619.
(c) P (X = 3|X ≥ 1) = P (X=3∩X≥1)
P (X≥1) = PP (X=3) 0.1953668
(X≥1) = 0.9816844 = 0.1990.
√ √
(d) σ = λ = 4 = 2.
83. (a) Let X represent the number of implants needed to get the frst unsuccessful
one. Then X has a geometric distribution with p = 0.05. P (X = 4) =
0.953 · 0.05 = 0.0429.
(b) In order for the frst unsuccessful implant to occur after the 11th trial, we need
the frst 11 trials to be successful. P (X > 11) = 0.9511 = 0.5688.
(c) Let Y represent the number of trials needed to get the third unsuccessful
implant. Then Y has �the negative binomial distribution with r = 3 and
p = 0.05. P (Y = 28) = 27 3
2 0.05 0.95
25 = 0.0122.
84. (a) Here we need to pick 2 footballs and 4 balls that are either soccer balls or
volleyballs: �15�28
P (2 footballs) = 2
�434 = 0.3526.
6
(b) µ = n · a
N =6· 15
43 = 2.093.
�20�23 �20�23
(c) P (0 soccer balls)+P (1 soccer ball) = 0
�436 + 1�435 = 0.01656+0.11039 =
6 6
0.1269.
�15�20�8
(d) 3
�432 1
= 0.1134.
6
85. (a) Since the sampling is done without replacement, the number of computer en-
gineers in the group of 6 will follow a hypergeometric distribution. We must
select 3 computer engineers (from 7 total)
� � and 3 non-computer engineers (from
7 13
5 + 5 + 3 = 13 total). P (X = 3) = 3
�203 = 0.2583.
6
(b) Here we must select 2 biomedical engineers (from 5), 2 computer engineers
(from 7), 1 mechanical engineer (from 5) and 1 water resource engineer (from
Balka ISE 1.09 5.11. CHAPTER EXERCISES 75
�5�7�5�3
3). The probability of this is: 2 2
�201 1
= 0.0812.
6
86. Here we need to pick 1 Honda vehicle (from 4), 1 Ford (from 6), 1 Toyota (from 3),
and 2 Chrysler (from 7). We can calculate the�probability of this event using the
4 6 3 7
� � �
multivariate hypergeometric distribution: 1 1
�20 1 2
= 0.09752.
5
87. 5! 3 1 1
3!1!1!0! 0.4 0.3 0.2 0.1
0 = 0.0768.
Balka ISE 1.09 5.11. CHAPTER EXERCISES 76
Chapter 6
J.B.’s strongly suggested exercises: 1, 5, 6, 8, 9, 10, 11, 13, 14, 15, 21, 23, 24, 27
6.1 Introduction
1. (a) For continuous random variables, probabilities correspond to areas under the
density curve. True.
(b) The area under any continuous probability distribution is 1. True.
(c) A probability density function can take on negative values. False. f (x) ≥ 0
everywhere.
(d) A probability density function can take on values greater than 1. True. The
area under the entire curve is 1, but the height of the curve can be greater than
1.
(e) f (x) = P (X = x) for all x. False. The value of the pdf (the height of the curve
at point x) is not a probability.
(f) f (x) = µ whenever the distribution is symmetric. False.
77
Balka ISE 1.09 6.3. THE CONTINUOUS UNIFORM DISTRIBUTION 78
0.111.
R∞
(b) P (X > 0.02) = 0.02 10e−10x dx = −e−10x |0.02 ∞ = 0 − (−e−10×0.02 ) = e−0.2 ≈
0.819.
R 0.12
(c) P (0.08 < X < 0.12) = 0.08 10e−10x dx = −e−10x |0.08 0.12 = (−e−10×0.12 ) −
5. (a) Since the base is (25 − 5 = 20), the height of the rectangle must be 1
25−5 = 20 .
1
So,
20 for 5 ≤ x ≤ 25
( 1
f (x) =
0 elsewhere
Balka ISE 1.09 6.4. THE NORMAL DISTRIBUTION 79
7. µ and σ 2 are the parameters of the distribution (or µ and σ). µ can be any value
(−∞ < µ < ∞), but σ 2 must be positive (σ 2 > 0).
(d) The standard deviation of a normal distribution cannot be greater than the
mean. False.
(e) The mean and median of a normal distribution are always equal. True. The
normal distribution is symmetric, and the mean and median are equal in sym-
metric distributions.
9. (a) 0.6443.
(b) 1 − 0.6443 = 0.3557.
(c) 0.1443.
(d) 0.1003.
(e) 1 − 0.1003 = 0.8997.
(f) 0.
(g) 0.3997.
P (Z>2.41∩Z>0)
(h) P (Z > 2.41|Z > 0) = P (Z>0) = PP(Z>2.41) 0.008
(Z>0) = 0.5 = 0.016.
(i) P (−2.41 < Z < 2.41|Z > 0) = P (−2.41<Z<2.41∩Z>0)
P (Z>0) = P (0<Z<2.41)
P (Z>0) = 0.4920
0.5 =
0.984.
(j) ≈ 0.95 (by the table), 0.9541653 (using software).
(k) By the table: Q1 ≈ −.67, Q3 ≈ .67, IQR ≈ 1.34. Using software: Q1 ≈
−0.6744898, Q3 ≈ 0.6744898, IQR ≈ 1.348980
(l) P (Z < 2.13) = 0.9834. If two values are randomly and independently selected,
the probability they are both less than 2.13 is 0.98342 = 0.9671.
10. (a) 0.9968 (the area to the right of −2.73 under the standard normal curve).
(b) 0.0032 (the area to the right of 2.73 under the standard normal curve).
(c) 1.17.
(d) −1.17.
(e) 1.96.
(f) −0.71.
(g) 0.00052. The probability a single value is greater than 2.0 is 0.0228. The
probability two values, independently sampled, are both greater than 2.0 is
0.02282 = 0.00052.
11. 0 (the area to the right of 31788905573847358927509487 under the standard normal
curve is minuscule).
Balka ISE 1.09 6.4. THE NORMAL DISTRIBUTION 81
13. Let X represent the weight (in kg) of a randomly selected newborn female African
elephant that is born in captivity. Then X ∼ N (95.1, 13.72 ).
14. (a) If X has a normal distribution with mean µ and standard deviation σ, then
X −µ
σ has the standard normal distribution. True.
(b) If Z has a standard normal distribution, then the mean and standard deviation
of Z are equal. False. µ = 0, σ = 1.
Balka ISE 1.09 6.5. NORMAL QUANTILE-QUANTILE PLOTS 82
(c) The frst quartile of the standard normal distribution is greater than 0. False.
The frst quartile is approximately −0.67.
(d) The frst quartile of any normal distribution is greater than 0. False. Any of
the percentiles can be negative, depending on the parameters of the distribution.
(e) If X has a normal distribution, then P (X > µ) = P (X < µ). True. Since the
normal distribution is symmetric about µ, P (X > µ) = P (X < µ) = 0.5.
15. (a) Since the points on the plot fall close to a straight line, the population from
which the sample is drawn is likely approximately normal.
(b) Since the points on the plot fall close to a straight line, the population from
which the sample is drawn is likely approximately normal.
(c) This type of plot is typical of a distribution that is symmetric, but has shorter
tails than the normal distribution (perhaps the uniform distribution, or some-
thing similar).
(d) This type of plot is typical of a distribution that is skewed to the right.
16. The normal quantile-quantile plot would look like the one in Figure 6.1.
Sample Quantiles
-2 -1 0 1 2
Theoretical Quantiles
Figure 6.1: A normal quantile-quantile plot showing left skewness and one large outlier.
Balka ISE 1.09 6.6. OTHER IMPORTANT CONTINUOUS PROBABILITY DISTRIBUTIONS 83
18. (a) 2. The area of a triangle is 12 × base × height. Since the area under the entire
curve must equal 1, 12 × base × height = 1 =⇒ 12 × (1 − 0) × height = 1 =⇒
height = 2. So the highest point of the triangle is 2, which occurs at x = 1.
(b) f (0) = 0 and f (1) = 2, which implies that f (0.5) = 1. (Since f (x) is a straight
line in this question.)
(c) f (x) = 2x for 0 < x < 1 and 0 otherwise. (The curve is a line with slope =
2.)
(d) 0.
(e) 1. P (X < 1.5) is the area to the left of 1.5. Since all of the area lies between
0 and 1, P (X < 1.5) = 1.
(f) 0.25. P (X < 0.5) is the area to the left of 0.5. At x = 0.5, f (0.5) = 1. The
area to the left of 0.5 is a triangle with a base of (0.5 − 0) = 0.5 and a height
of 1, so P (X < 0.5) = 12 × 0.5 × 1 = 0.25.
(g) P (X > 0.5) = 1 − P (X < 0.5) = 1 − 0.25 = 0.75.
√
(h) 0.5. We need to fnd the value a such that the area to the left of a is 0.5.
The area to the left of a point a is 12 × (a − 0) × 2a. Setting this equal to 0.5,
Balka ISE 1.09 6.7. CHAPTER EXERCISES 84
√
a2 = 0.5 =⇒ a = 0.5.
6.7.2 Concepts
19. 1, since the area to the left of −458958643961346 under the standard normal curve
is minuscule.
20. (a) The 95th percentile of a continuous random variable cannot be less than 0.
False.
(b) The mean of a continuous random variable cannot be negative. False. Con-
tinuous random variables often have negative means.
(c) The mean of a continuous random variable can be greater than the third
quartile. True, but it would be unusual.
(d) The normal distribution is symmetric. True.
(e) If X has a normal distribution, then the mean and median of X are equal.
True, since for symmetric distributions the mean and median are equal.
6.7.3 Applications
21. Let X represent the total serum cholesterol (mmol/L) of a randomly selected Cana-
dian male between 60 and 79 years of age. Then X ∼ N (4.7, 1.22 ).
22. Let X represent the height (in cm) of a randomly selected Canadian female between
20 and 39 years of age. Then X ∼ N (163.3, 6.42 ).
Balka ISE 1.09 6.7. CHAPTER EXERCISES 85
23. Let X represent the height (in cm) of a randomly selected Canadian male between
20 and 39 years of age. Then X ∼ N (177.7, 6.52 ).
24. Let X represent the test diameter (in mm) of a randomly selected mature male
green sea urchin in this area. Then X ∼ N (39.7, 3.82 ).
25. Let X represent the weight (in grams) of the cereal in a randomly selected bag of
cereal of this type. Then X ∼ N (374.5, 2.82 ).
26. Let X represent the amount of protein (in grams) of a randomly selected cup of
this type of 2% milk. Then X ∼ N (8.0, 0.302 ).
using the table. Using software, we can fnd that the 99th percentile of the
standard normal distribution is 2.326348.)
(e) 8.20 grams. The 75th percentile of the standard normal distribution is ap-
proximately 0.67. Thus the 75th percentile of protein in a cup of 2% milk
is approximately µ + σz = 8.0 + 0.3 · 0.67 = 8.20 grams. (There is a little
rounding error when fnding a percentile of the standard normal distribution
using the table. Using software, we can fnd that the 75th percentile of the
standard normal distribution is 0.6744898.)
27. Let X represent the total amount of fat (in grams) in a randomly selected serving
(85 grams) of this type of cooked ground beef. Then X ∼ N (10.2, 1.82 ).
28. Let X represent the length (in mm) of the head of a randomly selected lizard of
the species P. versicolor. Then X ∼ N (11.90, 0.412 ).
29. Let X represent the weight (in grams) of the heart of a randomly selected adult
Caucasian male. Then X ∼ N (365, 712 ).
(a) P (X > 400) = P (Z > 400−36571 ) = P (Z > 0.493) = 0.311. (There is a little
rounding error if you use the table: P (Z > 0.49) = 0.3121.)
(b) P (X > 453.6) = P (Z > 453.6−365
71 ) = P (Z > 1.248) = 0.106.
(c) P (X < 300) = P (Z < 300−365
71 ) = P (Z < −0.915) = 0.180. (There is a little
rounding error if you use the table: P (Z < −0.92) = 0.1788.)
(d) P (300 < X < 400) = P ( 300−365
71 < Z < 400−365
71 ) = P (−0.915 < Z < 0.493) =
0.689 − 0.180 = 0.509.
(e) The 90th percentile of the standard normal distribution is approximately 1.28.
The 90th percentile of heart weight is approximately 365 + 71 × 1.28 = 455.9
grams. (This answer has a little rounding error. If we use software, we can
fnd that the correct value to 2 decimal places is 455.99.)
30. Let X represent the weight (in kg) of a randomly selected newborn baby in the
United States. Then X ∼ N (3.3, 0.52 ).
31. Let X represent the lifetime (in days) of a randomly selected insect of this type.
Then X ∼ N (60.2, 18.02 ).
is 4.0 × 10−15 .)
(c) P (Z < 50−60.2
18.0 ) = P (Z < −0.567) = 0.285. (If you use the table, there will be
a little rounding error: P (Z < −0.57) = 0.2843.)
(d) P ( 50−60.2
18.0 < Z < 70−60.2
18.0 ) = P (−0.567 < Z < −0.544) = 0.707 − 0.285 =
0.421.
(e) The 90th percentile of the standard normal distribution is approximately 1.28,
and thus the 90th percentile of life lengths is approximately µ + σz = 60.2 +
18 × 1.28 = 83.2 days. (This answer has a little rounding error. If we use
software, we can fnd that the correct value is 83.27.)
32. Let X represent the purity level in a randomly selected oxygen container of this
type. Then X ∼ N (99.50, 0.0482 ).
33. Let X represent the weight (in grams) of a randomly selected adult pigeon in this
city. Then X ∼ N (370, 282 ).
(a) P (X < 350) = P (Z < 350−370 28 ) = P (Z < −0.714) = 0.238. (If you use a
table, there will be a little rounding error: P (Z < −0.71) = 0.2389.)
(b) P (X > 453.6) = P (Z > 453.6−370
28 ) = P (Z > 2.986) = 0.0014.
(c) P (350 < X < 400) = P ( 28 < Z < 400−370
350−370
28 ) = P (−0.714 < Z < 1.071) =
0.858 − 0.238 = 0.620.
(d) P (X > 6000) = P (Z > 6000−37028 ) = P (Z > 201.07) ≈ 0. (Under the assumed
Balka ISE 1.09 6.7. CHAPTER EXERCISES 90
model, this probability is non-zero, but very very close to 0. For all intents
and purposes there is no chance of observing this event.)
(e) The 5th percentile of the standard normal distribution is approximately −1.645.
The 5th percentile of pigeon weights is approximately µ+σz = 370+28(−1.645) =
323.94 grams.
34. Let X represent the cheese demand (in kg) on a randomly selected day. Then
X ∼ N (50, 36). Note that √ the variance is given (σ = 36), and the standard
2
35. Let X represent the diameter (in cm) of a randomly selected shaft. Then X ∼
N (4.10, 0.052 ).
Sampling Distributions
J.B.’s strongly suggested exercises: 1, 6, 7, 8, 9, 10, 20, 21, 22, 23, 24, 25
7.1 Introduction
2. (a) µ = 14.
(b) 42 = 6.
�
91
Balka ISE 1.09 7.1. INTRODUCTION 92
(e)
¯) =
X 1 1 2 1 1
µX̄ = E(X ¯ x) = 12.5 · + 13.5 · + 14.0 · + 14.5 · + 15.5 · = 14
xp(¯
6 6 6 6 6
Note that E(X
¯ ) = µ (on average, the sample mean equals the population
mean).
(c) 2
6 = 13 .
(d) The following table gives the 4 possible outcomes, the corresponding values of
X¯ , and their probabilities of occurring.
The probability distribution of the sample mean X ¯ is thus:
x̄ 0 0.5 1
p(x̄) 0.25 0.5 0.25
Balka ISE 1.09 7.2. THE SAMPLING DISTRIBUTION OF THE SAMPLE MEAN 93
Outcome TT TH HT HH
Value of X¯ 0 0.5 0.5 1
Probability 0.25 0.25 0.25 0.25
This is the sampling distribution of the average number of heads per toss (X ¯)
when a fair coin is tossed twice. It was not necessary to actually toss the coin
to derive the sampling distribution.
(e) The mean of the sampling distribution of X ¯ is represented by µ ¯ . We can use
X
the formula for the mean of a discrete random variable to fnd the mean of X: ¯
X
¯) =
µX̄ = E(X ¯ x) = 0 · 0.25 + 0.5 · 0.5 + 1 · 0.25 = 0.50
xp(¯
Note that the mean of the probability distribution of the sample mean is equal
to mean of the probability distribution for a single toss: µX¯ = µX = 21 .
(f) The variance of the sampling distribution of X ¯ is represented by σ 2 . We
X̄
can use the formula for the variance of a discrete random variable to fnd the
variance of X
¯:
X
2
σX̄ = (x̄−µX̄ )x p(x̄) = (0 = 0.5)2 ·0.25+(0.5−0.5)2 ·0.5+(1−0.5)2 ·0.25 = 0.125
5. (a) Let X represent the number of winning cups. Then X has a binomial distri-
bution with n = 2 and p = 61 . The distribution of X in table form is:
x 0 1 2
25 10 1
p(x) 36 36 36
(b) The proportion of winning cups is the number of winning cups divided by the
number of cups purchased (p̂ = X n ). So the sample proportion takes on 3
possible values: p̂ = 02 = 0, or p̂ = 12 = 0.5, or p̂ = 22 = 1. The sampling
distribution of the sample proportion is:
p̂ 0 0.5 1
Probability 25 36
10
36
1
36
30−µ
(d) P (X¯ > 30) = P (Z > √ )
σ/ n
= P (Z > 30−10
√ ) = P (Z > 3.2) = 0.0007.
25/ 16
(e) P (0 < X¯ < 20) = P ( 0−10
√
25/ 16
<Z< 20−10
√ ) = P (−1.6 < Z < 1.6) = 0.8904.
25/ 16
7. Plot B is the sampling distribution of the sample mean. In this situation the
sampling distribution of X ¯ will be approximately normal with a mean of 20 and a
standard deviation of σX̄ = √σn = √20 400
= 1. Although A, B, and D all have means
of 20, plot B is the only one with a standard deviation that appears to be close to
1.
8. The central limit theorem tells us that sums and means of random variables have
distributions that tend toward the normal distribution as the number of observa-
tions increases, regardless of the distribution from which we are sampling. This
implies that for large sample sizes, the distribution of the sample mean will be
approximately normal, even when we are sampling from a non-normal population.
The central limit theorem is very important in the world of statistics. Because of
the CLT, we can often make probability calculations and use statistical inference
procedures based on the normal distribution, even when the underlying population
is not normally distributed.
9. False. µ is a parameter, and as such it does not have a sampling distribution. (We
view µ as a fxed, unchanging value.)
10. (a) The standard deviation of the sampling distribution of the sample mean de-
pends on the value of µ. False. σX¯ = √σn depends only on σ and n.
(b) We cannot possibly determine any characteristics of a statistic’s sampling dis-
tribution without repeatedly sampling from the population. False. We can
often mathematically determine characteristics of the sampling distribution.
For example, we have shown that E(X ¯ ) = µ.
(c) The sampling distribution of X is always at least approximately normal for
¯
large sample sizes, and is sometimes approximately normal for small sample
sizes. True. The distribution of X ¯ is normal for any sample size if we are
sampling from a normally distributed population. If we are sampling from a
distribution that is approximately normal, then the distribution of X¯ will be
approximately normal for any sample size.
(d) If the sample size is quadrupled, then standard deviation of the sampling
distribution of the sample mean would decrease by a factor of 2. True. √σn
√
decreases by a factor of 4 = 2 if n = 4.
Balka ISE 1.09 7.4. SOME TERMINOLOGY REGARDING SAMPLING DISTRIBUTIONS 95
11. The standard error of a statistic is the estimate of the standard deviation of that
statistic’s sampling distribution. (It is a measure of the variability of the statistic
in repeated sampling.)
(Note: Some sources defne standard error as the true standard deviation of the
statistic’s sampling distribution, and the estimated standard error as the estimate.
But here we reserve the term standard error for the estimated standard deviation
of a statistic’s sampling distribution.)
12. SE(X
¯) = √s
n
= 13
√.7
6
= 5.593.
7.7 Concepts
16. (a) The letter n represents the number of samples that are drawn in repeated sam-
pling, and not the number of observations in each sample. False. n represents
the number of observations in the sample.
(b) In practice, we usually take a large number of samples so that we can accurately
estimate the sampling distribution of X. ¯ False. In practice, we draw only a
single sample. (We may draw a sample with a large number of observations,
but we draw only a single sample.) We use mathematical techniques to derive
¯
characteristics of the sampling distribution of X.
(c) The standard deviation of the population from which we are sampling de-
creases as the sample size increases. False. The standard deviation of the
population is what it is, and drawing more observations won’t change that.
(d) The standard deviation of the sampling distribution of X ¯ decreases as the
sample size increases. True. √n decreases as n increases.
σ
6 + (5 − 3.5) · 6 + (6 − 3.5) · 6 = 12 .
1 2 1 2 1 35
(d) To fnd the sampling distribution of X,¯ we can list all 36 possible outcomes
and calculate X
¯ for each possibility. The results are given in the following
table.
Roll 1 / Roll 2 1 2 3 4 5 6
1 1.0 1.5 2.0 2.5 3.0 3.5
2 1.5 2.0 2.5 3.0 3.5 4.0
3 2.0 2.5 3.0 3.5 4.0 4.5
4 2.5 3.0 3.5 4.0 4.5 5.0
5 3.0 3.5 4.0 4.5 5.0 5.5
6 3.5 4.0 4.5 5.0 5.5 6.0
x̄ 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0
p(x̄) 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36
(e) The quick way is to realize that µX¯ = µ = 3.5 (found in 17b). The slower way
would bePto use the formula for the expectation of a discrete random variable:
¯
E(X ) = xp(¯
¯ x).
√
35/12
(f) The quick way is to realize that σX̄ = n = √2 ≈ 1.2076. (In 17c we
√σ
found that σ 2 = 35
12 .) The slower way
P is to use the formula for the variance of
a discrete random variable: σX̄
2 = (x̄ − µ)2 p(x̄).
19. (a) If we are sampling from a normally distributed population, then the sampling
distribution of X¯ is normal for any sample size. True
(b) The standard deviation of the sampling distribution of the sample mean in-
creases as the sample size increases. False. σX̄ = √σn decreases as the sample
size increases.
(c) The central limit theorem states that the sample mean equals the population
mean, as long as the sample size is large. False. E(X ¯ ) = µ (for any sample
¯ will not be exactly
size) but there will always be some variability (the value of X
equal to µ).
(d) If the sampling distribution of X¯ is approximately normal, then the population
from which we are sampling must be approximately normal. False. For large
sample sizes the sampling distribution of X ¯ is always approximately normal,
regardless of the shape of the population from which we are sampling, and thus
an approximately normal sampling distribution of X ¯ does not say much about
the sampled population.
(e) All else being equal, the mean of the sampling distribution of X ¯ decreases as
the sample size increases. False. The mean of the sampling distribution of X ¯
is µ, regardless of the sample size.
20. (a) If we draw a very large sample from any population and plot a histogram of the
observations, the shape of the histogram will be approximately normal. False.
A histogram of a large number of observations sampled from a distribution will
Balka ISE 1.09 7.8. APPLICATIONS 98
(c) In practice, we usually know the true value of µ. False. In practical problems,
the parameter µ is almost always unknown.
(d) The sample mean is an unbiased estimator of the population mean. True.
E(X ¯ ) = µ.
(e) If we were to repeatedly sample from a population, then the distribution of the
sample mean would become approximately normal as the number of samples
increases, as long as the sample size of each sample stays constant. False. The
distribution of X¯ becomes approximately normal as the sample size n increases
(in other words, as the number of values used to calculate the mean increases).
Repeatedly sampling from a population doesn’t change the sampling distribution
of X¯.
7.8 Applications
22. Let X represent the amount of ribofavin (in mg) in a randomly selected large white
egg. Then X ∼ N (0.25, 0.022 ).
(e) 0.2469 mg. The 20th percentile of the standard normal distribution is ap-
proximately −0.84. The 20th percentile of X
¯ is approximately µ + √σ z =
n
0.25 + 0.02
√
30
(−0.84) = 0.2469 mg.
23. Let X represent the amount of protein (in grams) of a randomly selected breakfast
sandwich of this type. Then X ∼ N (17.0, 0.82 ).
24. Let X represent the length (in mm) of the right ear of a randomly selected Caucasian
Italian man in this age group. Then X ∼ N (62, 42 ).
25. Let X represent the length (in mm) of the carapace of a randomly selected adult
male crayfsh in this lake. Then X ∼ N (48.0, 2.52 ).
26. Let X represent the weight (in kg) of a randomly selected newborn female African
elephant that is born in captivity. Then X ∼ N (95.1, 13.72 ).
(a) 0.360. P (X > 100.0) = P (Z > 100.0−95.1 13.7 ) = P (Z > 0.358) = 0.360. (There
will be a little rounding error if you use a table: P (Z > 0.36) = 0.3594.)
(b) X¯ ∼ N (95.1, 13.72 ). (The sampling distribution of the mean weight of 3 ran-
3
domly selected newborn female elephants is approximately normal with a mean
2
3 ≈ 62.6 kg .
of 95.1 kg and a variance of 13.7 2
(c) 0.268. P (X¯ > 100.0) = P (Z > 100.0−95.1 √ ) = P (Z > 0.619) = 0.268.
13.7/ 3
(d) 0.196. P (98.0 < X ¯ < 102.0) = P ( 98.0−95.1√
13.7/ 10
< Z < 102.0−95.1
√ ) = P (0.669 <
13.7/ 10
Z < 1.593) = 0.944 − 0.748 = 0.196.
(e) 87.9 kg. Using a table, we would fnd that the 30th percentile of the stan-
dard normal distribution is approximately −0.52. The 30th percentile of birth
weight is approximately µ + σz = 95.1 + 13.7(−0.52) = 87.98 kg. (But there
is a fair bit of rounding error if you use a table. Using software, we can fnd
that the 30th percentile of the standard normal distribution is −0.5244 and
the 30th percentile of birth weights is 87.915 kg.)
(f) 92.83 kg. Using a table, we would fnd that the 30th percentile of the stan-
dard normal distribution is approximately −0.52. The 30th percentile of X ¯ is
approximately µ + √n z = 95.1 + √10 (−0.52) = 92.847 kg. (But there is a fair
σ 13.7
bit of rounding error if you use a table. Using software, we can fnd that the
30th percentile of the standard normal distribution is −0.5244 and the 30th
percentile of the sampling distribution of mean birth weight is 92.828 kg.)
27. Let X represent the weight (in grams) of the heart of a randomly selected adult
Caucasian male. Then X ∼ N (365, 712 ).
28. Let X represent the height (in cm) of a randomly selected Canadian female between
20 and 39 years of age. Then X ∼ N (163.3, 6.42 ).
(e) P (X¯ > 75) = P (Z > 75−71√ ) = P (Z > 0.894) = 0.186. (There will be a little
10/ 5
rounding error if you use a table: P (Z > 0.89) = 0.1867).
(f) From 29d, the probability a randomly selected student scored higher than 75
is 0.345. The probability all 5 randomly selected students scored higher than
75 is approximately 0.3455 = 0.005. (Note that for this calculation to be
reasonable, we require a very large class size.)
30. Let X represent the diameter of a randomly selected shaft produced by this process.
Then X ∼ N (4.10, 0.052 ).
31. Let X represent the oxygen purity level of a randomly selected container produced
by this company. If the company’s claim is true, X ∼ N (99.50, 0.0482 ).
Confdence Intervals
J.B.’s strongly suggested exercises: 1, 2, 4, 5, 7, 8, 9, 10, 11, 13, 14, 24, 26, 27, 29, 31, 32
8.1 Introduction
1. (a) z.025 = 1.96 (see Figure 8.1), and the margin of error is 1.96 × √5
22
= 2.089.
(b) z.05 = 1.645, and the margin of error is 1.645 × √1010 = 5.202.
(c) z.005 = 2.576, and the margin of error is 2.576 × √1518 = 9.108.
(d) α = 1 − 0.632 = 0.368. α2 = 0.184. z.184 = 0.90, and the margin of error is
0.90 × √1518 = 3.182.
(e) α = 1 − 0.212 = 0.788. α2 = 0.394. z.394 = 0.27, and the margin of error is
0.27 × √3550 = 1.336.
2. (a) 77.4% (since the area between −1.21 and 1.21 under the standard normal curve
is 0.774 (see Figure 8.2).
(b) 82.3% (since the area between −1.35 and 1.35 under the standard normal curve
is 0.823).
(c) 88.4% (since the area between −1.57 and 1.57 under the standard normal curve
is 0.884).
(d) 98.6% (since the area between −2.45 and 2.45 under the standard normal curve
is 0.986).
(e) 99.8% (since the area between −3.15 and 3.15 under the standard normal curve
is 0.998).
105
Balka ISE 1.09 8.2. INTERVAL ESTIMATION OF µ WHEN σ IS KNOWN 106
0.05 0.05
= 0.025 = 0.025
2 0.95 2
− z 0.025 0 z 0.025
The area to the right of this value is 0.025
The area to the left of this value is 0.975
(c) The standard normal table tells us that (d) The standard normal table tells us that
z0.025 = 1.96 (rounded to 2 decimal places). −z0.025 = −1.96 (rounded to 2 decimal
places).
0.774
0.113 0.113
-4 -3 -2 -1 0 1 2 3 4
-1.21 1.21
5. We can be 95% confdent that the mean right ear length of all Caucasian Italian
females between the ages of 18 and 30 lies between 60.91 mm and 62.89 mm. Or:
We can be 95% confdent that the true mean right ear length of Caucasian Italian
females between the ages of 18 and 30 lies between 60.91 mm and 62.89 mm.
(It can also be said that in repeated sampling, 95% of the 95% confdence intervals
calculated in this manner would capture the true mean right ear length of Caucasian
Italian females between the ages of 18 and 30.)
6. Statement (c) is a correct interpretation of the interval, and the rest of the state-
ments are false.
8.2.3 An Example
8.3.1 Introduction
8. (a) With 13 degrees of freedom, t.10 = 1.350 (see Figure 8.3). SE(X
¯) = √10
14
=
2.673. M OE = 1.350 × 2.673 = 3.609.
Balka ISE 1.09 8.3. CONFIDENCE INTERVALS FOR µ WHEN σ IS UNKNOWN 108
t distribution with n − 1 = 13
degrees of freedom
0.20 0.20
= 0.10 = 0.10
2 0.80 2
− t 0.10 0 t 0.10
The area to the right of this value is 0.10
(b) R tells us that (with 13 DF) (c) The t table tells us that (with 13 DF) t0.10 = 1.350
t0.10 = 1.350171 (rounded to 6 (rounded to 3 decimal places).
decimal places).
9. (a) The variance of the t distribution is greater than the variance of the standard
normal distribution. True.
(b) The mean of the t distribution is equal to the mean of the standard normal
distribution. True. (Both distributions have a mean of 0.)
(c) The t distribution has more area in the tails and a lower peak than the standard
normal distribution. True.
(d) As the degrees of freedom increase, the t distribution tends toward the standard
normal distribution. True.
(e) The t distribution is equivalent to the standard normal distribution if the
degrees of freedom are at least 30. False. Although the t distribution tends
toward the standard normal distribution as the degrees of freedom increase,
Balka ISE 1.09 8.3. CONFIDENCE INTERVALS FOR µ WHEN σ IS UNKNOWN 109
10. The interval found using zα/2 would be narrower. Although narrower intervals are
preferred, had we made this mistake we would be reporting incorrect results. (But
if the sample size is large the mistake would have only a very small e˙ect.)
8.3.2 Examples
11. (a) No, there is no indication of a violation of the normality assumption. The
boxplot looks roughly symmetric, and there is no systematic curvature or
outliers evident in the normal QQ plot.
(b) The sample mean is the point estimate of the population mean: X ¯ = 50.91
kgf. SE(X ¯ ) = √ = √ = 1.3683.
s
n
7.74
32
(c) Using software or a t table, we can fnd that with n − 1 = 32 − 1 = 31 degrees
of freedom, t0.025 = 2.0395. The 95% interval is given by:
¯ ± t0.025 SE(X
X ¯)
50.91 ± 2.0395 × 1.3683
50.91 ± 2.791
12. If the population is not normally distributed, then the true coverage probability
of the confdence interval method can be very di˙erent from the nominal (stated)
confdence level. In short, the reported results might be very misleading.
Balka ISE 1.09 8.4. DETERMINING THE MINIMUM SAMPLE SIZE N 110
13. Strong skewness and outliers pose the biggest problems for the t procedure (and
they are especially problematic if the sample size is small). If a distribution is
roughly symmetric and unimodal (the distribution is “mound shaped”), then this
type of violation of the normality assumption is not a big problem, even for small
sample sizes. The e˙ect of any violation of the normality assumption becomes less
of a problem as the sample size increases.
1.645 × 10 2
14. (a) n ≥ ( ) = 10.8. We need a sample size of at least 11.
5
1.96 × 10 2
(b) n≥( ) = 15.4. We need a sample size of at least 16. (Note that
5
in minimum sample size calculations, we round up to the next largest whole
number.)
1.96 × 30 2
(c) n≥( ) = 138.3. We need a sample size of at least 139.
5
2.576 × 10 2
(d) n≥( ) = 66357.8. We need a sample size of at least 66,358. (Note
0.10
that the required sample size might be far larger than what is possible in
practice. We may need to adjust our plans or abandon them if the calculation
tells us that our wants are not practical.)
1.96 × 4 2
(e) n≥( ) = 61.47. We need a sample size of at least 62.
1
15. σX̄ = √σ
n
= √10
16
= 2.5.
16. SE(X
¯) = √s
n
= 2.0
√
16
= 0.5.
17. The population is assumed to be normally distributed, and the standard deviation
is based on sample data, so to calculate the intervals we will use: X
¯ ± tα/2 √s .
n
(a) For the sample 3, 5, 12, X¯ = 6.667 and s = 4.725816. There are n − 1 =
3 − 1 = 2 degrees of freedom, and with 2 degrees of freedom, t.025 = 4.303.
The 95% interval is given by:
√ , which works out to 6.667 ± 11.74.
6.667 ± 4.303 × 4.7258
3
Balka ISE 1.09 8.5. CHAPTER EXERCISES 111
8.5.2 Concepts
18. D is the best interpretation. (The rest of the statements are false.)
19. (a) The number of intervals that contain µ will have a binomial distribution with
n = 100 and p = 0.95.
(b) P (X ≥ 1) = 1 − P (X = 0) = 1 − 0.95100 = 0.994.
20. (a) The central limit theorem tells us that the sampling distribution of X
¯ is ap-
proximately normal for large sample sizes, so in most cases when n = 100,
methods based on the normal distribution are reasonable (even when there is
some skewness). But there may be better methods of analysis, such as carrying
out an inference procedure on a transformation of the data.
(b) σX̄ = √1.0
100
= 0.10.
(c) We will use X¯ ± zα/2 √σ to calculate the interval. Using a standard normal
n
table or software, we can fnd the appropriate z value: z0.04 = 1.75. (The area
to the right of 1.75 under the standard normal curve is 0.04, so using 1.75
in the interval formula will result in a 92% confdence level.) The interval is:
1.4 ± 1.75 × 0.10, which works out to 1.4 ± 0.175.
(d) The true value of µ is unknown. (We are constructing a confdence interval in
order to estimate it.)
(e) The parameter µ represents the true mean time to death for this type of rat
after being infected with this virus (under the conditions of this experiment).
21. True. (This can be seen in the t table. For example, t0.025 → 1.96 as the degrees of
freedom increase.)
¯ 1 − 1.96 √σ < X
2, then we need to fnd P (X ¯ 1 + 1.96 √σ ).
¯2 < X
n n
¯ 1 − 1.96 √σ < X
P (X ¯ 1 + 1.96 √σ ) = P (−1.96 √σ < X
¯2 < X ¯ 1 < 1.96 √σ )
¯2 − X
n n n n
r
σ2 σ2 √ σ
Note that X ¯ 1 has a mean of µ − µ = 0 and a standard deviation of
¯2 − X + = 2√ .
n n n
Standardizing:
σ σ
−1.96 √ − 0 1.96 √ − 0
¯ 1 − 1.96 √σ < X
P (X ¯ 1 + 1.96 √σ ) = P (
¯2 < X √ σ
n
<Z< √ σ
n
)
n n 2√ 2√
n n
1.96 1.96
= P (− √ < Z < √ )
2 2
= 0.834
The probability that the sample mean of one sample falls within the 95% confdence
interval for µ found from another sample of the same size is only 0.834.
23. We are given that θ̂ = 2.00 and σθˆ = 3.00 (the standard deviation of the sampling
distribution of θ̂ is 3.00). Since the estimator θ̂ is normally distributed, and its
standard deviation is known, the margin of error is zα/2 × σθ̂ = 1.96 × 3 = 5.88,
and the 95% interval for θ is 2.00 ± 5.88.
24. Test your conceptual understanding: Which of the following statements are true?
You should be able to explain why a statement is true or why a statement is false.
(a) The standard error of a statistic decreases as the sample size decreases. False.
The standard error increases as the sample size decreases.
(b) The one-sample t procedures are robust to violations of the normality as-
sumption. True. The t procedures work quite well in a variety of non-normal
situations.
(c) The t procedures do not perform well when there is strong skewness or outliers
in the data, especially for small sample sizes. True.
(d) The width of a confdence interval for µ decreases as the sample size decreases.
False. The width of a confdence interval increases as the sample size decreases.
(e) The width of a confdence interval for µ decreases as the variance increases.
False. The width of a confdence interval increases as the variance increases.
25. Test your conceptual understanding: Which of the following statements are true?
You should be able to explain why a statement is true or why a statement is false.
Balka ISE 1.09 8.5. CHAPTER EXERCISES 113
(a) In practical problems, σ is usually known and thus we usually use z procedures
instead of t procedures. False. The vast majority of the time σ is an unknown
quantity.
(b) Standard errors are the estimated standard deviations of parameters in re-
peated sampling. False. A standard error is the estimated standard deviation
of a statistic in repeated sampling.
(c) If the assumptions of a procedure used to calculate a 90% confdence interval
are violated, the true confdence level of the interval may be very di˙erent
from 90%. True. If the assumptions are false the true confdence level may be
very di˙erent from what is stated.
(d) All else being equal, the width of a confdence interval decreases as the con-
fdence level increases. False. If we want greater confdence that the interval
captures the parameter, we must cover more ground and have a wider interval.
(e) All else being equal, the margin of error decreases as the confdence level in-
creases. False. The margin of error increases as the confdence level increases.
(f) All else being equal, the standard error of the sample mean decreases as the
confdence level of the interval increases. False. SE(X ¯ ) = √s , which does not
n
involve the confdence level.
(g) All else being equal, the confdence level of an interval increases as the sample
size increases. False. The confdence level (1 − α) and sample size (n) are not
related.
8.5.3 Applications
26. (a) Both the boxplot and the normal QQ plot show indicate right-skewness (which
is typical of BMI data). This skewness is not a serious problem in this scenario,
as the skewness is not very strong, and the sample size is fairly large (94). We
can use the t procedures. (We will use inference procedures on the mean in
this question, which is perfectly valid. But because of the skewness, in real
world scenarios the median BMI is often reported instead of the mean BMI.)
(b) The sample mean is the point estimate of the population mean: X ¯ = 23.3.
SE(X ¯ ) = √ = √ = 0.5570.
s 5.4
n 94
(c) Using software, we can fnd that with 93 degrees of freedom, t0.025 = 1.9858.
(If we must use a table, we may have to look up 90 df and fnd the t value of
1.987.)
¯ ± t0.025 SE(X
X ¯)
23.3 ± 1.9858 × 0.5570
23.3 ± 1.106
Balka ISE 1.09 8.5. CHAPTER EXERCISES 114
which works out to (22.2, 24.4). (Had we used the t value of 1.987 from the
table, we would have gotten the same answer to 1 decimal place.)
(d) The interpretation of the interval is a little tricky here, due to the complexities
of the sampling design. The sample is not truly a simple random sample from
any population, so one could argue that we don’t really have a confdence
interval for anything. But sampling designs like this are very common in
practice (when human consent is required, there are always going to be biases
present). So I’ll give a not-quite-true-but-somewhat-reasonable interpretation,
then discuss the biases:
We can be 95% confdent that µ, the true mean BMI for 14-year old boys in
this area of rural Texas, lies between 22.2 and 24.4 kg/m2 .
The sample is drawn from a school district in north central Texas, so, loosely,
the population is all 14 year-old boys in this area of north central Texas.
However, there are some sampling biases that make this interpretation not
quite true:
• The sample was drawn from only 6 schools in that area. Students in these
schools may di˙er in a meaningful way from students at other schools in
the area (and from children that are not enrolled at a school).
• Parents needed to provide informed consent in order for students to par-
ticipate. Students whose parents provide informed consent may di˙er in
a meaningful way from those whose parents do not.
• Students needed to provide self-assent (the student had to agree to be
part of the study). Students who agree to participate may di˙er in a
meaningful way from those who do not.
• Students who participated were given a t-shirt. This could introduce bias,
as it is conceivable that there is a relationship between desire for the t-shirt
and BMI.
27. (a) No, there is no indication of a violation of the normality assumption. The box-
plot looks symmetric, and there is no systematic curvature or outliers evident
in the normal QQ plot. A minor violation of the normality assumption would
not be a big deal in this scenario, as the sample size is not small (n = 51). We
can use the t procedures.
(b) The sample mean is the point estimate of the population mean: X ¯ = 1.65
mm/s. SE(X ¯ ) = √ = √ = 0.0868.
s
n
0.62
51
(c) Using software or a t table, we can fnd that with n − 1 = 51 − 1 = 50 degrees
of freedom, t0.025 = 2.009. (If we must use a table, we may have to look up 90
Balka ISE 1.09 8.5. CHAPTER EXERCISES 115
28. (a) We need a simple random sample from a normally distributed population. The
skeletons in the Athens Collection do not truly represent a simple random from
any population, and to come up with an informed opinion we would need to
look in greater detail at how this sample was gathered. However, in a loose
way, we can think of this sample of skeletons as a random sample of skeletons
from men who lived near Athens near the end of the 20th century.
As far as normality is concerned, we don’t have the actual data to investigate
that, but it is likely that the diameters are approximately normally distributed.
The sample size (n = 24) is not very large, so normality is a concern. If we
had access to the data, we should investigate the normality assumption with
a normal QQ plot.
(b) The sample mean is the point estimate of the population mean: X ¯ = 10.38
mm/s. SE(X ) = √n = √24 = 0.1286.
¯ s 0.63
29. (a) We would need the sample to be a simple random sample from a normally dis-
tributed population. The sample of female blowfies is not truly a simple ran-
dom sample of blowfies from the area (they were captured in traps, and there
could conceivably be factors infuencing the types of fy that get trapped). We
also need the number of larvae in fies of this type to be normally distributed.
(But since the sample size is not small (n = 49), the normality assumption is
not very important.)
(b) The largest observation is roughly 1.5 standard deviations greater than the
mean, and the smallest is roughly 2 standard deviations below the mean. So
there are not any extreme values (outliers) that would a˙ect the t procedures.
This information makes the use of the t procedures more reasonable.
(c) The sample mean is the point estimate of the population mean: X ¯ = 33.4
mm/s. SE(X ) = √n = √49 = 1.0.
¯ s 7.0
(d) Using software, we can fnd that with 48 degrees of freedom, t0.025 = 2.0106.
(If we must use a table, we may have to look up 45 df and fnd the t value of
2.014.)
¯ ± t0.025 SE(X
X ¯)
33.4 ± 2.0106 × 1.0
33.4 ± 2.01
which works out to (31.4, 35.4). (Had we used the t value of 2.014 from the
table, we would have gotten the same answer to 1 decimal place.)
(e) We can be 95% confdent that the true mean number of larvae carried by
female C. varifrons blowfies in this feld lies between 31.4 and 35.4.
There are possible biases introduced by the sampling mechanism. Flies with
certain characteristics (old or large fies, for example) may be more or less likely
to be captured. These characteristics may also be related to the number of
larvae that they carry. (For example, it is conceivable that older fies would be
less likely to be captured. If older fies also tend to carry fewer larvae, then the
interval would tend to be too large (since older fies would be underrepresented
in the sample).)
Also, the interval only applies to this type of fy under the conditions of the
study. If we were to use this interval to estimate the mean in other situations
(di˙erent locations, temperatures, humidity, or for a di˙erent type of fy), then
the interval may be very far o˙ and very misleading.
30. (a) We would need the sample of fies to behave like a simple random sample
of C. varifrons blowfies. We would also need the rate of deposition to be
approximately normally distributed. We would need to investigate the study
in greater detail to form an opinion about the sample of fies that they used.
Balka ISE 1.09 8.5. CHAPTER EXERCISES 117
Since the sample size is not small (n = 42), it would be reasonable to use the
t procedures, unless the data showed strong skewness or outliers.
(b) The sample mean is the point estimate of the population mean: X ¯ = 0.46
mm/s. SE(X ¯ ) = √ = √ = 0.049.
s
n
0.32
42
(c) Using software, we can fnd that with 41 degrees of freedom, t0.025 = 2.0195.
(If we must use a table, we may have to look up 40 df and fnd the t value of
2.021.)
¯ ± t0.025 SE(X
X ¯)
0.46 ± 2.0195 × 0.049
0.46 ± 0.100
which works out to (0.36, 0.56). (Had we used the t value of 2.021 from the
table, we would have gotten the same answer to 2 decimal places.)
(d) We can be 95% confdent that the population mean rate of larviposition lies
between 0.36 and 0.46 larvae per second.
This interval applies to this type of blowfy under the conditions of this ex-
periment. If the conditions of the experiment were to change, then the true
mean rate of larviposition may change. Biases may have been introduced by
the choice of C. varifrons blowfy used in this experiment. (It is not clear how
the C. varifrons blowfies used in the experiment were chosen.)
31. (a) To draw conclusions about all hash browns made in U.S. locations of this fast-
food chain, we need the hash browns in the sample to be a random sample
from the population of hash browns in these U.S. locations. Little is said in the
U.S. Nutrient Database about how the sample was collected, but the people
that compile this database are professionals, and it is unlikely that this sample
has obvious biases. (Discussion of possible bias will be discussed below in the
answer to Question 31d.)
For the t procedure to be reasonable, we need the amount of sodium in hash
browns of this type to be normally distributed. This is especially true here,
since the sample size is so small (n = 6). It is very likely that the amount
of sodium in hash browns of this type is approximately normally distributed,
but we cannot tell from the given information.
(b) The sample mean is the point estimate of the population mean: X ¯ = 324.8mg.
SE(X ¯ ) = √ = √ = 16.330.
s 40.0
n 6
(c) Using software or a t table, we can fnd that with 5 degrees of freedom, t0.025 =
2.571.
¯ ± t0.025 SE(X
X ¯)
324.8 ± 2.571 × 16.330
324.8 ± 41.98
Balka ISE 1.09 8.5. CHAPTER EXERCISES 118
There are many sources of possible bias, depending on how the sample was
drawn. The distribution of sodium may di˙er from location to location, as
preparation methods might di˙er slightly between locations. (The fast-food
chain in question is a very large chain with common preparation methods, so
this is unlikely to be major infuence, but there still could be an e˙ect.) If the
sample was drawn from an individual location, and we were trying to draw
inferences about all U.S. restaurants, this could easily result in bias. If the
sample of 6 hash browns was drawn from a one batch of hash browns, then it
would be very unwise to try to draw conclusions about all hash browns from
this type of sample.
(e) Yes, the company’s claimed value of 310 mg is a plausible value of the popu-
lation mean, since it falls within the 95% confdence interval.
32. (a) To draw conclusions about black women in the Manchester area, we need the
women in the sample to be a random sample from the population of black
women in the Manchester area. The sample was drawn from women attending
a sexual health clinic, and these women might di˙er in a meaningful way from
the general population of black women in the area. (Sources of possible bias
will be discussed below in the answer to Question 32d.)
For the t procedure to be reasonable, we also need the 2D:4D ratios to be
normally distributed. The article stated that the distribution was close to
normal, so the normality assumption is reasonable. The sample size is 46,
which is not small, so the t procedures would still be reasonable even if there
were a little skewness or other issues with mild non-normality.
(b) The sample mean is the point estimate of the population mean: X ¯ = 0.963.
SE(X ¯ ) = √ = √ = 0.0050.
s 0.034
n 46
(c) Using software or a t table, we can fnd that with 45 degrees of freedom,
t0.025 = 2.014.
¯ ± t0.025 SE(X
X ¯)
0.963 ± 2.014 × 0.0050
0.963 ± 0.010
(d) We can be 95% confdent that the true mean 2D:4D ratio (of the left hand)
of black women attending this sexual health clinic in Manchester lies between
0.953 and 0.973.
There are many sources of possible bias, depending on the conclusion one
wishes to make. Some things to consider:
• The sample of women was drawn from black women attending a sexual
health clinic in Manchester, UK. It is reasonable to think that the conclu-
sions apply to black women who attend this clinic.
• It is likely reasonable to think that these results apply to all black women
in the Manchester, UK area, but we need to be cautious with a conclusion
like this, since black women attending this sexual health clinic might di˙er
in a meaningful way from the general population of black women in the
area.
• Although these results give some indication of the 2D:4D ratio for black
women in general, generalizing beyond the area from which the sample
was drawn is always dubious and may lead to misleading conclusions.
• As is often the case involving studies with human participants, individuals
were included in this study only if they provided written consent. Women
who consent to studies like this might di˙er in a meaningful way from
those who do not.
33. (a) We would need these intimate partner homicides to be a simple random sam-
ple from a (theoretical) population of intimate partner homicides. We also
need the ages of the perpetrators of intimate partner homicide to be normally
distributed. These assumptions will not be perfectly justifed in this case.
There are some sources of possible bias (discussed below in the answer to
Question 33d). The ages will not be perfectly normally distributed, but the
sample size is not too small (n = 36), so unless there is strong skewness or
outliers, the t procedure should work reasonably well.
(b) The sample mean is the point estimate of the population mean: X ¯ = 43.8
years. SE(X ¯ ) = √ = √ = 2.583.
s
n
15.5
36
(c) Using software or a t table, we can fnd that with 41 degrees of freedom,
t0.025 = 2.030.
¯ ± t0.025 SE(X
X ¯)
43.8 ± 2.030 × 2.583
43.8 ± 5.24
One source of bias is that the sample of homicides included only those scenarios
that were determined to be intimate partner homicides. (In reality, not all
homicides are correctly labelled as such, and not all intimate partner homicide
perpetrators are properly labelled as such.) It is conceivable that the age of
perpetrators in which the homicide is not classifed properly tends to be older
or younger than the age of perpetrators in homicides classifed as intimate
partner homicides. So the interval applies only to the population of scenarios in
which the death was labelled as an intimate partner homicide. This population
would di˙er (in a small but meaningful way) from the population of scenarios
that were truly intimate partner homicides.
34. (a) There are no obvious problems here. The points fall close to a straight line,
with no obvious curvature or other problems (indicating that the data is ap-
proximately normally distributed). It is reasonable to assume normality and
use the t procedure.
(b) The standard deviation is based on sample data, so the interval is found using
the formula X¯ ± tα/2 √s . With 15 − 1 = 14 degrees of freedom, t.025 = 2.145.
n
The endpoints of the interval are: 295.6 ± 2.145 × √ 10.2
15
, which works out to
295.6 ± 5.65 or (289.95, 301.25).
(c) The second statement (ii) is correct and the rest of the statements are false.
35. Of the given options, D is the best one. (The rest of the statements are false.)
36. We can be 95% confdent that the true mean wait time for all calls to this call
centre lies between 2.88 and 3.12 minutes.
37. (a) Plotting a normal quantile-quantile plot can give a good indication of whether
or not the data is approximately normal.
(b) The interval is 6.8±2.365× 0√.32
8
, which works out to 6.8±0.2676, or (6.53, 7.07).
(c) We can be 95% confdent that the true mean fuel consumption of this type of
car under these conditions lies between 6.53 and 7.07 l/100k.
(c) 29.3 ± 2.060 × 2.3338, which works out to 29.3 ± 4.81 or (24.49, 34.11).
(d) 1.708 × √
11.9
26
= 3.986.
(e) 29.3 ± 3.986 or (25.31, 33.29).
(f) We can be 90% confdent that the true mean number of facial tissues used per
day by this type of cold su˙erer under these conditions lies between 25.31 and
Balka ISE 1.09 8.5. CHAPTER EXERCISES 121
33.29.
39. We can be 95% confdent that the true mean melting temperature of this new type
of oil lies between 34.34 and 34.71 degrees Celsius.
Hypothesis Testing
J.B.’s strongly suggested exercises: 1, 5, 6, 10, 12, 13, 15, 19, 21, 33, 37, 38, 41, 48, 59,
61, 62, 63, 64, 65, 66
9.1 Introduction
1. (a) H0 : µM = µF (the true mean resting heart rate is the same for male and female Olympic athletes).
Ha : µM = 6 µF (the true mean resting heart rate is di˙erent between male and female Olympic athlet
(b) H0 : p1 = p2 = p3 = p4 (the six month survival rate is the same for all four treatments).
Ha : pi 6= pj for at least one i, j combination (the six month survival rates are not all equal).
(c) H0 : µ = 4.0 (the true mean saturated fat content is 4.0 grams).
Ha : µ > 4.0 (the true mean saturated fat content is greater than 4.0 grams).
It would also be acceptable to express the hypotheses this way:
H0 : µ ≤ 4.0 (the true mean saturated fat content is no more than 4.0 grams).
Ha : µ > 4.0 (the true mean saturated fat content is greater than 4.0 grams).
But when carrying out the test we will need to test a specifc value, so in this
text we will always express the null hypothesis as an equality.
(d) H0 : p = 0.98 (the true proportion of satisfed customers is 0.98).
Ha : p < 0.98 (the true proportion of satisfed customers is less than 0.98).
(e) H0 : σ12 = σ22 = σ32 (the true variances are all equal).
Ha : σi2 6= σj2 for at least one i, j combination (the true variances are not all equal).
123
Balka ISE 1.09 9.2. THE LOGIC OF HYPOTHESIS TESTING 124
(a) Z = 15.5−20
√
5/ 10
= −2.85.
(b) Z = 15.5−20
√
5/ 40
= −5.69.
(c) Z = 35.5−30
√
5/ 20
= 4.91.
(d) Z = 65.5−30
√
5/ 20
= 31.75.
¯ 12 X̄ −µ
4. X−√
6/ 75
is the Z statistic: Z = σ/ √ . When we are sampling from a normally dis-
n
tributed population and the null hypothesis is true, the Z statistic has the standard
normal distribution.
6. (a) The signifcance level of a test is the probability of rejecting the null hypothesis,
given the null hypothesis is true. True.
Balka ISE 1.09 9.3. HYPOTHESIS TESTS FOR µ WHEN σ IS KNOWN 125
(b) If we reject the null hypothesis, then we know that the null hypothesis is false.
False. There is an important di˙erence between the result of the test and the
(unknown) underlying reality. Whatever the result of the hypothesis test, we
still do not know the underlying reality.
(c) If we reject the null hypothesis, then we know that the alternative hypothesis
is false. False.
(d) If we do not reject the null hypothesis, then we know that the null hypothesis
is true. False.
(e) If we do not reject the null hypothesis, then we know that the alternative
hypothesis is true. False.
9.3.4 P-values
7. (a) p-value = 0.0058 (twice the area to the right of 2.76 under the standard normal
curve).
(b) p-value = 0.9971 (the area to the left of 2.76 under the standard normal curve).
(c) p-value = 0.0029 (the area to the right of 2.76 under the standard normal
curve).
(d) p-value = 0.0818 (twice the area to the left of −1.74 under the standard normal
curve).
(e) p-value = 0.0409 (the area to the left of −1.74 under the standard normal
curve).
(f) p-value = 0.9591 (the area to the right of −1.74 under the standard normal
curve).
9. A is correct, and the rest of the statements are false. The p-value is the probability,
given the null hypothesis is true, of obtaining a test statistic with at least as much
evidence against the null hypothesis as what was observed in the sample.
(d) The p-value of a test depends on sample data and on the null and alternative
hypotheses. True.
(e) The p-value of a two-sided test will be greater than 1 if the null hypothesis is
true. False. A p-value is a probability and as such cannot be greater than 1.
11. (a) The p-value of a test will be less than 0.05 if the null hypothesis is false.
False. The p-value can be any value between 0 and 1, whether or not the null
hypothesis is true. Keep in mind that there is a fundamental di˙erence between
the conclusion from a test and the (unknown) underlying reality.
(b) The p-value of a test is the probability that the null hypothesis is true. False.
(c) If the p-value of a hypothesis test with a two-sided alternative is equal to
exactly 1, then we can be certain the null hypothesis is true. False.
(d) If the p-value of a hypothesis test with a two-sided alternative is equal to
exactly 1, then we can be certain the null hypothesis is false. False.
(e) If the p-value of a test is less than 0 that means there is extremely strong
evidence against the null hypothesis. False. A p-value is a probability and as
such it cannot be less than 0. If we fnd a negative p-value, that means we
made a mistake.
9.4 Examples
12. (a) H0 : µ = 3.0 mg/kg dw. (The true mean cadmium concentration in caps of
Boletus edulis in this area is 3.0 mg/kg of dry weight. (The true mean cadmium
concentration is at the tolerable value.).)
Ha : µ > 3.0 mg/kg dw. (The true mean cadmium concentration in caps of
Boletus edulis in this area is greater than 3.0 mg/kg of dry weight. (The true
mean cadmium concentration exceeds the tolerable value.).)
It would also be acceptable to express the hypotheses this way:
H0 : µ ≤ 3.0. (The true mean cadmium concentration in caps of Boletus edulis
in this area is no more than 3.0 mg/kg of dry weight. (The true mean cadmium
concentration is no more than the tolerable value.).)
Ha : µ > 3.0 (The true mean cadmium concentration in caps of Boletus edulis
in this area is greater than 3.0 mg/kg of dry weight. (The true mean cadmium
concentration exceeds the tolerable value.).)
In some ways this set of hypotheses makes more sense, and is more ftting with
the spirit of the test. But when carrying out the test we will need to test a
specifc value, so in this text we will always express the null hypothesis as an
equality.
¯ ¯
(b) Z = X−µσ
0
= X−µ √ 0 = 9.26−3
σ/ n
√
5.00/ 29
= 6.74.
X̄
(c) Since the alternative hypothesis is Ha : µ > 3.0, the p-value is the area to the
Balka ISE 1.09 9.4. EXAMPLES 127
right of 6.74 under the standard normal curve. This area is very, very small
(see Figure 9.1).
-6 -4 -2 0 2 4 6
6.74
Figure 9.1: The p-value is the area to the right of 6.74 under the standard normal curve.
Using software: p-value = 7.8 × 10−8 . Using a standard normal table, we could
say only that the p-value is very close to 0 (or we may give a range, such as
p-value < 0.0000003).
(d) The p-value is less than the given signifcance level of α = 0.05, so we can
reject the null hypothesis at α = 0.05.
(e) There is statistically signifcant evidence (at α = 0.05) against the null hy-
pothesis that µ = 3.0 mg/kg dw. So there is very strong evidence that the
population mean µ is greater than 3.0 mg/kg dw. This means that there is
statistically signifcant evidence that the true mean concentration of cadmium
in caps of Boletus edulis mushrooms in this area is greater than the Slovenian
tolerable value of 3.0 mg/kg dw.
13. (a) H0 : µ = 3.35 kg. (The true mean weight of full-term Hispanic newborns in
the U.S. in 2013 is 3.35 kg – the same mean as for full-term newborns of the
general population.)
Ha : µ 6= 3.35 kg. (The true mean weight of full-term Hispanic newborns in
the U.S. in 2013 di˙ers from 3.35 kg. That is, the true mean weight di˙ers
from that of full-term newborns of the general population.)
¯ ¯
(b) Z = X−µσ
0
= X−µ√ 0 = 3.318−3.35
σ/ n
√
0.47/ 100
= −0.681.
X̄
(c) 0.496. The alternative hypothesis is two-sided, so the p-value is double the
area in the tail, beyond the observed value of the test statistic. In this case,
the p-value is double the area to the left of −0.681 under the standard normal
curve (see Figure 9.2).
The area to the left of −0.681 under the standard normal curve is 0.248, and
so the p-value is 0.496.
(d) The p-value is greater than the given signifcance level of α = 0.05, so we
do not reject the null hypothesis at this level of signifcance. (The p-value is
in fact much larger than 0.05, so we would not reject the null hypothesis at
Balka ISE 1.09 9.5. INTERPRETING THE P -VALUE 128
-6 -4 -2 0 2 4 6
-0.681
Figure 9.2: The p-value is double the area to the left of −0.681 under the standard normal
curve.
any reasonable signifcance level, but we used α = 0.05 since it was the given
signifcance level for this problem.)
(e) Based on this sample, there is no evidence that the true mean weight of full-
term Hispanic newborns in 2013 di˙ers from 3.35 kg (the mean weight of all
full-term U.S. newborns that year). We can say that the data is consistent
with full-term Hispanic newborns having the same mean weight as that of
full-term newborns in the general population. (N.B. We cannot say that they
are equal, just that we do not have any evidence of a di˙erence.)
14. (a) The p-value will have a uniform distribution on the interval (0,1).
(b) If the null hypothesis is true, on average the p-value will equal 0.5.
15. (a) A p-value of 2.9 × 10−48 gives extremely strong evidence against the null hy-
pothesis. There is almost no chance of observing what was observed if the null
hypothesis were true.
(b) A p-value of 0.0008 gives very strong evidence against the null hypothesis.
There is less than a one-in-a-thousand chance of observing what was observed
if the null hypothesis were true.
(c) A p-value of 0.02 gives strong, but not overwhelming, evidence against the null
hypothesis.
Balka ISE 1.09 9.6. TYPE I ERRORS, TYPE II ERRORS, AND THE POWER OF A TEST 129
(d) A p-value of 0.07 gives a little evidence against the null hypothesis, but it is
not strong (and not signifcant at the commonly used α level of 0.05).
(e) A p-value of 0.12 gives possibly a hint of evidence against the null hypothesis,
but nothing that is signifcant at any of the usual signifcance levels.
(f) A p-value of 0.45 gives no evidence against the null hypothesis.
(g) A p-value of 0.88 gives no evidence against the null hypothesis.
16. (a) Here the null hypothesis is false and we will reject it, so we will make neither
error. (Our decision is consistent with the underlying reality.)
(b) Here the null hypothesis is false and we will not reject it, so we will make a
Type II error.
(c) Here the null hypothesis is true and we will reject it, so we will make a Type
I error.
(d) Here the null hypothesis is true and we will not reject it, so we will make
neither error. (Our decision is consistent with the underlying reality.)
17. It all boils down to the balance between Type I and Type II errors. The signifcance
level α is the probability of a Type I error (given the null hypothesis is true). If a
Type I error has major repercussions (thousands of people will die, for example),
but a Type II error is not as costly, then we should choose a very small value for α.
If a Type I error is not very costly relative to a Type II error, then a larger choice
of α is reasonable.
18. (a) Reject H0 if X¯ ≤ 26.710. With the alternative hypothesis of Ha : µ < 30 and
α = 0.05, we reject the null hypothesis if Z ≤ −1.645. Expressing the rejection
region in terms of X:
¯
Z ≤ −1.645
¯
X − 30
√ ≤ −1.645
20/ 100
X¯ ≤ 30 + √20 (−1.645)
100
¯
X ≤ 26.710
Balka ISE 1.09 9.6. TYPE I ERRORS, TYPE II ERRORS, AND THE POWER OF A TEST 130
(b) 0.259. The power of the test is the probability of rejecting the null hypothesis,
given µ = 28:
26.710 − 28
Power = P (X
¯ ≤ 26.710|µ = 28) = P (Z ≤ √ )
20/ 100
= P (Z ≤ −0.645)
= 0.259
(c) 0.639. The power of the test is the probability of rejecting the null hypothesis,
given µ = 26:
Z ≥ 2.326
¯
X − 800
√ ≥ 2.326
160/ 400
¯ ≥ 800 + √160 (2.326)
X
400
¯ ≥ 818.61
X
(b) 0.140. The power of the test is the probability of rejecting the null hypothesis,
given µ = 810:
Z ≤ −1.96 or Z ≥ 1.96
¯ − µ0
X ¯ − µ0
X
√ ≤ −1.96 √ ≥ 1.96
σ/ n σ/ n
X¯ − 15 X¯ − 15
√ ≤ −1.96 √ ≥ 1.96
12/ 50 12/ 50
¯ ≤ 11.674
X ¯ ≥ 18.326
X
(b) The power of the test if µ = 16 is the total area in these two regions under
the sampling distribution of X.
¯
Power = P (X
¯ ≤ 11.674|µ = 16) + P (X
¯ ≥ 18.326|µ = 16)
11.674 − 16 18.326 − 16
= P (Z ≤ √ ) + P (Z ≥ √ )
12/ 50 12/ 50
= P (Z ≤ −2.549) + P (Z ≥ 1.371)
= 0.0054 + 0.0852
= 0.0906
(c) If µ = 17:
Power = P (X
¯ ≤ 11.674|µ = 17) + P (X
¯ ≥ 18.326|µ = 17)
11.674 − 17 18.326 − 17
= P (Z ≤ √ ) + P (Z ≥ √ )
12/ 50 12/ 50
= P (Z ≤ −3.138) + P (Z ≥ 0.781)
= 0.0008 + 0.2173
= 0.2181
Balka ISE 1.09 9.7. ONE-SIDED TEST OR TWO-SIDED TEST? 132
(d) If µ = 13:
Power = P (X
¯ ≤ 11.674|µ = 13) + P (X
¯ ≥ 18.326|µ = 13)
11.674 − 13 18.326 − 13
= P (Z ≤ √ ) + P (Z ≥ √ )
12/ 50 12/ 50
= P (Z ≤ −0.781) + P (Z ≥ 3.138)
= 0.2173 + 0.0008
= 0.2181
21. (a) P (X
¯ > 10|µ = 0) = P (Z > 10−0
√ )
100/ 50
= P (Z > 0.707) = 0.24.
(b) 0. A Type II error is possible only when the null hypothesis is false.
(c) 0. A Type I error is possible only when the null hypothesis is true.
(d) P (X
¯ > 10|µ = 5) = P (Z > 10−5 √ ) = P (Z > 0.353) = 0.36.
100/ 50
(e) P (X
¯ > 10|µ = 10) = P (Z > 10−10
√ ) = P (Z > 0) = 0.5.
100/ 50
22. (a) An increase in the sample size will increase the power of the test.
(b) A decrease in the signifcance level will decrease the power of the test.
(c) An increase in the distance between the true mean and the hypothesized mean
will increase the power of the test.
(d) A decease in the population variance will increase the power of the test.
23. If we choose a one-sided alternative hypothesis, then the test will have greater
power than its two-sided counterpart if the true mean lies in the direction specifed
by the alternative hypothesis. The downside of the one-sided alternative is that if
the true mean lies in the opposite direction, then it will be impossible to detect
this di˙erence from the hypothesized value. If we choose a two-sided alternative
hypothesis, then the test will be able to detect a di˙erence from the hypothesized
value in either direction.
24. This is poor statistical practice. We should not let the sample data infuence our
choice of hypothesis. If we are interested in a di˙erence from the null hypothesis
in either direction, then a two-sided alternative should be chosen at the outset.
Balka ISE 1.09 9.8. STATISTICAL SIGNIFICANCE AND PRACTICAL SIGNIFICANCE 133
25. (a) If the p-value of a hypothesis test is 0.0000001, then there is very strong
evidence that the results of the test have practical importance. False. A
very small p-value means that there is very strong evidence against the null
hypothesis. Whether or not these results have any practical importance is a
di˙erent question (one that cannot be answered with a hypothesis test).
(b) If the sample size is extremely large, then even tiny, meaningless di˙erences
from the hypothesized value will be found statistically signifcant. True.
(c) If the results of a hypothesis test are reported, then there is no need to also
report the associated confdence interval for the parameter. False. It is good
practice to also report a confdence interval for the parameter of interest, to
give an idea of the size of the e˙ect.
26. (a) No. The p-value is less than 0.05, implying 50 would fall outside of the 95%
confdence interval.
(b) Yes. The p-value is greater than 0.01, implying 50 would be contained in the
99% confdence interval.
27. B (the p-value is less than 0.05). Since the 95% confdence interval for µ does not
contain 0, a two-sided test of H0 : µ = 0 would result in a p-value that is less than
0.05.
29. (a) Using software: p-value = 0.0508. From the table: 0.05 < p-value < 0.10.
(b) Using software: p-value = 0.9745. From the table: 0.95 < p-value < 0.975.
(c) Using software: p-value = 0.0254. From the table: 0.025 < p-value < 0.05.
(d) Using software p-value = 0.0833. From the table: 0.05 < p-value < 0.10.
(e) Using software: p-value = 0.0416. From the table: 0.025 < p-value < 0.05.
(f) Using software: p-value = 0.9584. From the table: 0.95 < p-value < 0.975.
H0 : µ = 11.0 (This type of burrito has a true mean fat content of 11.0 g.)
Ha : µ 6= 11.0 (This type of burrito has a true mean fat content that di˙ers from 11.0 g.)
(b) SE(X
¯) = √s
n
= 1√
.45
6
= 0.59196.
X¯ − µ0
t= ¯)
SE(X
11.49 − 11.0
=
0.59196
= 0.828
(c) Since the alternative hypothesis is two-sided, the p-value is double the area
in the tail, beyond the observed value of the test statistic. In this case, the
p-value is double the area to the right of 0.828 under a t distribution with
n − 1 = 6 − 1 = 5 degrees of freedom (see Figure 9.3).
-6 -4 -2 0 2 4 6
0.828
Figure 9.3: The p-value is double the area to the left of 0.828 under a t distribution with
5 degrees of freedom.
Balka ISE 1.09 9.11. MORE ON ASSUMPTIONS 135
Using software, p-value = 0.45. Using a t table, we could fnd only a range:
p-value > 0.40.
(d) Since the p-value is large (0.45), there is no evidence against the null hypoth-
esis. There is no evidence that the true mean fat content in this type of bean
burrito di˙ers from the stated value of 11 grams. (The observed data is con-
sistent with the true mean equalling 11 grams.) N.B. We cannot say that the
true mean does equal 11.0 g, we can only that we have no evidence that it
di˙ers from 11 g.
(e) In order for the t procedure to be valid, we need the sample to be a simple
random sample from the population of interest, and we need the population
to be normally distributed. The sampling design was not clearly stated (we do
not know how the sample was obtained), so we cannot make a truly informed
decision about any possible biases that may have been introduced. While fast-
food restaurants strive for consistency, it is likely that the amount of fat in a
burrito tends to change a little from preparer to preparer, and possibly from
restaurant to restaurant. But given the nature of fast food it is probably not
too far o˙ the mark to think of the sample as a random sample from all bean
burritos from this fast-food chain.
The sample size is small, so the normality assumption is important. We were
not given the data, but we were told that there were no outliers, which is
somewhat comforting. Using the t procedure here is a little questionable, but
still reasonable.
31. No, it would not be reasonable to use the t procedures on this raw data. This plot
shows some right skewness and a large outlier. (Outliers and skewness negatively
a˙ect the t procedures.) The negative impact of skewness and/or outliers can be
bad, especially when the sample size is not large. (The sample size here is 20, which
is not very large.) There are other analysis options, such as using the t procedure
on a transformation of the data (such as the log of the data values), or using a
distribution-free procedure.
32. (a) The t test performs poorly when we are sampling from a strongly skewed
distribution, especially if the sample size is small.
(b) If we use the t test when sampling from skewed distributions, then the true
signifcance level may be quite di˙erent from the reported signifcance level,
and the true p-value may be quite di˙erent from the stated p-value. In short,
the reported results may be very misleading. This is especially true for small
sample sizes.
Balka ISE 1.09 9.12. CHAPTER SUMMARY 136
X̄−µ0
33. (a) t = SE(X̄) = 15.5−20
√
2/ 10
= −7.12. If the null hypothesis is true, the t test statistic
will have a t distribution with n − 1 = 10 − 1 = 9 degrees of freedom.
X̄−µ0
(b) t = SE(X̄) = 15.5−20
√
20/ 10
= −0.712. If the null hypothesis is true, the t test statistic
will have a t distribution with n − 1 = 10 − 1 = 9 degrees of freedom.
(c) Z = X̄−µσ
0
= 35.5−30
√
2/ 40
= 17.39. If the null hypothesis is true, the Z test
X̄
statistic will have the standard normal distribution. (Since we are sampling
from a normally distributed population and σ is known.)
X̄−µ0
(d) t = SE(X̄) = 45.5−40
√
20/ 50
= 1.94. If the null hypothesis is true, the t test statistic
will have a t distribution with n − 1 = 50 − 1 = 49 degrees of freedom. (With
these degrees of freedom, the t distribution will be similar to the standard
normal distribution.)
X̄−µ0
34. (a) t = SE(X̄) = 35.5−30
√
2/ 20
= 12.30. If the null hypothesis is true, the t test statistic
will have a t distribution with n − 1 = 20 − 1 = 19 degrees of freedom.
(b) Z = X̄−µσ
0
= 35.5−40
√
20/ 25
= −1.125. If the null hypothesis is true, the Z test
X̄
statistic will have the standard normal distribution. (Since we are sampling
from a normally distributed population and σ is known.)
X̄−µ0
(c) t = SE(X̄) = 65.5−30
√
2/ 200
= 251.0. If the null hypothesis is true, the t test statistic
will have a t distribution with n − 1 = 200 − 1 = 199 degrees of freedom.
(With these degrees of freedom, the t distribution is very close to the standard
normal distribution.)
X̄−µ0
(d) t = SE(X̄) = 10/32.1−32
√
60000
= 2.45. If the null hypothesis is true, the t test statistic
will have a t distribution with n − 1 = 60000 − 1 = 59999 degrees of freedom.
(With these degrees of freedom, the t distribution is very, very close to the
standard normal distribution.)
35. (a) The null hypothesis is X¯ = 50. False. We make hypotheses about parameters,
not statistics.
(b) The sample size is 24. False. n = 25.
(c) The area to the left of −4.1027 under a t distribution with 24 degrees of
freedom is 2 × 0.0004 = 0.0008. False. Twice the area to the left of −4.1027 is
given in the output as the p-value (0.0004), so the area to the left of −4.1027
is 0.0002.
(d) If we used this data to test the null hypothesis that the population mean is
40.0, against a two-sided alternative hypothesis, the p-value would be less than
0.05. True, since the 95% confdence interval does not contain 40.
Balka ISE 1.09 9.12. CHAPTER SUMMARY 137
(e) The 95% margin of error is greater than 20.0. False. The margin of error is
the half-width of the interval: 48.94322−46.80365
2 = 1.069785.
9.12.2 Concepts
36. For a one-sample Z test about a population mean µ to be valid, the sample must
be a simple random sample from a normally distributed population. (We also need
to know the true value of σ.) The assumption of a normally distributed population
is not very important if the sample size is large.
37. For a one-sample t test about a population mean µ to be valid, the sample must be
a simple random sample from a normally distributed population. The assumption
of a normally distributed population is not very important if the sample size is
large.
38. If the assumptions of a test are violated, then the reported signifcance level and
p-value will be di˙erent from their “true” values, and the reported results may be
misleading.
39. Both tests require the assumption of a normally distributed population, and for
both tests this assumption becomes less important as the sample size increases. We
use a Z test when the population standard deviation σ is known. We use a t test
when the population standard deviation σ is unknown, and must be estimated by
the sample standard deviation s.
40. The signifcance level, the hypothesized mean, and the choice of alternative hy-
pothesis should not be based on the current sample’s data. (One should be able to
arrive at the appropriate choices before looking at the sample data.) The sample
mean, the value of the test statistic, and the p-value do involve the current sample
data (the test statistic and p-value also involve the appropriate hypotheses of the
test).
41. (a) Here the normality assumption of the t test is true. If the null hypothesis is
true, the t statistic will have a t distribution, and will therefore have a mean
of 0.
(b) Here the normality assumption of the t test is true. If the null hypothesis is
true, the p-value will have a continuous uniform distribution on the interval
(0,1). As such, the p-value will have a mean of 0.5.
42. (a) We are carrying out 25 independent tests, and the probability that we make a
Type I error in any individual test is 0.05, so the number of Type I errors we
make will have a binomial distribution with n = 25 and p = 0.05.
Balka ISE 1.09 9.12. CHAPTER SUMMARY 138
(b) Since the mean of a binomial random variable is np, on average we will make
np = 25 · 0.05 = 1.25 Type I errors.
(c) P (At least one Type I error) = 1−P (No Type I errors) = 1−0.9525 = 0.7226.
43. B (0.0358). Since the hypothesized value of 29 is not contained in the 95% conf-
dence interval, the p-value of the two-sided test would be less than 0.05. Since the
hypothesized value of 29 is contained in the 99% confdence interval, the p-value
of the two-sided test would be greater than 0.01. So, 0.01 < p-value < 0.05. Of
the 5 numbers listed, the only one that lies between 0.01 and 0.05 is 0.0358. (It is
possible to verify that this is the correct p-value by carrying out the appropriate
calculations.)
45. All 3 values are possible values of µ. Although there is strong evidence against
the null hypothesis, indicating that the population mean is likely less than 32, any
value of µ is still possible.
46. Whether or not it is reasonable to use the t procedures is debatable in both cases.
The plot on the left shows strong right skewness (a clear violation of the normality
assumption). Skewness negatively impacts the performance of the t procedures, but
in this situation the large sample size of n = 100 would lessen the impact. In this
situation, we would likely be better o˙ doing an analysis involving a transformation
of the data.
The plot on the right shows no clear violation of the normality assumption, but
the sample size is very small. In order to use the t procedures when the sample
size is this small, we need to be very confdent that the normality assumption is
reasonable. (Use of the t procedures is dubious when the small sample size is so
small.)
hypothesis. There is very strong evidence that the true population mean di˙ers
from 50. The point estimate of µ is 52.46, with an associated 95% confdence
interval of (51.26, 53.66), giving strong evidence that the population mean is
in fact greater than 50.
(c) 0.05. The signifcance level of a test is the probability of a Type I error (the
probability of rejecting the null hypothesis, given the null hypothesis is true).
(d) The p-value is less than 0.05, so we would reject the null hypothesis at α = 0.05.
But if µ = 50, the null hypothesis is true. We would have rejected a true null
hypothesis, and thus we would have committed a Type I error.
(e) No. The plots give no indication of a violation of the normality assumption,
so it is reasonable to carry out a t test.
48. (a) Sample B would have the largest p-value of this test (the least evidence that
its population mean di˙ers from 18).
(b) Sample C would have the smallest p-value of this test (the greatest evidence
that its population mean di˙ers from 18).
49. (a) No, the null hypothesis is not rejected (since the p-value is greater than the
signifcance level).
(b) Yes, the null hypothesis is rejected (since the p-value is less than the signif-
cance level).
(c) C (but opinions may di˙er here). Since the p-values are nearly identical in
the two tests, the amount of evidence against the null hypothesis is nearly
identical.
50. A. If a Type I error is much, much worse than a Type II error, we should choose a
very small value for α.
51. (a) A Type I error is rejecting the null hypothesis when it is true. If the null
hypothesis is true, we will get tails (and reject the null hypothesis) with prob-
ability 0.5. The probability of a Type I error is thus 0.5.
(b) For it to be impossible to make a Type II error, it must be impossible to
not reject the null hypothesis when it is false. This is true here only if the
probability of tails is 1.
54.
P (Type II error) = P (Do not reject H0 when it is false)
¯ ≤ 20.1|µ = 20.3)
= P (X
20.1 − 20.3
= P (Z ≤ √ )
0.5/ 100
= P (Z ≤ −4.0)
= 0.00003167
56. B (the reported p-value will be less than it should be). The standard normal
distribution has less area in the tails than the t distribution, and the two-sided p-
value is double the area in the tail, beyond the observed value of the test statistic.
57. (a) All else being equal, the power of a test will increase as the population variance
decreases. True. With lower variability there is less uncertainty and a greater
ability to detect deviations from the null hypothesis.
(b) If the signifcance level of a test is decreased (from 0.10 to 0.05, say), then the
probability of a Type II error will increase. True.
(c) The probability of a Type II error decreases the farther the true population
mean is from the hypothesized mean. True (for two-sided alternatives), and
true in spirit for one-sided alternatives.
(d) If the null hypothesis is true, we cannot make a Type II error. True.
59. (a) If a null hypothesis is rejected at the 5% signifcance level, then it would also
be rejected at the 10% signifcance level. True. If we reject H0 at α = 0.05,
this implies p-value ≤ 0.05, and we would also reject at α = 0.10.
Balka ISE 1.09 9.12. CHAPTER SUMMARY 141
60. (a) If the p-value of a hypothesis test with a two-sided alternative is equal to
exactly 0, we can be certain the null hypothesis is false. (This is a conceptual
question—for all the tests we have done, it is not possible to get a p-value of
exactly 0.) True, since this would mean it was impossible to observe what was
observed if the null hypothesis were true.
(b) The p-value of a test is the probability of getting an insignifcant result, given
the null hypothesis is true. False.
(c) The p-value of a test is less than the signifcance level whenever the null hy-
pothesis is false. False.
(d) The p-value of a test is the probability of rejecting the null hypothesis when
it is false. False.
(e) If the p-value of a test is large (0.99, say), then we can be almost positive that
the null hypothesis is true. False.
61. (a) For a two-sided alternative hypothesis, the p-value of the test will equal 1 if
the sample mean equals the hypothesized mean. True.
(b) If we are sampling from a normally distributed population and conducting
an ordinary t test, then if the null hypothesis is true the p-value will have a
uniform distribution between 0 and 1. True.
(c) If the null hypothesis is true, then on average the p-value will equal 0.5. True.
(d) If the null hypothesis is false, then on average the p-value will equal 0.5. False.
9.12.3 Applications
62. (a) The boxplot and the normal QQ plot both show indications of right-skewness
(which is typical of BMI data). This skewness is not a serious problem in this
scenario, as the skewness is not very strong, and the sample size is fairly large
(94). We can use the t procedures.
Balka ISE 1.09 9.12. CHAPTER SUMMARY 142
(c) SE(X
¯) = √s
n
= 5.4
√
94
= 0.5570.
¯ − µ0
X
t= ¯)
SE(X
23.3 − 22.4
=
0.5570
= 1.616
(d) Since the alternative hypothesis is two-sided, the p-value is double the area to
the right of 1.616 under a t distribution with n − 1 = 94 − 1 = 93 degrees of
freedom (see Figure 9.4). Using software, we can fnd that the p-value is 0.1095.
0 1.616
Figure 9.4: The p-value is double the area to the right of 1.616 under a t distribution
with 93 degrees of freedom.
Using a table, we could fnd only a range of values: 0.10 < p-value < 0.20.
(e) Because the p-value is not small (p-value = 0.1095), there is not strong ev-
idence against the null hypothesis (The evidence against the null hypothesis
would not be signifcant at α = 0.05 or even α = 0.10.) There is not strong
evidence that the population mean BMI for 14 year-old boys in rural Texas
di˙ers from that of 14 year-old boys in the United States as a whole. (It is
plausible that they are the same.)
(f) There is little or no evidence (p-value = 0.1095) against the null hypothesis
that the true mean BMI for 14 year-old boys in rural Texas di˙ers from that
of 14 year-old boys in the United States as a whole. We can be 95% confdent
that the population mean BMI for 14 year-old boys in rural Texas lies between
22.2 and 24.4 kg/m2 . (Note that the 95% confdence interval captures 22.4,
the hypothesized value of the population mean.)
Balka ISE 1.09 9.12. CHAPTER SUMMARY 143
The sample is drawn from a school district in North Central Texas, so very
loosely speaking the population is all 14 year-old boys in north central Texas.
However, there are some sampling biases that make this assessment not quite
true:
• The sample was drawn from only 6 schools in the area. Students in these
schools may di˙er in a meaningful way from those at other schools in the
area (or from children that don’t go to school).
• For students to participate, parents needed to provide informed consent.
Students whose parents provide informed consent may di˙er in a mean-
ingful way from those whose parents do not.
• Students needed to provide self-assent (the student had to agree to be
part of the study). Students who agree to be part of a study like this may
di˙er in a meaningful way from those who do not.
• Students who participated were given a T-shirt. This could introduce
bias, as it is conceivable that there is a relationship between desire for the
t-shirt and BMI.
63. (a) The plots look good, and show no indication of a violation of the normality
assumption. There are no outliers, obvious skewness, or other problems. While
we would prefer a larger sample size (n = 11 here), it is reasonable to use the
t procedures.
(b) H0 : µ = 0. (The true mean di˙erence is 0. On average, the BR estimation
method is equal to the true age at death.)
Ha : µ 6= 0. (The true mean di˙erence is not 0. On average, the BR estimation
method does not equal the true age at death.)
(c) SE(X ¯ ) = √s = 7√.784 = 2.3470.
n 11
X¯ − µ0
t= ¯)
SE(X
11.073 − 0
=
2.3470
= 4.718
(d) Since the alternative hypothesis is two-sided, the p-value is double the area to
the right of 4.718 under a t distribution with n − 1 = 11 − 1 = 10 degrees of
freedom. (see Figure 9.6).
Using software, we can fnd that the p-value is 0.0008. Using a table, we could
fnd only a range: p-value < 0.001.
(e) There is very strong evidence against the null hypothesis (p-value = 0.0008).
There is very strong evidence that the true mean di˙erence between the BR
estimate and the true age di˙ers from 0.
(f) There is very strong evidence against the null hypothesis (p-value = 0.0008).
There is very strong evidence that the true mean di˙erence between the BR
Balka ISE 1.09 9.12. CHAPTER SUMMARY 144
-6 -4 -2 0 2 4 6
4.718
Figure 9.5: The p-value is double the area to the right of 4.718 under a t distribution
with 10 degrees of freedom.
estimate and the true age di˙ers from 0. We can be 95% confdent that the true
mean di˙erence between the BR estimate and the true age lies between 5.84
years and 16.302 years. (Note that the hypothesized value of 0 falls outiside
of the 95% interval.) It appears as though the true mean di˙erence is in fact
greater than the hypothesized value of 0, and therefore the BR estimate tends
to be too large for individuals of this age.
The conclusions apply only to individuals that die at the age of 22 (but they
would likely also hold for people of similar age). The sample is not truly a
simple random sample from any population, and there may be strong sampling
biases present, depending on the specifc nature of the sample. While we are
attempting to use these results to draw inferences about all people that die
at 22, the BR method may have di˙erent performance for di˙erent subgroups.
For example, it is possible that the BR aging method performs di˙erently for
di˙erent ethic groups. It is also conceivable that people who donate their
bodies for educational or medical research di˙er in a meaningful way from
those that do not. Experts in the feld of dental aging would likely be able to
provide guidance about the possible impacts of this type of bias.
-6 -4 -2 0 2 4 6
-1.992
Figure 9.6: The p-value is double the area to the left of −1.992 under a t distribution
with 134 degrees of freedom.
Using software, we can fnd that the p-value is 0.048. Using a table, we could
fnd only a range: 0.02 < p-value < 0.05.
(d) The p-value is less than the given signifcance level (0.048 < 0.05), so there is
statistically signifcant evidence against the null hypothesis. There is strong
evidence that the true mean 2D:4D ratio (on the left hand) for women of
European descent at Western Washington University di˙ers from 1.
(e) The p-value is less than the given signifcance level (0.048 < 0.05), so there is
statistically signifcant evidence against the null hypothesis. There is strong
evidence that the true mean 2D:4D ratio (on the left hand) for women of
European descent at Western Washington University di˙ers from 1. We can
be 95% confdent that the true mean 2D:4D ratio lies between 0.98804 and
0.99996. (Note that the hypothesized value of 1 falls outiside of the 95%
interval.) It appears as though the true mean 2D:4D ratio is in fact less than
the hypothesized value of 1. (Female students of European descent tended to
have an index fnger that is shorter than the ring fnger.)
The sample was of female students of European descent in an introductory
biology course at Western Washington University. So our conclusions apply
to women of European descent enrolled in biology courses at this university,
and to a lesser extent, women of European descent at the university. While we
may look to studies like this to provide information about women of European
descent in general, there are many possible biases to keep in mind, including:
• Students enrolled in an introductory biology course may tend to have
di˙erent physical characteristics than other students.
• Students had to consent to being part of this study, and students who
consent may possibly di˙er in a meaningful way from those that do not.
• Most importantly, women of European descent at this university may
di˙er in a meaningful way from women of European descent in general.
While it is seemingly unlikely that these factors would have a major impact on
the 2D:4D ratio, it is very possible that they would have a meaningful e˙ect.
Balka ISE 1.09 9.12. CHAPTER SUMMARY 146
University students are not a random sample from the general population.
H0 : µ = 30.0 (This type of cheeseburger has a true mean protein content of 30 g.)
Ha : µ 6= 30.0 (This type of cheeseburger has a true mean protein content that di˙ers from 30 g.)
(b) SE(X
¯) = √s
n
= 2√
.72
6
= 1.1104.
X¯ − µ0
t= ¯)
SE(X
33.81 − 30.0
=
1.1104
= 3.431
(c) Since the alternative hypothesis is two-sided, the p-value is double the area
in the tail, beyond the observed value of the test statistic. In this case, the
p-value is double the area to the right of 3.431 under a t distribution with
n − 1 = 6 − 1 = 5 degrees of freedom (see Figure 9.7).
-6 -4 -2 0 2 4 6
3.413
Figure 9.7: The p-value is double the area to the left of 3.431 under a t distribution with
5 degrees of freedom.
Using software, p-value = 0.019. Using a t table, we could fnd only a range:
0.01 < p-value < 0.02.
(d) The p-value is less than the given signifcance level (0.019 < 0.05), so there is
statistically signifcant evidence against the null hypothesis. There is statis-
tically signifcant evidence that the true mean protein content in this type of
cheeseburger di˙ers from the stated value of 30 grams. Since the sample mean
is greater than 30 (33.81), this implies there is strong evidence that the true
mean protein content in this type of cheeseburger is in fact greater than 30
grams.
Balka ISE 1.09 9.12. CHAPTER SUMMARY 147
(e) In order for the t procedure to be valid, we need the sample to be a simple
random sample from the population of interest, and we need the population
to be normally distributed. The sampling design was not clearly stated (we do
not know how the sample was obtained), so we cannot make a truly informed
decision about any possible biases that may have been introduced. While
fast-food restaurants strive for consistency, it is possible that the amount of
protein in a cheeseburger tends to change a little from preparer to preparer,
and from restaurant to restaurant. But given the nature of fast food it is
probably not too far o˙ the mark to think of the sample as a random sample
from all cheeseburgers of this type from this fast-food chain.
The sample size is small, so the normality assumption is important. We were
not given the data, but we were told that there were no outliers, which is
somewhat comforting. Using the t procedure here is a little questionable, but
still reasonable.
66. (a) H0 : µ = 5.0 mg/kg dw. (The true mean lead concentration in caps of Laccaria
amethystina in this area is 5.0 mg/kg of dry weight. (The true mean lead
concentration is at the tolerable value.).)
Ha : µ > 5.0 mg/kg dw. (The true mean lead concentration in caps of Laccaria
amethystina in this area is greater than 5.0 mg/kg of dry weight. (The true
mean lead concentration exceeds the tolerable value.).)
Note that the choice of alternative hypothesis is based on the problem at hand,
and does not depend on the value of the sample mean.
(b) SE(X ¯ ) = √s = √
n
0.25
21
= 0.05455.
X¯ − µ0
t= ¯)
SE(X
0.94 − 5.0
=
0.05455
= −74.421
(c) Since the alternative hypothesis is µ > 5.0, the p-value is the area to the right
of the observed value of the test statistic. In this case, the p-value is the area
to the right of −74.421 under a t distribution with n − 1 = 21 − 1 = 20 degrees
of freedom (see Figure 9.8).
For all intents and purposes, this area is 1. Using software: p-value = 1 −
3.2 × 10−26 . Using a t table, we could fnd only a range: p-value > 0.40. (But
we should have a pretty good idea that the p-value is near 1 just by looking
at the value of the test statistic.)
(d) A p-value this large gives absolutely no evidence against the null hypothesis.
There is no evidence whatsoever that the true mean lead concentration in this
type of mushroom in this area is greater than the tolerable value of 5.0 mg/kg
dw.
Balka ISE 1.09 9.12. CHAPTER SUMMARY 148
Figure 9.8: The p-value is the area to the right of −74.42 under a t distribution with 20
degrees of freedom. This area is very, very close to 1.
67. (a) No, there is no indication of a violation of the normality assumption. The
boxplot looks roughly symmetric, and there is no systematic curvature or
outliers evident in the normal QQ plot. A minor violation of the normality
assumption would not be a big deal in this scenario, as the sample size is not
small (n = 51). We can use the t procedures.
(b) The hypotheses are:
(c) SE(X
¯) = √s
n
= 0.60
√
51
= 0.0840.
X¯ − µ0
t= ¯)
SE(X
1.80 − 1.0
=
0.0840
= 9.522
(d) Since the alternative hypothesis is two-sided, the p-value is double the area to
the right of 9.522 under a t distribution with n − 1 = 51 − 1 = 50 degrees of
freedom (see Figure 9.9).
(e) There is very, very strong evidence against the null hypothesis (p-value =
8.1 × 10−13 ). There is very strong evidence that the true mean speed of
tandem running (under the conditions of the study) di˙ers from 1.0 mm/s.
(f) There is very, very strong evidence against the null hypothesis (p-value =
8.1 × 10−13 ). There is very strong evidence that the true mean speed of
tandem running (under the conditions of the study) di˙ers from 1.0 mm/s.
We can be 95% confdent that the true mean speed of tandem running (under
the conditions of the study) lies between 1.63 mm/s and 1.97 mm/s. (Note
Balka ISE 1.09 9.12. CHAPTER SUMMARY 149
0 9.522
Figure 9.9: The p-value is double the area to the right of 9.522 under a t distribution
with 50 degrees of freedom.
that the hypothesized value of 1.0 mm/s falls outiside of the 95% interval.) It
appears as though the true mean speed of tandem running is in fact greater
than the hypothesized value of 1.0 mm/s.
Although it wasn’t perfectly clear from the article how the tandem running
pairs were chosen, it is reasonable to think that these results apply to Tem-
nothorax albipennis ants (with a young experienced ant leading a young in-
experienced ant) under the conditions of the study. The true mean may be
di˙erent if the conditions change, or if a di˙erent type of ant is studied.
68. (a) The normality assumption looks reasonable, and we can go ahead with the t
procedures on this data.
(b) The 90% interval is given by 1082 ± 1.771 × √6014 , which works out to 1082 ±
28.399, or (1053.6, 1110.4).
(c) H0 : µ = 920, Ha : µ > 920.
(d) The t procedure is appropriate. Both the t and Z procedures assume a nor-
mally distributed population, which is a reasonable assumption here. Since the
standard deviation is based on sample data (the population standard deviation
is unknown), we would use the t procedure.
(e) t = 1082−920
√
60/ 14
= 10.10.
(f) The p-value of the test is the area to the right of 10.10 under a t distribution
with n − 1 = 14 − 1 = 13 degrees of freedom. Using software, the p-value can
be found to be 8.0 × 10−8 . Using the t table, we could say only that the p-value
is very close to 0 (depending on the specifc table, we might say something like:
p-value < 0.0005).
(g) There is very strong evidence (one-sided p-value = 8.0 × 10−8 ) that the popu-
lation mean calorie content is greater than what the company claims. A 90%
Balka ISE 1.09 9.12. CHAPTER SUMMARY 150
69. (a) (ii) is correct (H0 : µ = 12.0, Ha : µ > 12.0). (Note once again that we make
hypotheses about parameters, and never about statistics.)
(b) t = X̄−µ√ 0 = 16.7−12.0
s/ n
√
4.8/ 9
= 2.9375. The p-value is the area to the right of 2.9375
under a t distribution with n − 1 = 9 − 1 = 8 degrees of freedom. Using
software: p-value = 0.0188. Using a t table, we could only give a range:
0.01 < p-value < 0.02.
(c) Statement 2 (ii) is correct (and the rest of the statements are incorrect). Since
the p-value is less than the given signifcance level of 0.05, the evidence against
the null hypothesis is statistically signifcant at α = 0.05. There is very strong
evidence that the population mean arsenic level is greater than 12.0 ppm (the
Environment Canada guideline).
(d) This might not be the most appropriate test for a number of reasons. An
important reason is that the average arsenic concentration may not be the
best summary of the data, since a high arsenic level in one school is very bad,
regardless of whether it is o˙set by low levels in other schools. Also, we may
wish to look at the hypotheses from the other perspective, and test to see if
there is strong evidence that the population mean arsenic level is less than the
Environment Canada guideline.
(e) We rejected a true null hypothesis, and therefore we made a Type I error.
70. (a) There is extremely strong evidence (p-value = 7.7 × 10−11 ) that the mean
response time for this type of rat under these conditions di˙ers from 1.25
seconds. The point estimate of the mean response time is 1.476, with an
associated confdence interval of (1.42, 1.53). This gives strong evidence that
the true mean response time is greater than 1.25 seconds.
(b) Although it is necessary from time to time, it is unwise to compare sample data
to a “historical control”, as it opens up a host of problems. Perhaps the rats
in this experiment have slower response times due to characteristics of the lab
in which the experiment took place, and not because of an e˙ect of the drug.
(The lab might have a poor measuring device, or low oxygen levels, or some
other type of improper setup.) It is much better to conduct a case-control
experiment, comparing two groups of rats under similar conditions.
value given in the output is double the area to the right of 0.9966 under a t
distribution with 44 degrees of freedom. For the alternative Ha : µ > 250, the
p-value is the area to the right of 0.9966, so for this alternative: p-value =
0.3244
2 = 0.1622.
(c) 0.8378. Since the alternative hypothesis in the output is two-sided, the p-
value given in the output is double the area to the right of 0.9966 under a
t distribution with 44 degrees of freedom. For the alternative Ha : µ < 250,
the p-value is the area to the left of 0.9966, so for this alternative: p-value =
1 − 0.3244
2 = 0.8378.
(d) There is no evidence (p-value = 0.32) that the true mean weight of nuts in
packages of this type di˙ers from 250 grams.
(e) The third statement (iii) is a reasonable interpretation of the confdence inter-
val, and the rest of the statements are incorrect.
72. (a) The appropriate hypotheses are given in the frst option: H0 : µ = 0.50, Ha : µ >
0.50.
(b) t = 0.538−0.50
√
0.14/ 150
= 3.32.
(c) The p-value is the area to the right 3.32 under a t distribution with n − 1 =
150 − 1 = 149 degrees of freedom. Using software: p-value = 0.00056. Using
the t table: 0.0005 < p-value < 0.001.
(d) Since the p-value is less than the given signifcance level (0.00056 < 0.05) there
is statistically signifcant evidence against the null hypothesis. There is very
strong evidence that the population mean trans-fat content exceeds 0.50 grams
per serving.
(e) Yes, this hypothesis test would still be informative. The procedure assumes a
normally distributed population, but this assumption becomes less important
as the sample size increases. For a sample size of n = 150, the central limit
theorem tells us that these methods should still work reasonably well. (But
there may be better methods of analysis. For example, we might wish to use
the t procedures on a transformation of the data.)
73. (a) A t statistic is appropriate, since the observations were roughly normally dis-
tributed and the standard deviation is based on sample data.
(b) t = 84.14−82.9
√
0.12/ 100
= 103.33.
(c) The p-value is very close to 0. The p-value is twice the area to right of 103.33
under a t distribution with 100 − 1 = 99 degrees of freedom. This area is very,
very near zero. (While conceptually there is some non-zero area this far out
in the tail, for all intents and purposes this area is 0.)
(d) Since the p-value is less than the given signifcance level of 0.01, we can say
there is statistically signifcant evidence against the null hypothesis. There is
extremely strong evidence that the population mean weight of bolts of this
type exceeds 82.9 grams.
Balka ISE 1.09 9.12. CHAPTER SUMMARY 152
(e) We would have rejected a true null hypothesis, and therefore we would have
made a Type I error.
74. (a) H0 : µ = 800 (the true mean head injury rating for drivers is 800).
Ha : µ < 800 (the true mean head injury rating for drivers is less than 800).
X̄−µ
(b) t = s/ √ = 750−800
n
√
50/ 24
= −4.90.
(c) The p-value is the area to the left of −4.90 under a t distribution with n − 1 =
24 − 1 = 23 degrees of freedom. Using software: p-value = 3.0 × 10−5 . Using
the t table: p-value < 0.0005.
(d) There is very strong evidence (p-value = 3.0 × 10−5 , signifcant at α = 0.05)
that the population mean driver head injury rating for this model of car under
these conditions is less than 800.
J.B.’s strongly suggested exercises: 4, 6, 7, 9, 10, 11, 17, 23, 26, 27, 30, 31, 33
10.1 Introduction
2. (a) Let T be a random variable representing Tom’s time on the frst run, and P
be a random variable representing Pete’s time on the frst run. We are given
that T ∼ N (4.612, 0.048) and P ∼ N (4.528, 0.044).
153
10.3. HYPOTHESIS TESTS AND CONFIDENCE INTERVALS FOR TWO INDEPENDENT SAMPLES (WHEN σ1 AND σ2 ARE KNOWN)
3. (a) The point estimate of the di˙erence in the population means (µ1 − µ2 ) is the
value of the di˙erence in the sample means: X¯1 − X
¯ 2 = 8.8 − 17.2 = −8.4.
2 2
(b) s2p = (10−1)1.42 +(5−1)2.61
10+5−2 = 3.492.
√
(c) SE(X ¯1 − X ¯ 2 ) = 3.492 1/10 + 1/5 = 1.0235.
p
(d) −8.4 ± 2.160 × 1.0235, which works out to −8.4 ± 2.211 or (−10.61, −6.19).
(e) H0 : µ1 = µ2 , Ha :√µ1 = 2.
6 µp
SE(X ¯1 − X¯ 2 ) = 3.492 1/10 + 1/5 = 1.023523. t = 8.8−17.2 = −8.21.
1.0235
The p-value is double the area to the left of −8.21 under a t distribution with
10+5−2 = 13 degrees of freedom. Using software: p-value = 1.7×10−6 . Using
a t table: p-value < 0.001. There is statistically signifcant evidence against
the null hypothesis (p-value = 1.7 × 10−6 ) at α = 0.05. There is statistically
signifcant evidence that the population means are not equal.
7. Opinions di˙er a great deal on the choice between the Welch procedure and the
pooled-variance t procedure. Both procedures assume normally distributed pop-
ulations. The pooled-variance procedure also assumes that the populations have
equal variances. If the sample variances are very di˙erent, then this gives some
indication that the assumption of equal population variances is not reasonable, and
the pooled-variance procedure may not be performing well. The e˙ect of di˙erent
population variances is exaggerated if the sample sizes are also very di˙erent. So
if the sample variances are very di˙erent, and the sample sizes are very di˙erent,
then the pooled-variance procedure is not an appropriate method of analysis. If the
sample variances are similar, and the sample sizes are similar, then the choice be-
tween the Welch procedure and the pooled-variance t procedure is largely a matter
of personal preference.
9. (a) We should use a paired-di˙erence procedure, since the before and after mea-
surements represent two measurements on the same pig. (The before and after
measurements do not represent independent samples.)
Balka ISE 1.09 10.5. PAIRED-DIFFERENCE PROCEDURES 157
(c) X
¯ = 250+155+290+165+120+270+190+190+200 = 203.333 mL. s = 56.1805 mL.
9
(d) SE(X
¯ ) = 56.1805
√
9
= 18.7268 mL.
(e) The boxplot and normal QQ plot look good. There are no obvious outliers,
skewness, or other major problems. The sample size is quite small (n = 9),
but it is not unreasonable to use the t procedures here.
(f) Using software, we can fnd that with 8 degrees of freedom, t0.025 = 2.306004.
(If we must use a t table, we can fnd the rounded t value of 2.306.)
¯ ± t0.025 SE(X
X ¯)
203.333 ± 2.306 × 18.7268
203.333 ± 43.184
We can be 95% confdent that the true mean increase in leg volume, for this
type of pig under the conditions of this experiment, lies between 160.15 mL
and 246.52 mL.
(g) The hypotheses are:
H0 : µ = 0 (The true mean change in leg volume (under the conditions of this experiment) is 0.)
Ha : µ > 0 (The true mean change in leg volume is greater than 0.)
¯ − µ0
X
t= ¯)
SE(X
203.333 − 0
=
18.7268
= 10.8579
Balka ISE 1.09 10.5. PAIRED-DIFFERENCE PROCEDURES 158
The p-value is the area to the right of 10.8579 under a t distribution with
n − 1 = 9 − 1 = 8 degrees of freedom. Using software, we can fnd: p-value =
2.3 × 10−6 . Using a t table, we could fnd only a range: p-value < 0.0005.
A p-value that is this small gives very strong evidence against the null hy-
pothesis. (The evidence would be statistically signifcant at any of the usual
signifcance levels.) There is very strong evidence that, on average, pigs treated
with a dose of venom and antivenom experience swelling in the a˙ected area.
10. (a) The data should be plotted. Plotting the data helps us get a feel for what the
numbers are telling us, and can help us to determine whether the normality
assumption is reasonable.
(b) The hypotheses are:
(In loose terms, the null hypothesis is that the short training program did not
help people discriminate between genuine and fake smiles, and the alternative
hypothesis is that the training program (on average) helped people to discrim-
inate between genuine and fake smiles.)
The sample mean fell in the direction given in the alternative hypothesis, so
the one-sided p-value is half of the reported two-sided p-value. One-sided
p-value = 0.0348
2 = 0.0174.
(c) There is statistically signifcant evidence at α = 0.05 (one-sided p-value =
0.0174) that the training program results in improved discrimination accuracy
scores on average. The point estimate of the true mean change in discrimina-
tion accuracy is 0.618, with an associated 95% confdence interval of 0.048 to
1.188.
Balka ISE 1.09 10.6. INVESTIGATING THE NORMALITY ASSUMPTION 159
10.7.2 Concepts
11. SE(X ¯1 − X¯ 2 ) is the estimate of the standard deviation of the sampling distribution
of X
¯1 − X¯ 2 . (It estimates σ ¯ 2 ). It is a measure of the dispersion of X
X̄1 −X
¯ 2 in
¯1 − X
repeated sampling.
12. No, this would not make sense. Statistical hypotheses always involve parameters
and never statistics.
13. (a) Any hypothesized value outside of the 95% interval would be rejected at α =
0.05. For example, H0 : µ1 − µ2 = 42 would be rejected.
(b) Any hypothesized value within the 95% interval would not be rejected at α =
0.05. For example, H0 : µ1 − µ2 = 20 would not be rejected.
14. Yes, of course X¯ 1 does not equal X¯ 2 . The output shows that X¯ 1 = −0.127 and
X2 = 0.082, so we know the sample means are not equal. However, in the test
¯
of H0 : µ1 = µ2 , there is not any evidence against the null hypothesis (p-value =
0.5763).
15. (a) All else being equal, as µ1 − µ2 gets closer to 0, the power of the test will
decrease (it will be harder to detect a di˙erence if the di˙erence is small).
(b) All else being equal, as the sample size increases, the power of the test will
increase.
17. (a) H0 : µA = µC (II) would have the smallest p-value, since it shows the greatest
evidence against the null hypothesis.
Balka ISE 1.09 10.7. CHAPTER EXERCISES 160
(b) H0 : µA = µB (I) would have the largest p-value, since it shows the least
evidence against the null hypothesis.
18. (a) The boxplots do not give any indication that the pooled-variance t procedure
is inappropriate. (The pooled-variance t procedure is a reasonable choice.)
There is one small outlier in Sample B, which may distort things a little bit,
but it is not very extreme and it is not likely to have a major impact. The
variances of the two samples are similar, so the equal variance assumption may
not be too far o˙ the mark.
(b) No, there is not strong evidence (p-value = 0.29) of a di˙erence in population
means.
(c) Yes. We can be 95% confdent that µ1 − µ2 lies between −1.69 and 0.52.
Since 10 falls far outside of this interval, there is very strong evidence that the
di˙erence in population means does not equal 10.
(d) For the pooled variance procedure, df = n1 + n2 − 2 = 43 (from the output).
This implies that n1 + n2 = 45.
19. (a) Output 1 is from the Welch procedure and Output 2 is from the pooled-
variance procedure. (One can tell by looking at the degrees of freedom.)
(b) The conclusions are nearly identical, so it matters little which procedure is
used. The sample standard deviations are not given in the output, but they
were likely similar.
20. (a) Of the given choices, the paired-di˙erence procedure is the most appropriate
one. We have two measurements on each individual, so in this situation it
would not be appropriate to use Welch’s t or the pooled-variance t (since
they assume that the samples are independent). We need to use the paired-
di˙erence approach.
(b) The fact that the placebo injection was given frst, then the drug two days
later, is a cause for concern. It is conceivable (and not a major stretch), to
think that the individuals may be more nervous on the frst day of the study,
at the frst injection. Perhaps by the second day, the participants were more
comfortable with the study and more relaxed. This may very well infuence
the results. It would be much better to randomize the order in which they
received each treatment (the drug or the placebo).
21. We should use the paired-di˙erence approach, since there is a dependence within
each pair of twins. (Twins are more similar than non-twins.)
q q
2 2
22. Here we require 1.96 3n + 3n ≤ 0.5. This implies that 1.96 18 n ≤ 0.5 =⇒ n ≥
√
( 1.960.5 18 )2 =⇒ n ≥ 276.5. We would need to sample at least 277 members from
each population.
Balka ISE 1.09 10.7. CHAPTER EXERCISES 161
23. (a) When we use the pooled-variance t procedure, it is because we know the pop-
ulations have the same variance. False. We are assuming that the (unknown)
population variances are equal.
(b) The pooled-variance t procedure works well, even when the population vari-
ances are a little di˙erent. This is especially true if the sample sizes are similar.
True.
(c) It would be most appropriate to use the Welch procedure instead of the pooled-
variance t procedure if the sample variances are very di˙erent and the sample
sizes are very di˙erent. True.
(d) If the conclusions from the Welch procedure and the pooled-variance t proce-
dure are very similar, then it does not matter much which procedure is used.
True.
(e) The Welch procedure is an exact procedure, as long as X ¯1 = X ¯ 2 . False. The
Welch procedure is an approximate procedure.
25. (a) Suppose we are constructing a confdence interval for µ1 − µ2 . All else being
equal, the greater the di˙erence between µ1 and µ2 , the wider the interval.
False. The width does not depend on µ1 − µ2 .
(b) Suppose we are constructing a confdence interval for µ1 − µ2 . All else being
equal, the greater the di˙erence between X ¯ 1 and X
¯ 2 , the wider the interval.
¯ ¯ ¯
False. The width does not depend on X1 − X2 . (X1 − X ¯ 2 is the midpoint of
the interval.)
(c) Suppose we are constructing a confdence interval for µ1 − µ2 . All else being
equal, the greater the sample sizes, the narrower the interval. True.
(d) Suppose we wish to test H0 : µ1 = µ2 . We obtain random samples from the
respective populations, run the appropriate test, and fnd that the p-value is
Balka ISE 1.09 10.7. CHAPTER EXERCISES 162
0.00000032. We can be very confdent that our results have important practical
implications. False. There is strong evidence of a di˙erence in population
means, but whether this is important in a practical sense is an entirely di˙erent
question.
(e) If we test H0 : µ1 = µ2 against a two-sided alternative and fnd a p-value of
0.32, then we know that µ1 = µ2 . False. We do not have any evidence of a
di˙erence in population means, but we do not know the population means, and
we do not know if they are equal.
10.7.3 Applications
26. (a) The t procedures should be reasonable here. The boxplots appear to show
that the distributions are roughly symmetric, but there is one (small) outlier
in the OE group. The normal QQ plots are very roughly linear, although
the OE group shows hints of possible left skewness (mainly due to the one
small outlier). Overall the plots look reasonable, and since the two-sample t
procedures are quite robust to violations of the normality assumption, the t
procedures will likely perform reasonably well in this scenario.
(b) The sample standard deviations are not that similar (0.37 vs 0.60), and the
sample sizes are quite di˙erent (15 vs 51), so it is probably best to use the
Welch (unpooled variance) approach. (Opinions may di˙er on this matter.)
(c) Using software, we can fnd that with 37.714 degrees of freedom t0.025 = 2.0249.
(If we used the table or software with 37 degrees of freedom, t0.025 = 2.026.)
¯1 − X
X ¯ 2 ± t0.025 SEW (X ¯2)
¯1 − X
1.14 − 1.80 ± 2.0249 · 0.1272
−0.66 ± 0.258
X¯1 − X¯2
t= ¯1 − X¯2)
SEW (X
1.14 − 1.80
=
0.1272
= −5.19
Balka ISE 1.09 10.7. CHAPTER EXERCISES 163
(f) The p-value is double the area to the left of −5.19 under a t distribution
with 37.714 degrees of freedom. Using software, we can fnd that the p-value
is 7.5 × 10−6 . Using a table, we could fnd only a range for the p-value:
p-value < 0.001. This is a very small p-value, implying very strong evidence
against the null hypothesis.
(g) There is very strong evidence (p-value = 7.5 × 10−6 ) of a di˙erence in the
true mean speed of YE and OE leaders in tandem running for Temnothorax
albipennis under the conditions of the study. The 95% confdence interval
indicates that we can be 95% confdent that the di˙erence (µOE − µY E ) in
true mean speed lies between −0.92 and −0.40 mm/s. In short, there is very
strong evidence that, on average, tandem running pairs led by an old and
experienced ant move more slowly than pairs lead by a young and experienced
ant.
Although it wasn’t clear from the question how the tandem running pairs were
chosen, it is reasonable to think that these results apply to Temnothorax al-
bipennis ants under the conditions of the study. The means of tandem running
speed may very well change if the conditions change, or if a di˙erent type of
ant is studied.
27. (a) There are a variety of options, but we should (at least) plot boxplots and nor-
mal quantile-quantile plots. This would help us visualize possible di˙erences
between the groups, and investigate whether the normality assumption is rea-
sonable. (The sample sizes are fairly large here (246 and 46), so the normality
assumption is not overly important, but we should still investigate the shape
of the distribution and ensure there are no major outliers.)
(b) The sample standard deviations are very similar here (0.035 vs 0.034), so the
pooled-variance procedure would be a reasonable choice. The Welch (unpooled
variance) approach would also be reasonable. (Opinions may di˙er on this
matter.)
(c) There are n1 + n2 − 2 = 246 + 46 − 2 = 290 degrees of freedom. Using software
or a t table, we can fnd that with 290 degrees of freedom t0.025 = 1.968.
¯1 − X
X ¯1 − X
¯ 2 ± t0.025 SE(X ¯2)
0.994 − 0.963 ± 1.968 · 0.005598
0.031 ± 0.0110
(e)
X¯1 − X¯2
t= ¯1 − X
¯2)
SE(X
0.994 − 0.963
=
0.005598
= 5.538
(f) The p-value is double the area to the right of 5.538 under a t distribution
with 290 degrees of freedom. Using software, we can fnd that the p-value
is 6.8 × 10−8 . Using a table, we could fnd only a range for the p-value:
p-value < 0.001. This is a very small p-value, implying very strong evidence
against the null hypothesis.
(g) There is very strong evidence (p-value = 6.8 × 10−8 ) of a di˙erence in the
true mean 2D:4D ratio for white and black women in Manchester. The 95%
confdence interval indicates that we can be 95% confdent that the di˙erence
in true mean 2D:4D ratio (µW − µB ) lies between 0.020 and 0.042. In short,
there is very strong evidence that, on average, white women have a greater
mean 2D:4D ratio than that of black women.
The women were drawn from people attending a sexual health clinic in Manch-
ester, UK. It is reasonable to think that these results would apply to women
who attend this clinic. Extrapolating to an even larger population is some-
what dubious, but it is likely reasonable to think that these results apply to
white and black women in the Manchester, UK area. Although these results
give some indication of the di˙erence in this ratio for white and black women
in general, generalizing beyond the area from which the sample was drawn is
always dubious, and may lead to incorrect conclusions.
28. (a) There are a variety of options, but we should (at least) plot boxplots and
normal quantile-quantile plots. This would help us visualize possible di˙er-
ences between the groups, and investigate whether the normality assumption
is reasonable. (The sample sizes are small here (8 and 7) so the normality
assumption is important. Also, outliers could have a strong e˙ect on the re-
sults.)
(b) The sample standard deviations are very di˙erent here (25.4 vs 7.9), so the
Welch (unpooled variance) approach would likely be the best option. (Opinions
may di˙er on this matter.)
(c) The degrees of freedom are given as 8.512. Using software, we can fnd that
with 8.512 degrees of freedom t0.025 = 2.2821. (If we had to use a t table with
Balka ISE 1.09 10.7. CHAPTER EXERCISES 165
29. (a) There are a variety of options, but we should (at least) plot boxplots and nor-
mal quantile-quantile plots. This would help us visualize possible di˙erences
between the groups, and investigate whether the normality assumption is rea-
sonable. (The sample sizes are not small here (66 and 28), so the normality
assumption is not overly important, but we should still investigate the shape
of the distribution.)
Balka ISE 1.09 10.7. CHAPTER EXERCISES 166
(b) The sample standard deviations are very similar here (1.50 and 1.64), so the
pooled-variance procedure would be a reasonable choice. The Welch (unpooled
variance) approach would also be reasonable. (Opinions may di˙er on this
matter.)
(c) There are n1 + n2 − 2 = 66 + 28 − 2 = 92 degrees of freedom. Using software,
we can fnd that with 92 degrees of freedom t0.025 = 1.9861. (If we had to use
a t table, we may have to look up 90 DF, and fnd t0.025 = 1.987.)
¯1 − X
X ¯ 2 ± t0.025 SE(X ¯2)
¯1 − X
95.71 − 95.41 ± 1.9861 · 0.3479
0.30 ± 0.691
which works out to (−0.39, 0.99). (Had we used t0.025 = 1.987 from the t table,
we’d have gotten the same interval to 2 decimal places.)
(d) H0 : µ1 = µ2 (The true mean 3D symmetry for men is equal to the true mean
3D symmetry for women.)
Ha : µ1 6= µ2 (The true mean 3D symmetry for men is not equal to the true
mean 3D symmetry for women.)
(e)
¯1 − X
X ¯2
t= ¯ ¯2)
SE(X1 − X
95.71 − 95.41
=
0.3479
= 0.862
(f) The p-value is double the area to the right of 0.862 under a t distribution with
92 degrees of freedom. Using software, we can fnd that the p-value is 0.39.
Using a table, we could fnd only a range for the p-value: p-value > 0.20. This
p-value is large, yielding no evidence against the null hypothesis.
(g) There is no evidence (p-value = 0.39) of a di˙erence in the true mean 3D
symmetry for men and women. The 95% confdence interval indicates that we
can be 95% confdent that the di˙erence in true mean 3D symmetry (µM −µW )
lies between −0.39 and 0.99. There is no evidence that, on average, men have
di˙erent mean 3D symmetry than that of women.
The people in the sample were drawn from healthy Italian Caucasians between
31 and 40 years of age. Depending on the sampling design, it is likely reason-
able to think that these results apply to healthy Italian Caucasians between
31 and 40 years old.
30. (a) There are a variety of options, but we should (at least) plot boxplots and nor-
mal quantile-quantile plots. This would help us visualize possible di˙erences
between the groups, and investigate whether the normality assumption is rea-
sonable. (The sample sizes are not small here (47 and 27), so the normality
Balka ISE 1.09 10.7. CHAPTER EXERCISES 167
assumption is not overly important, but we should still investigate the shape
of the distribution.)
(b) The sample standard deviations are similar here (33.53 and 37.55), so the
pooled-variance procedure would be a reasonable choice. The Welch (unpooled
variance) approach would also be reasonable.
(c) There are n1 + n2 − 2 = 47 + 27 − 2 = 72 degrees of freedom. Using software,
we can fnd that with 72 degrees of freedom t0.025 = 1.9935. (If we had to use
a t table, we may have to look up 70 DF, and fnd t0.025 = 1.994.)
¯1 − X
X ¯1 − X
¯ 2 ± t0.025 SE(X ¯2)
256.12 − 245.82 ± 1.9935 · 8.4603
10.3 ± 16.867
which works out to (−6.6, 27.2). (Had we used t0.025 = 1.994 from the t table,
we’d have gotten the same margin of error and interval to 2 decimal places.)
(d) H0 : µ1 = µ2 (The true mean of the BEST index for psychopaths is equal to
the true mean of the BEST index for nonpsychopaths.)
Ha : µ1 6= µ2 (The true mean of the BEST index for psychopaths is not equal
to the true mean of the BEST index for nonpsychopaths.)
(e)
¯1 − X
X ¯2
t= ¯1 − X
¯2)
SE(X
256.12 − 245.82
=
8.4603
= 1.217
(f) The p-value is double the area to the right of 1.217 under a t distribution with
72 degrees of freedom. Using software, we can fnd that the p-value is 0.2274.
Using a table, we could fnd only a range for the p-value: p-value > 0.20. This
p-value is large, yielding little or no evidence against the null hypothesis.
(g) There is no evidence (p-value = 0.22) of a di˙erence in the true mean BEST
index of psychopaths and nonpsychopaths. The 95% confdence interval in-
dicates that we can be 95% confdent that the di˙erence in true mean BEST
index (µP − µN P ) lies between −6.6 and 27.2. There is no evidence that, on
average, psychopaths have a di˙erent mean BEST index than that of nonpsy-
chopaths.
The people in the sample were males drawn from an inpatient psychiatric
treatment facility in Holland. Technically the conclusions apply only to male
psychiatric patients from this facility. (The results give some indication of the
reality of the situation for male inpatient psychiatric patients in general, but
generalizing to a population outside of the one sampled is always dubious.)
Balka ISE 1.09 10.7. CHAPTER EXERCISES 168
We can be 95% confdent that, when this type of pig is Tasered, the true mean
change in mean corpuscular volume lies between 0.814 fL and 5.786 fL.
(b) The hypotheses are:
X¯ − µ0
t= ¯)
SE(X
3.3 − 0
=
1.11559
= 2.958
The p-value is double the area to the right of 2.958 under a t distribution with
n − 1 = 11 − 1 = 10 degrees of freedom. Using software, we can fnd: p-value =
0.0143. Using a t table, we could fnd only a range: 0.01 < p-value < 0.02.
A p-value of this size gives moderately strong evidence against the null hypoth-
esis. (The evidence would be statistically signifcant at the commonly chosen
signifcance level of 0.05.) There is moderately strong evidence that, on aver-
age, Tasered pigs of this type experience an increase in their mean corpuscular
volume.
33. (a) This is an experiment (since the researchers imposed a treatment, the exercise
wheel, on the rats).
Balka ISE 1.09 10.7. CHAPTER EXERCISES 169
(b) H0 : µ1 = µ2 (the tumour size is the same on average for both groups).
Ha : µ1 < µ2 (the exercise group has a lower mean tumour size). (Here the
researchers are only interested if the wheels tend to reduce tumour size, but
one could certainly make an argument for a two-sided alternative hypothesis).
s2p = (10−1)0.08+(20−1)0.12
10+20−2 = 0.10714.
√
¯ ¯
p
SE(X1 − X2 ) = 0.10714 1/10 + 1/20 = 0.12677.
0.12677 = −2.366. The p-value is the area to the left of −2.366 under a t
t = 1.2−1.5
distribution with 28 degrees of freedom. By computer: p-value = 0.0126. By
the table: 0.01 < p-value < 0.025.
Here we do not have a given value of α, but we should still be able to say
something reasonable. There is strong evidence (p-value = 0.0126) of a di˙er-
ence in population mean tumour size. There is strong evidence that those rats
in the exercise group have a lower population mean size of tumour.
(c) Yes, absolutely. A p-value that small gives very, very strong evidence against
the null hypothesis. Since this was a designed experiment, we can say that
there is evidence of a causal relationship (the exercised wheels caused the
observed di˙erence).
(d) 1.2−1.5±1.701×0.12677, which works out to −0.3±0.2156 or (−0.516, −0.084).
We can be 90% confdent that the true di˙erence in mean tumour size lies be-
tween −0.516 and −0.084.
(e) H0 : µ1 = µ2 (the tumour size is the same on average for both groups).
Ha : µ1 < µ2 (the exercise group has a lower mean tumour size). (Here the
researchers are only interested if the wheels tend to reduce tumour size, but
one could certainlyp make an argument for a two-sided alternative hypothesis).
SEW (X ¯ 2 ) = 0.08/10 + 0.12/20 = 0.1183.
¯1 − X
0.1183 = −2.535. The p-value is the area to the left of −2.535 under a t
t = 1.2−1.5
distribution with 21.764 degrees of freedom.
By computer: p-value = 0.0095. By the table (with 20 DF): 0.005 < p-value <
0.01.
Here we do not have a given value of α, but we should still be able to say
something reasonable. There is strong evidence (p-value = 0.0095) of a di˙er-
ence in population mean tumour size. There is strong evidence that those rats
in the exercise group have a lower population mean size of tumour.
(f) By computer (with 21.764 DF): 1.2 − 1.5 ± 1.718 × 0.1183, which works out
to −0.3 ± 0.2032 or (−0.503, −0.097).
By the table (with 21 DF): 1.2 − 1.5 ± 1.721 × 0.1183, which works out to
−0.3 ± 0.2036 or (−0.504, −0.096).
(g) The choice of appropriate procedure here would be debatable. There is not a
large di˙erence between the variances, so the pooled-variance procedure would
not be a terrible choice. But one could make an argument for each one, so the
Balka ISE 1.09 10.7. CHAPTER EXERCISES 170
34. (a) The last statement (v) is the best conclusion. The other statements are not
reasonable.
(b) We can be 95% confdent that the di˙erence in population mean cholesterol
levels between males and females (µM − µF , for students at this university)
lies between −21.9 and 17.1 mg/dl.
36. (a) H0 : µ1 = µ2 (the population mean “fear of negative evaluation” is the same
for bulimics and non-bulimics).
Ha : µ1 6= µ2 (the population mean “fear of negative evaluation” is di˙erent
between bulimics and non-bulimics).
(b) There is signifcant evidence against the null hypothesis. There is signifcant
evidence of a di˙erence in population mean “fear of negative evaluation” score.
It appears as though bulimics tend to have a higher fear of negative evaluation
score.
(c) We can be 95% confdent that the di˙erence in population mean “fear of eval-
uation” scores lies between 0.94 and 8.80.
37. (a) No, these plots give no indication of a violation of the normality assumption.
Although the sample sizes are small, the use of the t procedures is reasonable
here.
2 +(16−1)0.422
(b) s2p = (12−1)0.65
12+16−2 = 0.2805.
(c) H0 : µ1 = µ2 (The population mean resistance is the same for both types of
resistor.)
Ha : µ1 6= µ2 (The population mean resistance di˙ers between the two types
Balka ISE 1.09 10.7. CHAPTER EXERCISES 171
of resistor.)
√
(d) ¯1 − X ¯ 2 ) = 0.2805 1/12 + 1/16 = 0.2023.
p
SE(X
t = 11.6−12.2
0.2023 = −2.967.
(e) The p-value is double the area to the left of −2.967 under a t distribution
with 26 degrees of freedom. By computer: p-value = 0.0064. By the table:
0.002 < p-value < 0.010. There is signifcant evidence at the 5% signifcance
level (p-value < 0.05) of a di˙erence in population mean resistance. It appears
as though the resistors from Manufacturer B have a greater mean resistance.
(f) 11.60−12.20±2.056×0.2023, which works out to −0.6±0.416 or (−1.016, −0.184).
(g) We can be 95% confdent that the di˙erence in the population mean resistance
(µA − µB ) lies between −1.052 and −0.148.
(h) The output tells us that double the area to the left of −2.7904 under the t
distribution is 0.01222. Since for the one-sided alternative Ha : µA < µB we
require the area to the left of −2.7904, the p-value is 0.01222
2 = 0.0061.
2 2
38. (a) s2p = (80−1)23.3 +(75−1)19.8
80+75−2 = 469.9299.
√
¯1 − X ¯ 2 ) = 469.9299 1/80 + 1/75 = 3.4842.
p
SE(X
A 95% confdence interval is given by 664.2 − 624.3 ± 1.976 × 3.4842, which
works out to 39.9 ± 6.88 or (33.02, 46.78). We can be 95% confdent that the
di˙erence in population mean yield strength (µN ew − µStandard ) lies between
33.02 MPa and 46.78 MPa.
(b) H0 : µ1 = µ2 (the population mean yield strength is the same for both meth-
ods).
Ha : µ1 =6 µ2 (the √ populationpmean yield strength di˙ers between the methods).
¯ ¯
SE(X1 − X2 ) = 469.9299 1/80 + 1/75 = 3.4842.
t = 664.2−624.3
3.4842 = 11.45.
The p-value is double the area to the right of 11.45 under a t distribution with
80 + 75 − 3 degrees of freedom. Using software: p-value = 2.6 × 10−22 . Using
a t table: We can say that the p-value is close to 0 (p-value < 0.001).
There is extremely strong evidence (p-value = 2.6 × 10−22 ) of a di˙erence in
population mean yield strength between the two procedures. In fact, there is
very strong evidence that the heat treated method results in a greater mean
yield strength.
39. (a) There is strong evidence (p-value = 0.028) of a di˙erence on average between
the stated and actual weight of the chickens. The point estimate of the dif-
ference (Stated − Actual) is 22.8, with an associated 95% confdence interval
of 4.1 to 41.5 grams. In loose terms, it appears as though the actual weight
tends to be lower than the stated weight.
(b) If the normality assumption is violated then our reported results may be mis-
leading. What we are stating is a 95% confdence interval will have a true
confdence level that di˙ers from 95%. And our reported p-value may be very
Balka ISE 1.09 10.7. CHAPTER EXERCISES 172
40. (a) Here we have 2 measurements on each gas sample, and thus these measure-
ments cannot be considered independent. We need to use a paired-di˙erence
procedure.
(b) We could try plotting a normal quantile-quantile plot of the 6 di˙erences (but
with only 6 points, the plot may not be very informative). It will be a little
sketchy using the t procedures on such a small sample.
(c) The 6 di˙erences (Old − New) are 0.03, 0.10, −0.12, 0.05, −0.08, 0.08. These 6
di˙erences have a mean of X ¯ = 0.01 and a standard deviation of s = 0.08944.
¯
SE(X ) = 0.08944
√
6
= 0.0365. A 95% confdence interval for the population
mean di˙erence (Old − New) is 0.01 ± 2.571 × 0.0365, which works out to
0.01 ± 0.094 or (−0.084, 0.104). We can be 95% confdent that the population
mean di˙erence (Old method − New method) lies between −0.084 and 0.104.
(d) H0 : µ = 0 (There is no di˙erence between the two methods on average).
Ha : µ =6 0 (There is a di˙erence between the two methods on average).
0.01−0
t = 0.0365 = 0.274.
The p-value is double the area to the right of 0.274 under a t distribution
with 5 degrees of freedom. By computer: p-value = 0.80. By the table:
p-value > 0.40. There is no evidence (p-value = 0.80) that the population
mean di˙erence di˙ers from 0. We have no evidence of a di˙erence between
the viscosity measurements on average.
J.B.’s strongly suggested exercises: 2, 5, 6, 8, 10, 12, 16, 17, 19, 20, 24, 25, 26, 27
11.1 Introduction
p(1−p)
1. E(p̂) = p (p̂ is an unbiased estimator of p), and V ar(p̂) = n .
2. The normal approximation is best when n is large and p = 0.5. The normal
approximation is worst when n is small and p is close to 0 or 1.
3. Never. We make confdence intervals for parameters, and never for statistics.
4. (a) p̂ = 62
= 0.31.
200q q
p̂(1−p̂)
(b) SE(p̂) = n = 0.31(1−0.31)
200 = 0.0327.
(c)
p̂ ± zα/2 SE(p̂)
0.31 ± 1.96 × 0.0327
175
Balka ISE 1.0911.3. CONFIDENCE INTERVALS AND HYPOTHESIS TESTS FOR THE POPULATION PROPORTION P 176
5. (a) np̂ = 10 × 0.70 = 7, n(1 − p̂) = 10 × 0.30 = 3. Since these quantities are
not both at least 15, we should not be using the normal approximation. (The
sampling distribution of p̂ cannot be reasonably approximated by a normal
distribution in this situation.)
(b) np̂ = 200 × 0.70 = 140, n(1 − p̂) = 200 × 0.30 = 60. Since these quantities
are both at least 15, we can use the normal approximation. (The sampling
distribution of p̂ can be reasonably approximated by a normal distribution in
this situation.)
(c) np̂ = 200 × 0.99 = 198, n(1 − p̂) = 200 × 0.01 = 2. Since these quantities are
not both at least 15, we should not be using the normal approximation. (The
sampling distribution of p̂ cannot be reasonably approximated by a normal
distribution in this situation.)
(d) np̂ = 10,000×0.01 = 100, n(1−p̂) = 10,000×0.99 = 9,900. Since these quanti-
ties are both at least 15, we can use the normal approximation. (The sampling
distribution of p̂ can be reasonably approximated by a normal distribution in
this situation.)
(e) Here np̂ = 0, and we should not even think about using the normal approxi-
mation in this situation.
p̂ ± 1.96SE(p̂)
0.435 ± 1.96 × 0.0260
0.435 ± 0.051
p̂−p0
Z = SE 0 (p̂)
= 0.435−0.5
0.0262 = −2.48.
Since the alternative hypothesis is two-sided, the p-value is double the area
to the left of −2.48 under the standard normal curve. The area to the left of
−2.48 is 0.0066 (found using software or the standard normal table), and so
the p-value is about 0.013.
The p-value is less than the given signifcance level (0.013 < 0.05), so the
evidence against the null hypothesis is signifcant at α = 0.05. There is statis-
tically signifcant evidence that the true proportion of male births (for babies
born to heavy-smoking Liverpudlian parents) di˙ers from 0.5. Note that the
sample proportion of males was 0.435, with a corresponding 95% confdence
interval of (0.384, 0.486), indicating that there is strong evidence that the true
proportion of male births is in fact less than 0.5.
(h) The sample was of births to Liverpool couples in which both parents were
heavy smokers. As such, our conclusions apply only to the population of Liv-
erpool births in which both parents are heavy smokers. (Technically speaking
our conclusions apply only to the population from which we sampled, but we
may use a study like this to give us a hint of what might be the reality in other
scenarios (all UK births to heavy-smoking parents, for example).)
7. (a) n = ( 1.96
0.03 ) · 0.5(1 − 0.5) = 1067.1, and we round up to 1068. 1068 is the
2
8. SE(p̂1 − p̂2 ) is the standard error of the di˙erence in sample proportions, which is
an estimate of the standard deviation of the sampling distribution of p̂1 − p̂2 . It is
a measure of the variability in p̂1 − p̂2 if samples were to be repeatedly drawn from
the populations.
9. (a) p̂1 = 82
= 0.205, p̂2 = 104
400 400 = 0.260.
q
(b) SE(p̂1 − p̂2 ) = 0.205(1−0.205)
400 + 0.260(1−0.260)
400 = 0.02981.
(c) 0.205 − 0.260 ± 1.96 × 0.02981, which works out to −0.055 ± 0.0584 or (−0.113,
0.003).
(d) H0 : p1 = p2 , Ha : p1 =
6 p2 . The pooled proportion p̂ = 400+400
82+104
= 0.2325.
q
1 1
SE0 (p̂1 − p̂2 ) = 0.2325(1 − 0.2325)( 400 + 400 ) = 0.02987.
Z = 0.205−0.260
0.02987 = −1.841. The resulting p-value is 0.066, which does not give
signifcant evidence against the null hypothesis at α = 0.05.
(c) H0 : pM = pC (The true proportion of pigeons that will navigate to the home
loft under the conditions of this experiment is the same for the magnetic pulse
group and the control group. In short, the magnetic pulse has no e˙ect on the
likelihood of a pigeon arriving home.)
Ha : pM 6= pC (The true proportion of pigeons that will navigate to the home
loft under the conditions of this experiment di˙ers between the magnetic pulse
group and the control group. In short, the magnetic pulse has an e˙ect on the
likelihood of a pigeon arriving home.)
p̂M − p̂C
Z=
SE0 (p̂M − p̂C )
0.5385 − 0.5789
=
0.1132
= −0.357
Since the alternative hypothesis is two-sided, the p-value is double the area
to the left of −0.357 under the standard normal curve. The area to the left
of −0.357 under the standard normal curve is approximately 0.360, resulting
in a p-value of approximately 0.72. This is a large p-value, and it yields
absolutely no evidence against the null hypothesis. There is no evidence that
the proportion of pigeons that successfully navigate home di˙ers between the
magnetic pulse group and the control group. (There is no evidence that the
magnetic pulse has any e˙ect on the likelihood of a pigeon navigating back to
the loft.)
11. (a) 0.919 (the area to the right of −1.40 under the standard normal curve).
(b) 0.081 (the area to the left of −1.40 under the standard normal curve).
(c) 0.162 (twice the area to the left of −1.40 under the standard normal curve).
Balka ISE 1.09 11.6. CHAPTER EXERCISES 180
(d) 0.030 (the area to the left of −1.88 under the standard normal curve).
(e) 0.060 (twice the area to the left of −1.88 under the standard normal curve).
11.6.2 Concepts
12. p̂ is the sample proportion, whereas p represents the true proportion for the entire
population. Once the sample is drawn, we will know the value of p̂, but p is typically
an unknown value that we are trying to estimate.
13. No, this would never make sense. We test hypotheses about parameters, and not
about statistics.
14. SE(p̂) is the standard error of p̂, the estimate of the standard deviation of the
sampling distribution of p̂. It is a measure of the variability in p̂, if we were to
repeatedly draw samples from the population. Since we regard p as a fxed, unknown
quantity that we are trying to estimate, it is simply a constant and it does not have
a standard error.
15. (a) np0 = 10 × 0.20 = 2, n(1 − p0 ) = 10 × 0.80 = 8. Since these quantities are
not both at least 15, we should not be using the normal approximation. (The
sampling distribution of p̂ cannot be reasonably approximated by a normal
distribution in this situation.)
(b) np0 = 200 × 0.20 = 40, n(1 − p0 ) = 200 × 0.80 = 160. Since these quantities
are both at least 15, we can use the normal approximation. (The sampling
distribution of p̂ can be reasonably approximated by a normal distribution in
this situation.)
(c) np0 = 10 × 0.998 = 9.98, n(1 − p0 ) = 10 × 0.002 = 0.02. Since these quantities
are not both at least 15, we should not be using the normal approximation.
(The sampling distribution of p̂ cannot be reasonably approximated by a nor-
mal distribution in this situation.)
(d) np0 = 1000×0.998 = 998, n(1−p0 ) = 1000×0.002 = 2. Since these quantities
are not both at least 15, we should not be using the normal approximation.
(The sampling distribution of p̂ cannot be reasonably approximated by a nor-
mal distribution in this situation.)
(e) np0 = 100,000 × 0.998 = 99,800, n(1 − p0 ) = 100,000 × 0.002 = 200. Since
these quantities are both at least 15, we can use the normal approximation.
(The sampling distribution of p̂ can be reasonably approximated by a normal
distribution in this situation.)
• These types of poll are often conducted by telephone, and will therefore not
include anyone who does not have a phone. In the early days of cell phones,
Balka ISE 1.09 11.6. CHAPTER EXERCISES 181
these types of surveys called only landline phones. People who were cell-phone-
only would be left out (and these people tended to be a younger subgroup
of the population, whose opinions may di˙er in an important way from the
population as a whole).
• If this was an online survey, there are obvious biases. People who never use a
computer (who tend to be older and poorer than those who do), would not be
included.
• The survey does not include people who hang up (refuse to answer questions).
If the opinions of these people tend to di˙er from those who do answer (which
is quite likely), then there will be bias in the survey.
• On phone surveys they typically speak to adults only, and thus the interpre-
tation of the survey must relate to adult Canadians, and not Canadians as a
whole.
18. (a) The normal approximation to the sampling distribution of p̂ works best when
we have a large sample size and p = 0.5. True.
(b) The sampling distribution of p̂ becomes more normal as p tends to 1. False.
The sampling distribution becomes strongly skewed as p → 1.
(c) The sampling distribution of p is approximately normal for large sample sizes.
False. p is a parameter, not a statistic, and as such does not have a sampling
distribution.
(d) All else being equal, the value of SE(p̂) decreases as the sample size increases.
Balka ISE 1.09 11.6. CHAPTER EXERCISES 182
True.
(e) In repeated sampling, exactly 95% of 95% confdence intervals for p will cap-
ture p. False, since we are using a normal approximation and not an exact
procedure. (But if the sample size is very large then the true percentage will be
very close to 95%.)
11.6.3 Applications
19. (a) p represents the true proportion of homing pigeons that would navigate back to
the home loft (for this type of pigeon, under the conditions of this experiment).
(b) p̂ = 22
38 = 0.5789.
(c) Yes, the sample size is large enough. There are 22 successes (pigeons that
made it back) and 16 failures (pigeons that did not make it back), so the at
least 15 guideline is satisfed (barely).
q q
(d) SE(p̂) = p̂(1−p̂)
n = 0.5789(1−0.5789)
38 = 0.0801.
(e)
p̂ ± 1.96SE(p̂)
0.5789 ± 1.96 × 0.0801
0.5789 ± 0.1570
home loft is greater than 0.25. (This should not come as a great surprise, since
we found a 95% confdence interval for p of 0.42 to 0.74.)
20. (a) p represents the true proportion of seatbelt-wearing victims of fatal car acci-
dents that show seat belt marks. (Note that the study involved only accidents
in Sydney, Australia, in which there was no airbag deployment. Any conclu-
sions relate to that population of accidents.)
(b) p̂ = 27
74 = 0.3649.
(c) Yes, the sample size is large enough. (There are 27 successes (individuals with
seat belt marks) and 47 failures (individuals without seat belt marks), so the
at least 15 guideline is satisfed.)
q q
(d) SE(p̂) = p̂(1−p̂)
n = 0.3649(1−0.3649)
74 = 0.0560.
(e)
p̂ ± 1.96SE(p̂)
0.3649 ± 1.96 × 0.0560
0.3649 ± 0.1097
which works out to (0.255, 0.475).
(f) We can be 95% confdent that p lies between 0.255 and 0.475, where p is the
true proportion of seatbelt-wearing victims that show seat belt marks.
(g) No, since the information in the question does not give any indication of a
hypothesized value to be tested. It is conceivable that a researcher may have
a hypothesis they wish to test, but based on the given information this is an
estimation problem, not a testing problem.
21. (a) p represents the true proportion of murders in which the victim was the of-
fender’s domestic partner (for Finnish murders in which the murderer was
male).
(b) p̂ = 24
91 = 0.2637.
(c) Yes, the sample size is large enough. (There are 24 successes (partners) and
67 failures (non-partners), so the at least 15 guideline is satisfed.)
q q
(d) SE(p̂) = p̂(1−p̂)
n = 0.2637(1−0.2637)
91 = 0.0462.
(e)
p̂ ± 1.96SE(p̂)
0.2637 ± 1.96 × 0.0462
0.2637 ± 0.0905
which works out to (0.173, 0.354).
(f) We can be 95% confdent that the true proportion of murders that were of the
domestic partner of the perpetrator lies between 0.173 and 0.354. (For Finnish
murders in which the murderer was male.)
Balka ISE 1.09 11.6. CHAPTER EXERCISES 184
(g) H0 : p = 0.4 (The true proportion of victims that were the domestic partner of
the perpetrator is 0.40.)
Ha : p =
6 0.4 (The true proportion of victims that were the domestic partner of
the perpetrator
q is not 0.40.)
q
p0 (1−p0 ) 0.4(1−0.4)
SE0 (p̂) = n = 91 = 0.0514.
p̂−p0 0.2637−0.4
Z= SE0 (p̂) = = −2.65.
0.0514
Since the alternative hypothesis is two-sided, the p-value is double the area
to the left of −2.65 under the standard normal curve. The area to the left of
−2.65 is 0.0040 (found using software or a standard normal table), and so the
p-value is about 0.008.
The p-value is less than the given signifcance level (0.008 < 0.05), so the
evidence against the null hypothesis is signifcant at α = 0.05. There is statis-
tically signifcant evidence that the true proportion of victims that were the
domestic partner of the perpetrator di˙ers from 0.4. The sample proportion is
0.26, with a 95% confdence interval for p of (0.173, 0.354), indicating strong
evidence that the true proportion of victims that are domestic partners is in
fact less than 0.4.
(h) The sample was of Finnish murder victims in which the perpetrator was male.
As such, our conclusions apply only to the population of Finnish murder vic-
tims in which the perpetrator was male. (Technically speaking our conclusions
apply only to the population from which we sampled, but we may use a study
like this to give us a hint of what might be the reality in other scenarios
(murders committed by European males, for example).)
p̂F − p̂M
Z=
SE0 (p̂F − p̂M )
0.3516 − 0.2637
=
0.0684
= 1.285
Since the alternative hypothesis is two-sided, the p-value is double the area
to the right of 1.285 under the standard normal curve. The area to the right
of 1.285 under the standard normal curve is about 0.099, resulting in a p-
value of approximately 0.20. The p-value is greater than the given signifcance
level (0.20 > 0.05), and so the evidence against the null hypothesis is not
signifcant at the 5% signifcance level. There is not signifcant evidence that
the proportion of murder victims that were domestic partners di˙ers between
male and female murderers. (The data is consistent with the two proportions
being equal.)
(b) We can be 95% confdent that the di˙erence in the true proportions of individ-
uals that express the Leb antigen (pU −pC ) lies between −0.08 and 0.29. (Note
that 0 is contained within this interval and thus it is plausible that pU = pC .
The data is consistent with there being no association between peptic ulcers
and the Leb antigen.)
(c) H0 : pU = pC (The true proportion of individuals expressing the Leb antigen
is the same for peptic ulcer su˙erers and healthy individuals.)
Ha : pU 6= pC (The true proportion of individuals expressing the Leb antigen
di˙ers between peptic ulcer su˙erers and healthy individuals.)
p̂U − p̂C
Z=
SE0 (p̂U − p̂C )
0.7167 − 0.6136
=
0.0931
= 1.107
Since the alternative hypothesis is two-sided, the p-value is double the area to
the right of 1.107 under the standard normal curve. The area to the right of
1.107 under the standard normal curve is approximately 0.134, resulting in a p-
value of approximately 0.27. The p-value is greater than the given signifcance
level (0.27 > 0.05), and so the evidence against the null hypothesis is not
signifcant at the 5% signifcance level. There is not signifcant evidence that
the proportion of individuals expressing the Leb antigen di˙ers between peptic
ulcer su˙erers and healthy individuals. (The data is consistent with the two
proportions being equal.)
p̂H − p̂N
Z=
SE0 (p̂H − p̂N )
0.4353 − 0.5322
=
0.0271
= −3.57
Since the alternative hypothesis is two-sided, the p-value is double the area
to the left of −3.57 under the standard normal curve. The area to the left
of −3.57 under the standard normal curve is approximately 0.0002 (found
using software, or approximated with the standard normal table), resulting in
a p-value of approximately 0.0004.
Since the p-value is less than the given signifcance level (0.0004 < 0.05), there
is statistically signifcant evidence against the null hypothesis. There is strong
evidence that the true proportion of male births to heavy-smoking parents
di˙ers from the true proportion of male births to non-smoking parents. (By
looking at the sample proportions of p̂H = 0.4353 and p̂N = 0.5322, and the
Balka ISE 1.09 11.6. CHAPTER EXERCISES 188
p̂F − p̂C
Z=
SE0 (p̂F − p̂C )
0.230 − 0.135
=
0.0386
= 2.46
Since the alternative hypothesis is two-sided, the p-value is double the area
to the right of 2.46 under the standard normal curve. The area to the right
of 2.46 under the standard normal curve is approximately 0.007 (found using
software or the standard normal table), resulting in a p-value of approximately
0.014.
Since the p-value is less than the given signifcance level (0.014 < 0.05), there
is statistically signifcant evidence against the null hypothesis. There is strong
evidence that the true success rate (of obtaining a phone number) in pleasant
aroma areas di˙ers from the true success rate in areas with no aroma. (By
looking at the sample proportions of p̂F = 0.230 and p̂C = 0.135, and the
resulting 95% confdence interval for pF − pC of (0.02, 0.17), we can state that
there is strong evidence that the true success rate in pleasant aroma areas is
greater than the true success rate in areas with no aroma.)
Note that the conclusions to this study apply only to the specifc conditions of
the experiment (good-looking young males approaching young women walking
alone in a shopping mall). If the conditions of the experiment were to change
(using males who were not quite as good-looking, for example), then the results
might possibly change dramatically.
(d) Since the p-value is very small (2.7 × 10−6 ), there is very strong evidence
of a di˙erence in population proportions. Since the sample proportion in the
placebo group is larger than that of the vaccine group, there is strong evidence
that those in the vaccine group developed polio less often. And since this was
a randomized experiment, we can say the strong conclusion: This study yields
strong evidence that use of this vaccine in children reduces the risk of developing
polio.
0.0007982837
(e) RR = = 1.854. People without the vaccine were 1.85 times
0.0004305446
more likely to develop polio.
q
0.68(1−0.68)
27. (a) p̂ = 100
68
= 0.68. SE(p̂) = 100 = 0.046648. The 95% confdence
interval is given by 0.68 ± 1.96 × 0.046647, which works out to 0.68 ± 0.0914
or (0.589, 0.771). We can be 95% confdent that the proportion of this type
of minnow that would die within 48 hours after exposure to 5 ppm of zinc lies
between 0.589 and 0.771.
(b) Yes. The entire interval lies to the right of 0.50, indicating that 5 ppm of zinc
kills more than 50% of these minnows under these conditions. It appears as
though the LD50 is less than 5 ppm.
(c) H0 : p = 0.50 (5 ppm of zinc will kill 50% of minnows within 48 hours)
Ha : p =6 0.50 (The true percentage of minnows killed within 48 hours with 5
ppm of zinc qdi˙ers from 0.50.)
SE0 (p̂) = 0.50(1−0.50)
100 = 0.05. Z = 0.68−0.50
0.05 = 3.6. The p-value is double the
area to the right of 3.6 under the standard normal curve (p-value = 0.0003).
Since the p-value is less than the given signifcance level of α = 0.05, there is
signifcant evidence against the null hypothesis. There is signifcant evidence
that, under these conditions, the true proportion of minnows killed by 5 ppm
of zinc will di˙er from 0.50. Since the sample proportion is greater than 0.50
(leading to a test statistic in the right tail of the distribution), there is strong
evidence that the true proportion killed is in fact greater than 0.50. (Which
shouldn’t come as a big surprise, since our confdence interval was found to be
(0.589, 0.771) in 27a.)
(d) n = ( 2.576
0.03 ) · 0.5(1 − 0.5) = 1843.3. We would need a sample size of at least
2
1844 minnows.
(e) No. The rough guideline tells us that we need at least 15 successes and 15
failures for the normal approximation to be reasonable. Here, depending on
how we defne success and failure, we have 198 of one and only 2 of the other.
The sampling distribution of p̂ cannot reasonably be approximated by a normal
distribution in this spot.
q
0.64179(1−0.64179)
28. (a) p̂ = 43
67 = 0.64179. The 95% confdence interval is 0.64179±1.96 67 ,
which works out to .64179 ± 0.1148, or (0.527, 0.757). We can be 95% conf-
Balka ISE 1.09 11.6. CHAPTER EXERCISES 191
dence that the proportion of babies born at 25 weeks that live for at least a
year lies between 0.527 and 0.757.
(b) H0 : p = 0.5 (Half of 25 week premature babies die before one year.)
Ha : p 6= 0.5 (The true proportion premature babies that die before one year
di˙ers fromq0.5.)
0.5(1−0.5)
SE0 (p̂) = 67 = 0.06108. Z = 0.64179−0.5
0.06108 = 2.321. The p-value is
twice the area to the right of 2.321 under the standard normal curve, which is
approximately 0.020. There is very strong evidence (p-value = 0.02) that the
true proportion of this type of premature baby that survives to one year di˙ers
from 0.5. In fact, since the sample proportion is greater than 0.50 (leading to
a Z statistic in the right tail of the distribution), there is strong evidence that
the true proportion that survives is in fact greater than 0.50.
q
0.425(1−0.425)
29. p̂ = 17
40 = 0.425. SE(p̂) = 40 = 0.07816. The 95% confdence interval is
given by 0.425 ± 1.96 × 0.07816, which works out to 0.425 ± 0.1532 or (0.272, 0.578).
We can be 95% confdent that proportion of all soft-serve ice cream vendors in this
city that have E. coli counts that exceed recommended guidelines lies between 0.272
and 0.578.
q
0.05(1−0.05)
30. (a) p̂ = 50025
= 0.05. SE(p̂) = 500 = 0.009746. The 90% confdence
interval is given by 0.05 ± 1.645 × 0.009746, which works out to 0.05 ± 0.016 or
(0.034, 0.066). We can be 95% confdent that the proportion of adults (in this
age group, area, and time) that have a university degree lies between 0.034
and 0.066.
(b) H0 : p = 0.23 (The proportion with a university degree was the same as for all
of Canada.)
Ha : p =
6 0.23 (The proportion with a university degree di˙ered from the rest
of Canada.)q
SE0 (p̂) = 0.23(1−0.77)
500 0.01882 = −9.564. The p-value is
= 0.01882. Z = 0.05−0.23
double the area to the left of −9.564 under a standard normal curve, which
is very near 0 (using a computer, we can fnd it to be 1.1 × 10−21 ). There is
extremely strong evidence that the proportion of people that have a university
degree in this age group and area di˙ered from that of Canada as a whole. In
fact, since the sample proportion is less than the proportion for all of Canada
(leading to a test statistic in the left tail of the distribution), there is a strong
indication that the true proportion in this area was in fact less than that of
Canadians as a whole.
31. n = ( 1.645
0.04 ) · 0.5(1 − 0.5) = 422.8. We would need a sample size of at least 423
2
0.0699.
A 90% confdence interval for pF − pM is given by 0.58 − 0.43 ± 1.645 × 0.0699,
which works out to 0.15 ± 0.115 or (0.035, 0.265).
(b) We can be 90% confdent that the true di˙erence between the approval ratings
of women and men (pF − pM ) lies between 0.035 and 0.265.
(c) H0 : pF = pM (The approval ratings for men and women are equal.)
Ha : pF 6= pM (The approval ratings for men and women are di˙erent.)
The pooled proportion is p̂ = 100+100
58+43
= 0.505.
SE0 (p̂F −p̂M ) = 0.505(1 − 0.505)(1/100 + 1/100) = 0.070707. Z = 0.58−0.43
p
0.070707 =
2.121.
The p-value is twice the area to the right of 2.121 under the standard normal
curve, which works out to 0.034. There is signifcant evidence against the null
hypothesis at the given level of α = 0.05. There is signifcant evidence of a
true di˙erence in approval ratings between men and women. It appears as
though women are more likely to approve of the way the President is handling
his job.
q
0.88(1−0.88)
33. (a) p̂1 = 176
200 = 0.88, p̂ 2 = 184
200 = 0.92. SE(p̂1 − p̂ 2 ) = 200 + 0.92(1−0.92)
200 =
0.02993.
A 95% confdence interval for p1 − p2 is given by 0.88 − 0.92 ± 1.96 × 0.02993,
which works out to −0.04 ± 0.0587 or (−0.099, 0.019).
We can be 95% confdent that the di˙erence in population proportion of bed-
bugs killed for heat treatments of 45 degrees and 47 degrees Celsius lies between
−0.099 and 0.019.
(b) H0 : p1 = p2 (The proportion of bedbugs killed is the same for both tempera-
tures),
Ha : p1 < p2 (The proportion of bedbugs killed is lower at the lower tempera-
ture).
p̂ = 176+184
200+200 = 0.90.
SE0 (p̂1 − p̂2 ) = 0.90(1 − 0.90)(1/200 + 1/200) = 0.030. Z = 0.88−0.92
p
0.030 =
−1.333.
p-value = 0.091 (the area to the left of −1.333 under the standard normal
curve).
There is not signifcant evidence (at the given signifcance level of α = 0.01)
of a di˙erence in proportion of bedbugs killed between the two temperatures.
Chapter 12
193
Balka ISE 1.09 194
Chapter 13
J.B.’s strongly suggested exercises: 2, 3, 4, 6, 7, 8, 10, 11, 14, 15, 16, 17, 19, 21, 22, 23
13.1 Introduction
(b)
Observed 52 149 99 100
Expected .25 × 400 = 100 .25 × 400 = 100 .25 × 400 = 100 .25 × 400 = 100
195
Balka ISE 1.09 13.2. χ2 TESTS FOR ONE-WAY TABLES 196
2. (a) Yes, the observed percentage of Q1 birthdays is higher for Canadian NHL
players than for Canadians as a whole (26.5% vs 24.2%). Yes, the observed
percentage of Q4 birthdays is lower for Canadian NHL players than for Cana-
dians as a whole (20.6% vs 23.8%).
(b) H0 : p1 = 0.242, p2 = 0.260, p3 = 0.260, p4 = 0.238 (The distribution of birth
dates for Canadian NHL players is the same as for Canadians as a whole.)
Ha : The given probabilities are not all correct. (The distribution of birth dates
for Canadian NHL players is not the same as for Canadians as a whole.)
(c) 510 × 0.242 = 123.42. This is a fair bit lower than the observed count of 135.
(d) 4 − 1 = 3.
(e)
Birth date Q1 Q2 Q3 Q4 Total
Observed count 135 146 124 105 510
Expected count .242 · 510 = 123.42 .260 · 510 = 132.60 .260 · 510 = 132.60 .238 · 510 = 121.38 510
The test statistic:
(135 − 123.42)2 (146 − 132.60)2 (124 − 132.60)2 (105 − 121.38)2
χ2 = + + + = 5.209
123.42 132.60 132.60 121.38
(f) The p-value is the area to the right of 5.209 under a χ2 distribution with 3
degrees of freedom, as illustrated in Figure 13.1.
χ2 Distribution with 3 DF
0 5.209
Using software, we can fnd that the p-value is 0.157. Using a χ2 table, we can
fnd only that the p-value falls in an interval, such as: p-value > 0.10.
(g) Since the p-value is greater than the given signifcance level (0.157 > 0.05),
the evidence against H0 is not signifcant at α = 0.05. There is not signifcant
evidence that the distribution of birth dates for Canadian NHL players di˙ers
from that of Canadians as a whole. (It is plausible that the null hypothesis
is true.) Note that this does not mean that Malcom Gladwell’s assertion is
wrong, we simply do not have signifcant evidence of a birthday e˙ect in this
analysis.1 The observed percentage of Canadian NHL players born in Q1
(26.5) is greater than the percentage for Canadians (24.2). But we do not
have enough evidence to say this is a signifcant di˙erence.
1
A proper attempt to to investigate Canadian NHL player birthdays would involve data over many years,
and possibly other methods of analysis. The birthday e˙ect in professional hockey is real, but we simply
have not seen much evidence of it here.
Balka ISE 1.09 13.2. χ2 TESTS FOR ONE-WAY TABLES 197
3. (a) H0 : The number of eggs that hatch in a nest follows a binomial distribution
with n = 3 and p = 0.80. Ha : The number of eggs that hatch in a nest does
not follow a binomial distribution with n = 3 and p = 0.80.
(b) We fnd the expected probabilities using the binomial distribution with n = 3
and p = 0.80. The expected probabilities are 0.008, 0.096, 0.384, 0.512. We
multiply these by the total number of nests (500) to get the expected counts:
4, 48, 192, 256.
2 2 2 2
(c) χ2 = (80−4)
4 + (29−48)
48 + (164−192)
192 + (227−256)
256 = 1458.889.
(d) The degrees of freedom are 4 − 1 = 3.
(e) The p-value is the area to the right of 1458.889 under a χ2 distribution with
4 − 1 = 3 degrees of freedom. The p-value is close to 0.
(f) There is very strong evidence that the number of eggs that hatch per nest does
not follow a binomial distribution with p = 0.80.
Using this estimate of p in the binomial formula with n = 3, we fnd the “expected
probabilities” for 0, 1, 2, 3 are 0.02921811, 0.19693766, 0.44247034, 0.33137389.
Multiplying by the number of nests, we obtain the expected counts: 14.60906,
98.46883, 221.23517, 165.68694. We then use our usual χ2 statistic:
(80−14.60906)2 (29−98.46883)2 (164−221.23517)2 (227−165.68694)2
χ2 = 14.60906 + 98.46883 + 221.23517 + 165.68694 = 379.1994.
The p-value is the area to the right of 379.1994 under a χ2 distribution with 4 −
1 − 1 = 2 degrees of freedom (recall that we lose one degree of freedom for every
parameter that must be estimated from the data – hence the loss of an extra degree
of freedom here). This area is near 0.
There is very, very, very strong evidence that the number of eggs that hatch per
nest does not follow a binomial distribution.
The p-value is the area to the right of 185.328 under a χ2 distribution with 4−1 = 3
degrees of freedom. This area is very close to 0. There is extremely strong evidence
that the four possible outcomes are not equally likely.
6. (a) H0 : There is no association (relationship) between the row and column vari-
ables (this can be phrased di˙erently, depending on the sampling design—the
null hypothesis may be that the row and column variables are independent).
C1 C2
R1 200·650
1000 = 130 200·350
1000 = 70
(b)
R2 220·650
1000 = 143 220·350
1000 = 77
R3 580·650
1000 = 377 580·350
1000 = 203
(c) χ2 = 209.5835.
(d) DF = (3 − 1)(2 − 1) = 2.
(e) The p-value is the area to the right of 209.5835 under a χ2 distribution with 2
degrees of freedom. This area is near 0 (p-value < 2.2e-16, as reported by R).
(f) Yes and yes. With such a tiny p-value, the null hypothesis would be rejected
at any reasonable level. There is extremely strong evidence against the null
hypothesis.
7. (a) 76.4% of the buyback guns were of small calibre. 20.3% of the homicide guns
were of small calibre. (Note that the percentage of small calibre guns is much
higher for buyback guns than for homicide guns.)
(b) H0 : The distribution of gun calibre is the same for buyback guns, homicide
guns, and suicide guns.
Ha : The distribution of gun calibre is not the same for buyback guns, homicide
guns, and suicide guns. (In other words, it is not true that the 3 distributions
are the same.)
(c) Row total×Column
Overall total
total
= 834×941
1435 = 546.89.
(d) (Number of rows − 1)(Number of columns − 1) = (4 − 1)(3 − 1) = 6.
(e) The p-value is the area to the right of 422.48 under a χ2 distribution with
6 degrees of freedom. The p-value is minuscule (very, very close to 0). (The
statistical software R reports p-value < 2.2 × 10−16 , which is R’s way of saying
that the p-value is very, very close to 0.)
(f) Here the p-value is minuscule, indicating extremely strong evidence against the
Balka ISE 1.09 13.4. CHAPTER EXERCISES 199
null hypothesis. (It would be nearly impossible to observe what was observed,
if the null hypothesis were true.) There is extremely strong evidence that
the distribution of gun calibre di˙ers between the 3 types of gun (buybacks,
homicides, suicides).
(g) It appears as though buyback guns were more likely than homicide and suicide
guns to be of smaller calibre, and homicide and suicide guns were more likely
than buyback guns to be of medium or large calibre. (We could investigate
this in greater detail using statistical inference techniques.)
(h) The samples were of guns from buyback programs in Milwaukee, and guns
used in homicides and suicides in Milwaukee, so those are the populations to
which the conclusions apply.
13.4.2 Concepts
8. E. The χ2 approximation breaks down when one or more of the expected counts
is small. Depending on how small they are, and how many are small, the ap-
proximation may be very poor. (Some people use the rough guideline that the
approximation is poor if some of the expected counts are less than 5. But this
guideline is quite conservative—if one or two of the expected counts slip a little
under 5, it is not a big problem.)
9. No. Here we have 50 children, but the sum of the observed is 119. The children are
appearing in more than one cell. (In other words, these 119 observations are not
independent.) The χ2 test is not appropriate here.
11. (a) True. The p-value is the area to the right of the test statistic.
(b) False. As the degrees of freedom increase, the χ2 distribution gets closer to a
normal distribution, that part is true. But the mean of a χ2 distribution is
equal to its degrees of freedom (not 0, as for the standard normal distribution).
(c) True. (But the skewness decreases as the degrees of freedom increase.)
(d) True.
(e) False. (Remember that the χ2 test statistic is the sum of squared terms.)
13.4.3 Applications
(b)
Phenotype
Tall/cut Tall/potato Dwarf/cut Dwarf/potato
Observed count 926 288 293 104
9 3 3 1
Expected under 9:3:3:1 16 × 1611 = 906.2 16 × 1611 = 302.1 16 × 1611 = 302.1 16 × 1611 = 100.7
Table 13.1: Observed and expected counts of phenotypes in the second generation.
(The given values are rounded, but you should carry many decimal places
throughout your calculations.)
2 2 2 2
χ2 = (926−906.2)
906.2 + (288−302.1)
302.1 + (293−302.1)
302.1 + (104−100.7)
100.7 = 1.469
(c) 0.69. The p-value is the area to the right of 1.469 under the χ2 distribution
with 4−1 = 3 degrees of freedom. Using software, we can fnd that the p-value
is 0.69. (Using a χ2 table, we could fnd only a range for the p-value, such as
p-value > 0.20.)
(d) Since the p-value is large (0.69), there is no evidence against the null hypoth-
esis. The null hypothesis would not be rejected at any reasonable signifcance
level. There is no evidence that the true ratio of phenotypes di˙ers from 9:3:3:1.
This experiment gives no evidence of genetic linkage between the genes.
Ha : p =
6 0.75 (The true proportion of round seeds is not 0.75.)
(b) Of the 80 plants, 72 had round seeds and 8 had wrinkled seeds (these are the
observed counts). If the null hypothesis is true, we would expect to get, on
average, 0.75 × 80 = 60 plants with round seeds, and 0.25 × 80 = 20 with
wrinkled seeds.
2 2
χ2 = (72−60)
60 + (8−20)
20 = 9.6.
(c) The p-value is the area to the right of 9.6 under the χ2 distribution with
2 − 1 = 1 degrees of freedom. Using software, the p-value can be found to
be 0.0019. (Using a χ2 table, we could give only a range, such as 0.001 <
p-value < 0.005.)
(d) The p-value is less than the given signifcance level (0.0019 < 0.05) and so
the evidence against the null hypothesis is statistically signifcant at α = 0.05.
There is very strong evidence that the true proportion of plants with round
seeds di˙ers from 0.75. (Note that the observed proportion of round seeds
( 72
80 = 0.90) is greater than the hypothesized proportion. It appears as though
the true proportion of round seeds is greater than 0.75.)
15. (a) p̂ = 72
80 = 0.90. Z = q0.90−0.75
.75(1−.75)
= 3.098.
80
(b) p-value = 0.0019. (The p-value is double the area to the right of 3.098 under
the standard normal curve, which can be found using software or the standard
normal table.)
(c) The square of the Z test statistic is exactly equal to the χ2 test statistic.
(3.0982 = 9.6 (other than a little rounding error).)
(d) The p-values are exactly equal. The two tests are equivalent.
18. (a) 18.3% of the 60 of peptic ulcer patients have blood type A. 34.1% of the 44
healthy individuals have blood type A.
(b) H0 : The distribution of ABO blood type is the same for peptic ulcer su˙erers
Balka ISE 1.09 13.4. CHAPTER EXERCISES 203
19. (a) The proportion of male births to non-smoking couples is 2685 5045 = 0.532. The
proportion of male births for couples where the father is a non-smoker but the
mother is a heavy smoker is 175 365 = 0.479.
(b) H0 : Sex of the baby and maternal smoking status are independent variables.
Ha : Sex of the baby and maternal smoking status are not independent vari-
ables.
(c) Row total×Column
Overall total
total
= 3595×5045
6853 = 2646.545.
(d) (Number of rows − 1)(Number of columns − 1) = (2 − 1)(3 − 1) = 2.
(e) The p-value is the area to the right of 5.499 under a χ2 distribution with 2
degrees of freedom. p-value = 0.064 (found using software). If we used a
table instead of software we could only give a range for the p-value, such as
0.05 < p-value < 0.10.
(f) Since the p-value is greater than the given signifcance level of 0.05, the evi-
dence against the null hypothesis is not statistically signifcant at this level.
We do not have strong evidence that male births and maternal smoking status
are not independent. (It is plausible that they are independent.) The smallish
p-value of 0.064 gives a hint of evidence against the null hypothesis, but it is
not statistically signifcant at α = 0.05.
(g) The sample involved births to parents in Liverpool, UK, in which the father
was a non-smoker. As such, our conclusions apply only to the population of
Liverpool births in which the father is a non-smoker. (Technically speaking
our conclusions apply only to the population from which we sampled, but we
may use a study like this to give us a hint of what might be the reality in other
scenarios (all UK births, for example).)
Balka ISE 1.09 13.4. CHAPTER EXERCISES 204
20. (a) The proportion of male births to non-smoking couples is 2685 5045 = 0.532. The
proportion of male births for couples where the mother is a non-smoker but
the father is a heavy smoker is 122 245 = 0.498.
(b) H0 : Sex of the baby and paternal smoking status are independent variables.
Ha : Sex of the baby and paternal smoking status are not independent variables.
(c) Row total×Column
Overall total
total
= 3192×5045
6032 = 2669.7.
(d) (Number of rows − 1)(Number of columns − 1) = (2 − 1)(3 − 1) = 2.
(e) The p-value is the area to the right of 1.461 under a χ2 distribution with 2
degrees of freedom. p-value = 0.482 (found using software). If we used a
table instead of software we could only give a range for the p-value, such as
p-value > 0.10.
(f) Here the p-value is large (0.48), indicating absolutely no evidence against the
null hypothesis. The observed data is consistent with sex of the baby and
paternal smoking status being independent variables.
(g) The sample involved births to parents in Liverpool, UK, in which the mother
was a non-smoker. As such, our conclusions apply only to the population of
Liverpool births in which the mother is a non-smoker. (Technically speaking
our conclusions apply only to the population from which we sampled, but we
may use a study like this to give us a hint of what might be the reality in other
scenarios (all UK births, for example).)
21. (a) For female murderers, 19.8% of the murders were of family members. For
male murderers, 4.4% of the murders were of family members. (Note that the
percentage for female murderers is much higher.)
(b) H0 : The distribution of relationship to the victim is the same for male and
female murderers.
Ha : The distribution of relationship to the victim is di˙erent for male and
female murderers.
(c) Row total×Column
Overall total
total
= 22×91
182 = 11.
(d) (Number of rows − 1)(Number of columns − 1) = (4 − 1)(2 − 1) = 3.
(e) The p-value is the area to the right of 16.638 under a χ2 distribution with 3
degrees of freedom. p-value = 0.00084 (found using software). If we used a
table instead of software we could only give a range for the p-value, such as
p-value < 0.001.
(f) Here the p-value is very small (0.00083), indicating very strong evidence against
the null hypothesis. There is very strong evidence that the distribution of re-
lationship to the victim for female murderers di˙ers from the distribution of
relationship to the victim for male murderers.
(g) It appears as though females were more likely than males to murder family
members or partners, and males were more likely than females to murder
acquaintances. (We could investigate this in greater detail using plots and
statistical inference techniques.)
Balka ISE 1.09 13.4. CHAPTER EXERCISES 205
(h) The samples were of convicted male and female murderers in Finland, so those
are the populations to which the conclusions apply.
23. (a) H0 : Text colour and ID are independent variables. (The likelihood of a stu-
dent flling out their ID number incorrectly is the same for the two colours.)
Ha : Text colour and ID not independent variables. (The likelihood of a stu-
dent flling out their ID number incorrectly is di˙erent between the colours.)
(b) Expected number of white-test writers that incorrectly fll out their ID: 17×423
842 =
8.54
Expected number of yellow-test writers that incorrectly fll out their ID: 17×419
842 =
8.46
Expected number of white-test writers that properly fll out their ID: 825×423
842 =
414.46
Expected number of yellow-test writers that properly fll out their ID: 825×419
842 =
410.54
2 2 2 2
χ2 = (1−8.54)
8.54 + (16−8.46)
8.46 + (422−414.46)
414.46 + (403−410.54)
410.54 = 13.654
(c) Using software, the p-value is 0.00022. (Using the χ table we could give only
2
more likely to fll out an incorrect ID number. (It may be the case that
the observed e˙ect is a real one, but it would be much more convincing if
we had a plausible explanation for the e˙ect that was seen in the sample.)
• This e˙ect was not seen in many other tests, including tests with the same
students in the same course.
25. (a) H0 : The true ratio is 1:1:2:2. (3 and 4 stars are twice as likely as 1 and 2
stars).
Ha : The true ratio is not 1:1:2:2.
(b) Observed counts: 12, 79, 209, 100. Expected counts: 66.667, 66.667, 133.333,
133.333. 2 2 2 2
χ2 = (12−66.667)
66.667 + (79−66.667)
66.667 + (209−133.333)
133.333 + (100−133.333)
133.333 = 98.38.
(c) The p-value is the area to the right of 98.38 under a χ distribution with
2
There is very strong evidence (p-value = 1.15 × 10−6 , signifcant at α = 0.05) that
the favours are not equally preferred.
27. The p-value is the area to the right of 377.42 under a χ2 distribution with (3 −
1)(4 − 1) = 6 degrees of freedom. This area is near 0. There is extremely strong
evidence that age and rating are not independent for viewers of this movie.
28. C is the best option. The tiny p-value gives very strong evidence of an association
between the type of concrete and the type of damage.
The p-value is the area to the right of 97.4709 under a χ2 distribution with (2 −
1) × (3 − 1) = 2 degrees of freedom. Using software, we can fnd that the p-value is
very near 0 (so close to 0 that most software will simply report it as 0). Using the
χ2 table, we can fnd that the p-value is much less than 0.0005.
The null hypothesis is that there is no association between the row and column
variables (in other words, that the methods are all equally e˙ective at killing bed-
bugs). Since we have a tiny p-value, there is very strong evidence against the null
hypothesis. As such, there is very strong evidence that the methods are not all
equally e˙ective at killing bedbugs.
Balka ISE 1.09 13.4. CHAPTER EXERCISES 208
Chapter 14
One-Way ANOVA
J.B.’s strongly suggested exercises: 2, 3, 5, 8, 10, 12, 13, 16, 18, 19, 22, 23, 25, 26, 28
14.1 Introduction
2. (a)
Source DF SS MS F p-value
(800/3)
Groups 4−1=3 800 800
3 ≈ 266.667 (200/36) = 48 p-value < 0.001
Error 40 − 4 =36 1000 − 800 = 200 200
36 ≈ 5.556 — —
Total 40 − 1 = 39 1000 — — —
(b) M SE = sp ≈ 5.556.
2
(c) F = 48.
(d) Yes (since p-value < 0.05).
209
Balka ISE 1.09 14.3. CARRYING OUT THE ONE-WAY ANOVA 210
3. (a) s2p = M SE = SSE n−k , where SSE = (ni − 1)s2i = (10 − 1)2.12 + (10 − 1)4.22 +
P
4. (a) X ¯ 2 = 12, X
¯ 1 = 5, X ¯ 3 = 6, n1 = 2, n2 = 3, n3 = 2, s2 = 2, s2 = 16, s2 = 8.
1 2 3
X¯ = 58 = 8.285714.
7
SST = 2×(5−8.285714)2 +3×(12−8.285714)2 +2×(6−8.285714)2 = 73.42857.
(2 − 1)2 + (3 − 1)16 + (2 − 1)8
(b) s2p = M SE = = 10.5.
2+3+2−3
M ST = SSTk−1 =
73.42857
2 = 36.71428.
(c) F = M SE = 10.5 = 3.497.
M ST 36.71428
The standard deviations for the di˙erent groups are very similar (36, 44, 38),
so the common variance assumption appears to be reasonable. The sample
sizes are large, so the normality assumption is not that important. Overall it’s
very reasonable to use ANOVA here. But we should plot the data, as this can
give us an overall view of what we are dealing with, help to assess normality,
and show any outliers or other problems.
(b) It’s very reasonable to use ANOVA here, even if there is a little right skewness.
ANOVA works well, even if there is a little non-normality in the groups (espe-
cially if the sample sizes are large, and they are fairly large here). We should
Balka ISE 1.09 14.4. WHAT SHOULD WE DO AFTER A ONE-WAY ANOVA? 211
plot the data and investigate the skewness though—if the skewness is strong
we might consider using a transformation before carrying out the ANOVA.
(For example, we might consider carrying out an ANOVA on the logs of the
eye movement measurements.)
(c) There is statistically signifcant evidence (p-value = 0.019 < 0.05) that the
groups do not all have the same population mean eye movement. (In other
words, there is statistically signifcant evidence that the e˙ect on eye movement
di˙ers between the treatment groups.)
(d) No, the analysis is not yet complete. Since there is statistically signifcant
evidence that the groups do not all have the same true mean, we will investigate
the di˙erences between the groups using pairwise comparisons.
14.4.1 Introduction
6. 0.9525 = 0.277.
7. (1 − 0.05)10 = 0.5987.
µT T and µU L − µRL .
(b) A 95% confdence √ interval
p for µi − µj is Xi − Xj ± t.025 SE(Xi − Xj ), where
¯ ¯ ¯ ¯
SE(Xi − Xj ) = M SE 1/ni + 1/nj . The t has 122 degrees of freedom (the
¯ ¯
degrees of freedom for error), yielding t.025 = 1.9796 (found using software –
the value may be slightly di˙erent if you look it up in a t table).
The interval for µU L − µT T is:
¯U L − X
X ¯ T T ± t.025 SE(X ¯T T )
¯U L − X
√ p
63 − 42 ± 1.9796 × 1561 1/42 + 1/41
21 ± 17.17
which works out to (3.8, 38.2).
The interval for µRL − µT T is:
¯ RL − X
X ¯ T T ± t.025 SE(X ¯T T )
¯ RL − X
√ p
64 − 42 ± 1.9796 × 1561 1/42 + 1/41
22 ± 17.17
Balka ISE 1.09 14.4. WHAT SHOULD WE DO AFTER A ONE-WAY ANOVA? 212
9. (a) 52 = 10.
�
(b) If we want to keep α at no more than 0.10, for the individual intervals we use
α0 = 010
.10
= 0.01. Each interval should have a 99% confdence level.
(c) 2.787. t0.01/2 = t0.005 . There are n − k = 30 − 5 = 25 degrees of freedom.
t0.005 = 2.787.
µT T and µU L − µRL .
(b) The family-wise confdence level of 95% implies α = 0.05. For the 3 individual
intervals, α0 = 0.05
3 , with a resulting confdence interval of (1 − 3 )100% =
0.05
98.33%.
(c) The appropriate t value is tα0 /2 = t.05/6 = t0.00833 . With 122 degrees of freedom
(the degrees of freedom for error), t0.00833 = 2.4274 (found using software – it
Balka ISE 1.09 14.4. WHAT SHOULD WE DO AFTER A ONE-WAY ANOVA? 213
11. (a) The Tukey procedure is an exact procedure if the normality assumption is
met, the common variance assumption is met, and the sample sizes are equal.
Although not an exact procedure if the sample sizes are di˙erent, it can still
be used as an approximate procedure. Here, the sample sizes are very close
(41, 42, 42), and so it is very reasonable to use the Tukeyp
procedure.
√
(b) The interval for µU L − µT T is: 63 − 42 ± 2.3725 × √1561p1/42 + 1/41.
The interval for µRL − µT T is: 64 − 42 ± 2.3725 × √1561p1/42 + 1/41.
The interval for µU L − µRL is: 63 − 64 ± 2.3725 × 1561 1/42 + 1/42.
(c) The interval for µU L − µRL contains 0, and so there is not statistically signif-
cant evidence of a di˙erence in mean eye movement between the unrehearsed
lying and the rehearsed lying groups.
The intervals for µU L − µT T and µRL − µT T do not contain 0, and thus there
is statistically signifcant evidence that µU L and µRL both di˙er from µT T .
Since both intervals lie entirely to the right of 0, we have strong evidence that
µU L and µRL are both greater than µT T .
(d) The LSD intervals used a multiplier (t0.025 ) of 1.9796, the Bonferroni intervals
used a multiplier (t0.00833 ) of 2.4274, and the Tukey intervals used a multiplier
1.9796 = 1.198 shows that the Tukey intervals are about
of 2.3725. The ratio 2.3725
20% wider than the LSD intervals. The ratio 2.3725 2.4274 = 0.977 shows that the
Tukey intervals are a little narrower (≈ 2% narrower) than the Bonferroni in-
tervals. Like the Bonferroni method, the Tukey method maintains the family-
wise confdence level of 95%, but it manages to do so with narrower intervals.
In this example, that resulted in one comparison crossing over to statistical
signifcance at the 5% level. (The unrehearsed lying − rehearsed lying di˙er-
ence was statistically signifcant using the Tukey method, but not signifcant
using the Bonferroni approach.)
(e) From the ANOVA, there is statistically signifcant evidence (p-value = 0.019 <
0.05) that the di˙erent lie conditions do not all result in the same true mean
eye movement. Tukey confdence intervals with a family-wise confdence level
of 95% showed that both the unrehearsed lying group and the rehearsed ly-
ing group had mean eye movements that were signifcantly greater than the
truth tellers, but there was not a statistically signifcant di˙erence between
the rehearsed and unrehearsed liars.
Balka ISE 1.09 14.5. EXAMPLES 215
14.5 Examples
12. We assume that we have simple random samples from the k populations, and that
the populations are normally distributed with common variance σ 2 . If the assump-
tions are not true, then our conclusions may be misleading (for example, the stated
Type I error may be very di˙erent from the true probability of rejecting a true null
hypothesis).
14.9 Concepts
14. No. In one-way ANOVA, only large values of the F test statistic give strong evidence
against the null hypothesis.
15. (a) The test statistic will equal 0 if M ST = 0 (provided M SE 6= 0). MST will
equal 0 when there is no variability in the sample means—when the k groups
all have the same value of the sample mean.
(b) The test statistic will equal 1 if M ST = M SE.
16. (a) If we performed a one-way ANOVA on these 3 samples, then the p-value would
be small. True. Visually there is very, very strong evidence against H0 , im-
plying a small p-value .
(b) If we carried out a t-test of H0 : µA = µC against a two-sided alternative, then
the p-value would be small. True. Visually there is very, very strong evidence
against H0 , implying a small p-value .
(c) The use of the ANOVA procedures for the test of H0 : µA = µB = µC would
be a bad idea, as the assumptions are clearly violated. False. There is no
Balka ISE 1.09 14.9. CONCEPTS 216
17. (a) The test statistic will have an F distribution with 4 df in the numerator, and
45 df in the denominator. If the null hypothesis and the assumptions are true,
the F test statistic has an F distribution with k − 1 degrees of freedom in the
numerator and n − k degrees of freedom in the denominator. If there are 10
observations in each of 5 groups, there are 5 − 1 = 4 degrees of freedom in the
numerator, and 50 − 5 = 45 degrees of freedom in the denominator.
(b) If the null hypothesis and the assumptions are true, then the p-value will have
a uniform distribution between 0 and 1.
(c) Since the p-value is uniformly distributed between 0 and 1, the p-value will
equal 0.5 on average (if the null hypothesis and the assumptions are true).
18. (a) If the null hypothesis (and the assumptions) are true, then the test statistic
in one-way ANOVA has an F distribution. True.
(b) In one-way ANOVA, we assume that the observations within each group are
normally distributed, and that all groups have the same population variance.
True.
(c) In one-way ANOVA, we assume that the observations within each group are
normally distributed, and that all groups have the same population mean.
False
(d) The test statistic in one-way ANOVA can be negative. False. The F statistic
is a ratio of variances, and as such cannot be negative.
(e) If the null hypothesis is false, then MST will tend to be bigger than MSE.
True.
19. (a) If the null hypothesis is true, then the F statistic will be infnite. False (of
course).
(b) If the null hypothesis is true, then the F statistic will equal 1. False. If H0 is
true, the F statistic will have a median that is near 1, but it can take on any
value between 0 and ∞.
(c) If the null hypothesis is false, then the F statistic will sometimes be less than
Balka ISE 1.09 14.10. APPLICATIONS 217
one. True. The F statistic can take on any nonnegative value, regardless of
whether the null hypothesis is true or false.
(d) If the null hypothesis is false, then the F statistic will sometimes be less than
0. False. The F statistic cannot be negative.
(e) If the null hypothesis is false, then the p-value will be less than 0.05. False. If
the null hypothesis is false, we can still end up with a large p-value.
20. (a) If M SE = 0, then the p-value will equal the F statistic. False.
(b) If the population means are equal, then the F statistic will equal 1. False. If
the population means are equal, then H0 is true and the F statistic will have
an F distribution.
(c) If the sample means are all equal, then the F statistic will equal 1. False. If
the sample means are equal, the F statistic will equal 0.
(d) If the sample means are all equal, then the null hypothesis is true. False. If
the sample means are all equal, then there is absolutely no evidence whatsoever
against H0 . But that still does not mean it must be true.
(e) The mean square error is the pooled variance. True.
21. (a) If the population means are very di˙erent, we expect the F statistic to be
greater than one. True. If H0 is true (the population means are equal), then
the F statistic will have a median that is somewhere around 1. If H0 is false,
the F statistic will tend to be larger than 1.
(b) We will reject the null hypothesis at the 5% level whenever the sample means
are all equal. False. When the sample means are all equal there is absolutely
no evidence whatsoever against the null hypothesis.
(c) An assumption of one-way ANOVA is that the groups all have di˙erent pop-
ulation variances. False. One-way ANOVA assumes the population variances
are equal.
(d) If the assumptions of the pooled-variance t procedures are met, then the t test
and the F test are equivalent tests, with t2 = F . True.
(e) The equal variance assumption is not important for large sample sizes, due to
the central limit theorem. False. A violation of the equal variance assumption
can be very problematic, even for large sample sizes. The central limit theorem
can help out with a violation of the normality assumption.
14.10 Applications
22. (a) The ANOVA assumptions appear to be reasonable here. There are a few
outliers, which can cause problems sometimes, but there are not many and
they are not that extreme. The standard deviations between the groups are
very similar, so the assumption of equal population variances appears to be
Balka ISE 1.09 14.10. APPLICATIONS 218
1.1922. The t has 325 degrees of freedom (the degrees of freedom for error),
yielding t.025 = 1.9673. The interval is thus:
¯ R2 − X
X ¯ R2 − X
¯ R3 ± t.025 SE(X ¯ R3 )
42.3 − 42.9 ± 1.9673 × 1.1922
−0.6 ± 2.3454
which works out to (−2.9, 1.7).
The interval contains 0, and thus there is not a statistically signifcant di˙er-
ence between X¯ R2 and X¯ R3 . (There is not signifcant evidence that µR2 di˙ers
from µR3 .)
23. (a) There are no outliers or major skewness (nothing indicating non-normality),
which is good. The control group has a much greater sample standard devia-
Balka ISE 1.09 14.10. APPLICATIONS 219
tion than the antivenom groups (113 vs 56 and 45). This is a little troubling,
as the ANOVA assumes equal population variances. However, the sample sizes
are similar, and so the e˙ect of di˙erent variability may not be that great. It
is reasonable to use ANOVA, but it is not the ideal situation.
(b) There is absolutely no evidence (p-value = 0.98) of a di˙erence in the pop-
ulation means of the change in leg volume. There is no evidence that the
antivenom had any e˙ect on the mean swelling.
(c) Since we had no evidence against the null hypothesis in the ANOVA, the
analysis is complete. (There is no need to carry out pairwise comparisons.
The researchers could, if they so desired, compare the two antivenom groups
with a t test, since they had this comparison in mind before the experiment,
but there would not be a signifcant di˙erence in that comparison.)
24. (a) The CBT group shows several outliers, which can be a concern, but they
are not that extreme and it may be the result of having a small IQR. Other
than the outliers, the ANOVA assumptions are well satisfed—the distributions
look roughly symmetric and the standard deviations are very similar. It’s
reasonable to use ANOVA here.
(b) There is very strong evidence (p-value = 0.006) that the groups do not all have
the same population mean change in weight. (There is very strong evidence
that the treatments do not all have the same e˙ect.)
(c) There are 32 = 3 pairwise comparisons (µC − µCBT , µC − µF T , µCBT − µF T ).
�
with 69 degrees of freedom. Using software, we can fnd that with 69 degrees
of freedom t0.0083 = 2.4537.
The interval for µC − µCBT is:
X¯C − X ¯C − X
¯ CBT ± t.025 SE(X ¯ CBT )
√ p
−0.20 − 1.36 ± 2.4537 × 11.66 1/26 + 1/29
−1.56 ± 2.2629
X¯C − X ¯C − X
¯ F T ± t.025 SE(X ¯F T )
√ p
−0.20 − 3.30 ± 2.4537 × 11.66 1/26 + 1/17
−3.50 ± 2.6133
Bonferroni procedures are essentially the same (but the Bonferroni procedure
results in slightly wider intervals).
Non-technical summary: The women in the Family Therapy treatment group
tended to gain more weight than the women in the control group. There was
not strong evidence of a di˙erence in weight gain between the control and CBT
groups, or the CBT and FT groups.
25. (a) The standard deviations are very similar, so there is no issue with the common
variance assumption. Overall, ANOVA is likely reasonable here. But we should
plot the data, as this can give us an overall view of what we are dealing with,
help to assess if the normality assumption is reasonable, and show any outliers
or other problems.
(b) There is no evidence (p-value = 0.64) of a di˙erence in the true mean SDO of
the di˙erent ethnic groups. (These conclusions apply only to the male inmate
population from which the sample was drawn.)
26. (a) Any subsequent analysis would depend on the assumption that it is reasonable
to pool the individuals together. If the distribution of SDO did vary between
the ethnic groups in a meaningful way, then conclusions based on a pooled
analysis would be suspect. For example, if the pooled sample showed no
relationship between age and SDO, this might be because the e˙ect of age
di˙ered between the ethnic groups and the pooling averaged it out. If we pool
over an important variable, the resulting conclusions can sometimes be very
misleading.
(b) There is extremely strong evidence (p-value = 3.8 × 10−8 ) that the age groups
do not all have the same population mean SDO. (These conclusions apply only
to the male inmate population from which the sample was drawn.)
(c) There is extremely strong evidence that there is a relationship between age
and SDO. From the values given in the table, it appears as though younger
inmates tend to have a higher SDO, and are thus more accepting of the notion
that it is reasonable for some groups to dominate� others. (We could investigate
this a little more formally by carrying out the 42 = 6 pairwise comparisons.
And since age is a quantitative variable, we could also investigate this by using
regression techniques instead of ANOVA.)
Source DF SS MS F p-value
Groups 6−1=5 8117.2 − 6862.8 = 1254.4 1254.4
5 = 250.88 250.88
60.2 = 4.167 < .01
Error 120 − 6 = 114 60.2 × 114 = 6862.8 60.2 — —
Total 120 − 1 = 119 8117.2 — — —
(b) F = 4.167. The p-value is the area to the right of 4.167 under an F distribution
with 5 and 114 degrees of freedom. Using a computer: p-value = 0.0016. Using
the table: 0.001 < p-value < 0.01.
(c) There is signifcant evidence that the teaching methods do not all result in the
same mean test performance.
√
(d) SE(X ¯i − X¯ j ) = 60.2 1/20 + 1/20 = 2.4536. We now need t.05 with 114
p
Source DF SS MS F
Treatments 9−1=8 612 − 324 = 288 288
= 36 36
= 3.00
28. (a) 8 12
Error 36 − 9 = 27 27 ∗ 12 = 324 12
Total 36 − 1 = 35 612
(b) Here we need the area to the right of 3 under an F distribution with 8 and
27 degrees of freedom. Using a computer: p-value = 0.015. Using the table:
0.01 < p-value < 0.025.
(c) There is signifcant evidence against the null hypothesis, which means there is
signifcant evidence that not all of the seat belt types have the same population
mean breaking strength.
√ p
(d) 2.052 × 12 1/4 + 1/4 = 5.026
(e) i. The sample means were not all equal. True. If the sample means were all
equal, then MST = 0.
ii. This was a balanced experimental design (same sample size in each group).
True, there were 4 observations in each group.
iii. We know for certain the population mean breaking strengths of the di˙er-
ent groups are not all equal. False. Statistical inference is not about being
certain.
iv. The pooled sample variance is equal to 12. True (the pooled variance is
MSE).
29. (a) H0 : µ1 = µ2 = µ3 = µ4 = µ5 (the mean fuel use is the same for all 5 additives).
Ha : µi 6= µj for at least one i, j combination. (the means of the 5 additives
are not all equal).
(b) We require the di˙erent groups to have normally distributed populations with
common variance σ 2 . Normal quantile-quantile plots can help us check the
normality assumption (although these may not be very informative with the
Balka ISE 1.09 14.10. APPLICATIONS 223
small sample sizes). Summary statistics and boxplots could help us determine
if the equal variance assumption is reasonable.
(c) The ANOVA table is given in Table 14.2.
Source DF SS MS F p-value
Groups 5−1=4 680.2 680.2
4 = 170.05 170.05
18.533 = 9.175 p-value < .001
Error 20 − 5 = 15 958.2 − 680.2 = 278 18.533 — —
Total 20 − 1 = 19 958.2 — — —
(d) F = 9.175. The p-value is the area to the right of 9.175 under an F distribution
with 4 and 15 degrees of freedom. Using a computer: p-value = 0.0006. Using
the table: p-value < 0.001.
(e) There is very strong evidence (p-value = 0.0006) that the additives do not all
result in the same population mean fuel use.
30. (a) H0 : µ1 = µ2 = µ3 = µ4 (the population mean strength is the same for all 4
solder types).
Ha : µi =
6 µj for at least one i, j combination. (the population means of the 4
solder types are not all equal).
(b) There is very strong evidence (p-value = 0.00017) that the types of tin-lead
solder do not allpresult in the same mean shear strength.
√
(c) 2.120 × 4.425 1/5 + 1/5 = 2.820.
Balka ISE 1.09 14.10. APPLICATIONS 224
Chapter 15
J.B.’s strongly suggested exercises: 2, 4, 6, 8, 9, 12, 14, 15, 16, 18, 20, 37, 40, 41, 42, 43
15.1 Introduction
2. (a) Since we wish to use high school GPA to help predict university GPA, high
school GPA will be the explanatory variable and university GPA will be the
response variable.
(b) There are a number of variables that might help to predict university GPA,
including the student’s socioeconomic background, gender, and high school
they attended. Regression analysis can help us to determine which of these
are useful predictors.
3. In simple linear regression there is one explanatory variable (X). In multiple linear
regression there is more than one explanatory variable (X1 , X2 , . . .). (In both cases
there is a single response variable Y .)
225
Balka ISE 1.09 15.2. THE LINEAR REGRESSION MODEL 226
5. β̂1 is a statistic that is calculated using sample data. It estimates β1 , the true slope
(a parameter).
β̂0 is a statistic that is calculated using sample data. It estimates β0 , the true
intercept (a parameter).
βˆ1 is the slope. We know from our grade school days that Slope = RiseRun . When
X = 2, Y is approximately 6, and when X = 10, Y is approximately −10. This
is a drop of approximately 16 units (a rise of approximately −16) over a run of 8.
βˆ1 = Rise
Run ≈ 8 = −2. (The slope of the given line is approximately −2.) β̂0 is the
−16
7. We
P 2use P the least squares method—the parameter estimate are chosen such that
ei = (Yi − (β̂0 + β̂1 Xi ))2 is a minimum. In other words, the parameter esti-
mates are chosen such that the sum of the squared vertical distances from the points
to the line is a minimum.
10. We assume that the observations are independent, and that the term is a random
variable that is normally distributed with a mean of 0 and the same variance (σ 2 )
at every value of X. Under these assumptions, Y ∼ N (β0 + β1 X, σ 2 ).
P 2
ei 3.12 + 2.42 + (−6.1)2 + (−.5)2 + 8.92 + (−12.6)2 + (4.8)2
11. (a) s =
2 = =
n−2 7−2
62.768.
(b) The sum of the residuals is 0. The residuals always sum to 0 in a least squares
regression.
12. (a) Never. We carry out hypothesis tests on parameters, not statistics.
(b) Rarely. This might be a point of interest for an investigator from time to time,
but not often.
Balka ISE 1.09 15.4. STATISTICAL INFERENCE IN SIMPLE LINEAR REGRESSION 228
which works out to (−10.963, −0.507). Note that this 95% confdence interval
lies entirely to the left of 0, which should not be surprising given the results
of the hypothesis test.
15. Plot A shows curvature in the residuals (the residuals tend to be too small for
large and small values of X, and too large near the mean of X). Our simple linear
regression model is not adequate for this data.
Plot B shows that the variance of the residuals increases as X increases. This is
a problem for our model. (In these situations, the slope and intercept estimators
are still unbiased estimators of their respective parameters, but our standard errors
will be o˙.)
Plot C shows a random scattering of points, and no problems with the assumed
linear regression model.
Plot D shows a normal quantile-quantile plot in which the points lie very close to
a straight line. It indicates that the observed residuals are approximately normally
distributed. This indicates that the normality assumption is reasonable.
16. The plots look good, and do not indicate any problems with the assumed model.
The plot of the residuals vs snout-vent length shows a random scattering of points,
without any problems like curvature or increasing variance. The points on the
normal quantile-quantile plot fall roughly on a straight line. (There are 3 residuals
that are a little smaller than would be expected (very mild outliers), but they are
unlikely to have a major impact.) The assumptions of the model are reasonable,
and the inference procedures we carried out in Question 14 are valid.
18. (a) Yes, the correlation coeÿcient would be a very reasonable measure of the
strength of the relationship in each of the four plots, since all of the relation-
ships look roughly linear.
Balka ISE 1.09 15.7. ESTIMATION AND PREDICTION USING THE FITTED LINE 230
(b) From software, the calculated values of the correlation coeÿcients: Plot A:
r = 0.523, Plot B: r = −0.988, Plot C: r = −0.884, Plot D: r = 0.082.
Ranking in terms of increasing values of the correlation coeÿcient r: Plot B <
Plot C < Plot D < Plot A.
(c) Ranking in terms of increasing strength of the linear relationship:
Plot D < Plot A < Plot C < Plot B. (The greater the magnitude of r, the
stronger the linear relationship.)
(d) A change of the scaling of X and/or Y would not change the value of the cor-
relation coeÿcient r. Multiplying X or Y by a constant, or adding a constant,
does not change r.
19. (a) Yes, the coeÿcient of determination would be a very reasonable measure of
the strength of the relationship in each of the four plots, since all of the rela-
tionships look roughly linear.
(b) From software, the calculated values of the coeÿcient of determination: Plot
A: R2 = 0.274, Plot B: R2 = 0.977, Plot C: R2 = 0.781, Plot D: R2 = 0.007.
Ranking in terms of increasing values of the coeÿcient of determination:
Plot D < Plot A < Plot C < Plot B.
(e) A prediction interval for a single value is wider than the corresponding conf-
dence interval for the mean. (There is greater uncertainty when we are pre-
dicting a single value than when we are estimating the mean.)
15.8 Transformations
22. (a) Points A and D have the greatest leverage, since they are furthest from the
mean of X. Points B and C have very little leverage, since they are near the
mean of X.
(b) Points B and D have the residuals with the greatest magnitude. (The vertical
distance from points B and D to the line is far greater than for the other two
points.)
(c) Point D would be the most infuential, since it has both high leverage and falls
far from the overall pattern of the observations. It would defnitely have the
most infuence of any point in this data set.
(d) If point D were removed, the slope would increase.
23. (a) No, as this would be extrapolating far beyond the range of the observed data.
(b) No, the model is not fawed. We view β̂0 as the estimated mean of Y when
X = 0 only if X = 0 is within (or very near) the range of the observed data.
When X = 0 is far beyond the range of the observed data, we typically view
β̂0 simply as the Y -intercept, a necessary part of the line without any practical
meaning beyond that.
Balka ISE 1.09 15.12. A BRIEF MULTIPLE REGRESSION EXAMPLE 232
24. (a) e1 = Y1 − Ŷ1 , where Y1 = 25, Ŷ1 = 1.20 + 3.14(8) = 26.32. e1 = 25 − 26.32 =
−1.32.
(b) 3.14 ± 2.306 × 0.3436, which works out to 3.14 ± 0.7923, or (2.35, 3.93).
(c) Yes, since the interval does not contain 0. Since the entire interval lies to the
right of 0, there is strong reason to believe β1 > 0, and there is an increasing
relationship of Y with X.
(d) Ŷ = 1.20 + 3.14(120) = 378. This prediction is an extrapolation, using the
regression line to estimate Y for an X value that is beyond the range of the
observed data. It is not a reasonable use of the regression line.
(e) Ŷ = 1.20 + 3.14(20) = 64. This is still an extrapolation, and it’s still not
reasonable.
15.13.2 Concepts
27. The term extrapolating refers to using the regression line to predict or estimate
values of Y for values of X that are beyond the range of the observed data. This
should be avoided.
28. (a) Plot 1 shows a strong relationship between X and Y , but it is clearly not a
straight line relationship. Plots 2 and 3 look reasonable.
(b) Plot 1 has the strongest relationship.
(c) Plot 2 shows the strongest linear relationship.
(d) In decreasing order of the strength of the linear relationship: Plot 2 > Plot
3 > Plot 1. Since r2 measures the strength of the linear relationship, .9509
corresponds to Plot 2, .5935 corresponds to Plot 3, and .04392 corresponds to
Plot 1.
29. No. If r is very close to 0, then the sample shows no linear relationship between
X and Y . There could still be a strong relationship between X and Y , but not a
simple straight line relationship. See, for example, the relationship in Figure 15.1.
And if r is close to 0, there is always the chance that there is truly a strong straight
line straight line relationship between X and Y , and just we happened to see a
value of r close to 0.
30. An r2 value of .95 implies a very strong linear relationship between X and Y . For
example, Figure 15.2 shows a relationship with r2 = 0.81. But even though the
relationship is strong, there is not necessarily a causal link.
Yˆ = −1.0 + 5.1(3) = 14.3, Yˆ = −1.0 + 5.1(4) = 19.4. The four residuals are
4.1 − 4.1 = 0, 8.2 − 9.2 = −1.0, 16.3 − 14.3 = 2.0, 18.4 − 19.4 = −1.0. This
leaves a sample variance about the regression line of
e2
P
32. s2 = n−2i is the best estimate of σ 2 . We aren’t given the fourth residual, but we
do know that the residuals must sum to 0 ( e2i = 0). Since the frst 3 resid-
P
uals sum to 4.7, this implies the fourth residual must have been −4.7. s2 =
2.02 +3.42 (−0.7)2 +(−4.7)2
4−2 = 19.07.
33. (a) In reality there is absolutely no relationship between X and Y in this situation
(they are randomly generated values, randomly paired together), so β1 = 0.
The null hypothesis and the assumptions of the t test are true, and so the test
statistic will have a t distribution with n − 2 = 10 − 2 = 8 degrees of freedom.
(b) 0, since that is the mean of a t distribution with 8 degrees of freedom.
(c) In the event the null hypothesis is true (and the assumptions are true), the
p-value has a uniform distribution between 0 and 1. so here the p-value would
have a uniform distribution between 0 and 1.
(d) 0.5 (the mean of a uniform distribution between 0 and 1).
34. The test of H0 : β1 = 0 (the null hypothesis of no linear relationship between X and
Y ).
35. Since 0 lies outside of the 95% confdence interval for β1 , the two-sided p-value of
the test of H0 : β1 = 0 is less than .05.
36. (a) The method used to fnd the sample slope and intercept is called the method
of least cubes. False. We use the method of least squares.
(b) The residuals in a simple linear regression always sum to 0. True.
(c) If X¯ = 0, then β̂1 = 0. False.
(d) If Y = 0, then β̂0 = 0. False.
¯
Balka ISE 1.09 15.13. CHAPTER EXERCISES 235
(e) r has the same sign as β̂1 . True. The sign of r is always the same as the sign
of β̂1 .
37. (a) β̂0 has the same sign as β̂1 . False. They can be equal, but they certainly don’t
have to be.
(b) If all data points fall perfectly on a line, then r = 1 or −1. True.
(c) The least squares regression line always passes through the point (0, 0). False.
The line will only pass through the origin if β̂0 = 0.
(d) The least squares regression line always passes through the point (X ¯ , Y¯ ).
True. (To show this, try putting X ¯ and the formula for β̂0 into the least
squares line).
(e) If there is no linear relationship between Y and X, then r = 1. False. We
can’t have a stronger sample relationship than r = 1 (all points fall perfectly
on a line when r = 1).
38. (a) Tests and confdence intervals on βˆ0 are the most common inference proce-
dures in regression. False, for a couple of reasons. First, we do not carry
out inference procedures for the statistic β̂0 . We may wish to carry out infer-
ence procedures for the parameter β0 , but these procedures are not usually of
practical interest. Inference for β1 is much more common.
(b) If the sample shows no linear relationship between Y and X (a random scat-
tering of points, say), then r and r2 will be close to 0. True.
(c) i = Yi −(β̂0 +β̂1 Xi ). False. ei = Yi −(β̂0 +β̂1 Xi ), not i . i = Yi −(β0 +β1 Xi )
(d) The residuals (the ei values) are uncorrelated. False. We assume that the true
error terms (the values of ) are uncorrelated, but the observed residuals are
correlated.
(e) The sample correlation coeÿcient is the proportion of the variance in Y that
is attributable to the linear relationship with X. False. This is the meaning
of the coeÿcient of determination.
39. (a) Instead of least squares, it would be better to choose a regression line based
on minimizing the sum of the absolute value of the residuals, but statisticians
have not yet fgured out how to do so. False. We can do this, but the least
squares method has some nice mathematical properties.
(b) The estimated standard deviation in a simple linear regression is the sum of the
squared residuals, divided by the sample size. False. s2 = Sum of squared
n−2
residuals
.
(c) If the test of the null hypothesis β1 = 0 results in a very small p-value, then r2
must have been large. False. Statistical signifcance is largely tied to sample
size. A small R2 value can still result in statistical signifcance, if the sample
size is large enough.
(d) If s = 0 and β̂1 > 0, then r = 1. True. s = 0 implies that all points fall
perfectly on a line (there is no variability about the regression line), and r has
the same sign as the sample slope β̂1 .
Balka ISE 1.09 15.13. CHAPTER EXERCISES 236
15.13.3 Applications
40. (a) Overall the simple linear regression model looks to be reasonable here. A
straight line ft looks very reasonable (there is no indication of curvature).
There does appear to be some increasing variance. (As head width increases,
the variance of the residuals increases. We often see this with count data,
as the variance often increases with the mean.) We do have more advanced
statistical methods that deal with this type of data, but overall the simple
linear regression model is not bad.
(b) No, the assumptions of the simple linear regression model are not perfectly true
in this case. The model assumes that for a given X, Y is normally distributed
about the true line. That cannot possibly be true here, as Y represents a
discrete count and thus cannot be perfectly normally distributed. But we can
still use the model. It may not be perfect, but it may very well provide a very
reasonable approximation.
(c) Ŷ = −29.400 + 17.004X, where Ŷ represents the predicted number of larvae
and X represents head width.
(d) A 1 mm increase in head width (X) results in an increase in the estimated
mean number of larvae (Y ) of 17.004.
(e) It is estimated that the larger blowfies carry 0.2 × 17.004 = 3.4 more larvae
on average.
(f) Yes, there is extremely strong evidence of a relationship between female head
width and the number of live larvae that they carry. The p-value of the test
of H0 : β1 = 0 is 9.01 × 10−14 . This is a minuscule p-value, and there is almost
surely a relationship between these variables.
(g) Ŷ = −29.400 + 17.004 × 4.0 = 38.616. (If we were using the model to predict
the count for a single female with a head width of 4.0 mm, our best guess
would be 39 larvae.)
(h) 0.6969 (the value of R2 given in the output).
41. (a) H0 : β1 = 0 (there is no linear relationship between hand grip strength and
facial attractiveness).
Ha : β1 =
6 0 (there is a linear relationship between hand grip strength and facial
attractiveness).
t = β̂1 −0 = 0.01309
0.01845 = 0.709 (note that this t statistic is given in the output).
SE(β̂1 )
The p-value is double the area to the right of 0.709 under a t distribution with
n − 2 = 32 − 2 = 30 degrees of freedom. The p-value of 0.4836 is given in the
output. This is a large p-value, giving no evidence against the null hypothesis.
This sample gives no evidence of a relationship between hand grip strength
and facial attractiveness.
(Side note: The authors of the original paper found that after using more
advanced statistical methods to control for other important variables (e.g.
Balka ISE 1.09 15.13. CHAPTER EXERCISES 237
42. (a) Although the sample size is small and there is nothing defnitive, it appears as
though the relationship between GE and cadence may be more of a curvilinear
relationship. The residuals corresponding to both the small and large values of
cadence are large and positive, whereas the residuals corresponding to values of
cadence near the mean are negative. (The model may be underestimating GE
for large and small cadence values, and overestimating GE for values of cadence
near the mean.) The simple linear regression model may not be adequate.
One option for us would be to ft a curve through the points using multiple
regression techniques. (We crazy statisticians can do that!) For this example,
if we ft a curve by including an X 2 term in our model, we would get the
curvilinear relationship shown in Figure 15.3a. The residuals for this model
are shown in Figure15.3b. This model seems to ft the data much better.
26
1.0
24
0.5
Residuals
GE (%)
22
20 0.0
18
-0.5
16
80 85 90 95 100 80 85 90 95 100
Cadence (rpm) Cadence (rpm)
(a) Scatterplot with a ftted curve. (b) Residuals for the ftted curve.
Figure 15.3: Scatterplot of GE vs cadence and plot of the residuals for the ftted curve.
(b) Yes, there is very strong evidence of a relationship between cadence and GE.
The p-value of the test of H0 : β1 = 0 is given in the R output as 0.00130.
There is strong evidence that as cadence increases, GE tends to decrease.
43. (a) H0 : β0 = 0, Ha : β0 6= 0. This test does not have much practical meaning.
This tests the hypothesis that the mean memory score is 0 when the right
hippocampus has a volume of 0. This doesn’t have any practical meaning.
(b) H0 : β1 = 0, Ha : β1 =
6 0. Yes, this is meaningful. It is a test of the null hypoth-
esis that there is no linear relationship between right hippocampus volume and
score on the memory test.
(c) There is very strong evidence (two-sided p-value = 0.0017) of an increasing
relationship between right hippocampus volume and score on the Wechsler
memory scale.
(d) 0.08835 ± 2.086 × 0.02440, which works out to 0.08835 ± 0.0509 or (0.037,
0.139).
(e) e1 = Y1 − Ŷ1 . Y1 = 28. Ŷ1 = −41.72277 + 0.08835 × 955 = 42.65148.
Balka ISE 1.09 15.13. CHAPTER EXERCISES 238
e1 = 28 − 42.65148 = −14.65.
(f) 0.396 (The value of R2 √ given in the output.)
√
(g) The magnitude of r is r2 = 0.396 = 0.629. And since r has the same sign
as the slope, r = 0.629.
(c) H0 : β1 = 0.30, Ha : β1 =
6 0.30.
t = 0.3620−0.30
0.03211 = 1.931. The p-value is double the area to the right of 1.931
under a t distribution with 25 − 2 = 23 degrees of freedom. By computer:
p-value = 0.066. By the table: 0.05 < p-value < 0.10. There is not signifcant
evidence at α = 0.05 that the true slope di˙ers from 0.30.
SSXY 1100
45. (a) β̂1 = = ≈ 1.1996.
SSXXq 917 q
2
(b) SE(β̂1 ) = SSsXX = 24.935
917 ≈ 0.1649.
(c) 1.1996±2.306×0.1649, which works out to 1.1996±0.3803, or (0.8193, 1.5799).
(d) Yes. The entire interval lies well to the right of 0.
(e) H0 : β1 = 1, H0 : β1 =
6 1. t = 1.1996−1
.1649 = 1.2104. For a two-sided alternative,
0.20 < p-value < 0.30. We have no evidence that β1 di˙ers from 1.
(f) Ŷ = 161.6358 + 1.1996(92) = 271.999.
(g) s2 = 24.935.
2
(h) Under our usual assumptions, β̂1 ∼ N (β1 , SSσXX )
(i) Ŷ = 161.6358 + 1.1996(−2) = 159.2366. This is a nonsensical estimate, as it
estimates the height of a tree that is −2 months old.