Stat 101 Sample Final Exam
Stat 101 Sample Final Exam
Stat 101 Sample Final Exam
1. Conclusions taken from descriptive statistics only apply to the data at hand.
2. In the experiment of tossing two fair coins, the probability that the outcomes are identical is 0.50.
3. In probability sampling, all the elements of the population are given equal chances of inclusion in
the sample.
4. Confidence interval estimate for the difference of the mean of paired data requires us to
compute for the absolute value of the difference of each pair of observations.
5. As the sample size increases, the probability of rejecting the null hypothesis increases.
6. Systematic sampling requires information on the arrangement of the elements in the sampling
frame to determine the reliability of the estimates.
7. Sturge’s formula is used to give an approximation of the class size.
8. If A and B are mutually exclusive events with P(A)=0.3 and P(B)=0.5, then P(A∩B)=0.15.
9. A coefficient of correlation for two variables X and Y equal to zero implies that X and Y are not
correlated.
10. If a and b are constants and X and Y are independent random variables, then Var(aX – bY) is
equal to a2Var(X) + b2Var(Y).
11. In cluster sampling, it is ideal to form clusters so that the elements are heterogeneous with respect
to the characteristic being studied within each cluster.
12. If the set of observations is negatively skewed, the mode is less than the mean.
13. If the observations in the sample of size 100 are all distinct then the 7th decile is greater than the
3rd decile.
14. If the results of an investigation show that one sleeping tablet works better than another at
the 5% level of significance, the conclusion would be similar if tested at the 10% level of
significance.
15. In the simple linear regression model, the random error terms are independent from one another
and are all normally distributed with mean and variance equal to 0 and 1, respectively.
16. In stratified random sampling, we divide the population into groups called strata and randomly
select the strata to which every element of the sampled stratum will be a part of the sample for the
study.
PART II. MUTIPLE CHOICE.
1. A rubbing alcohol company has to make their products within a standard of 70% isopropyl solution so that its germ-
killing qualities are optimal and that there are no side-effects in the skin of their buyers. The quality control
department conducted a survey of their products in the production line and from a sample of 30 units, the average
concentration was 72%. At 0.05 level of significance, there was no sufficient evidence to conclude that the
concentration of the alcohol they produce was not different from 70%. The value 70% is a
2. A nongovernment organization compares the household expenditures of two districts in Quezon City. What method of
data collection is most appropriate for this particular case?
3. Suppose that 𝑍~𝑁(0,1) and 𝑐 is a real number. Which of the following is NOT correct?
4. Suppose that a dataset contains n distinct values (n is an even number) and its distribution is skewed to the right. What
percentage of the observations will be greater than the median?
A. More than 50% B. Less than 50% C. Exactly than 50% D. Indeterminate
5. A researcher wishes to study the relationship between two variables X and Y. Five respondents of the same sex, age,
and income bracket were selected and the researcher made the appropriate measurements for variables X and Y. The
following measurements were obtained:
X 5 4 3 2 1
Y 1 2 3 4 5
A. -1 B. +1 C. -1/2 D. +1/2
6. If a sample has 500 observations, how many observations are in between the 20th and 95th percentiles?
With only the level of measurement as the primary consideration, which central tendency is best and is most applicable to
these data?
8. Assuming a linear relationship between 𝑋 and 𝑌, which of the following is true if the coefficient of correlation 𝑟
equals -0.30?
9. An automobile manufacturer claims that a particular make of car averages at least 20 miles per gallon for highway
driving. The statistician for a consumer magazine believes this statement is erroneous and that the average mileage per
gallon is less than 20. State the hypotheses to use in a statistical test for the statistician to verify her belief.
10. We have a negative relationship between number of drinks consumed and number of marks in a driving test. One
individual scores 3 on number of drinks consumed, another individual scores 5 on the number of drinks
consumed. What will be their respective scores on the driving test if the intercept is 18 and the slope is 3?
11. A scientist is weighing each of the 30 fishes. She obtains a mean of 30 g and a standard deviation of 2 g. After
completing the weighing, she finds that the scale was misaligned, and always under reported every weight by 2 g (i.e.,
a fish that really weighed 26 g was reported to weigh 24 g). What is the mean and standard deviation after correcting
for the error in the scale?
A. 28 g, 2 g B. 30 g, 4 g C. 32 g, 2 g D. 32 g, 4 g
12. Consider rolling a six-sided die once. Let A be the set of outcomes where an odd number comes up. Let B be the set
of outcomes where a 1 or a 2 comes up. In terms of A and B, what is the event getting 2 dots?
A. Ac ∪ B c B. Ac ∩ B c C. Ac ∪ B D. Ac ∩ B
X −µ
16. Suppose we have defined a random variable T = which follows a 𝑡-distribution with 𝑛 − 1 degrees of
S
n
freedom. Which of the following statements is an assumption about the random sample from which 𝑋� and 𝑆 have
been computed?
17. Suppose 95% confidence interval was constructed. Which of the following is (are) the cause(s) of the increase in the
width of the confidence interval?
I. The level of confidence was set to a value higher than 95% when the other factors are held constant.
II. The Type I error was set to a value higher than 5% when the other factors are held constant.
III. The sample size was reduced given that the other factors are held constant.
18. Assume that in your hand you hold an ordinary six-sided die and a one-peso coin. You toss both the die and the coin
on a table. The probability that a tail appears on the coin and any number more than 3 on the die is
19. Let 𝐴 be a random variable with 𝐸(𝐴) = 13 and 𝑉𝑎𝑟(𝐴) = 6. Which among the following is FALSE?
20. If the p-value for your test statistic satisfies p-value > 0.25 then:
22. A pretest-posttest designed experiment is interested in estimating the difference in the true pretest and posttest means
using a fixed sample of 37 units. The confidence level was supposed to be 0.99, but instead a 95% CI for the
difference in means was computed, which of the following statements best describes the miscalculation?
23. High concentration of trace metals in drinking water affects the flavor and poses a health hazard on drinkers. A health
officer wanted to compare the zinc concentration found in bottom water and surface water for six randomly selected
brands of bottled water. The data are as follows
Brand
Location
1 2 3 4 5 6
Bottom water 0.43 0.27 0.39 0.71 0.60 0.59
Surface water 0.42 0.24 0.57 0.62 0.64 0.65
Assuming that the differences in zinc concentration found in bottom water and surface water are normally distributed,
test at 5% level of significance whether the data suggest that the true average zinc concentration of the bottom water
(µ1 ) exceeds that of surface water (µ 2 ) . The appropriate test procedure to be used is
25. The Chi- Square Test of Independence should not be used if more than ___ % of the expected frequencies
are less than 5 or when any expected frequency is less than 1.
A. 80 C. 70
B. 20 D. 30
26. If the correlation coefficient r=0.5 then the coefficient of determination is:
(The coefficient of determination is just r2.)
A. 0.10 C. 1.00
B. 0.25 D. 2.50
27. We have a negative linear relationship between number of drinks consumed and driving test score. One
individual scores 3 on number of drinks consumed, while another individual scores 5 on the number of
drinks consumed. What will be their respective scores on the driving test if the intercept is 18 and the slope
is 3?
A. 51 and 87 C. 27 and 33
B. 9 and 3 D. Impossible to predict