Q3StatisticsProbability LESSONS
Q3StatisticsProbability LESSONS
Q3StatisticsProbability LESSONS
Third Quarter
A random variable X is a variable that assumes numerical values associated with the
random outcomes of an experiment, where one (and only one) numerical value is assigned to
each sample point. Random variables are denoted by uppercase letters and particular values
of the variable are denoted by lower case letters.
A discrete random variable is one that can assume a countable number of values. Mostly,
discrete random variables represent count data, such as number of students in a class. These
data
assume only countable number of values.
Example
Suppose two coins are tossed. Let Z be the random variable representing the number of heads
that occur. Find the values of the random variable Z.
Value of the Random
Possible Outcomes
Variable Z (number of heads)
HH 2
HT 1
TH 1
TT 0
So, the possible values of the random variable Z are 0, 1, and 2.
Example
Suppose three cell phones are tested at random in preparation for the online class. Let D
represent the defective cell phone and let N represent the non-defective ones. If we let X be
the random variable for the number of defective cell phones, construct the probability
distribution in tabular and formula form then draw the corresponding probability histogram.
Solution
Determine the sample space and count the number of defective cell phones in each outcome.
Value of the Random Variable X
Possible outcomes
(number of defective cell phones)
NNN 0
NND 1
NDN 1
DNN 1
NDD 2
DND 2
DDN 2
DDD 3
There are four possible values of the random variable X (0, 1, 2 and 3).
Probability P(X)
0.300
0.250
0.200
0.150
0.100
0.050
0.000
0 1 2 3
Number of Defective Cell Phones
The probability of an event, denoted as P(X), is the numerical measure of the likelihood that an event
will occur. It is calculated by finding the quotient of the number of favorable outcomes and the total number
of possible outcomes, In symbols,
P(X)=𝑛(𝑋)/ n (S)
where n(X) is the number of the elements in the event and n(S) is the number of the elements in the sample
space.
Example 1. Suppose two coins are tossed. If X is a random variable representing the number
of tails that occur, find the probability that 1 head will come out.
Solution. The possible outcomes are HH,HT,TH and TT. Therefore, the probability that 1
head will come out P(1) is 2/4 =1/2 or 50% or P(X)= 𝑛(𝑋)/
n (S) = 2 /4 =1/2 𝑜𝑟 50%.
Example 2. Kenneth had thrown a pair of dice. What is the probability of getting a sum of 6?
9?
Solution: The sample space is the set {(2,4) (4,2), (3,3), (5,1), (1,5)} and there are 36
elements in the sample space. Hence, the probability of getting a sum of 6 is
P(X)= 𝑛(𝑋)
= 𝑜𝑟 13.89%
𝑛 (𝑆)
Example 3. Suppose a card is selected at random from a deck of 52 cards. Compute for the
probability of picking a “red queen” at random.
0Solution: The event “red queen” has two elements: E= { queen of spades, queen of clubs}.
Since there are 52 cards, the probability of picking a red queen is
P(X)= 𝑛(𝑋)/ n / S =
= or 3.85%
Example 5. Maria, a Grade 11 student has a bag with 4 blue, 6 red and 10
yellow balloons. What is the probability that a balloon chosen at random is not
yellow?
Solution: Since there are 10 yellow balloons in a bag, the probability of getting
a yellow balloon is 10
= 1 𝑜𝑟 50%. Using the complement rule, the probability
that the balloon chosen is not yellow is:
P(not yellow)= 1- P(yellow), hence,
P(not yellow)= 1- or 50%
EXERCISE 1
Directions: Determine whether the following value can represent a probability. Show your
solutions and explain you answer.
1. The probabilities that a bias die will fall as 1,2,3,4,5, and 6 are ½, 1/6, 1/12 , 1/12, 1/12 &
1/12 RESPECTIVELY.
2. The probabilities that doctors save 5,6,7,8 or 9 covid-19 patients in one day are 0.18,
0.25,
0.1, 0.3 and 0.17 respectively
3. The probabilities that a businessman will invest P 28,000, P 50,000, or P 75,000 are
, , 𝑎𝑛𝑑 .
4. P(1)= , P(2)= , P(3)= , P(4)=
5. P(1)= 0.1, P(2)= 0.2, P(3)= 0.3, P(4)= 0.4,
EXERCISE 2
Directions: Answer the following problems using the Complement Rule of probability.
1. Ma. Laarni had a box containing 8 white balls, 6 green balls and 4 red balls which she
uses for her experiment. If a ball is chosen at random, what is the probability that a. it
is not white?
b. It is not green?
c. It is not red?
2. Kyla Mae and Mary Joy are two Senior High School students running for a position in
the Student Supreme Government. If the probability that Kyla Mae will win the
election is 0.52, What is the probability that Mary Joy will win?
3. Jomarson has three coins and tossed it simultaneously. Find the probability of showing
at least one head.
EXERCISE 3
Directions: Answer the following problems with accuracy. Show your complete solutions.
1. In a Statistics and Probability class, Kenjie was assigned to draw from an ordinary
deck of cards. What is the probability of drawing; a. an ace?
b. a diamond?
c. a face card?
d. a black jack?
e. a black card?
2. In a Cookery class, students were asked to choose at random a month of the year for
their culminating activity. What is the probability that the name of the months they
chose ends with y?
3. One of the hobbies of Marjorie is to collect local key chains. She has 15 key chains
from
Baguio, 10 from Puerto Princesa, 8 from Ilocos region, and 5 from Tagaytay. Her
brother Jomarson borrows one key chain for his school activity. What is the
probability that the key chain he borrows is from Puerto Princesa?
A Random Variable can either be discrete or continuous. A discrete random variable is one
which takes on only a limited number of values. Hence, in this lesson we will illustrate the
mean and the variance of a discrete random variable.
Example1. If X is the random variable defined by the number of tails in tossing a coin twice,
the sample space would be: S={HH,HT,TH,TT}. Illustrate the process of finding the mean or
the expected value of getting one tail.
STEPS SOLUTION
1. Construct the probability distribution
for the random variable X X P(X)
representing the number of getting 0
one tail.
1
or 1/2
2
STEPS SOLUTION
µ = = 2.43
2. Subtract the mean µ from
each value of the random
variable X X P(X) X.P(X) X-µ
0 0 0-2.43=-2.43
1 1-2.43=-1.43
2 2-2.43= -0.43
3 3-2.43= 0.57
4 4-2.43= 1.57
5 5-2.43= 2.57
3. Square (X-µ)
1 -1.43 2.04
2 -0.43 0.18
3 0.57 0.32
4 1.57 2.46
5 2.57 6.60
Learning Competency
The learner is able to illustrate the Mean and Variance of a Discrete Random Variable
(M11/12SP-111b-1)
EXERCISE 1
Directions: Complete the table below by illustrating the process of finding the mean of the
probability distribution.
1. The probabilities of companies to manufacture medicines for
COVID-19
STEPS SOLUTION
1. Construct the probability distribution
for the random variable X
representing the number of company X P(X)
manufacturing medicines 1
EXERCISE 2
Directions: Complete the table below Illustrating the process of finding the variance of the
probability distribution
1. Smart Load in a Day
STEPS SOLUTION
1. Find the mean or the
expected value of the X P(X) X.P(X)
probability 10
distribution. µ=
ƩX.P(X)
15
20
25
µ=
2. Subtract the mean from
each value of the P(X) X.P(X) X-µ
random variable X
X
10
15
20
25
3. Square (X-µ)
X P(X) X. P(X) X-µ (X-µ)2
10
15
20
25
15
20
25
EXERCISE 3.
Directions: Complete the table below illustrating the process of finding the mean and the
variance of a probability distribution. 10 points
No. of Probability X.P(X) X-µ (X-µ)2 X-µ)2. P(X)
students for P(X)
Online
Learning
X
26 0.20
27 0.20
28 0.25
29 0.25
30 0.10
µ= Ʃ(X-µ)2.
P(X)=
A probability distribution can be described in terms of its mean and variance. In this lesson
you will learn how to compute the mean and variance of a discrete random variable.
Definition: If X is a discrete random with values x1, x2, x3,…xn with probabilities P(x1),P( x2),
P(x3),…P(xn), respectively, then the mean, or expected value of X denoted by
E(X)= x1 P(x1) + x2 P( x2) + x3 P(x3) + …xn P(xn)= Ex P(x), for all elements of X.
It is obtained by multiplying each value of the random variable by the probability of its
occurrence and add all the products obtained.
Illustrative Examples:
Directions: Determine the Mean and or the Expected Value of each Random
Variable 1. Tails in Five Flips of a Coin
No. of tails (X) 0 1 2 3 4 5
Probability P(X) 0.03 0.16 0.31 0.31 0.16 0.03
Solution: 2(0.31) + 3 (0.31) +16) + 5(0.03= 2.5
4(0.
E(X)= 0 (0.03) + 1(0.16) +
The mean distribution is 2.5
35
= 6.3
40
= 4.8
E(X)=27.1
Where 𝑜2 is the variance of the a discrete probability; X is the value of the random
variable ,µ is the mean of the probability distribution, and P(X) is the probability of the
random variable X
Directions: Complete the table below and compute for the variance of the probability
distributions
3. Number of Computers Sold
0 0 -2.73 7.452
(7.452)( ) =0.994
1 -1.73 2.992
(2. 992)( ) =0.199
𝑜2 =2.462
µ= = 2.73
Using formula 1
EXERCISE 1
Directions: Complete the table below and determine the mean or the expected value of the
probability distribution.
1. Number of Errors in a Test
X.(P(x)
X P(x)
4 0.25 1.0
5 0.14 ? (a)
6 0.31 1.86
7 0.2 ? (b)
8 0.1 0.8
4
5
E(X)=
EXERCISE 2
Directions: Complete the table below and find the variance of the probability distributions
1. Number of Text Messages Received in a Day
X P(X) X.P(X) X- µ (X- µ)2 (X- µ)2 . P(X)
15 1.875
18 2.25
20 7.5
22 5.5
25 1/8 3.13
µ= 𝑜2 =
Exercise 1
Compute for the mean, variance and standard deviation of the following probability
distribution. (15 points)
1.
X 1 3 5 7
P(X) 2/7 3/7 1/7 1/7
2.
X 5 10
P(X) .80 .20
3.
X 0 1 2 3 4
P(X) 1/5 0 3/5 1/5 0
X 10 20 30
4.
P(X) 0.40 0.50 0.10
5. The probability distribution of the random variable X, which can take only the values 3,
4, and 6, given that P(3)=7/30, P(4)= 1/3, and P(6)= 13/30.
Exercise 2.
Solve the following problems.
1. Let the random variable Y represents the number of bond paper per ream that are used for
reproducing modules on any given workday. The probability distribution for Department A
is
Y 1 2 3 4 5
P(Y) 0.3 0.5 0.1 0.05 0.05
and for Department B
Y 1 2 3 4 5
P(Y) 0 0.25 0.15 0.30 0.30
Compute for the mean and variance of the two departments. Compare their mean
and show that the variance of the probability distribution for department B is greater than
of Department A.
2. The probabilities that a physician attends to 2, 3, 4, or 5 patients in any day are 0.40, 0.25,
0.15, and 0.20, respectively. Find the average number of patients that a physician attends
on a day. Interpret its value in the context of medical treatment.
3. The probability that a printer produces 0, 1, 2, 3, and 4 misprints are 15%, 22%, 10%, 38%
and 15%, respectively. Construct a probability distribution table and compute the mean
value, variance and standard deviation of the random variable and then interpret each
value.
Exercise 3.
Solve the following problems completely.
1. A roulette wheels contains 50 numbers: 30 are red and 20 are green. The roulette is fair
which means that it is equally likely to land on any of the 50 numbers. You win P500 if
the roulette lands on red numbers; otherwise, you loss P300. What is the expected value or
payoff of the game?
2. In a playing card, a woman is paid P300 if she draws a jack or a queen and P500 if she
draws a king or an ace from an ordinary deck of 52 playing cards. If she draws any other
card, she will pay P200. How much is the expected value of her winning?
3. A raffle offers P 20,000 cash for 1st prize, P10,000 for 2nd prize, and P 5,000 for 3rd prize.
The raffle has 1000 tickets sold at P150 each. Find the expected gain or loss if a person
buys one ticket.
Definition
𝝁 = 𝑬(𝑿) = ∑ 𝒙𝑷(𝒙)
𝒙
The variance of X is
𝟐
𝜎2 = 𝐸[(𝑋 − 𝝁)𝟐] = ∑ (𝑋 − 𝝁) 𝑷(𝒙) 𝒐𝒓 𝜎2 = 𝑬(𝑿𝟐) − 𝝁2
or
and thestandard deviationi s 𝝈 = √𝝈𝟐
Example 1.
Gain, X Win Lose
4,980 -20
Probability P(X)
A private organization conducted a raffle draw to finance their projects. One thousand
tickets are sold at 20.00 each for a flat screen television valued at 5,000. What is the expected
value of the gain if a person purchases one ticket?
Solution.
For a win, the net gain is 4,980, since the person does not get the cost of the ticket (20)
back. While for a loss, the gain is represented by a negative number, in this case (-20). The
solution, then, is
𝜎 5
𝜎
The standard deviation is of the probability distribution is 2.236. This implies that on
the average, the number of balls selected from a bag is 2.236 units from the mean 3.
Exercise 1. Calculate for the mean or expected value, variance and standard deviation of each random
distribution. (2 points each)
1.
X 0 1 2 3
P(X) 3/4 0 0 1/4
2.
Y 1 2 3 4 5 6
P(Y) 1/6 1/6 1/6 1/6 1/6 1/6
3.
R 1 2 3
P(R) 5/10 0 5/10
4.
M 1 2 3 4
P(M) .2 .1 .3 .4
5.
S 2 4
P(S) 20% 80%
Exercise 2. Solve the following problems. (3 points each) 1. Find the mean of the
probability distribution of the random variable X if P(X)=𝑥 −3 for 15 X= 7, 8, and 9.
2. A random number generator produces sequences of independent digits, each of which
is as likely to be any digit from 0 to 9 as any other. If X denotes any single digit, find
the mean or expected value of the probability distribution.
3. A player draws a card at random from a standard deck of 52 cards. If the card drawn is
red, the player wins P500 but if it is black, he loses P250. Find the expected value of
the game.
4. A private organization conducted a raffle draw to finance their projects. One thousand
tickets are sold at 10.00 each for four prizes of 2,000, 1,500, 1,000 and 500. What is
the expected value if a person purchases two tickets?
5. By investing in a stock, Mark can make a profit in one year of P5000 with a
probability of 0.3 or take a loss of P1000 with a probability of 0.7. What is his
expected gain per month?
3. Five COVID patients are tested with a specific medicine. The probabilities for 0, 1, 2,
3, or 4 successes are 0.25, 0.10, 0.15, 0.35, and 0.15. Find the variance and standard
deviation of the probability distribution.
Y 0 1 2 3 4
P(Y) .04 .26 .15 .10 .45
4. The probability distribution of Y, the number of imperfections per 5 meters of a
synthetic fabric in continuous rolls of uniform width, is given by
FIND THE VARIANCE AND STANDARD DEVIATION OF THE PROBABILITY
DISTRIBUTION.
5. The probability that 0, 1, 2, 3, or 4 people will be placed on hold when they call a
customer service representative is shown in the distribution. Find the variance and
standard deviation for the data. The customer service has four phone lines.
C 0 1 2 3 4
P(C) .20 .18 .33 .15 .14
Exercise 2.
Do the following. Sketch the normal distribution/s given the following conditions. (2
points each)
1. Two normal distributions with the same mean and standard deviation.
2. Three normal distribution in one horizontal axis with the same mean but with different
standard deviation.
3. A normal curve with shaded area within 2 standard deviation of the mean.
4. A normal curve with a mean, μ=75 and standard deviation, σ=1.
5. A normal curve with a mean, μ=0 and standard deviation, σ=1.
Exercise 3.
Solve the following problems. Compute for the mean and standard deviation of the
distribution and construct the corresponding normal curve. (5 points each)
1. Compute the mean and standard deviation of the normal distribution where 68.3% of the
values fall between 2.8 and 4.0. Construct the normal curve of the normal distribution.
2. Assume that 95.5% of students have a grade in Statistics and Probability subject
between 85 and 93 and the data are normally distributed. (a.) Compute the mean and
standard deviation of the distribution. (b.) Construct the normal curve of the normal
distribution.
Example 1.
Find the area between z = 0 and z = 1.75.
Solution:
Step 1. Sketch the normal curve.
0 1.75
Step 2. Locate the area for z = 1.75 from the Areas Under the Normal Curve table.
Proceed down the column marked z until you reach 1.7. Then proceed to the right along this
row until you reach the column mark 5. The intersection of the row that contains 1.7 and the
column 5 is the area. The area is 0.4599.
Example 2. Find the area between z= 1.75 and z= 2.7
Solution:
Step 1. Sketch the normal curve.
A2
A1 A
0 1.75 2.7
Step 2.
Solution:
Step 1. Sketch the normal curve.
0.5
A1 A
1.75
ACTIVITY 1
Directions: Find the area that corresponds to each of the following standard normal
values(zvalues).(1 pt. each)
1. Z= 0.6 6. Z= 0.35
2. Z= 1.96 7. Z= 2.13
3. Z= -1.51 8. Z= -1.09
28
4. Z= 2.35 9. Z= -1.18
5. Z = -1.23 10. Z= -2.85 ACTIVITY 2
Directions: Identify the region/area under the normal curve corresponding to each of the
following cases.(2 pts. each)
1. Between z = 0 and z = 1.36
2. Between z = 0 and z = -1.87
3. Between z = 1.36 and z = 2.5
4. Between z = -1.36 and z = -1.87
5. Between z = -1.36 and z = 2.5
6. Between z = 1.36 and z = -2.5
7. To the right of z = 1.36
8. To the right of z = -1.36
9. To the left of z = 2.5
10.To the left of z = -1.87
ACTIVITY 3
Directions: Find the indicated area under the normal curve, then convert it to percentage.
(3 pts.each)
1. What percent of the area under the normal curve is between z = 0.58 and z = 2.8?
2. What percent of the area under the normal curve is between z = -2.00 and z =
2.00?
3. What percent of the area under the normal curve is between z = -1.15 and z =
1.15?
4. What percent of the area under the normal curve is between z = -1.34 and z = -
2.13?
5. What percent of the area under the normal curve is between z = 2.38 and z
= -1.32?
The standard score or z-score measures how many standard deviations is given
value(x) is above or below the mean. The z-scores are useful in comparing observed values.
A positive z-score indicates that the score or observed value is above the mean, whereas a
negative zscore or observed value is below the mean.
The z-score is found by using the following equations: A.
29
For Sample
𝑋 − 𝑋̅
𝑍=
𝑠
Where:
Z = standard score
X= raw score or observed value
𝑋̅= sample mean
S = sample standard deviation
B. For Populations:
𝑍=X
−µ σ Where:
Z = standard score
x= raw score or observed value µ =
population mean
σ = population standard deviation
Example 1.
On a final examination in Mathematics, the mean was 50 and the standard deviation
was 10. Determine the standard score of a student who received a score of s40, assuming that
the scores are normally distributed.
Solution:
𝑋 − 𝑋̅ 40 − 50
𝑍= == −1
𝑠 10
-1 0
This means that 40 is -1, and the standard deviation is below the mean.
Example 2.
On the third quarter examination in Physics, the population mean was 80 and the
population standard deviation was 8. Determine the standard score of student who got a score
of 92, assuming that the scores are normally distributed.
30
Solution:
𝑋 − 𝑋̅ 92 − 80
𝑍= = = 1.5
𝑠 8
-1.5 0 1.5
80 92
This means that 92 is 1.5 standard deviation above the mean.
Example 3.
Ana scored 80 in a Filipino test and 60 in an English test. Scores in Filipino test have a
mean of 70 and a standard deviation of 8. Scores in English test have a mean of 50 and a
standard deviation of 5. In which subject was her standing better assuming that the scores in
her Filipino and English class are normally distributed?
𝑋 − 𝑋̅ 80 − 70
𝑍= == 1.25
𝑠 8
For English
𝑋 − 𝑋̅ 60 − 50
𝑍= == 2
𝑠 5
Her standing in English was better than her standing in Filipino. Her score in Filipino
was 1.25 standard deviation above the mean of the scores in Filipino whereas in English, her
score was 2 standard deviation above the mean of the scores in English.
31
Example 4.
In a Statistics test, the score is 40 and the standard deviation is 4. Assuming the scores
are normally distributed, what percent of the score is
a. greater than 45?
b. Less than 50?
c. Between 35 and 46? Solution:
a. 1. Convert the score 45 to z-score.
𝑋 − 𝑋̅ 45 − 40
𝑍= = = 1.25
𝑠 4
32
2. Sketch the normal curve.
A 2 = 0.5
A1 A
0 1.25
40 4
33
1. 65 6. 56 2.
2.85 7. 80 3.
3.69 8. 62 4.
4. 76 9. 59
5. 78 10. 55
ACTIVITY 2
Directions: Using the formula for z, find the value of the random variable x in each of the
following
1.µ = 86, σ = 15, z = 1.6
2.µ = 50, σ = 8, z = 2
3.µ = 53, σ = 6, z = -1.5
4.µ = 68, σ = 4, z = -2.5
5.µ = 70, σ = 10, z = 1.5
ACTIVITY 3
For any set of measurements, (arranged in ascending or descending order), a percentile (or
a centile) is a point in the distribution such that a given number of cases is below it. A percentile
is a measure of relative standing. It is a descriptive measure of the relationship of a measurement
to the rest of the data.
34
In the previous lesson we learned how to compute the z- scores based on means
and standard deviations for both sample and population. The resulting z-scores allowed us to
read the corresponding areas (or probabilities) using the standard normal table.
0.9
0.5 0.4
0 z
Example 2.
The results of NAT scores in Mathematics are normally distributed with µ = 70 and
𝜎 = 10. What is the percentile rank of a score 75?
Solution:
a. Convert the raw score 75 to z–score
Z= X – 𝜇 / σ = 75 – 70 / 10 = 0.5
35
A
A2 A1
0
0.5
70 75
Example 3.
A certain flashlight battery lasts on the average of 5 years with a standard
deviation of 0.75 years. Assuming that the battery lives and normally distributed, find the
probability that a given battery will last less than 3.5 years?
Solution;
1. Compute for z.
Z= X – 𝜇 / σ = 3.5 – 5 / 0.75 = - 2
36
A2
A1
-2 0
3.5 5
Hence, the probability that a given battery will last less than 3.5 years is 0.0228 or
2.28
37
Computes probabilities and percentiles using the standard
normal table.
ACTIVITY 1
Directions:
Find each of the following percentile points under the normal curve.(1 point each).
1. P98
2. P80
3. P64
4. P42
5. P30
ACTIVITY 2
1. 70
2. 64
3. 58
4. 52
5. 50
ACTIVITY 3
38
Illustrates Random Sampling
1.
Simple Random Sampling
• Each individual is chosen entirely by chance and each member of the
population has an equal chance, or probability, of being selected.
2.
Systematic Sampling
• Individuals are selected at regular intervals from the sampling frame.
The intervals are chosen to ensure an adequate sample size. If you need
a sample size n from a population of size x, you should select
every x/nth individual for the sample.
3.
Stratified Sampling
• In this method, the population is first divided into subgroups (or strata)
who all share a similar characteristic. It is used when we might reasonably expect the
measurement of interest to vary between the
different subgroups, and we want to ensure representation from all the
subgroups.
4.
Cluster Sampling
• In a clustered sample, subgroups of the population are used as the
sampling unit, rather than individuals. The population is divided into
subgroups, known as clusters, which are randomly selected to be included in the study.
Examples
Identify the probability sampling technique used in each of the following situations:
39
1. The pilot testing for the modular distance learning will be conducted. Students will be
classified according to grade level and 25 students will be selected from each grade level
to participate in the pilot testing.
2. A teacher wants to choose 5 students from her 45 students. She will select every 6 th
student from her list.
3. A Principal wants to select 5 teachers to attend a 2-hour webinar on google classroom. To
avoid bias in selecting teachers, she will assign numbers to all teachers based on their
alphabetical list. She will make use of the random number table to choose the 5 teachers.
4. All senior citizens who belong to the 5 chosen barangays will join in the Rapid testing for
Covid-19.
5. Suppose a community consists of 250 families belonging to different income brackets.
We will select samples from families coming from each of the three income brackets,
high-income; average-income; and low-income.
Answers
1. Stratified
2. Systematic
3. Simple Random
4. Cluster
5. Stratified
Exercise 1
Direction: Identify the random sampling technique illustrated in each scenario.
Use:
SI for Simple Random
SY for Systematic
ST for Stratified
CL for Cluster Sampling
40
5. Students are categorized into grade levels and sections. Fifteen representatives
from each grade level and section will be chosen to participate on a survey on
the gadgets they have at home.
6. Mrs. Cruz, a Class Adviser of Grade 11 randomly selects 25 out of her 50
students to participate on a survey on the internet access of students at home.
7. All students belonging to the 8 randomly selected sections will participate in the
survey on the readiness of students for SY 2020-2021.
8. Teachers are grouped into age bracket and sex. Five representatives from each
age bracket and sex will be selected to participate on a survey on the “Online
Readiness and Capability of Teachers for SY 2020-2021.
9. In a school, every 3rd teacher will be selected to participate in the Rapid Test for
Covid-19.
10. The names of 100 televiewers are written on slips of papers. The slips of papers
are placed inside a box, and twenty names are picked from the box to receive a
cash price.
Exercise 2
Direction: Illustrate the following:
2. A study will need a sample of 50 students. There are five sections (Sections A-E) of
Grade 11 students with 50 students per class.
A. Use stratified sampling to choose the 50 samples
B. Use cluster sampling to choose the 50 samples.
3. A researcher would like to investigate the perception of students towards online learning.
A. Use stratified sampling to choose the 100 respondents from the 900 students.
Grade Level Number of Students
Grade 7 160
Grade 8 154
Grade 9 146
Grade 10 138
Grade 11 162
Grade 12 140
Total 900
Exercise 3
Direction: Read and analyze the given situation below. Determine the most appropriate
sampling method to use in each item.
A survey is conducted to determine the readiness of students for distance learning. The class
adviser would like to utilize 20 of her students to participate in the survey.
1. What sampling technique is best to use if she will not consider the gender of her
students?
41
2. What if she will consider the gender of the students in choosing the
participants, what sampling technique will she use?
3. What sampling technique will be used if she wants to have a systematic way of
choosing the respondents regardless the gender?
Examples
Identify if the following examples is a parameter or a statistic
1.The mean grade of all students in a Statistics class is 88.5.
2.The average height of a sample of 50 male students is 5 feet and 7 inches.
3.Sixty (60) percent of all students at a particular high school prefer modular distance
learning.
Answers
1.Parameter
2. Statistic
3. Parameter
Directions/Instructions:
A. Determine if the given statement is a parameter or a statistic.
1. The recovery rate of patients from Covid-19 in the Philippines is 33.49% as of July 20,
2020.
2. Based on the results of the national LESF it showed that 7.2 million enrolees this school
year prefer to use modular distance learning.
3. A sample of 10 public schools in Region 2 showed that 80% of their enrolees prefer
blended learning as revealed in the result of the LESF.
4. The average height of 100 randomly selected 16-year-old women in Region 2 is 5 feet and
4 inches.
5. In a certain secondary school, 49.75% of the entire student population chose online
distance learning.
42
6. Out of 150 sample households, only 34.67% have internet connection.
7. The percentage of students having a smartphone in the Philippines is almost 90%.
8. A sample of 50 Covid-19 patients with mild symptoms obtained a mean of 14 days to
recover from the virus.
9. Covid-19 patients with severe symptoms in the Philippines take an average of three to six
weeks to recover.
10. A sample of 500 randomly selected LESFs from Region 2 revealed that more than 50%
preferred modular distance learning.
B. For each study, identify both the parameter and statistic. Assume that the
following situations are true based on surveys.
1. The number of internet subscribers in the Philippines is 73 million. The digital population
mostly belong to the age group of 16 years old above. A study is interested to estimate the
number of internet users in Region 2. In a random sample of 5000 students aged 16-18 it
revealed that out of the 5000 samples 4218 or 84.36% are internet subscribers.
2. In the Philippines, almost 90% of the Senior High School students play online games
during the lockdown. A research is conducted to determine the proportion of Senior High
School students in Region 2 who had played at least one online game during the
lockdown. Of the 2500 random samples, 2376 or 95.04% of them played at least one
online game during the lockdown.
3. A researcher wants to estimate the average height of 16-year old Filipino men. A random
sample 500 men were taken from the different regions of the country, the researcher
obtained a mean height of 70 inches from the samples.
4. The energy department wants to determine the average monthly kilowatt per hour (KWH)
consumption of households during the lockdown. A sample of 1000 households obtained
a mean of 300 kilowatt per hour (KWH) consumption per month.
5. In the Philippines, 41.34 % of the students prefer the modular distance learning. A survey
is conducted to determine the most preferred learning modality in Region 2. Of the 1500
samples, 822 or 54.8% of them preferred the modular distance learning.
1. A recent survey of 100 students showed that the average number of hours they stay online
is 4 hours a day.
2. All students in a certain school spend an average of 80 minutes in watching youtube
videos.
3. A sample of 50 patients obtained a mean of 5 days to recover from cough.
43
computed from all possible random samples of a specific size taken from a
population.
1. Determine the number of possible samples that can be drawn from the
population using the formula: NCn
Where: N = size of the population
n = size of the sample
2. List all possible samples and compute for the mean of each sample.
3. Construct a frequency distribution of the sample means obtained in Step
2.
Example
A population consists of five numbers 13, 14, 15, 16 and 17. Consider samples of size 2 that
can be drawn from this population.
a. NCn = 5C2
= 10
!
44
Total 10 1.00
Directions/Instructions:
C. How many different samples of size n = 2 can be selected from the following sizes
of population? Write your answer on the blank provided before the number. (1
point each) 1. N = 8
2. N = 9
3. N = 12 4. N = 20 5. N = 30 6. N = 35 7. N =
42 8. N = 53 9. N = 65
10. N = 78
D. Find the mean of the following sets of numbers. Write your answer on the
space provided.
Given Mean
3, 5, 7
5, 6, 6, 7
7, 8, 9, 10, 11
8, 9, 11, 13, 15, 16
17, 18, 19, 20, 21, 22, 23
E.
a. A population consists of five numbers 4, 5, 7, 8 and 10. Consider samples of size 3
that can be drawn from the population.
A. Determine the number of possible samples that can be drawn from the population
B. List all the possible samples and their corresponding mean
C. Construct the sampling distribution of the sample means
b. A population consists of six numbers 2, 4, 6, 8, 10 and 12. Consider samples of size 2
that can be drawn from the population
11. Determine the number of possible samples that can be drawn from the population
12. List all the possible samples and their corresponding mean
13. Construct the sampling distribution of the sample means
In your previous lessons, you have studied about the mean and variance of a probability
distribution and the step-by-step process of constructing the sampling distribution of the
sample means.
45
In this learning activity sheet, you will be able to find the mean and variance of a
sampling distribution of the sample means using formulas for the mean and variance of
probability distribution and define the sampling distribution of the sample mean for normal
population when the variance is: a) known and b) unknown.
Example
Consider a population with values 2, 4, 6, 8, and 10. 1. What is the mean (),
variance (σ2) and standard deviation (σ) of the population?
2. How many samples of size 2 can be taken from the population?
3. List all possible samples of size 2 from the population if sampling is without replacement.
Then calculate the mean of each sample.
4. Construct a sampling distribution of the sample means for this population.
5. Find the mean of the sampling distribution of the sample means (𝜇𝑥̅). Compare this to the
mean of the population.
6. Calculate the variance (𝝈𝟐𝑋̅) and standard deviation (𝜎𝑋̅)of the sampling distribution of
the sample means. Compare this to the variance and standard deviation of the population.
Solution: 1. The mean of the population can be computed using the formula:
= ∑𝑋
So, the mean of the population is 6.
46
Step 3: ∑(X − )2 = 16 + 4 + 0 + 4 + 16 = 40
Step 4:
N
σ2 = 8
The standard deviation of the population can be computed by simply getting the
square root of the variance. That is σ= . So, the standard deviation of the population
is σ=
2. To find the number of samples taken from the population, use the formula C
N 𝑛
47
5. To find the mean of the Sampling Distribution of the Sample Means (𝑥̅),
Step 1: Multiply the sample mean (𝑋̅) to its corresponding probability distribution
P(𝑋̅). Label this as (𝑋̅)• P(𝑋̅); and
Step 2: Get the summation of results in Step 1.
The mean of the sampling distribution of the sample means is equal to the mean of the
population. We can write this as: 𝑥̅ =
6. To find the variance 𝝈𝟐̅𝒙 of the sampling distribution of the sample means , consider
the following steps:
Step 1: Subtract the mean (𝑥̅) of the sampling distribution of the sample means from
each sample mean (𝑋̅). It can be labelled as 𝑋̅-𝑥̅
Step 2: Square the difference of population mean and each sample mean.
It can be labelled as (𝑋̅-𝑥̅)2
Step 3: Multiply the results in step 2 by its corresponding probability. It can be labelled
as (𝑋̅-𝑥̅)2 • P (𝑋̅ )
Step 4: Get the summation of results in step 3
48
5 5-6= -1 (-1)2= 1 2
1• =2
10 10
6 6-6= 0 (0) = 0
2
0• =0
7 7-6= 1 (1)2= 1 2
1• =2
10 10
8 8-6= 2 (2) = 4
2
4• = 1 4
10 10
9 9-6= 3 (3) = 9
2
9• = 1 9
10 10
or 3
10
So, the variance of the sampling distribution of the sample means 𝝈𝟐𝑋̅ is 3.
𝜎 or 1.73
So, the standard deviation of the sampling distribution of the sample means is 1.73
The computed value of the variance and standard deviation of the sampling distribution of the
sample means is smaller than the variance and standard deviation of a population.
b. List all the possible samples of size 2 that can be drawn without replacement and
calculate its corresponding means. (The table below is already given as hint)
[5 points
]
49
c. Construct the sampling distribution of the sample means. (The table below is
already given as hint) [9 points]
Total
2. A group of students got the following scores in a Mathematics quiz: 8,12,16, 20, and
24. Consider sample scores of size 3 that can be drawn without replacement from the
given scores.
a. List all the possible samples of size 3 and calculate the means of each sample.
Round your sample mean into 2 decimal places. [5 points]
50
Total
The mean x̅ of the sampling distribution of the sample means? Compare this
to the mean of the population. [5 points]
c) The standard deviation 𝜎𝑥̅ of the sampling distribution of the sample means?
Compare this to the standard deviation σ of the population. [5 points]
2. The 4 nurses assigned in the RHU in the Municipality of Luna have the following
ages: 24, 26, 28, and 30. Suppose samples of size 2 are drawn from this population.
Let x be the age of the nurses.
a. What is the mean () and standard deviation (σ) of the ages of the population?
[5 points]
How many samples of size 2 can be taken from the population?
[2 points]
List all possible samples of size 2 of the population if sampling is without replacement.
Then calculate the mean 𝑥̅ of each sample. [4 points]
51
Direction: Solve the following problem. Assume that the population is infinite in each item.
Show your solution. [Hint: The variance of the sampling distribution of the sample means
can be solve using the formula: 𝝈𝟐𝑋̅ = 𝜎 𝑛
2
]
2. The height of female Senior High School students of Luna National High School is
normally distributed with a mean of 65 inches and a standard deviation of 2 inches. If
20 students are randomly selected from the population, what would be the mean and
standard deviation of the resulting sampling distribution of the sample means?
[4
points ]
This theorem says that regardless of the distribution of the population from
which a sample is taken, we can choose a sufficiently large n so that the resulting
sampling distribution will approach a normal distribution. In general, n≥ 30 is
large enough for the sampling distribution of the mean to be approximately normal.
𝒛 𝝁
µ = population mean
52
𝝈 = population standard deviation
𝒏 = 𝐬𝐚𝐦𝐩𝐥𝐞 𝐬𝐢𝐳𝐞
Example
The average time it takes a group of students to finish a certain class activity is 39 minutes.
The standard deviation is 8 minutes. Find the probability that if 45 randomly selected
students take the activity, the mean time it takes the group of students to complete the
activity will be less than 41 minutes.
Given:
Steps Solution
Identify the given information ̅𝒙 = 41 µ
= 39
𝝈=8
𝒏 = 𝟒𝟓
Solve the problem Z=
√𝟒𝟓
Directions/Instructions:
Follow the different steps to answer the following problems applying the central limit theorem.
1. The average time it takes a group of senior high school students to finish a periodical
examination is 45 minutes. The standard deviation is 6.5 minutes. Assume that the
variable is normally distributed.
a. What is the probability that if 50 randomly selected students will take the periodical test,
the mean time will be more than 47.5 minutes?
b. What is the probability that if 50 randomly selected will take the periodical test, the mean
time will be less than 47.5 minutes?
c. What is the probability that if 50 randomly selected will take the periodical test, the mean
time will be between 43 minutes and 47.5 minutes?
d. What is the probability that if 50 randomly selected will take the periodical test, the mean
time will be between 46 minutes and 47 minutes?
2. The mean weight of 16-year old females is 120 pounds with a standard deviation of 12.5
53
a. If a sample of 35 female is selected, find the probability that the mean weight of the
samples is less than 125 pounds.
b. If a sample of 30 female is selected, find the probability that the mean weight of the
samples is between 120 pounds and 123 pounds.
c. If a sample of 30 female is selected, find the probability that the mean weight of the
samples is between 121pounds and 123 pounds.
3. The average number of milligrams (mg) of sugar of a certain brand of milk tea is 600 mg
with a standard deviation of 28 mg. Assume that the variable is normally distributed.
a. If a sample of 30 servings of milk tea is selected, what is the probability that the mean
sugar content of the samples will be greater than 590 mg?
b. If a of sample 30 servings of milk tea is selected, what is the probability that the mean
sugar content of the samples will be between 590 mg and 605 mg?
c. If a sample of 50 servings of milk tea is selected, what is the probability that the mean
sugar content of the samples will be less than 605?
SOLVING PROBLEMS INVOLVING SAMPLING DISTRIBUTION OF
THE SAMPLE MEAN
Since, we are done learning about the properties of the sampling distribution of the
sample means in terms of its mean, variance, and standard deviation, this time we shall solve
problems involving sampling distributions of the sample means applying these properties
called the Central Limit Theorem. The Central Limit Theorem tells that the sampling
distribution of the sample means approaches a normal distribution as the sample size gets
larger, regardless the shape of the population distribution. One of the important components
of the Central Limit Theorem is that the average of sample means is the population mean.
If n ≥ 30, the sampling distribution will appear to be normal, thus Central Limit
Theorem applies.
Using the Central Limit Theorem to convert sampling distribution to standard normal
distribution:
Example 1
54
A population distribution has a mean 𝜇 = 84 and standard deviation of 𝜎 = 18. What is the
probability that a random sample size of n = 48 and a mean of 88 or more?
Use the table for Areas of the Normal Curve to find the P( 88).
0.0618
A population distribution
has a mean = 84 and a
standard deviation of =
18. What is the probability
0 1.54 that a random sample size
of n = 96 and a mean of 81
and below?
Find P( 81).
Solution
Convert the values to z-score.
The giver is n = 96 which is 30 Central Limit Theorem applies.
Use the table for Areas of the Normal Curve to find the P( 81).
55
= P(z 0) - P(-1.63 z 0)
= 0.5000 – 0.4484
= 0.0516
Thus, the probability that a random sample size of n = 96 and a mean of 81 and below
is 0.0516 or 5.16%.
Example 3
Use the table for Areas of the Normal Curve to find the P( < 85.5).
P( < 85.5) = P(z < 0.79)
= P(z 0) + P(0 z 0.79)
= 0.5000 + 0.2852
= 0.7852
Thus, the probability that a random sample size of n = 90 and a mean of less than 85.5 is
0.7852 or 78.52%.
0.78
52
56
0 0.79
Example 4
A population distribution has a mean = 84 and a standard deviation of = 18. What is the
probability that a random sample size of n = 120 and a mean of greater than 80?
Find P( > 80).
Solution
Convert the values to z-score.
The given is n = 120 which is 30 Central Limit Theorem applies.
Use the table for Areas of the Normal Curve to find the P( > 80).
P( > 80) = P(z > -2.43)
= P(-2.43 z 0) + P(z 0)
= 0.4925 + 0.5000
= 0.9925
Thus, the probability that a random sample size of n = 120 and a mean of greater than 80 is
0.9925 or 99.25%
0.9925
-2.43 0
Example 5
A population distribution has a mean = 84 and standard deviation of = 18. What is the
probability that a random sample size of n = 210 and a mean of between 83 and 86?
Find .
Solution
Convert the values to z-scores.
In here, n = 210 which 30 Central Limit Theorem applies.
57
Use the table for Areas of the Normal Curve to find the P( ). P(
) = P(-0.81 z 1.61)
= P(-0.81 z 0) + P(0 z 1.61)
= 0.2910 + 0.4463
= 0.7373
Thus, the probability that a random sample size of n = 210 and a mean of between 83 and 86
is 0.7373 or 73.73%
0.7373 Learning Competency
Solves problems involving
sampling distributions of the
sample mean (M11/12SP-
III-f-1)
-0.43 0 1.61
EXERCISE 1
1. Find the probability in each of the following situations. Assume that the variable is
normally distributed.
n Probability
80 7 75
74.5 5 12
42 4.46 53
123 12.3 165
250 16 235
2. A normally distributed variable has a mean = 120 and standard deviation of = 12. If
a sample size of n = 36 is selected, what is the probability of the following means?
1. According to research, Filipinos spend an average of 9.5 hours per day in the internet
and the standard deviation is 2.4 hours. Assume that the variable is normally
distributed. If 50 Filipinos were randomly selected,
a. what is the probability that the average hours spend in the internet is 8.5 hours and
above?
b. what is the probability that the average hours spend in the internet is less than 10
hours?
c. what is the probability that the average hours spend in the internet is between 9 hours
and 10 hours?
2. Suppose that the annual employment rate of private companies in a certain country is
to have a normal distribution with a mean of 4% and a standard deviation of 1.6%. A
random sample of 25 private companies is selected from the population and the
sample mean is calculated.
a. What is the probability that the sample mean will be less than 5%?
b. What is the probability that the sample mean will be more than 3.5%?
c. What is the probability that the sample mean will be between 3% and 5%?
3. According to SHS research teachers, the average time of senior high school research
proposal defense is 28.5 minutes and the standard deviation is 4.3 minutes. Assume
that the variable is normally distributed.
a. What is the probability that a randomly selected senior high school students will
defend their research proposals in less than 28 minutes?
b. What is the probability that a randomly selected senior high school students will
defend their research proposals in more than 28 minutes?
c. What is the probability that 45 senior high school students who are randomly selected
will defend their research proposals with an average time of less than 28 minutes?
d. What is the probability that 50 senior high school students who are randomly selected
will defend their research proposals with an average time of between 27.5 minutes and
29.5 minutes?
The t-DISTRIBUTION
In the previous lessons, you have learned z-distribution. It is used when the
population variance is unknown and the sample size is large (𝑛 ≥ 30). This time you will
encounter a new distribution called t-distribution, it is used in estimating the population
parameter when the sample size is small or less than 30 and the population variance is
unknown.
Like the z-distribution/normal distribution, the t-distribution is bell-shaped and symmetric
about the y-axis, centered at zero and the total area under the normal curve is one.
59
Glimpse from the Past
The t-distribution was also called Student’s t-distribution. William Sealy Gosset
formulated the t-distribution in 1908, he is an Irish brewing employee who was involved in
researching new methods of manufacturing ale (type of beer). He published his finding
using the pseudonym Student because brewing employees were not allowed to publish
findings.
Hence, the t-distribution is sometimes called Student’s t-distribution.
The following formula is used to convert any normal variable to its corresponding
t-value on a t-distribution. 𝑥̅ − 𝜇
𝑡= 𝑠
where: t
= t-value
𝑥̅ = sample mean
𝜇 = population mean n = sample size (Note: the
formula is used when n<30)
60
EXAMPLE 1
Source: https://www.dummies.com/wp-content/uploads/451675.image0.jpg
What is the tvalue (round to the nearest thousandth) when df=21 at α= 0.01?
Answer:
Referring to the t-table, df=21 and α= 0.01, then t=2.518. The t-value is the
intersection between the left side column (degrees of freedom = 21) and the right tail
probabilities (α= 0.01) is 2.518.
61
EXAMPLE 2
What is the t-value (round to the nearest thousandth) when n=16 at α= 0.05?
Answer:
df=n-1, since n=16, df= 16-1= 15
Referring to the t-table, df=15 and α= 0.05, then t=1.753. The t-value is the
intersection between the left side column (degrees of freedom = 15) and the right tail
probabilities (α= 0.05) is 1.753.
62
EXAMPLE 3
Find the t-score below which we can expect 99% of sample means will fall if samples of size 20
are taken a normally distributed population.
Answer:
The cumulative probability is 1- α = 0.99, we have α=0.01. Likewise, the degrees of
freedom is df = n-1 = 20 – 1 = 19. To find the required t-score, move down the first column
of the t-table until you get 19, which is the degree of freedom. Move across the row
containing this entry until you get to the column with heading α=0.01. The value that we get
is 2.539. This shows that 99% of all sample means are expected to have t-scores less than or
equal to 2.539. We can say that 1% of all sample means will have t-scores less than or equal
to -2.539.
63
EXAMPLE 4
Using the t-distribution table. Find the degrees of freedom and the critical values n=25 at
95% confidence coefficient.
Answer:
Since the confidence coefficient is 95%, we have α = 0.05. The degrees of freedom is df =
n-1 = 25 – 1 = 24. The intersection between df=24 and α = 0.05 is 1.711, the critical value
of t.
EXAMPLE 5
64
𝑡 = 2.667
PRACTICE EXERCISE 1
A. Directions: Determine the value of t (round to the nearest thousandth) based on the given
degrees of freedom df and α. [ 2 points each]
1. 𝑑𝑓 = 26, 𝛼 = 0.01
2. 𝑑𝑓 = 15, 𝛼 = 0.025
3. 𝑑𝑓 = 28, 𝛼 = 0.10
4. 𝑑𝑓 = 5, 𝛼 = 0.05
5. 𝑑𝑓 = 23, 𝛼 = 0.005
B. Directions: Determine the value of t (round to the nearest thousandth) based on the given
sample size n and α. [ 2 points each]
1. 𝑛 = 6, 𝛼 = 0.005
2. 𝑛 = 15, 𝛼 = 0.001
3. 𝑛 = 21, 𝛼 = 0.05
4. 𝑛 = 17, 𝛼 = 0.10
5. 𝑛 = 10, 𝛼 = 0.01
PRACTICE EXERCISE 2
Directions: In each of the following, samples of size n are taken from a population with a
population mean µ. Find the t-value of the given sample means. [ 3 points each]
PRACTICE EXERCISE 3
Directions: Solve the following problems. [ 5 points each]
1. A researcher gathers a plasma samples from 20 patients who recovered from COVID-19.
He conducts one-tailed t-test for a mean using an alpha level of 0.05. Once he conducts
his one-tailed t-test and obtains a test statistics t, what critical value should he compare t
to?
2. The average number of milligrams (mg) of cholesterol in a serving of a certain brand of
milk tea is 660 mg, and the standard deviation is 35 mg. Assume the variable is normally
distributed. What is the t-value if a sample of 10 servings of milk tea selected has a
sample mean of 670 mg of cholesterol?
65
3. An electronic company claims that the average life of the smartphone battery it
manufactures is 14 hours with a standard deviation of 3 hours. What is the t-value if a
random sample of 25 smartphones chosen has a sample mean of 15 hours?
EXAMPLE 1
What is the 10th percentile of a t-distribution when the sample size is 6?
Consider the charts below. The 10th percentile is found some place in the lower-end tail on
the left side of zero. The 10th percentile is the value where 10% of the quantities lie
underneath it, and 90% lie above it. It likewise implies that 0.10 is the area to the left of the
10th percentile tvalue, while 90th percentile is located somewhere in the upper-end tail on the
right side of zero. The 90th percentile is where 90% of the quantities lie underneath it, and
10% of the quantities lies above it. This means 0.10 is the area to the right of the 90th
percentile t-value.
66
t-Distribution Table
67
Source: https://www.dummies.com/wp-content/uploads/451675.image0.jp g
Source: https://link.quipper.com
Since the t-distribution table only appears at the right tail area, if the percentile lies to the
right of the mean or above the 50th percentile, the area to the right of the given percentile can
be found by subtracting the percentile from 100% or 1.
The area to the left of the 10th percentile t-value is equal to the area to the right of the
90th percentile t-value. This means that the 10th percentile and the 90th percentile have the
same t-value on the t-distribution table.
Solution:
i. List the known facts of the problem 10th percentile n= 6
df = n- 1 = 6 – 1 = 5
ii. Find the area associated with the percentile by dividing it by 100.
10th percentile is which is equal to 0.10
iii. Identify if the area of probability is higher or lower than 0.50. if the percentile is
higher than 0.50, subtract it from 1.
Since 0.10 is lower than 0.50, there is no need to subtract from 1. Thus, the area
is 0.10.
iv. Find the t-value by finding the intersection of the right tail area column and the row
for the degrees of freedom.
The desired t-value is in the body of the table. Look for the corresponding area
or probabilities column heading 𝑡0.10 and locate it with the degrees of freedom
5. The t-value is 1.476.
68
EXAMPLE 2
Calculate the 90th percentile from the value of the lower cumulative distribution of the student’s t-
distribution when the sample size is 11?
Solution:
i. List the known facts of the problem 90th percentile n= 11
df = n- 1 = 11– 1 = 10
ii. Find the area associated with the percentile by dividing it by 100.
90th percentile is which is equal to 0.90
iii. Identify if the area of probability is higher or lower than 0.50. if the percentile is
higher than 0.50, subtract it from 1.
Since 0.90 is higher than 0.50. It must be subtracted from 1.
1 – 0.90 = 0.10
0.10 is the area to the right of the 90th percentile
iv. Find the t-value by finding the intersection of the right tail area column and the row
for the degrees of freedom.
The desired t-value is in the body of the table. Look for the corresponding area
or probabilities column heading 𝑡0.10 and locate it with the degrees of freedom
10. The t-value is 1.372.
69
____________1. If the sample size is 25, then the degree of freedom is 24.
____________2. 99th percentile can be written as 99/100 or 0.99.
____________3. The percentile increased when the sample size decreased.
____________4. 90th percentile means the number where 90% of the values lie below it and 10%
lie above it.
____________5. The area of the region to the left of the 25th percentile is 0.75.
____________6. -2.064 is the 2.5th percentile of a t-distribution when the degrees of freedom is
24.
____________7. 95th percentile would result in a t-distribution value of 2.528 with 20 degree of
freedom.
____________8. A 99.5th percentile is a result in a t-distribution value of 3.106 with a sample size
of 12.
____________9. If n= 25, at α=0.05, the critical value is 1.719.
____________ 10. Suppose you have a sample size 15 and you want to find the 95th percentile
of its corresponding t-distribution, you should find first the critical values.
PRACTICE EXERCISE 2
Directions: Using the t-table, find the degrees of freedom and t-value corresponding to 95th
percentile for each of the following n. Be mindful of the sign. [ 2 points each]
1. 𝑛 = 4 6. 𝑛 = 19 2. 𝑛 = 6
7. 𝑛 = 23
3. 𝑛 = 12 8. 𝑛 = 25
4. 𝑛 = 14 9. 𝑛 = 26
5. 𝑛 = 17 10. 𝑛 = 28
PRACTICE EXERCISE 3
A. Directions: For each situation, specify the corresponding t-value. Be mindful of the sign and
feel free to use the t-table. [ 3 points each]
1. 90th percentile, df =28
2. df = 13, α = 0.10
3. 2.5th percentile for n = 20
4. 75th percentile, n = 12
5. t-values that bound the middle 60% with df=12
B. Directions: Find the percentile given the following data. [ 3 points each]
1. df = 19, t-value = -2.093
2. n = 13, t-value = -0.259
3. df = 20, t-value = -2.845
4. n= 21, t-value = -1.725
5. n= 27, t-value = 0.684
70
IDENTIFYING THE LENGTH OF CONFIDENCE INTERVAL
Confidence Intervals are results and they are usually numbers i.e. you survey a group of dog
owners to see how many packs of dog food they purchase a year. Your test statistics at the 95%
confidence level and get a confidence interval of (200, 300). This means you think they buy
between 200 and 300 packs of dog food per year.
Definition: Confidence Interval
A confidence interval is how much uncertainty there is with any statistic. Confidence
intervals are often used with a margin of error. It tells you how confident you can be that the
results from a survey reflect what you would expect to find it were possible to survey the
entire population. Confidence intervals are intrinsically connected to confidence level.
General Formula of a Confidence Interval
The general formula of a confidence interval is
𝑃𝑜𝑖𝑛𝑡 𝐸𝑠𝑡𝑖𝑚𝑎𝑡𝑒 ± 𝑇𝑎𝑏𝑢𝑙𝑎𝑟 𝑉𝑎𝑙𝑢𝑒 × 𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟 𝑜𝑓 𝑡ℎ𝑒 𝑝𝑜𝑖𝑛𝑡 𝑒𝑠𝑡𝑖𝑚𝑎𝑡𝑒 and
the tabular value depends on the sampling distribution of the point estimate.
The Confidence Interval for an Unknown Population Mean and Known Standard Deviation
The confidence interval for an unknown population mean and known standard deviation
is defined mathematically as
𝜎
𝑿̅
or
𝜎 𝜎
𝑿̅
where, 𝑿̅ is the sample mean (point estimator),
𝑍𝛼 is the critical value of z with a confidence interval of (1 − 𝛼)%,
2
𝑿̅ 𝜎
, and the upper limit of the confidence interval is 𝑿̅ 𝜎
.
Steps in Identifying the Length of a Confidence Interval for an Unknown Population
Mean and Known Standard Deviation
71
Step 1. Identify the given variables:
Population standard deviation, 𝜎
Sample mean, 𝑋
Sample size, 𝑛
Step 2. Determine the confidence level and appropriate 𝑍𝛼
2
Values of z of Importance
Z Confidence Level
1.645 90%
1.960 95%
2.326 98%
2.576 99%
3.090 99.8%
3.291 99.9%
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
ii. Compute for the Upper limit
𝜎
𝑈𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
Step 4. Construct and interpret the confidence interval.
Illustrative Example1.
A survey studying the income of individuals in a certain municipality was conducted.
A population standard deviation of Php 1000 and a mean income of Php 15, 000 was
gathered from 49 individuals.
a. Identify and interpret a 90% confidence interval for the true mean income of the
individuals in a certain municipality.
Solution:
72
(a) Construct and interpret a 90% confidence interval for the true mean income
of the individuals in a certain municipality.
Step 1. Identify the given variables:
Population standard deviation, 𝜎 = 1,000
Sample mean, 𝑋 = 15,000
Sample size, 𝑛 = 49
Step 2. Identify the 𝑍𝛼 with 90% confidence level
2
Values of z of Importance
𝑍𝛼 Confidence
2 Level
1.645 90%
1.960 95%
2.326 98%
2.576 99%
3.090 99.8%
3.291 99.9%
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
ii. Compute for the Upper limit
𝜎
73
𝑈𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
Illustrative Example 2.
The following were observed weights (in kilograms) of a random sample of 20
learners. Assuming that the population standard deviation of the weights of all learners in the
class is 9 kg.
40 45 46 48 48 50 55 55 56 58
50 59 60 60 62 62 64 64 65 66
a. Identify and interpret a 99% confidence interval for the true mean weight of the
learners.
b. Identify and interpret a 95% confidence interval for the true mean weight of the
learners.
Solution:
(a) Construct and interpret a 99% confidence interval for the true mean weight of the
learners.
Step 1. Identify the given variables:
Population standard deviation, 𝜎 = 9
Sample size, 𝑛 = 20 Sample mean,
𝑋 =?
We must solve for the sample mean since it is not found in the given variables.
Using the sample mean formula of ungrouped data:
𝑛𝑖 𝑥𝑖
74
Sample mean, 𝑋 = 56.05
Step 2. Identify the 𝑍𝛼 with 99% confidence level
2
Values of z of Importance
𝑍𝛼 Confidence
2 Level
1.645 90%
1.960 95%
2.326 98%
2.576 99%
3.090 99.8%
3.291 99.9%
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
ii. Compute for the Upper limit
𝜎
75
𝑈𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
Therefore, 99% CI is (50.87, 61.23) 𝑜𝑟 50.87 < 𝜇 < 61.23. Hence, we are
99% confident that the true mean weight of the learners is between 50.87 kg and 61.23
kg.
The Confidence Interval for an Unknown Population Mean and Unknown Standard Deviation
The confidence interval for an unknown population mean and unknown standard deviation
is defined mathematically as
𝑠
𝑿̅ (𝑡
or
𝑠 𝑠
𝑿̅ (𝑡
The lower limit of the confidence interval is 𝑿̅ − 𝑡( 𝛼 , and the upper limit of
2,𝑛−1)
76
∑ =1
Sample mean, 𝑋 = 𝑛
𝑖
𝑋𝑖
𝑛
𝑛 2
∑ (𝑋 𝑖 −𝑋)
= √ 𝑖=1
Standard deviation, 𝑛−1 𝑠
Step 2. Determine the confidence level and appropriate 𝑡( 𝛼
2,𝑛−1)
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 (𝑡
ii. Compute for the Upper limit
𝑠
𝑈𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 (𝑡
Step 4. Construct and interpret the confidence interval.
Illustrative Example 3.
Eleven regions in the Philippines are randomly sampled. In each region, rainfall was
monitored for one year, and the following total yearly rainfalls in centimeters were recorded:
145, 185, 209, 213, 241, 255, 255, 262, 269, 276, 297. Assume that yearly rainfalls within the
region are normally distributed. Identify a confidence interval for the true mean rainfall in the
Philippines with 99% confidence level.
Solution:
Step 1. Identify the given variables:
Sample size, 𝑛 = 11
Sample mean, 𝑥̅ = ∑𝑛𝑖=1 𝑋𝑖
𝑛
∑𝑛𝑖=1 𝑋𝑖
𝑥̅ =
𝑛
𝑥̅
77
185 237 185 − 237 = −52 (−52)2 = 2704
209 237 209 − 237 = −28 (−28)2 = 784
213 237 213 − 237 = −24 (−24)2 = 576
241 237 241 − 237 = 4 (4)2 = 16
255 237 255 − 237 = 18 (18)2 = 324
255 237 255 − 237 = 18 (18)2 = 324
262 237 262 − 237 = 25 (25)2 = 625
269 237 269 − 237 = 32 (32)2 = 1024
276 237 276 − 237 = 39 (39)2 = 1521
297 237 297 − 237 = 60 (60)2 = 3600
𝑛 = 11 ∑𝑛𝑖=1(𝑋𝑖 −)2 =19, 962
∑ 𝑛𝑖=1( 𝑋𝑖 − 𝑥 ) 2
=√
𝑛−1
̅𝑠
𝑠
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
78
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
ii. Compute for the Upper limit
𝑠
𝑈𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
Step 4. Construct and interpret the confidence interval.
Therefore, 99% CI is (194.3098, 279.6902) or 194.3098 < 𝜇 < 279.6902.
Hence, we are 99% confident that the true mean rainfall in all regions in the Philippines is
between 194.3098 cm and 279.6902 cm.
Directions: Identify the length of the confidence interval for an unknown population mean is
and known standard deviation. Show complete solution in a separate sheet of paper.
EXERCISE 2
Directions: Identify the length of the confidence interval for an unknown population mean is
and unknown standard deviation. Show complete solution in a separate sheet of paper.
EXERCISE 3
Directions: Solve the problem. Follow the given steps in finding the length of the confidence
interval. Show complete solution in a separate sheet of paper.
79
breaking strength of the new material is greater than 155. It is known that the
breaking strengths of the new plastic material follow the normal distribution and have
a standard deviation of 10 psi (pounds per square inch). Six samples of the new
plastic materials were randomly selected, and their breaking strengths were
determined. The data obtained were 156, 154, 168, 157, 160 and 158.
a. Identify and interpret a 98% confidence interval for the true mean breaking strength
of the new plastic material.
b. Identify and interpret a 95% confidence interval for the true mean breaking strength
of the new plastic material.
2. The nickel metal Hydride (Nimh) battery is one of the highly advertised rechargeable
batteries today. It is lighter and can last up to 2 to 4 times longer than alkaline or
standard Nickel-Cadmium (NiCd) batteries. To evaluate its performance, a random
sample of 10 Nimh batteries was taken. The number of photos taken using each
battery in a digital camera is given as follows: 405, 564, 342, 456, 435, 543, 473, 452,
462, and 475. Identify the length of a 95% confidence interval for the true mean
number of photos taken using the Nimh battery and interpret.
for the confidence interval of unknown population mean and known standard
deviation or 𝐿 𝜎
for the confidence interval of unknown population mean and
unknown standard deviation.
Steps in Computing for the Length of a Confidence Interval
Step 1. Identify the given variables:
i. Upper limit and Lower limit
ii. standard deviation, 𝜎/𝑠
Sample mean, 𝑋̅
Sample size, 𝑛
80
Tabular value
Step 2. Compute for the length of the confidence interval, L.
Use the formula 𝑳 = 𝑼𝒑𝒑𝒆𝒓 𝑳𝒊𝒎𝒊𝒕 − 𝑳𝒐𝒘𝒆𝒓 𝑳𝒊𝒎𝒊𝒕 if the lower limit and the upper
𝝈
limit are given in the problem. While use 𝑳 = 𝟐(𝒁𝜶⁄𝟐 × ) or 𝑳 = 𝟐 (𝒕(𝜶⁄𝟐,𝒏 × 𝝈 ) if
both the lower limit and upper limit are not given.
Step 3. Interpret the result.
Illustrative Example1.
A survey studying the income of individuals in a certain municipality was conducted.
a. If 90% confidence interval for the true mean income of the individuals in a certain
municipality is (14,765, 15,235) or 14, 765 < 𝜇 < 15, 235, compute for the
length of the confidence interval.
b. If 95% confidence interval for the true mean income of the individuals in a certain
municipality is (14,720, 15,280) 𝑜𝑟 14, 720 < 𝜇 < 15, 280, compute for the
length of the confidence interval.
c. How was the length of the confidence interval affected by the decrease in the
confidence coefficient holding the other factors constant?
Solution:
a) If 90% confidence interval for the true mean income of the individuals in a
certain municipality is (𝟏𝟒, 𝟕𝟔𝟓, 𝟏𝟓, 𝟐𝟑𝟓) or 𝟏𝟒, 𝟕𝟔𝟓 < 𝝁 < 𝟏𝟓, 𝟐𝟑𝟓, compute
for the length of the confidence interval.
Step 1. Identify the given variables:
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 = 14, 765
𝑈𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 = 15, 235
Step 2. Compute for the length of the confidence interval, L.
Since the upper limit and the lower limit is given, we will use the formula:
𝑳 = 𝑼𝒑𝒑𝒆𝒓 𝑳𝒊𝒎𝒊𝒕 − 𝑳𝒐𝒘𝒆𝒓 𝑳𝒊𝒎𝒊𝒕
𝐿 = 15, 235 − 14, 765
𝐿 = 470
Step 3. Interpret the result.
Hence, the length of 90% confidence interval between Php 14,765 and Php 15,235 is 470.
Solution:
b) If 95% confidence interval for the true mean income of the individuals in a
certain municipality is (𝟏𝟒, 𝟕𝟐𝟎, 𝟏𝟓, 𝟐𝟖𝟎) 𝒐𝒓 𝟏𝟒, 𝟕𝟐𝟎 < 𝝁 < 𝟏𝟓, 𝟐𝟖𝟎, compute
for the length of the confidence interval.
Step 1. Identify the given variables:
81
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 = 14, 720
𝑈𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 = 15, 280
Step 2. Compute for the length of the confidence interval, L.
Since the upper limit and the lower limit is given, we will use the formula:
𝑳 = 𝑼𝒑𝒑𝒆𝒓 𝑳𝒊𝒎𝒊𝒕 − 𝑳𝒐𝒘𝒆𝒓 𝑳𝒊𝒎𝒊𝒕
𝐿 = 15, 280 − 14, 720
𝐿 = 560
Step 3. Interpret the result.
Hence, the length of 95% confidence interval between Php 14,720 and Php15,280 is 560.
Solution:
c) How was the length of the confidence interval affected by the decrease in the
confidence coefficient holding the other factors constant?
Holding other factors constant, as the confidence coefficient decreases, the length of
the interval becomes narrower and vice versa. In this case, a 90% confidence interval
with a length of 470 is narrower than the 95% confidence interval with a length of
560.
Illustrative Example 2.
The following were observed weights (in kilograms) of a random sample of 20
learners. Assuming that the population standard deviation of the weights of all learners in the
class is 9 kg.
40 45 46 48 48 50 55 55 56 58
50 59 60 60 62 62 64 64 65 66
a. Compute for the length of 99% confidence interval for the true mean weight of the
learners.
b. Compute for the length of 95% confidence interval for the true mean weight of the
learners.
Solution: In example 2, there is unknown population mean and known standard deviation, so
we will use the standard normal distribution.
(a) Compute for the length of 99% confidence interval for the true mean weight of the
learners.
Step 1. Identify the given variables:
Population standard deviation, 𝜎 = 9
Sample size, 𝑛 = 20 Sample mean,
𝑋̅ =?
We must solve for the sample mean since it is not found in the given variables.
Using the sample mean formula of ungrouped data:
82
∑𝑛𝑖=1 𝑥𝑖
̅
𝑋=
𝑛
𝑋̅
𝑋̅
Sample mean, 𝑋̅ = 56.05
Determine the appropriate Zα⁄2 at 99% confidence level
Values of z of Importance
𝑍𝛼⁄2 Confidence
Level
1.645 90%
1.960 95%
2.326 98%
2.576 99%
3.090 99.8%
3.291 99.9%
𝑳 = 𝟐 (𝒁𝜶⁄𝟐 ×
𝐿
𝐿 = 2(5.18)
𝐿 = 10.36
Step 3. Interpret the result.
Hence, the length of 99% confidence interval is 10.36.
Solution:
(b) Compute for the length of 95% confidence interval for the true mean weight of
the learners.
83
Step 1. Identify the given variables:
Population standard deviation, 𝜎 = 9
Sample size, 𝑛 = 20
Sample mean, 𝑋̅ = 56.05
Determine the appropriate Zα⁄2 (z-value) at 95% confidence level
Values of z of Importance
𝑍𝛼⁄2 Confidence
Level
1.645 90%
1.960 95%
2.326 98%
2.576 99%
3.090 99.8%
3.291 99.9%
𝑳 = 𝟐 (𝒁𝜶⁄𝟐 ×
𝐿
𝐿 = 2(3.94)
𝐿 = 7.88
Step 3. Interpret the result.
Hence, the length of 95% confidence interval is 7.88
Illustrative Example 3.
The heights of 10 Statistics and Probability students in inches are given below. Compute for
the length of the 98% confidence interval for the true mean height of all Statistics and
Probability students.
61.8 66.5 67.4 67.6 68.5 71.4 72.0 72.4 74.7 78.1
Solution: In example 3, there is unknown population mean and unknown standard deviation,
so we will use the student’s t distribution. Step 1. Identify the given variables:
Sample size, 𝑛 = 10 Sample
mean, 𝑋̅ =?
84
̅ = ∑𝑛𝑖=1 𝑥𝑖
𝑋
𝑛
𝑋̅
𝑋̅
Sample mean, 𝑋̅ = 56.05 Standard
deviation, 𝑠 =?
̅ )2
𝑛 ( 𝑋𝑖 − 𝑋
∑ 𝑖=1
=√
𝑠 𝑛−1
𝑋 𝑋̅ 𝑋 − 𝑋̅ (𝑋 − 𝑋̅)2
61.8 70.04 61.8 − 70.04 = −8.21 (−8.21)2 = 67.4041
66.5 70.04 66.5 − 70.04 = −3.4 (−3.4)2 = 11.56
67.4 70.04 67.4 − 70.04 = −2.64 (−2.64)2 = 6.9696
67.6 70.04 67.6 − 70.04 = −2.44 (−2.44)2 = 5.9536
68.5 70.04 68.5 − 70.04 = −1.54 (−1.54)2 = 2.3716
71.4 70.04 71.4 − 70.04 = 1.36 (1.36)2 = 1.8496
72.0 70.04 72.0 − 70.04 = 1.96 (1.96)2 = 3.8416
72.4 70.04 72.4 − 70.04 = 2.36 (2.36)2 = 5.5696
74.7 70.04 74.7 − 70.04 = 4.66 (4.66)2 = 21.7156
78.1 70.04 78.1 − 70.04 = 8.06 (8.06)2 = 64.9636
𝑛 = 10 𝑛
𝛼 = 1 − 𝐶𝐿 = 1 − 0.98 = 0.02
𝛼 0 .
⁄2 = = 0.01
𝑑𝑓 = 𝑛 − 1 = 10 − 1 = 9
85
Using the Student’s t Distribution table, 𝑡(0.01,9) = 2.821 Step
2. Compute for the length of the confidence interval, L.
Since the upper limit and the lower limit is not given, we will use the formula:
𝒔
𝑳 = 𝟐 (𝒕(𝜶⁄𝟐,𝒏 ×)
𝑠
𝐿
𝐿 = 2(4.12)
𝐿 = 8.24
Step 3. Interpret the result.
Hence, the length of 98% confidence interval is 8.24.
Learning Competency: Computes for the length of a confidence interval (M11/12SP-IIIj-2)
EXERCISE 1
Directions: Compute for the length of the confidence interval given its lower limit and upper
limit. Show complete solution in a separate sheet of paper.
1. 95% confidence interval: (0.887, 1.213)
2. 99% confidence interval: (46.3, 53.7)
3. 90% confidence interval: (201.83, 238.81)
4. 98% confidence interval: (3.97, 7.03)
5. 95% confidence interval: (1, 041.65, 1, 096.35)
6. 98% confidence interval: (109.44, 114.56)
7. 99% confidence interval: (9.03, 10.97)
8. 95% confidence interval: (44.73, 47.44)
9. 95% confidence interval: (12.303, 13.298)
10. 908% confidence interval: (25.594, 30.806)
EXERCISE 2
Directions: Compute for the length of the confidence interval given some of the following
variables. Show complete solution in a separate sheet of paper.
86
EXERCISE 3
Directions: Solve the problem. Follow the given steps in computing for the length of the
confidence interval. Show complete solution in a separate sheet of paper.
87
Population standard deviation, 𝜎
Length of the Confidence Interval
Step 2. Compute for the appropriate sample size
𝝈
𝒏 = (𝟐𝒁𝜶⁄𝟐 × )𝟐
𝑳
Illustrative Example1.
A hardware manufacturer produces bolts used to assemble various machines. Assume
that the diameter of bolts produced by this manufacturer has an unknown population mean
and the standard deviation is 0.1 mm. How many samples are needed for a 95% confidence
interval to be between 5.0823 mm and 5.1337 mm of the true mean diameter of all bolts?
Solution:
Step 1. Identify the given variables:
✓ Standard deviation, 𝜎 = 0.1
✓ Zα⁄2 with 95% confidence interval
Values of z of Importance
𝑍𝛼⁄2 Confidence
Level
1.645 90%
1.960 95%
2.326 98%
2.576 99%
3.090 99.8%
3.291 99.9%
𝒏 = (𝟐𝒁𝜶⁄𝟐 × 𝝈)𝟐
𝑳
88
𝑛
𝑛 = (7.63)2
𝑛 = 58.21 ≈ 59
Note: When determining the sample size, we always round up the resulting value to the
next whole number.
Hence, the appropriate sample size is 59 bolts.
Illustrative Example 2.
How large should a sample be if we want to estimate the true average weight of
learners enrolled in a school using a sample to be drawn using simple random sampling that
we are 95% confident of our estimate. Assume that the length of the confidence interval is
between 52.11 kg and 59.99 kg and that the population standard deviation of the weights of
all learners in the class is 9 kg.
Solution:
Step 1. Identify the given variables:
✓ Standard deviation, 𝜎 = 9
✓ Zα⁄2 with 95% confidence level
Values of z of Importance
𝑍𝛼⁄2 Confidence
Level
1.645 90%
1.960 95%
2.326 98%
2.576 99%
3.090 99.8%
3.291 99.9%
89
𝒏 = (𝟐𝒁𝜶⁄𝟐 × 𝝈 𝟐 )
𝑳
𝑛
𝑛 = (4.48)2
𝑛 = 20.07 ≈ 21
Note: When determining the sample size, we always round up the resulting value to the next
whole number.
Hence, the appropriate sample size is 21 learners.
Directions: Compute for the appropriate sample size. Show complete solution in a separate
sheet of paper.
1. Length of the 95% confidence interval, L is 0.33 and standard deviation of 0.25
2. Length of the 90% confidence interval L is 2.2 and standard deviation of 2.67
3. 95% confidence interval: (19.02, 20.98) and standard deviation of 3
4. 99% confidence interval: (1938.4, 2041.6) and standard deviation of 120
5. 90% confidence interval: (29.01, 30.98) and standard deviation of 3
EXERCISE 2
Directions: Solve for the problem. Follow the given steps in computing for an appropriate
sample size using the length of the confidence interval. Show complete solution in a separate
sheet of paper.
1. How many simple random of dog weights must be included in a study. If the
standard deviation of the population of all dog weights is 18 lbs. Assume that we
are 99% confident that the true mean weight is between 185.27 lbs. and 200.72
lbs.
2. How many random samples of students must be selected from a certain school if
they are given an IQ test which has a standard deviation of 11 and 98% confident
that the true mean intelligence is (109.44, 114.56).
EXERCISE 3
Directions: Analyze and solve the given problem below. Justify your answer.
The appropriate sample size derived from the margin of error, ME is a statistic expressing
the amount of random sampling error in the results of a survey. The Margin of
𝜎
Error is mathematically defined as 𝑀𝐸 . Thus, the sample size, 𝑛 is computed
as 𝒏 = (𝒁𝜶⁄𝟐 × 𝑴𝑬 𝝈 )𝟐.
𝒏 = (𝒁𝜶⁄𝟐 × 𝝈 )𝟐
𝑴𝑬
Step 3. Interpret the result.
Illustrative Example1.
What should be the sample size of the five-year old children to be included in a
feeding program if the mayor wants to estimate the mean weight (in kilograms) of the
children in a certain barangay? He wants to be 99% confident that the estimate of the sample
mean is accurate within 0.06 kg. Assuming that the standard deviation of the weights of the
target population is 0.5 kg.
Solution:
Step 1. Identify the given variables:
Population standard deviation, 𝜎 = 0.5
Margin of Error, 𝑀𝐸 = 0.06
Zα⁄2 with 99% confidence level
Values of z of Importance
91
𝑍𝛼⁄2 Confidence
Level
1.645 90%
1.960 95%
2.326 98%
2.576 99%
3.090 99.8%
3.291 99.9%
𝒏 = (𝒁𝜶⁄𝟐 × 𝝈 )𝟐
𝑴𝑬
𝑛
𝑛 = (21.467)2
𝑛 = 460.82 ≈ 461
Note: When determining the sample size, we always round up the resulting value to the next
whole number.
Step 3. Interpret the result.
Hence, a simple random sample of 461 five-year old children should be included in a feeding
program to be 99% confident that the mean weight is within 0.06 kg from the true mean.
Illustrative Example 2.
How large should a sample be if we want to estimate the true average weight of
learners enrolled in a school using a sample to be drawn using simple random sampling to be
within 2kg away from the true value and that we are 99% confident of our estimate. Assume
that the population standard deviation of the weight is 9kg.
Solution:
Step 1. Identify the given variables:
Population standard deviation, 𝜎 = 9
Margin of Error, 𝑀𝐸 = 2
Zα⁄2 with 99% confidence level
Values of z of Importance
𝑍𝛼⁄2 Confidence
92
Level
1.645 90%
1.960 95%
2.326 98%
2.576 99%
3.090 99.8%
3.291 99.9%
𝒏 = (𝒁𝜶⁄𝟐 × 𝝈 )𝟐
𝑴𝑬
𝑛
𝑛 = (11.592)2
𝑛 = 134.37 ≈ 134
Note: When determining the sample size, we always round up the resulting value to the next
whole number.
Step 3. Interpret the result.
Hence, we need 134 learners in estimating the true average weight of learners enrolled in a
school class under simple random sampling scheme with 99% confidence that it is within 2
kg of the true mean.
EXERCISE 1
1. A set of ducks is fed a certain type of grain for a period to estimate the average weight
gain. What sample size is needed if you want the estimate to be within 1.5 kg of the
correct 𝜇 with 95% confidence interval and a standard deviation of 7 kg?
2. Determine the sample size needed to estimate the average height of all grade 1
learners if we want to be within 1 inch with a confidence level of 90%. Assume we
know that the standard deviation of such heights is 3 inches.
93
EXERCISE 2
Directions: Solve the following problem. Follow the given steps in computing for the length
of the confidence interval. Show complete solution in a separate sheet of paper.
1. Textbook authors developed books that are appropriate to reading level of the target
audience. How many randomly selected words should we use to estimate average
text’s word length to be within 0.5 letters with 98% confidence. Assuming that the
population standard deviation is 2.7.
2. A local government official observes an increase in the number of individuals with
cardiovascular and obesity problems in his barangay. To improve the health
conditions of his constituents, he aims to promote an easy and cheap way to reduce
weight. It is known that obesity results in greater risk of having illnesses like diabetes
and heart problems. He encouraged his constituents to participate in his Dance for
Life project every weekend for 3 months. It is known that the weight loss of those
who have completed the dance program follows a normal distribution with variance of
3.24 𝑘𝑔2(the standard deviation is the square root of the given variance).To know if
the program is effective in reducing weight, how many participants must be sampled
in order to be 95% confident that the estimated mean weight loss of the participants
will be within 0.5 kg of the true mean?
EXERCISE 3
Directions: Analyze and solve the given problem below. Show your complete solution in a
separate sheet of paper.
1
1. Find the sample size needed to estimate the population mean to be within ⁄5 of the
standard deviation with 99% confidence.
94