SHS - stat&Prob.Q3.W1 5.52pgs
SHS - stat&Prob.Q3.W1 5.52pgs
SHS - stat&Prob.Q3.W1 5.52pgs
STATISTICS AND
PROBABILITY
Learning Activity Sheets
Quarter 3: Week 1-5
A Random Variable is a numerical quantity that is derived from the outcomes of random
experiments.
The random variable in the activity above is a discrete random variable because the
set of possible outcomes is countable. The possible values of random variable X = {0, 1, 2, 3}
Example 1
Suppose we toss a fair coin 3 times. The sample space for this chance process is :
{ HHH HHT HTH THH HTT THT TTH TTT}
Since there are 8 possible outcomes then probability of each outcome is 1/8.
2
Define the variable X = the number of heads obtained.
What are the outcomes of possible heads?
X=0 TTT
X=1 HTT , THT , TTH
X=2 HHT , HTH , THH
X=3 HHH
PROBABILITY DISTRIBUTION
Probability distribution of a random variable tells all of its possible values along with
their associated probabilities.
Example 1: The random variable X is the number of tails in two tosses of a coin. What are
the probabilities of the possible outcomes? What is the probability distribution of X?
Solution:
Possible outcomes: {HH HT TH TT}
Each has probability ¼ so the probability distribution is:
X=No. of Tails 0 1 2
P(x ) 1 2 1
4 4 4
In Graphical Form:
We can use the probability distribution to answer questions about variable x. In symbols, we
want to find P ( x ≥1). We could add probabilities to find the answer:
3
FINDING POSSIBLE VALUES OF A RANDOM VARIABLE
To understand how one finds the possible values of random variables, do the activity
below...
Suppose two cellphones are tested at random. We want to find out the number of
defective cellphones. Thus, to each outcome in the sample space we shall assign a value.
These are 0, 1, or 2. If there is no defective cellphone, we assign the number 0; if there is 1
defective cellphone, we assign the number 1; and 2, if there are two defective cellphones. The
number of defective cellphones is a random variable.
The possible values of this random variable are 0, 1, and 2. Read and analyze the
given situation below.
Illustration
Let D represent the defective cellphones and N for the non-defective cellphones. If we let Y
be the random variable representing the number of defective cellphones, show the values of
the random variable Y. The table below to show the values of the random variable.
Example 2. Suppose two coins are tossed, let Z be the random variable representing the
number of heads that occur. Find the values of the random variable Z.
1. Determine the sample space. Let H represent head and T for tail
Solution: The sample space for this experiment is: S = {TT, TH, HH, HT}
2. Count the number of heads in each outcome in the sample space and assign
this number to this outcome.
Possible Outcomes Value of the Random
variable Z (Number of
heads)
TT 0
TH 1
HH 2
HT 1
4
Illustrating a Probability Distribution for a Discrete Random Variable and its Properties
A random variable is a set whose elements are the numbers assigned to the outcomes
of an experiment. It is usually denoted by uppercase letters such as X, whose elements are
denoted by lower case letters x1, x2, x3 and so on.
1. The probability of each value of the random variable must be between or equal to
0 and 1. In symbol 0 < P(X) < 1.
2. The sum of all the probabilities of all values of the random variable must be equal
to 1. In symbol, we write it as ΣP(X) = 1
Probability Histogram
STEPS:
1 Determine the sample space. Let R represent the red ball and G represent the
green ball.
Solution : The sample space for this experiment is: S= { RR, RG, GR, GG}
2.Count the number of green balls in each outcome in the sample space and assign this
number to this outcome.
Possible Value of the Random
Outcomes Variable Y (No. Of
Green Balls)
RR 0
RG 1
GR 1
GG 2
5
3. Assign probability values P(Y) to each value of the random variable.
There are 4 possible outcomes, if no green ball occurs once, the probability that
we shall assign to the random variable 0 is ¼
If 1 green ball occurs two times, the probability that we shall assign to the
random variable 1 is 2/4 or ½.
If 2 green balls occur once, the probability that we shall assign to the random
variable 1 is ¼.
Number of Green Probability P(Y)
Balls Y
0 1/4
1 2/4 or ½
2 ¼
Histogram
a) In plotting the probabilities along the vertical axis, you can change fractions to
decimals (ex. ¼=0.25 and ½= 0.5)
0.6
0.5
0.4
P(Y)
0.3
0.2
0.1
0
0 1 2
No. of Green Balls (Y)
Example 2
Let X be a random variable giving the number of girls in a randomly selected three-
child family. Assuming that boys and girls are equally likely , construct the probability
distribution of X and its corresponding histogram.
6
a.
Possible Outcomes Value of the Random Variable X
(No. of Girls)
BBB 0
BBG 1
BGB 1
GBB 1
BGG 2
GBG 2
GGB 2
GGG 3
b. No. of Girls 0 1 2 3
Probability P(X) 1/8 or 0.125 3/8 or 0.375 3/8 or 0.375 1/8 or 0.125
0.2
0.1
0
0 1 2 3
7
GENERAL INSTRUCTION: Write your answer on a separate paper
Practice A
Classify whether the given experiment implies a discrete random variable or a
continuous random variable. Write D if discrete and C if continuous.
_____ 1. The temperature of a solution in the laboratory
_____ 2. Collecting data about the heights of students in senior high schools.
_____ 3. Recording the distance travelled by the bus
_____ 4. Surveying about the number of cases due to Covid -19 pandemic in Angeles City
_____ 5. Number of promoted students at the end of school year
Practice B.
Complete the given table by finding the possible values of random variable.
1. In a box are 2 balls - one white and one yellow. Two balls are picked one at a time with
replacement. Let X be the random variable representing the number of white balls.
Find the values of the random variable X.
Possible Outcomes Value of the random variable X
(Number of white balls)
2. Four coins are tossed. Let Y be the random variable representing the number of tails
that occur. Find the values of the random variable Y.
8
Practice C.
Determine whether the given values can serve as the values of a probability distribution.
1. P(1) = 0.05, P(2)=1.01, P(3)= 0.2
2. P(1) = 3/20 , P(2)= 7/20 , P(3)= ½
3 P(1) = 0.35 , P(3) =0.25 , P(5) 0.28 , P(7)= 0.12
X 1 5 7 8
3.
P(x) 1/4 1/8 1/4 1/8
X 1 3 5 7
4.
P(x) 0.35 0.25 0.22 0.12
References
Albert, J. R. G. (2008). Basic Statistics for the Tertiary Level (ed. Roberto Padua,
Welfredo Patungan, Nelia Marquez), published by Rex Bookstore.
Handbook of Statistics 1 (1st and 2nd Edition), Authored by the Faculty of the
Institute of Statistics, UP Los Baños, College Laguna 4031
R Pierce, Rod. (3 Feb 2020). "Random Variables". Math Is Fun. Retrieved 24 May 2020
from http://www.mathsisfun.com/data/random-variables.html
Malate, Jose S. Statistics and Probability for Senior High School. Vicarisg Publications
and Trading, Inc, 2017.
Lim, Yvette F., Nocon, R., Nocon, E., Ruivivar, L. Math for Engaged Learning Statistics
and Probability. Sibs Publishing House, Inc. 2016.
Statistics and Probability 11, Pivot 4A Learners Material (N/A) Pages 4-53
9
ANSWER KEY
Practice A Practice B Practice C Practice D
1. 1. Probability
1 C 1. not a Distribution
2. 1
2 C Possible Value of probability 3. 0 ≤ P(x) ≤ 1
3 C Outcomes the distribution 4. ∑ P(x) = 1 and
4 D Random 2. probability 0 ≤ P(x) ≤ 1
5 D Variable Y distribution 5. Sample
3. probability Random variable
(No. Of
Probability values
white distribution or P(x)
balls) 4. not a
probability
WW 2 distribution
YY 0 5. not a
probability
WY 1 distribution
YW 1
Possible value of the random
X = { 0,1,2}
2.
Prepared
JOANAH S. BAUTISTA
Secondary School Teacher I - FGNMHS
10
WEEK 2: COMPUTING PROBABILITY CORRESPONDING TO A GIVEN RANDOM
VARIABLE
Background Information
RANDOM VARIABLES
In short:
𝑋 = {0, 1}
Note:
We could choose Heads = 100 and
Tails= 150 or other values if we want! It
is our choice
So:
We have an experiment such as tossing coin.
We give values to each event.
The set of values is a Random Variable.
A Random Variable has a whole set of values, and it could take on any of those values,
randomly.
We use a capital letter, like X or Y, to avoid confusion with the Algebra type of variable.
PROBABILITY
We can show the probability of any one value using this style:
P(X= value) = probability of that value
11
Example:
How many heads do you get when we toss three coins?
X= “The number of the Heads” is the Random Variable
In this case, there could be 0 Heads (if all the coins land Tails
up), 1 Head, 2 Heads or 3 Heads.
But this time the outcomes are NOT all equally likely.
X= “Number of Heads”
12
Let’s count how often each value occurs, and work out the probabilities:
2 occurs just once, so P(X = 2)= 1/36
3 occurs twice, so P(X = 3)= 2/36 or 1/18
4 occurs three times, so P(X = 4)= 3/36 or 1/12
5 occurs four times, so P(X = 5)= 4/36 or 1/9
6 occurs five times, so P(X = 6)= 5/36
7 occurs six times, so P(X = 7)= 6/36 or 1/6
8 occurs five times, so P(X = 8)= 5/36
9 occurs 4 times, so P(X = 9)= 4/36 or 1/9
10 occurs thrice, so P(X = 10)= 3/36 or 1/12
11 occurs twice, so P(X= 11)= 2/36 or 1/18
12 occurs just once, so P(X = 12)= 1/36
Example
1) A card is taken at random from a full pack of 52 playing cards. What is the probability
that the card taken is:
(a) Red card?
(b) a queen?
(c) a red ace?
(d) the seven of hearts?
Solution
As each card is equally likely to be drawn from the pack there are 52 equally likely outcomes.
13
2) A packet of sweets contains 18 red sweets, 12 green sweets and 10 yellow sweets. A
sweet is taken at random from the packet. What is the probability that the sweet is:
(a) Red,
(b) Not green,
(c) Green or yellow?
Solution
The total number of sweets in the packet is 40, so there are 40 equally likely outcomes when
one is taken at random.
(a) There are 18 red sweets, so:
18 9
𝑝(𝑟𝑒𝑑) = =
40 20
(b) There are 28 sweets that are not green in the packet, so:
28 7
𝑝 (𝑛𝑜𝑡 𝑔𝑟𝑒𝑒𝑛) = =
40 10
(c) There are 22 sweets that are green or yellow in the packet, so:
22 11
𝑝 (𝑔𝑟𝑒𝑒𝑛 𝑜𝑟 𝑦𝑒𝑙𝑙𝑜𝑤) = =
40 20
3) You roll a fair dice 120 times. How many times would you expect to obtain:
(a) A6
(b) An even score
(c) A score of less than 5?
Solution:
1
(a) 𝑝 (6) =
6
1
𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 6𝑠 = 𝑥 120 = 20
6
3 1
(b) 𝑝(𝑒𝑣𝑒𝑛 𝑛𝑢𝑚𝑏𝑒𝑟) = =
6 2
1
𝑒𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑒𝑣𝑒𝑛 𝑠𝑐𝑜𝑟𝑒𝑠 = 𝑥 120 = 60
2
4 2
(c) 𝑝(𝑠𝑐𝑜𝑟𝑒 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 5) = =
6 3
2
𝑒𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑛𝑢𝑚𝑏𝑒𝑟𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠 𝑙𝑒𝑠𝑠 𝑡ℎ𝑎𝑛 5 = 𝑥 120 = 80
3
A RANGE OF VALUES
We could also calculate the probability that a Random Variable takes on a range of values.
Example:
You rolled two dice, what is the probability that the sum of the scores is 5, 6, 7, or 8?
In other words: What is 𝑃(5 ≤ 𝑥 ≤ 8)?
𝑃(5 ≤ 𝑥 ≤ 8)= P(X=5) + P(X=6) + P(X=7) + P(X=8)
= (4 + 5 + 6 + 5)/36
= 20/36
= 5/9
14
Example:
x 0 1 2 3 4
f(x) 1/5 1/5 1/5 1/5 1/5
What is the expected value?
Answer
1 1 1 1 1
µ = 𝐸(𝑋) = ∑ 𝑥𝑓(𝑥) = 0 ( ) + 1 ( ) + 2 ( ) + 3 ( ) + 4 ( )
5 5 5 5 5
For this example, the expected value was equal to a possible value of X. this may not
always be the case. For example, if we flip a fair coin 9 times, how many heads should we
expect? We will explain how to find this later but we should expect 4.5 heads. The expected
value in this case is not a valid number of heads.
Example
x 0 1 2 3 4
f(x) 1/5 1/5 1/5 1/5 1/5
15
Activity 2: Practice calculating and interpreting the mean and standard deviation of a
discrete random variable.
1. Organizers of a concert are limiting tickets sales to a maximum of 444 tickets per
customer. Let T be the number of tickets purchased by a random customer. Here is
the probability distribution of T:
T=# of tickets 1 2 3 4
P(T) 0.1 0.3 0.2 0.4
Calculate the expected value of T.
2. Choose the best interpretation of the mean (or expected value) that you found in the
previous question.
a. Most customers will purchase 3 tickets.
b. A group of 10 customers will purchase exactly 29 tickets.
c. If we look at a large number of customers, then each customer, on average, will
have purchased about 2.9 tickets.
Reference
https://online.stat.psu.edu/stat500/lesson/3/3.2/3.2.1
Answer Key
Activity 1
1. 64/125
2. 16/25
3. 2/5
4. 21/50
5. 38/75
Activity 2
1. 2.9 tickets
2. C
3. 1.09
Prepared by
16
WEEK 3: MEAN AND VARIANCE OF A DISCRETE RANDOM VARIABLE
Background Information
The mean or expected value of a probability distribution tells the value of a random
variable that we expect to get if the experiment is done repeatedly. In other words, if the
experiment or process is repeated long enough, there is a likelihood or chance that the
average of the outcomes will begin to approach the expected value or the mean value.
The standard deviation of a probability distribution indicates how far, on the average,
is an observed value of a random variable X from its mean. The smaller the standard deviation,
the more likely that an observed value of the random variable will be close to its mean.
Use these concepts to solve problems involving the mean and the variance of
probability distribution.
Example 1:
The following data show the probability distribution of the number of computers sold
daily in a computer shop during the past several months.
Questions:
a. To find the probability that fewer than three computers will be sold, use X = 0, 1, and 2.
𝑃(𝑋 < 3) = 𝑃(0) + 𝑃(1) + 𝑃(2)
= 0.10 + 0.20 + 0.45
𝑃(𝑋 < 3) = 0.75
17
b. To find the probability that, at most, two computers will be sold, use X = 0, 1, and 2.
c. To find the probability that, at least, four computers will be sold, use X = 4 and 5.
𝑃(𝑋 ≥ 4) = 𝑃(4) + 𝑃(5)
= 0.05 + 0.05
𝑃(𝑋 ≥ 4) = 0.10
d. To find the probability that exactly three computers will be sold, use X = 3.
𝑃(𝑋 = 3) = 0.15
𝜇 = ∑[𝑋 𝑃(𝑋)]
𝜇 = 2.00
The mean of the probability distribution is 2. This implies that in the long run,
we expect that the average number of computers that will be sold on a given day will
be two.
X P(X) X2 X2 P(X)
0 0.10 0 0.00
1 0.20 1 0.20
2 0.45 4 1.80
3 0.15 9 1.35
4 0.05 16 0.80
5 0.05 25 1.25
18
𝜎 = √[𝑋2 𝑃(𝑋)] − 𝜇2
= √5.40 − 4
= √1.40
𝜎 = 1.18
The standard deviation of the probability distribution is 1.18. This implies that
on the average, the number of computers sold is 1.18 units from the mean 2.
Example 4:
In a convenient store, the number of tellers (X) busy with customers at 12:00 noon
varies from day to day. Past records indicate that the probability distribution of X is as
follows:
Questions:
1. What is the probability that exactly four tellers are busy at 12:00 noon?
2. What is the probability that, at least, two tellers are busy at 12:00 noon?
3. What is the probability that, fewer than five tellers are busy at 12:00 noon?
4. What is the probability that at least two, but fewer than five tellers are busy at
12:00 noon?
5. What is the mean of the probability distribution?
6. What is the standard deviation of the probability distributions?
Solution:
1. To find the probability that exactly four tellers are busy, use X = 4.
𝑃(𝑋 = 4) = 0.212
2. To find the probability that at least two tellers are busy, use X = 2, 3, 4, 5, and 6.
𝑃(𝑋 ≥ 2) = 𝑃(2) + 𝑃(3) + 𝑃(4) + 𝑃(5) + 𝑃(6)
= 0.078 + 0.155 + 0.212 + 0.262 + 0.215
𝑃(𝑋 ≥ 2) = 0.922
19
3. To find the probability that fewer than five tellers are busy,
use X = 0, 1, 2, 3, and 4.
𝑃(𝑋 < 5) = 𝑃(0) + 𝑃(1) + 𝑃(2) + 𝑃(3) + 𝑃(4)
4. To find the probability that at least two, but fewer than five tellers are busy,
use X = 2, 3, and 4.
𝑃(2 ≤ 𝑋 < 5) = 𝑃(2) + 𝑃(3) + 𝑃(4)
= 0.078 + 0.155 + 0.212
𝑃(2 ≤ 𝑋 < 5) = 0.445
X P(X) X P(X)
0 0.029 0.000
1 0.049 0.049
2 0.078 0.156
3 0.155 0.465
4 0.212 0.848
5 0.262 1.310
6 0.215 1.290
𝜇 = ∑[𝑋 𝑃(𝑋)]
= 0.00 + 0.49 + 0.156 + 0.465 + 0.848 + 1.310 + 1.290
𝜇 = 4.118
The mean of the probability distribution is 4.118. This implies that in the long
run, we expect that the average number of tellers who are busy at 12:00 noon
will be 4.118 or 4.
X P(X) X2 X2 P(X)
0 0.029 0 0.000
1 0.049 1 0.049
2 0.078 4 0.312
3 0.155 9 1.395
4 0.212 16 3.392
5 0.262 25 6.550
6 0.215 36 7.740
20
𝜎 = √[𝑋2 𝑃(𝑋)] − 𝜇2
= √19.438 − 16.958
= √2.48
𝜎 = 1.57
For example, your grades in Mathematics is one of the scores in the distribution, you
can predict the location of that score in the distribution and interpret it with regards to the mean
and standard deviation.
21
4. The width of the curve is determined by the standard deviation of the distribution.
5. The tails of the curve are plotted in both directions and flatten out indefinitely along the
horizontal axis. The tails are thus asymptotic to the baseline. A portion of the graph
that is asymptotic to a reference axis or another graph is called an asymptote, always
approaching another but never touching it.
6. The total area under a normal curve is 1. This means that the normal curve represents
the probability, or the proportion, or the percentage associated with specific sets of
measurement values.
To visualize and understand the properties/characteristics of a normal curve, please
refer to figure as shown below.
The shape of a normal curve is based on the two given parameters, the mean and the
standard deviation of the distribution. When comparing two distributions each described by
the normal curve, the following are the three situations based on the said parameters
a. When the means are not equal, but the standard deviations are equal. (µ1 ≠ µ2 ; ơ1 =ơ2 ),
the curves have a similar shape but centered at different points, as shown in the figure.
22
b. When the means are equal, but the standard deviations are not equal. (µ1 = µ2 ; ơ1 ≠ ơ2
), the curves are centered at the same point but they have different height and spreads. as
shown in the figure.
c. When the means are different and the standard deviations are also different (µ1≠ µ2 ; ơ1 ≠
ơ2 ), the curves are centered at different points and vary in shapes, as shown in the figure.
Every normal curve (regardless of its mean or standard deviation) conforms to the
following "empirical rule"
The empirical rule is better known as 68% - 95% - 99.70% rule. This rule states that
the data in the distribution lies within one (1), two (2), and three (3) of the standard deviation
from the mean are approximately 68%, 95%, and 99.70%, respectively. Since the area of a
normal curve is equal to 1 or 100% as stated on its characteristics, there are only a few data
which is 0.30% falls outside the 3-standard deviation from the mean.
23
Example 1:
The scores of the Senior High School students in their Statistics and Probability quarterly
examination are normally distributed with a mean of 35 and a standard deviation of 5.
Questions:
a. What percent of the scores are between 30 to 40?
b. What scores fall within 95% of the distribution?
Solution:
Draw a standard normal curve and plot the mean at the center. Then, add the standard
deviation to the mean once and mark it to the right of the mean. Add twice the standard
deviation to the mean and put it to the right of the first sum. Then, add thrice the standard
deviation to the mean and mark it to the right of the second sum. Do the same to the left.
This time, instead of adding, subtract the standard deviation from the mean.
Answer:
a. The scores 30 to 40 falls within the first standard deviation from the mean.
Therefore, the scores that fall between 30 and 40 is approximately 68% of the
distribution.
b. Since 95% of the distribution lies within 2 standard deviations from the mean, then
the scores corresponding to this area of the distribution are scores from 25 up to
45.
Example 2:
The district nurse of “Ospital ning Angeles” needs to measure the BMI (Body Mass
Index) of the Alternative Learning System students. She found out that the heights of male
students are normally distributed with a mean of 160 cm and a standard deviation of 7 cm.
Find the percentage of male students whose height is within 153 cm to 174 cm.
24
The mean of this problem is 160, it is subtracted by 7 to the left (160 -7 = 153; 153-7
= 146; 146 – 7 = 139) and add 7 to the right. (e.g. 160 + 7 = 167; 167 + 7 = 174 + 7 = 181).
153 cm falls at 1 standard deviation from the mean to the left and the height of 174 cm
falls at 2 standard deviations from the mean to the right.
Therefore, it covers the whole 68% and 13.5%. of the distribution and the sum of it is 81.5%
ANSWER: 81.5% of the male students have a height between 153 cm to 174 cm.
The probabilities that a market inspector will discover violations of the public health code in a
public market are given in the following table.
a. What is the probability that a market inspector will discover, at least, two violations
of the public health code?
b. What is the probability that a market inspector will discover, at most, three
violations of the public health code?
25
c. What is the probability that a market inspector will discover at least two, but fewer
than four violations of the public health code?
d. What is the mean of the probability distribution?
e. What are the variance and the standard deviation of the probability distribution?
The following are the number of cars sold per day at ABC Motor Corporation during
the past 500 working days.
26
Activity 3: True or False. Write the word TRUE if the statement is correct and FALSE if the
statement is incorrect
Activity 4
Activity 5: Read and analyze the following problem to answer the question.
Our government gave financial support and food, but it is not enough for the people.
Fortunately, we are blessed because the spirit of love and generosity is still present in the
heart of our fellow countrymen. They donated money, food, time, and other essential things.
Suppose you and your friends conducted a donation-drive in your barangay. Each of
the donors gave a certain amount with an average of Php 320.00 and a standard deviation of
Php 60.00. Assuming, it is normally distributed,
1. What percentage of the distribution that the amount falls between Php. 260.00 and
Php 380.00?
2. How much of the donations fall in 95% of the distributions?
3. How much of the donations fall in 99.70% of the distributions?
27
References
Belecina, Rene R., Elisa S. Baccay, and Efren B. Mateo. 2016. Statistics and Probability, 1st
ed. Manila: Rex Bookstore Inc.
Answer Key
Activity 1 Activity 2 Activity 3
= 1.456 f. µ = 3.056
g. 2 = 12.352
= 3.515
Activity 4 Activity 5
1. 68%
2. 200 to 440
3. 140 to 500
Prepared by:
ERVIN A. PEREZ
Malabanias Integrated School
28 SHS Department
WEEK 4: IDENTIFYING REGIONS UNDER NORMAL CURVE CORRESPONDS TO
DIFFERENT STANDARD NORMAL VALUES
Background Information
We have learned that the total area under the curve is equal to 1. Since the standard
normal distribution is defined as normally distributed random variable having a mean of zero (
µ= 0 ) and standard deviation of one (ơ = 1), the fifty (50) percent of the region under the curve
is below the mean and the other 50% is above the mean. Figure 1, as shown below illustrate
the standard normal curve.
You can easily identify the area of the regions under the normal curve by using the
Table of Areas under the Normal Curve which is also known as z-Table (see attachment at
the last page of this module). This table gives the area of any value of z from -3.99 to 3.99.
The value from this table will describe the area of the specific region of the curve to the left of
the given z-value.
Solution: First, split the given z-value into hundredths. We can find the whole
number and the tenths digit (-1.6) at the left side of the table while the hundredths (0.09)
located at the uppermost of the table. Then the intersection of these numbers will be the
area of the normal curve to the left of the z-value.
29
STEPS ON HOW TO FIND THE AREA THAT CORRESPONDS TO Z –VALUE
1. Draw/sketch a normal curve and locate the given z-value on the normal curve
2. Shade the region of the curve according to the condition of z-value whether it is below,
above, or between.
3. Use the table of the area under the normal curve to find the corresponding area.
4.1. When the z-value is to the left or any related terms (e.g. below, less than) just write the
value we obtained in step 3
4.2. When the z-value is to the right or any related terms (e.g. above, greater than), subtract
1 by the obtained value in step 3
4.3. When the shaded region is in between of the two z-value, subtract the biggest by the
smallest value obtained in step 3
Solution:
Step 2: Shade the region of the curve according to the condition of z-value whether it is
below, above, or between.
30
Step 3. Use the table of the area under the normal curve to find the corresponding area
4.1 When the z-value is to the left or any related terms (e.g. below, less than) just write
the value we obtained in step 3 Since the shaded region of the curve is to the left of the
z= -1.35 and the intersection between -1.3 and 0.05 is 0.0885. Therefore, the area of
the shaded region is 0.0885
31
0.9115
0.9115
0.9115
32
CONVERTING A NORMAL RANDOM VARIABLE TO A STANDARD NORMAL
VARIABLE AND VICE-VERSA
Suppose your score on a test in Probability and Statistics is 39 and the scores are
normally distributed with a mean of 33 and standard deviation of 3, then your score is exactly
2 standard deviations above the mean. If you scored 30, then it is exactly 1 standard deviation
below the mean. All values that are above the mean have positive z-scores and all values that
are below the mean have negative z-scores. If you obtained a z-score of -3, this means that
your score is 3 standard deviations below the mean, that is, 33 – 3(3) = 33 – 9 = 24.
Given any value x from a normal distribution with mean μ and standard deviation σ, to
convert x to a z-score (standard normal score), you need to;
33
Example 1.
Suppose IQ scores are normally distributed with a mean of 100 and standard
deviation of 10. If your IQ is 85, what is your z-score? (Round off your answer to the nearest
hundredths)
Solution:
85 − 100 15
𝑧 = = − = −𝟏. 𝟓𝟎
10 10
Example 2.
On a nationwide placement test that is normally distributed, the mean was 125 and
standard deviation was 15. If you scored 149, what was your z-score? (Round off your
answer to the nearest hundredths)
Solution:
149 − 125
𝑧 =
15
24
= = 𝟏. 𝟔𝟎
15
Example 3.
The heights of teachers in Sta. Catalina National High School are normally
distributed with a mean of 150 cm and standard deviation of 15 cm. The height of Sir Victor
has a z-score of 3.25. What is the actual height of Sir Victor? (Round off your answer to the
nearest hundredths)
Solution:
In this example, z-score is given while the normal random variable is unknown. To
solve for the normal random variable x, multiply the z-score (z) by the standard deviation σ,
then add the mean μ. Therefore;
𝑥 = zσ + 𝜇
𝑥 = 48.75 cm + 150 𝑐𝑚
𝒙 = 𝟏𝟗𝟖. 𝟕𝟓 𝐜𝐦
34
Example 4.
The time it takes for a cell to divide is normally distributed with an average of 60
minutes and standard deviation of 5 minutes. How long will it take for a given cell to divide if
its “mitosis” has a z-score of -1.35?
Solution:
𝑥 = zσ + 𝜇
𝑥 = −1.35(5 min) + 60𝑚𝑖𝑛
𝑥 = −6.75 min + 60 𝑚𝑖𝑛
𝑥 = 𝟓𝟑. 𝟐𝟓 𝐦𝐢𝐧𝐮𝐭𝐞𝐬
Example 5.
Suppose you have the population values 40 and 70 and that their corresponding z –
scores are -1 and 2, respectively.
1. Is it possible to determine the population’s mean and standard deviation?
2. If so, what are these values?
Solution:
To begin, let us first identify the given. Let
𝑥1 = 40 𝑥2 = 70
𝑧1 = −2 𝑧2 = 1
Use these given and substitute to the formula
𝑥1 = 𝑧1 σ + 𝜇 𝑥2 = 𝑧2 σ + 𝜇
40 = −2σ + 𝜇 70 = 1σ + 𝜇
Note: In this example, we do not have 𝜎1 and 𝜎2 , and 𝜇1 and 𝜇2 since we are
describing a single population. Thus, 𝜎1 = 𝜎2 = 𝜎 and 𝜇1 = 𝜇 2 = 𝜇. Furthermore, we need to
review our algebra specifically on systems of linear equation. So,
40 = −2σ + 𝜇 equation 1
70 = 1σ + 𝜇 equation 2
Subtract equation 1 from equation 2, we have
40 = −2σ + 𝜇 equation 1
70 = 1σ + 𝜇 equation 2
30 = 1σ − (−2 σ)
30 =3 σ the difference
30 3𝜎
=
3 3
10= σ
The standard deviation of the population is 10. To obtain the mean, substitute it to either
equation 1 or 2.
40 = −2σ + 𝜇 ⟹ 40 = −2(10) + 𝜇 ⟹ 60 = 𝜇
70 = 1σ + 𝜇 ⟹ 70 = 1(10) + 𝜇 ⟹ 60 = 𝜇
35
COMPUTING PROBABILITIES AND PERCENTILES USING THE STANDARD NORMAL
TABLE
Suppose that we know the probability this time. What is the corresponding z-score?
For example, where is 95% under the normal curve?
There are three important things to remember when we are given probabilities and
we want to know their corresponding z-scores.
Thus, when we are given the area and we wish to find the corresponding z-value, we locate
the given area at the body of the table. If the exact area is not available, we take the nearest
area. Then, we look up the corresponding z-value in the Table of Areas Under the Normal
Curve or z- Table.
A normal distribution curve can be used as a probability distribution curve for normally
distributed variables. The area under the standard normal distribution curve can also be
thought of as a probability. That is, if it's possible to select any 𝑧 value at random, the
probability of choosing one, say, below 1.45 would be the same as the area under the curve
at the left of 1.45. In this case, the area is 0.9265. Therefore, the probability of randomly
selecting a 𝑧 value below of 1.45 is 0.9265 or 92.65%. The problems involving probabilities
and percentiles are solved in the same manner as finding the areas under a normal curve.
36
(f) 𝑃(𝑋 > 𝑥) – probability at the right of a normal random variable 𝑥
(g) 𝑃(𝑎 < 𝑋 < 𝑏) – the probability of a normal random variable 𝑋 that is in between two
other normal random variables, say 𝑎 and 𝑏.
(h) 𝑃(𝑋 < 𝑎 ) ∪ 𝑃(𝑋 > 𝑏 ) – the probability of 𝑋 is in the opposite direction of two
values,
say 𝑎 and 𝑏
Example 1.
Find the probabilities for each of the following.
(a) 𝑃(𝑍 < 1.32)
(b) 𝑃(𝑍 < −1.05)
(c) 𝑃(−0.75 < 𝑍 < 1.56)
(d) 𝑃(𝑍 > −0.88)
Solution:
(a) The probability 𝑃(𝑍 < 1.32) means to find
the area under the normal curve at the left of
𝑧 = 1.32. From the table, find the intersection
on row 1.3 and column 0.02. Thus, the
probability is 𝟎. 𝟗𝟎𝟔𝟔 or 𝟗𝟎. 𝟔𝟔%.
Example 2.
Let 𝑋 be a normal random variable with mean 𝜇 = 15 and standard deviation 𝜎 = 3. Find the
probabilities of the following:
(a) (𝑋 < 19)
(b) (10 < 𝑋 < 19)
37
Solution:
(a) To find the probability, convert the normal random variable 19 to a z-score.
𝑥−𝜇 19−15
Thus, (𝑋 < 19) = 𝑃 (𝑍 < ) = 𝑃 (𝑍 < )
𝜎 3
= (𝑍 < 4 3 ) = 𝑃(𝑍 < 1.33).
From the table, the probability at the left of 𝒛 = 𝟏. 𝟑𝟑 is equal to 𝟎. 𝟗𝟎𝟖𝟐 𝒐𝒓 𝟗𝟎. 𝟖𝟐%.
(b) To find the probability, convert the normal random variables 10 and 19 to z values.
10−15 19−15
Thus, 𝑃(10 < 𝑋 < 19) = 𝑃 ( < 𝑍 < )
3 3
= (−1.67 < 𝑍 < 1.33)
= (𝑍 < 1.33) – (𝑍 < −1.67).
From the table, we have 0.9082 – 0.0475 = 𝟎. 𝟖𝟔𝟎𝟕 𝒐𝒓 𝟖𝟔. 𝟎𝟕%. These probabilities can also
be applied to real-life problems.
Identify regions under the normal curve corresponding to different standard normal
values. (M11/12SP-IIIc-3)
Convert a normal random variable to a standard normal variable and vice versa
(M11/12SP-IIIc-4)
Compute probabilities and percentiles using the standard normal table. (M11/12SP-IIIc-d-1)
Activity 1.
Identify the area under the normal curve of the following standard normal values
using the z table (refer to the last page of this module).
1. 𝑧= −3.25 _____________ 6. 𝑧 = −1.33 _________
2. 𝑧= 1.25 _____________ 7. 𝑧 = −2.5 _________
3. 𝑧= 0.56 _____________ 8. 𝑧 = 0.29 _________
4. 𝑧= 3 _____________ 9. 𝑧 = −0.13 _________
5. 𝑧= 1.96 _____________ 10. 𝑧 = −1.96 _________
Activity 2.
Find the area or region that corresponds to each of the following z values. Sketch
and label each curve.
1. to the right of z = 2.18
2. below z= -1.40
3. above z= 0.57
4. between z = -2.34 and z = 1.57
5. between z = 0.32 and z = 2.42
38
Activity 3.
Supply the needed information by solving the missing value.
1. 𝑥 = 25, 𝜇 = 40, σ = 6, 𝐳 = ?
2. 𝑧 = −2.25, 𝜇 = 218, σ = 36, 𝐱 = ?
3. 𝑥 = 39, 𝜇 = 30, σ = 3, 𝐳 = ?
4. 𝑥 = 125, 𝜇 = 105, σ = 8, 𝐳 = ?
5. 𝑧 = 1.11, 𝜇 = 145, σ = 20, 𝐱 = ?
Activity 4.
Solve the following problems.
1. The mean number of hours a Filipino worker spends on the computer is 3.1 hours per
workday. Assume the standard deviation is 0.5 hour and is normally distributed, how long
does a worker spend on the computer if his z-score is 1.2?
2. Each month, a Filipino household generates an average of 28 pounds of newspaper for
garbage or recycling. Assume the standard deviation is 2 pounds. Determine the z-score of
a household that generates 22 pounds of newspaper.
Activity 5.
A. Use the Table of Standard Normal distribution to determine corresponding
probabilities of the following:
1. 𝑃(𝑍 < −1.45)
2. 𝑃(0.5 < 𝑍 < 2.33)
3. 𝑃(𝑍 > 1.78)
References
Belecina, Rene R., Elisa S. Baccay, and Efren B. Mateo. 2016. Statistics and Probability, 1st
ed. Manila: Rex Bookstore Inc.
39
Answer Key
Activity 1
1. 0.0006 6. 0.0918
2. 0.8944 7. 0.0062
3. 0.7123 8. 0.6141
4. 0.9987 9. 0.4483
5. 0.9750 10. 0.0250
Activity 2
0.0146 0.9322
2. below z= -1.40
0.3667
3. above z= 0.57
0.2843
40
STANDARD NORMAL TABLE: Table Values Represent AREA to the LEFT of Z-score
41
Prepared by:
JELENE V. BASILIO
Sto. Domingo Integrated School
SHS Department
42
WEEK 5 : RANDOM SAMPLING
Background Information
43
4. Cluster Random Sampling - is used when
you are unable to sample from the entire
population. You divide the sample into
clusters that approximately reflect the whole
population, and then choose your sample
from a random selection of these clusters.
Parameter
A parameter is a measure that is used to describe the population while statistic is a
measure that is used to describe the sample. To understand more, let us discuss the two
measures. Population mean (µ), population variance (𝜎2) and population standard deviation
(𝜎) are what we called parameters.
Population Mean
The mean is the sum of the data divided by the number of data. The mean is used to
describe where the set of data tends to concentrate at a certain point.
Below are grades in Statistics of Grade 11 students during the third quarter.
94 85 88 79 78
75 89 91 84 77
Let us compute the population mean, population variance, and population standard
deviation.
1 94
2 85
3 88
4 79
5 78
6 75
7 89
8 91
9 84
10 77
N = 10 ∑X = 840
44
ΣX
To compute the population mean (𝜇) we can use this equation 𝜇 = whereas ∑X is the
𝑁
summation of all values of grades (x) and N is the total number of population therefore,
840
𝜇= = 84 the computer population mean (𝜇) is 84
10
Variance and standard deviation determine how to spread or to scatter each data on
the set from the mean.
To compute for the population variance, we apply the formula:
∑(𝑋 − 𝜇)2
𝜎2 =
𝑁
To compute for the population standard deviation, we use the formula:
∑(𝑋 − 𝜇)2
𝜎= √
𝑁
Whereas, X is the given data, µ is the population mean 𝑁 is the number of elements in the
population.
Consider the data given above, to solve for the population variance and population standard
deviation, we have this table:
Student’s number Grade (X) 𝑋−𝜇 (𝑋 − 𝜇)2
1 94 94 – 84 = 10 102 = 100
2 85 85 – 84 = 1 12 = 1
3 88 88 – 84 = 4 42 = 16
4 79 79 – 84 = -5 -52 = 25
5 78 78 – 84 = – 6 -62 = 36
6 75 75 – 84 = – 9 -92 = 81
7 89 89 – 84 = 5 52 = 25
8 91 91 – 84 = 7 72 = 49
9 84 84 – 84 = 0 02 = 0
10 77 77 – 84 = 7 72 = 49
To solve for the population variance and population standard deviation we have to use these
equations:
Population Mean Population Variance Population Standard Deviation
ΣX ∑(𝑋 − 𝜇)2
𝜇= 𝜎2 = ∑(𝑋 − 𝜇)2
𝑁 𝑁 𝜎= √
𝑁
45
Solution:
ΣX 840
Population mean: 𝜇 = = = 84
𝑁 10
∑(𝑋− 𝜇)2 382
Population variance: 𝜎2 = = = 38.2
𝑁 10
∑(𝑋− 𝜇)2 382
Population Standard Deviation: 𝜎 = √ = √ = √38.2 ≈ 6.18
𝑁 10
STATISTIC
From the previous data of the population, suppose that we randomly select only 7 data
(numbers with check) out of the total 10 data in the population.
94 85 88 79 78
75 89 91 84 77
Compute the sample mean, sample variance, and sample standard deviation. Here is the
result:
SAMPLE MEAN
The sample mean is the average of all the data of the samples. The symbol 𝑥̅ ,
(read as “x bar”) is used to represent the sample mean. To compute for the sample mean 𝑥̅
ΣX
we can use this equation: 𝑥̅ =
𝑛
Where: 𝑥̅ is the sample mean
∑x is the summation of x (sum of the measures)
𝑛 is number of elements in the sample
46
To get the sample mean:
ΣX 602
We substitute to the formula 𝑥̅ = = = 86
𝑛 7
Our computed sample mean 𝑥̅ is 86.
Sample variance is the computed variance of the elements of the sample. s 2 is used to
represent sample variance.
Sample standard deviation is the computed standard deviation of the elements of the
sample. s is used to represent sample standard deviation.
1 94 94 94 – 86 = 8 82 = 64
2 85
3 88 88 88 – 86 = 2 22 =4
4 79 79 79 – 86 = – 7 -72 = 49
5 78
6 75
7 89 89 89 – 86 = 3 32 = 9
8 91 91 91 – 86 = 5 52 = 25
9 84 84 84 – 86 = – 2 -22 = 4
10 77 77 77 – 86 = – 9 -92 = 81
∑X = 840 ∑X = 602
∑(x − 𝑥̅)2 = 236
47
To solve for the sample variance and sample standard deviation we have to use these
equations:
Sample Mean Sample Variance Sample Standard Deviation
ΣX ∑(𝑥 − 𝑥̅ )2
𝑥̅ = 𝑠2 = ∑(𝑥 − 𝑥̅ )2
𝑛 𝑛−1 𝑠= √
𝑛−1
Solution:
ΣX 602
Sample mean: 𝑥̅ = = = 86
𝑛 7
∑(𝑥− 𝑥̅ )2 236 236
Sample Variance: 𝑠2 = = = ≈ 39.33
𝑛−1 7−1 6
∑(𝑥− 𝑥̅ )2 236
Sample Standard Deviation: 𝑠 = √ = √ = √39.33 = 6.27
𝑛−1 7−1
Remember that parameters are for population while statistics are for sample.
Sampling distribution of the sample means is a frequency distribution using the computed
sample mean from all the possible random samples of a particular sample size taken from
the given population.
48
3. Construct a frequency and probability distribution table of the sample means indicating
its number of occurrence or the frequency and probability.
Probability P(x)
Sample Means Frequency 𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦
𝑁
1
2 1 = .10
10
1
2.33 1 = .10
10
2
2.67 2 = .20
10
2
3 2 = .20
10
2
3.33 2 = .20
10
1
3.67 1 = .10
10
1
4 1 = .10
10
49
6. Which of the following is random sampling?
a. convenience b. non-probability
c. purposive d. stratified
7. As a researcher, you ask the people of your barangay on how they spent their money
during the community quarantine. You decided to group them according to their
socio-economic status and then proportionately and randomly selecting samples
from these groups. Which type of sampling are you using?
a. convenience b. purposive
c. stratified d. systematic
8. As a researcher, you want to know the plans of graduating senior high school
students. You ask all by-passers to participate in your research. Which type of
sampling are you using?
a. convenience b. purposive
c. stratified d. systematic
9. You wish to examine the effects of community quarantine on your fellow students.
You decided to borrow a list from the guidance office and select every 10th student
on it. What sampling technique are you using?
a. convenience b. purposive
c. stratified d. systematic
10. You want to know the effects of the COVID-19 to the Filipino people and you will ask
the medical frontliners. Which type of sampling will you use?
a. convenience b. purposive
c. stratified d. systematic
Practice B
Choose the letter of the best answer. Encircle the letter of the correct answer.
For numbers 4-9, the following are scores of students in their quiz:
10, 8, 7, 20, 11, 8, 15, 10, 6, 14, 16, 20
4 – 5. What is the value of µ? (2 points)
a. 10.4 b. 11
c. 12.08 d. 14.4
50
8– 9. What is the value of the population variance? (2 points)
a. 18.7 b. 25
c. 20 d. 21.5
For numbers 10-15, the following are the seven scores of the students above that are
randomly selected: 8, 7, 20, 15, 10, 14, 16
Practice C
Choose the letter of the best answer. Encircle the letter of the correct answer.
1. What distribution pertains to the frequency distribution of the sample mean from all
the possible random samples of a particular sample size n taken from the given
population?
a. frequency b. normal
c. population d. sampling
2. Which of the following is NOT a step-in creating sampling distribution of the sample
mean?
a. Determine the number of sets of all b. Compute for the standard
possible random samples. deviation and variance of the
samples.
c. Construct a frequency distribution d. List all the possible random
table of the sample means and samples and solve for the sample
probability. mean of each set of samples
3. Which of the following is the mean of sample 6, 10, 21, 25, and 28?
a. 15.17 b. 18
c. 21.2 d. 22
For numbers 4-9, refer to the following set of data of a population {5, 6, 7, 8}
4. How many different samples of size n = 2 can be drawn from the population?
a. 6 b. 5
c. 4 d. 3
5. Which of the following sample mean appears most frequent in the sampling
distribution?
a. 5 b. 5.5
c. 6 d. 6.5
6. What is the lowest value of the sample mean in this sampling distribution?
a. 5.5 b. 6.5
c. 7 d. 8
51
7. What is the frequency of the sample mean 7?
a. 0 b. 1
c. 2 d. 3
8. What is the probability of the sample mean 6 in the sampling distribution?
a. 1/6 b. 2/3
c. 1/2 d. 2/5
For numbers 9-10, refer to the following population consists of the data {3,6,9,12,15}.
9. How many different samples of size n = 3 can be drawn from the population?
a. 6 b. 12
c. 8 d. 10
10. Which of the following sample mean appears most frequent in the sampling
distribution?
a. 6 b. 7
c. 8 d. 12
References:
Statistics and Probability 11, Pivot 4A Learners Material (N/A) Pages 172 - 206
https://www.scribbr.com/methodology/simple-random-sampling/
http://www.stat.yale.edu/Courses/199798/101/sample.htm#:~:text=Simple%20rando
m%20sampling%20is%20the,being%20included%20in%20the%20sample.
Answer Key:
1. b 1. d 1. d
2. b 2. c 2. b
3. c 3. d 3. b
4. d 4– 5. c 4. a
5. b 6-7. a 5. d
6. d 8-9. d 6. a
7. c 10-11. b 7. c
8. a 12-13. c 8. a
9. d 14-15. b 9. d
10. b 10. c
Prepared by:
Jeral S. Reyes
Teacher - II
52