Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

(3rd Month) MATH 112 - Statistics and Probability

Download as pdf or txt
Download as pdf or txt
You are on page 1of 65

STATISTICS &

PROBABILITY

JARISSA N. IMLANI
S.Y. 2021-2022

MIT-Senior High School


STATISTICS AND PROBABILITY Page 1
STATISTICS AND PROBABILITY
SELF-PACED LEARNING MODULE UNIT I

PRELIMINARIES

Module 1: Random Variables and Probability Distributions

Time Allotment: Week 1-3

Module Overview

In this lesson, the concept of a random variable is discussed. The notion of a statistical experiment
is defined as well a s random variables that relate to experiments. Finally, two types of random variables,
discrete and continuous, are described.

Learning Competencies:

At the end of this module, you are expected to:


1. Illustrates a random variable (discrete and continuous).
2. distinguishes between a discrete and a continuous random variable.
3. finds the possible values of a random variable.
4. illustrates a probability distribution for a discrete random variable and its
properties.
5. Computes probabilities corresponding to a given random variable.
6. Illustrates the mean and variance of a discrete random variable.
7. Calculate the mean and the variance of a discrete random variable.
8. Interprets the mean and the variance of a discrete random variable.
9. Solves problems involving mean and variance of probability distribution.

Instructional Materials:

 Oliva M. (2016). DIWA Senior High School Series: STATISTICS & PROBABILITY.
DIWA Learning Systems, Inc.
 1st edition REX BOOK Statistics and Probability Rene R. Belecina, Elisa S. Baccay,
Efren B. Mateo

Pre-Assessment:

Directions: Choose the letter that corresponds to the correct answer. Write only the letter
on the space before each number.

______1. Which of the following is not a true statement?


a. The value of a random variable could be zero.
b. Random variables can only have one value.
c. The probability of the value of a random variable could be zero.
d. The sum of all the probabilities in a probability distribution is always equal to one.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 2
______2. Which of the following is a discrete random variable?
a. the average amount of electricity consumed
b. the number of patients in a hospital
c. the amount of paint used in repainting a building
d. the average weight of female athletes
______3. If two coins are tossed, which is not a possible value of the random variable for the
number of heads?
a. 0 b. 1 c. 2 d. 3
______4. Which formula gives the probability distribution shown by the table?
x 2 3 6
P(x) 1/2 1/3 1/6
a. P(X) = 1/X c. P(X) = 6/X
b. P(X) = X/6 d. P(X) = 1/6

______5. What is the mean of the probability distribution?


a. 1.5 b. 2.0 c. 3.5 d. 4.0

I-Introduction
To find out if you are ready to learn this lesson, do the following:

II- Interaction (Learning Activities)

A Random Variable is a function that associates a real number to each element in the
sample space. It is a variable whose values are determined by chance.

Terminologies:
Probability – is the measure of how likely an event is. It is the likelihood of
occurrence of an event.
Statistics –
An Experiment is a situation involving chance or probability that leads to results called
outcomes. (Ex. Tossing a coin, throwing 2 dice or rolling a die)
An Outcome is the result of a single trial of an experiment. (Ex. Possible outcomes is a
head or a tail appears)
An Event is one or more outcomes of an experiment. (Ex. Pair of numbers that are the
same)
The sample space of an experiment is the set of all possible outcomes of that
experiment. Ex. In tossing a coin, S={H,T}. in rolling a die, S={1,2,3,4,5,6}.
The sample point is an element of sample space. Ex. In rolling a die, there are 6 sample
points.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 3
Success – the event under consideration occurs.
Failure – the event under consideration does not occur.
Tossing Three Coins

Suppose three coins are tossed. Let Y be the random variable representing the number of
tails that occur. Find the values of the random variable Y.

Ok class. Now you know the examples.

So, the possible values of the random variable Y are 0, 1, 2


and 3.

Do you have any question? If you do, please pm me.


Let’s proceed to the next topic class

MIT-Senior High School


STATISTICS AND PROBABILITY Page 4
Constructing Probability Distributions

A discrete probability distribution or a probability mass function consists of the


values a random variable can assume and the corresponding probabilities of the values.

Histogram- A representation of a frequency distribution by means of rectangles


whose widths represent a class intervals and whose areas are proportional to the
corresponding frequencies.

Properties of a Probability Distribution


1. The probability of each value of the random variable must be between or equal to 0
and 1. In symbol, we write it as 0 ≤ P(X) ≤1.
2. The sum of the probabilities of all values of the random variable must be equal to 1.
In symbol, we write it as ⅀P(X) = 1.

This situation requires the use of random variables and probability distribution. Ex. 1)
Number of Tails Suppose three coins are tossed. Let Y be the random variable representing the
number of tails that occur. Find the probability of each of the values of the random variable Y.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 5
MIT-Senior High School
STATISTICS AND PROBABILITY Page 6
Yeheeeyyyy!! You made it 
Now, Let’s proceed to the next topic class

Computing the Mean of a Discrete Probability Distribution

MIT-Senior High School


STATISTICS AND PROBABILITY Page 7
MIT-Senior High School
STATISTICS AND PROBABILITY Page 8
Yeheeeyyyy!! You made it 
Now, Let’s proceed to another topic class

Computing the Variance of a Discrete Probability Distribution

Steps in Finding the Variance and Standard Deviation


1. Find the mean of the probability distribution.
2. Subtract the mean from each value of the random variable X.
3. Square the results obtained in step 2.
4. Multiply the results obtained in step 3 by the corresponding probability.
5. Get the sum of the results obtained in step 4.
6. Get the square root of the variance to get the standard deviation.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 9
Ex. 1)
Number of Cars Sold
The number of cars sold per day at local car dealership, along with its corresponding
probabilities, is shown in the succeeding table. Compute the variance and the standard deviation
of the probability distribution by following the given steps.

Number of Cars Probability P(X)


0 1/10
1 2/10
2 3/10
3 2/10
4 2/10

MIT-Senior High School


STATISTICS AND PROBABILITY Page 10
The variance and standard deviation can be obtained using a shorter formula and procedure.
Alternative Procedure in Finding the Variance and Standard Deviation of a Probability
Distribution
1. Find the mean of the probability distribution.
2. Multiply the square of the value of the random variable X by its corresponding
probability.
3. Get the sum of the results obtained in step 2.
4. Subtract the mean from the results obtained in step 3.

The variance is 1.56 1nd the standard deviation is 1.25.


MIT-Senior High School
STATISTICS AND PROBABILITY Page 11
Did you read and analyze each given example?

III- Integration

Closure/synthesis
To check your learning from the activities you’ve done from exercises and drills. You have to
complete the sentence below. Write your answer on the line after doing the activities.
I learned that___________________________________________________________

Reflection guide
How do you relate to statistics and probability when playing snake and ladder?
______________________________________________________________________________
__________________________________________________________________

CLOSING

IV-Intervention

I will give you an activity to answer. Please read the directions below.

Assignment 1: FILL ME IN
Drawing Balls from an Urn.
Two balls are drawn in succession without replacement from an urn containing 5
red balls and 6 blue balls. Find the values of the random variable Z. Complete the table
below.
Possible Outcomes Value of the Random Variable Z
(number of blue balls)

Evaluation 1:
Direction: Write D if discrete and C if continuous on the space before each number.

A. Identify the following random variables as discrete or continuous.

______1. The number of defective computers produced by a manufacturer.


______2. The weight of newborns each year in a hospital.
______3. The number of siblings in a family of a region.
______4. The number of deaths per year attributed to lung cancer.
______5. The number of dropouts in a school district for a period of 10 years.
______6. The speed of a car.
______7. The number of female athletes.
______8. The time needed to finish the test.
______9. The amount of sugar in a cup of coffee.
______10. The number of people who are playing LOTTO each day.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 12
Study tips 1:
Look after your mind and body by getting some exercise, and
consider meditating to boost your focus and concentration further.

V- Post Assessment

You have to read the questions comprehensively and choose the


best answer among the options. Just encircle the letter of your choice.

1. Which of the following statement is correct?


a. The mean of the sampling distribution of the means is less than the population mean.
b. The mean of the sampling distribution is the sample means is greater than the
population mean.
c. The means of samples drawn from a population are always equal to the population
mean.
d. The means of samples drawn from a population may be equal, greater than or less than
the population mean.

2. How many possible of size n = 3 can be drawn from a population of size 12?
a. 36 b. 1728 c. 144 d. 220

3. If the variance of a population is 10, what is the variance of the sampling distribution of
the means of size 5 drawn from this population?
a. 2 b. 4.47 c. 1.41 d. 10

4. Subsets of population are called?


a. samples b. elements c. estimates d. averages

5. Which formula gives the probability distribution shown by the table?

x 4 5 3
P(x) 2/4 2/5 2/3

a. P(X) = 1/2X c. P(X) = 2/X


b. P(X) = X/6 d. P(X) = 1/6

MIT-Senior High School


STATISTICS AND PROBABILITY Page 13
Mo Module 2: Normal Probability Distribution
Time Allotment: Week 4-5

Module Overview

In this module, some basic concepts in probability and extra concepts on conditional
probability are thought. There are also discussion on the on the problem on how to compute for
the chance or probability of a certain experiments.

Learning Competencies

At the end of this lesson, you are expected to:


a.) Illustrates a normal random variable and its characteristics.
b.) Identifies regions under the normal curve corresponding to different standard normal
values.
c.) Converts a normal random variable to a standard normal variable and vice versa.
d.) Computes probabilities and percentiles using the standard normal table.

Instructional Materials:

 Oliva M. (2016). DIWA Senior High School Series: STATISTICS & PROBABILITY.
DIWA Learning Systems, Inc.
 1st edition REX BOOK Statistics and Probability Rene R. Belecina, Elisa S. Baccay,
Efren B. Mateo

Pre-Assessment

You have to read the questions comprehensively and choose the best answer among
the options. Just write the letter of your choice.

_______1. The standard normal distribution


a. Has a mean of zero (0) and a standard deviation of 1.
b. Has a mean of 1 and a variance of zero (0).
c. Has an area equal to 0.5.
d. Cannot be used to approximate discrete probability distributions.
_______2. If Z has a standard normal distribution, and P (0 < Z < z ) is 0.3770, then the
value of z is

a. 0.18 b. 0.81 c. 1.16 d. 1.47

MIT-Senior High School


STATISTICS AND PROBABILITY Page 14
_______3. Suppose Z has a standard normal distribution with a mean of 0 and a standard
deviation of 1. The probability that Z is less than 1.15 is _________.

a. 0.7849 b. 0.8794 c. 8479 d. 8749

_______4. Suppose Z has a standard normal distribution with a mean of zero and a standard
deviation of 1. The probability that Z values are larger than ________ is 0.3483.

a. 0.39 b. 0.03 c. 0.93 d. 0.09

_______5. Suppose Z has a standard normal distribution with a mean of 0 and a standard
deviation of 1. 85% of the possible Z values are smaller than _________.

a. 1.645 b. 1.96
c. 1.04 d. 1.58

Understanding the Normal Curve Distribution


I-Introduction

ENTRY CARD
Determine whether the statement is true or false by checking the appropriate box.
Statement TRUE FALSE
1. The areas under probability distribution correspond to the
probabilities of a random variable X.
2. A probability distribution is generated by frequencies.
3. The areas under the curve corresponds to all probabilities
for X.
4. Under the curve, there is no area over a point.
5. The probability associated with a particular value of X is
0.

II-Interaction (Learning Activities)

Hello class, how are you doing?


Today’s topic is about Random Variables. Let’s start. Please read the
content below.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 15
 A standard normal curve is a normal probability distribution that has a mean, µ = 0 and a
standard deviation, s = 1.

 A frequency distribution an arrangement of statistical data that exhibits the frequency of


the occurrence of the values of a variable.

 A skewness lack of straightness of symmetry; lack of symmetry in a frequency


distribution.

When scores in a set of data are ordered and their frequencies are noted, a pattern
emerges. Let us do the following activity to identify this pattern.

1. Graph the frequency polygon of the following scores of students in a quiz.


Here, there are 5 students who took the quiz.

X(score) F
3 1
2 3
1 1
N 5

MIT-Senior High School


STATISTICS AND PROBABILITY Page 16
2. Now let us increase the number of students to 14. Observe what will happen to the
shape of the graph.

X(score) F
5 1
4 3
3 5
2 3
1 1
N 14

3. Let us further increase the number of students. From the first graph to the third
X Graph, observe the transformation of the figures.

X(score) F
9 1
8 2
7 4
6 7
5 11
4 7
3 4
2 2
1 1
N 42

4. If we continue to increase the number of students, what will be the shape of the
graph? Take note also of its skewness.

5. For each distribution, compute the three measures of averages: mean, median, and
mode.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 17
The standard normal curve is a normal probability distribution that is most commonly
used as a model for inferential statistics. The equation that describes a normal curve is:

µ = mean of the population


∏ = 3.1416
e = 2.7183

Take note that in a standard normal curve, its mean µ = 0 and standard deviation
σ = 1. By substituting the value of mean and standard deviation in the formula,
mathematicians are able to find areas under the normal curve. Fortunately, these areas
have already been determined for our use. The area between -3 and +3 is almost 100%
(not yet 100%) because the curve almost touches the horizontal line. Thus, there is a small
fraction of the area at the tails of the distribution.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 18
The Table of Areas under the Normal Curve is also known as the z-Table. The z-
score is a measure of relative standing. It is calculated by subtracting mean (or µ) from the
measurement X and then dividing the result by s (or σ). The final result, the z-score,
represents the distance between a given measurement X and the mean, expressed in
standard deviations. Either the z-score locates X within a sample or within a population.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 19
Examples 1) Find the area that corresponds to z = 1.
Finding the area that corresponds to is the same as finding the area between z =0 and z=1.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 20
Do you have any question? If you do, please pm me. Let’s
proceed to the next topic class

Understanding the Z-Scores

MIT-Senior High School


STATISTICS AND PROBABILITY Page 21
Steps Solution
1. Use the computing formula for finding z-scores of
population data. z=

2. Check the given values. Since these are population values,


the z-score locates X within a population. µ = 50, σ = 4, and X = 58

3. Substitute the given values in the computing formula.


z=

4. Compute the z-value.


z=

Thus, the z-value that


corresponds to the raw
score 58 is 2 in a
population distribution.

This conversion from raw score to z-score is shown graphically in fig. 2.4

MIT-Senior High School


STATISTICS AND PROBABILITY Page 22
If you have any question or clarification just send your thoughts in our class group messenger.

Now. Lets proceed!!!!


Identifying Regions of Areas Under the Normal Curve
Ex. 1
Identifying Regions Under the Normal Curve
Task 1.
 Study the curve and identify the shaded region in terms of the z-score.
 Identify the same region in terms of area.
 Can you think of another way to identify the region?

MIT-Senior High School


STATISTICS AND PROBABILITY Page 23
Task 2.
 Draw a sketch of a normal curve.
 Draw a vertical line through the specified z-values and shade the region.
1. z = 0 and z = 1
2. z = 0 and z = -1
3. z = 1 and z = 2
4. z = -1 and z = -2
5. z = 2 and z = 3
6. z = -2 and z = -3

We see that using the z-table, we can determine specific regions under the normal curve.
For example, 50% of the region under the curve is below the mean and 50% is above the mean.
Specific regions can be determined in terms of their usefulness in a situation.

Task 3.
 Read the following section carefully.

Since the z-table provides the proportion of the area (or probability or percentage)
between any two specific values under the curve, regions under the curve can be described in
terms of area. For example, the area of the region z = 0 and z = 1 is given in the z-table to be
.3413. We can do operations with the areas to identify more regions under the curve. So, to find
the area of the region between z = 1 and z = 2, we subtract .3413 from .4772 resulting in .1359. It
is graphically shown below.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 24
MIT-Senior High School
STATISTICS AND PROBABILITY Page 25
Examples:
Find the area that corresponds to each of the following z-values:
1. z = 0.3
2. z = 1.96
3. z = -1.15

MIT-Senior High School


STATISTICS AND PROBABILITY Page 26
Let’s continue….

MIT-Senior High School


STATISTICS AND PROBABILITY Page 27
Determining Probabilities

MIT-Senior High School


STATISTICS AND PROBABILITY Page 28
MIT-Senior High School
STATISTICS AND PROBABILITY Page 29
Let’s pRoceeD to neXt topic….

MIT-Senior High School


STATISTICS AND PROBABILITY Page 30
Locating Percentiles Under the Normal Curve
Recalling basic Statistics, these are expressions of order. They indicate relative standing.
In real life, many people want to belong to a high level in terms of relative standing.

Ex. 1)
Locating Percentile Under the Normal Curve
For any set of measurements (arranged in ascending or descending order), a percentile (or
a centile) is a point in this distribution such that a given number of cases is below it. A percentile
is a measure of relative standing. It is a descriptive measure of the relationship of a measurement
to the rest of the data.

Suppose that we know the probability this time. What is the corresponding z-score? For
example, where is 95% under the normal curve?

There are three important things to remember when we are given probabilities and we
want to know their corresponding z-scores

 First, a probability value corresponds to an area under the normal curve.


 Second, in the table of Areas Under the Normal Curve, the numbers in the extreme left
and across the top are z-scores, which are the distances along the horizontal scale. The
numbers in the body of the table are areas or probabilities.
 Third, the z-scores to the left of the mean are negative values.

Ex. 1) Find the 95th percentile of a normal curve.

Analysis: by definition of P95, means locating an area before (or below) the point. We want to
know what the z-value is at this point.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 31
Ex. 3) The result of a nationwide aptitude test in mathematics are normally distributed with m =
80 and s = 15. What is the percentile rank of a score of 84?

Analysis: Since we are interested in the percentile rank of 84, this means finding the percentage
of scores below 84. To find the required area from the Table of Areas, we transform 84 to z-score
form and then perform the appropriate mathematical operation.

Notice that there is no perspective procedure in the computation of the z-scores and the
probabilities. You have analyze the problem carefully and apply what you have learned about the
concepts and procedures involving the normal curve.

Now, Let us continue….

MIT-Senior High School


STATISTICS AND PROBABILITY Page 32
Applying the Normal Curve Concepts in Problem Solving

Understanding Test Results


Ex. 1)
The results of a nationwide aptitude test in mathematics are normally distributed with
m=80 and s = 15. Find the raw score such that 70% of the cases are below it.

Analysis Questions:
1) What are the given?
2) What are you going to find?
3) What is suggested by the given to help you solve the problem?
4) Will the application of the normal curve concepts be useful?

The problem involves a normal distribution. So, we can apply the normal curve concepts
to solve the problem. The expression 70% implies that we are given the area and that we are
looking for a raw score. We can find this by determining first the corresponding z-score from The
Table of areas of the Normal Curve. If 70% of the cases lie below the raw score, then 30% of the
cases are above it. You can choose which probability to deal with. In this example, 30% is
selected.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 33
III- Integration

Closure/synthesis
To check your learning from the activities you’ve done from exercises and drills you have to
complete the sentences below. Write your answer on the line after doing the activities:
I think that_______________________________________________________________

Reflection guide
Explain why the total area of the region between z = -3 and z = 3 is .9974 or 99.74%.
______________________________________________________________________________
__________________________________________________________

CLOSING

IV-Intervention

Class, you can start answering your activities now.

Assignment 2:
1. Use the z-table to find the area that corresponds to each of the following:
a) z = .56
b) z = 1.32
c) z = -1.05
d) z = -2.18
e) z = -2,58

Evaluation 2:
DIRECTIONS:
1. Find each of the following percentile points under the normal curve.

a) P99
b) P90
c) P68
d) P40
e) P32

Study tips 2:
Be strategic about where you study.

If you have any question or clarification just send your thoughts in our class group messenger.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 34
POST ASSESSMENT

You have to read the questions comprehensively and choose the best answer among the
options. Write only the letter of your choice.

______1. If a particular batch of data is approximately normally distributed, we would find that
approximately
a. 2 of every 3 observations would fall between ± 1 standard deviation around the mean.
b. 4 of every 5 observations would fall between ± 1.28 standard deviation around the
mean.
c. 19 0f every 20 observations would fall between ± 2 standard deviation around the
mean.
d. All of the above.

FOR PROBLEMS @ TO $ CONSIDER THE FOLLOWING CASE:

The length of time it takes a grade 11 student to play the Mobile Legend computer app follows a
normal distribution with a mean of 3.5 minutes and a standard deviation of 1 minute.

______2. The probability that a randomly selected Grade 11 student will play one game of
Mobile Legend in less than 3 minutes is
a. 0.3551 b. 0.3085 c. 0.2674 d. 0.1915
______3. The probability that a randomly-selected grade 11 student will take between 2 and 4.5
minutes to play Mobile Legend is:
a. 0.0919 b. 0.2255 c. 0.4938 d. 0.7745
______4. The point is the distribution of times to play Mobile Legend in which 75.8% of the
grade 11 students exceed when playing Mobile Legend.
a. 2.8 minutes b. 3.2 minutes c.3.4 minutes d. 4.2 minutes

______5. The probability that a standard normal random variable, Z, falls between – 1.50 and
0.81 is 0.7242.
a. true b. false c. not true d. both a and b

MIT-Senior High School


STATISTICS AND PROBABILITY Page 35
Module 3: Sampling and Sampling Distributions
Time Allotment: Week 6-8

Module Overview

In this activity, you have to revisit the coin tossing activity but this time, try to look into
how the probability of getting a head is an unknown constant and needs to be estimated. Other
illustrations on sampling and estimation are also discussed.

Learning Competencies

At the end of this module, you are expected to:


1. Illustrates random sampling.
2. Distinguish between parameter and statistics.
3. Identifies sampling distributions of statistics.
4. Finds the mean and variance of the sampling distribution of the sample mean.
5. Defines the sampling distribution of the sample mean for normal population when the
variance is: (a) known, (b) unknown
6. Illustrates the Central Limit Theorem.
7. Defines the sampling distribution of the sample mean using the Central Limit Theorem.
8. Solves problems involving sampling distributions of the sample mean.

Instructional Materials

 Oliva M. (2016). DIWA Senior High School Series: STATISTICS & PROBABILITY.
DIWA Learning Systems, Inc.
 1st edition REX BOOK Statistics and Probability Rene R. Belecina, Elisa S. Baccay,
Efren B. Mateo

Pre-Assessment

You have to read the questions comprehensively and choose the best answer among the
options. Just write the letter of your choice.

_______1. The process of using sample statistics to draw conclusions about true
population parameters is called.
a. Statistical inference c. sampling
b. The scientific method d. descriptive statistics
_______2. The universe or “totality of items or things” under consideration is called
a. a sample c. a parameter
b. a population d. a statistic
_______3. The portion of the universe that has been selected for analysis is called
a. a sample c. a parameter
b. a frame d. a statistic

MIT-Senior High School


STATISTICS AND PROBABILITY Page 36
_______4. A summary measure that is computed to describe a characteristic from
only a sample of the population is called
a. a parameter c. a statistic
b. a census d. the scientific method
_______5. A summary measure that is computed to describe a characteristic of an
entire population is called
a. a parameter c. a statistic
b. a census d. the scientific method

Sampling Distribution of Sample Means


I-INTRODUCTION

Can you recall the concepts about sampling in our previous


topics?

Don’t skip! Please recall!

Interaction (Learning Activities)

Task: Make a research on the estimates or statistics of the following


variables.
 Average life expectancy of Filipinos
 Average monthly income of Filipino families
 Average height of Filipino women
 Average monthly consumption of rice in the country
 Average number of daily commuters at town proper

Ex. 1)
Sampling Distribution of Sample Means
A population consists of the numbers 2, 4, 9, 10, and 5. Let us list all possible samples of
size 3 from this population and compute the mean of each sample.

Sample Mean
2, 4, 9 5.00
2, 4, 10 5.33
2, 4, 5 3.67
2, 9, 10 7.00
2, 9, 5 5.33
2, 10, 5 5.67
4, 9, 10 7.67
4, 9, 5 6.00
4, 10, 5 6.33
9, 10, 5 8.00

There are 10 possible samples of size 3 that can be drawn from the given population.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 37
Note: The number of samples of size n that can be drawn from a population of size N is given by
NCn. A sampling distribution of sample means is a frequency distribution using the means
computed from all possible random samples of a specific size taken from a population.

This time, let us make a frequency distribution of the sample means. We shall
call this frequency distribution, the sampling distribution of the sample means.

Sample Mean Frequency


3.67 1
5.00 1
5.33 2
5.67 1
6.00 1
6.33 1
7.00 1
7.67 1
8.00 1
Total n=10

Observe that the means vary from sample to sample. Thus, any mean based on the sample
drawn from a population is expected to assume different values for the samples. So, this leads us
to a conclusion that sample mean is a random variable, which depends on a particular sample.
Being a random variable, it has a probability distribution. The probability distribution of the
sample means is also called the sampling distribution of the sample means. Refer Table 3.1.

Observe that the means of the samples are less than or greater than the mean of the
population. The difference between the sample mean and the population mean is called the
sampling error. It is the error due to sampling.

Steps in Constructing the Sampling Distribution of the Means


1.Determining the number of possible samples that can be drawn from the
population using the formula:
NCn
Where N = size of the population
n = size of the sample
2. List all the possible samples and compute the mean of each sample.
3. Construct a frequency distribution of the sample means obtained in Step 2.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 38
Let’s continue….

Finding the Mean and Variance of the Sampling Distribution of Means

Mean and Variance of Sampling Distribution of Sample Means


Consider a population consisting of 1, 2, 3, 4, and 5. Suppose samples of size 2 are
drawn from this population. Describe the sampling distribution of the sample means.
 What is the mean and variance of the sampling distribution of the sample means?
 Compare these values to the mean and variance of the population.
 Draw the histogram of the sampling distribution of the population.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 39
III- Integration

Closure/synthesis
To check your learning from the activities you’ve done from exercises and drills. You
have to complete the sentence below. Write your answer on the line after doing the
activities.
I noticed that_______________________________________________________

Reflection guide
Give an example of a central limit theorem in real life-scenario?
_________________________________________________________________
__________________________________________________________________
________________________________________________________________

MIT-Senior High School


STATISTICS AND PROBABILITY Page 40
CLOSING

IV-Intervention

I will give you an activity to answer. You may start now


Assignment 3: Find the finite population correction factor given the following:

a. N = 200, n = 10
b. N = 2000, n = 10
c. N = 400, n = 40
d. N = 500, n = 10
e. N = 200, n = 20

Evaluation 3:
1. A finite population consists of 8 elements. 10 10 10 10 10 12 18 40
a. How many samples of size n = 2 can be drawn from this population?
b. List all the possible samples and the correspond

Sample Mean

c. Construct the sampling distribution of the sample means.

Sampling Distribution of Sample Means


Sample Mean ̅ Frequency Probability P( ̅ )

d. Draw a histogram of the sampling distribution of the means.

Study tips 3:
Find your most productive time of day (morning, evening etc.) and use it to master the
most difficult topics.

POST ASSESSMENT

You have to read the questions comprehensively and choose


the best answer among the options. Just write the letter of your choice

_______1. Which of the following is most likely a population as opposed to a sample?


a. A respondent to a newspaper survey
b. The first 5 learners completing an assignment
c. Every third person to arrive at the bank
d. Registered voters in a country

MIT-Senior High School


STATISTICS AND PROBABILITY Page 41
_______2. Which of the following is most likely a parameter as opposed to a statistic?
a. The average score of the first five learners completing an assignment
b. The proportion of females registered to vote in a country
c. The average height of people randomly selected from database
d. The proportion of trucks stopped yesterday that were cited for bad
brakes

_______3. Which of the following is NOT a reason for the need for sampling?
a. It is usually too costly to study the whole population.
b. It is usually too time-consuming to look at the whole population.
c. It is sometimes destructive to observe the entire populations.
d. It is always more informative by investigating a sample than the entire
population.
_______4. Which of the following is NOT a reason for drawing a sample?
a. A sample is less time consuming than a census.
b. A sample is less costly to administer than a census.
c. A sample is always a good representation of the target population.
d. A sample is less cumbersome and more practical to administer.
_______5. The Philippine Airlines Internet site provides a questionnaire instrument
that can be answered electronically. Which of the 4 data collection is involved
when people complete the questionnaire?
a. Published sources
b. Experimentation
c. Surveying
d. observation

Module 4: ESTIMATION OF PARAMETERS


Time Allotment: Week 9-10

Module Overview

This module further builds on the discussions on the sampling and sampling distribution to
illustrate one of the basic purposes of statistics – inference. Learners are given description of basic
concepts of estimation; both point estimation and interval estimation.

Learning Competencies:

At the end of this module, you are expected to:


1. Illustrates the T-distribution.
2. Identifies percentiles using the t-table
3. Identifies the length of the confidence interval.
4. Compute for an appropriate sample size using the length of the interval.
5. Solves problems involving sample size determination.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 42
Instructional Materials:

 Oliva M. (2016). DIWA Senior High School Series: STATISTICS & PROBABILITY.
DIWA Learning Systems, Inc.
 1st edition REX BOOK Statistics and Probability Rene R. Belecina, Elisa S. Baccay,
Efren B. Mateo

Pre-Assessment:

You have to read the questions comprehensively and choose the best answer among the options.
Just write the letter of your choice in your notebook.

_______1. The width of a confidence interval estimate for a proportion will be


a. Narrower for 99% confidence than for 95% confidence.
b. Wider for a sample size of 100 than for a sample size of 50.
c. Narrower for 90% confidence than for 95% confidence.
d. Narrower when the sample proportion is 0.50 than when the sample
proportions is 0.20.
_______2. A 99% confidence interval estimate can be interpreted to mean that
a. If all possible samples are taken and confidence interval estimates are
developed, 99% of them would include the true population mean
somewhere within their interval.
b. We have 99% confidence that we have selected a sample whose
interval does include the population mean.
c. Both of the above.
d. None of the above.
_______3. When determining the sample size necessary for estimating the true
population mean, which factor is not considered when sampling with
replacement?
a. The population size.
b. The population standard deviation.
c. The level of confidence desired in the estimate.
d. The allowable or tolerable sampling error.
_______4. Suppose a 95% confidence interval for µ turns out to be (1,000, 2,000). To
make more useful inferences from the data, it is desired to reduce the width of the
confidence interval. Which of the following will result in a reduced interval
width?
a. Increase in the sample size.
b. Decrease in the confidence level.
c. Increase in the sample size and decrease in the confidence level.
d. Increase in the confidence level and decrease in the sample size.
_______5. In the construction of confidence intervals, if all other quantities are
unchanged, an increase in the sample size will lead to a ______________ interval.
a. narrower
b. wider
c. less significant
d. biased

MIT-Senior High School


STATISTICS AND PROBABILITY Page 43
I-Introduction

ENTRY CARD
A. Write the letter of the best answer for each of the following.
______1. What do you call the set of all people, objects, events, or ideas
you want to investigate?
a. sample b. population c. data d. statistics
______2. A sample refers to __________ of a population.
a. a subset b. a list c. a description d. another name
______3. What is the mean of 13, 27, 29, 17, and 14?
a. 29 b. 28 c. 20 d. 13
______4. What do you call a number that describes a population
characteristic?
a. sample statistic b. parameter c. variable d. decimal
______5. Which of the following symbols denotes population mean?
a. µ b. c. σ d. s
______6. Which of the following symbol denotes population standard
deviation?
a. µ b. ̅ c. σ d. s
______7. In statistics, error refers to:
a. mistake resulting in wrong decisions
b. mistake resulting from computation
c. difference between two parameter values
d. difference between an observed value and a parameter

II. Interactions (Learning activities)

Our topic is getting more interesting. Please read the content


below. Ask me if you have any question.

Let’s start!!!

Computing the Point Estimate of a Population Mean


An estimate is a value or a range of values that approximate a parameter. It is based on sample
statistics computed from sample data.

Estimation is the process of determining parameter values.

A point Estimate is a specific numerical value of a population parameter. The sample


mean is the best point estimate of the population mean.

An interval estimate is a range of values that may contain the parameter of a population.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 44
Ex.) 1
Coco Juice
Study the following situation and do the task. Then, answer the following questions or
supply the missing information.

Mr. Santiago’s company sells bottled coconut juice. He claims that a bottle contains 500
ml of such juice. A consumer group wanted to know if his claim is true. They took six random
samples of 10 such bottles and obtained the capacity, in ml, of each bottle. The result is shown as
follows:

Solutions:
Method 1: Using the excel program

A sample of the worksheet for sample 1 is shown as follows.

# A B C D
1 500
2 498
3 497
4 503
5 499
6 497
7 497
8 497
9 497
10 495

Steps:
1. Encode the data in the cells proceeding from cell A1 to cell A10.
2. Select the data
3. Click the insert function fx. The Insert Function dialog will box appears.
4. In the Insert Function dialog box, click the arrow to select a category. A drop window
will appear. In this window, click Statistical.
5. In the Insert Function dialog box, there is another window labeled Select a function.
Click AVERAGE. At the bottom of the box, click OK.
6. Another dialog box will appear. This is the Function Arguments box. There are two
smaller windows in the box labeled Number 1 and Number 2. In the window Number 1,
write A1:A10. The computer reads the numbers encoded in Column a Row 1, Column A
Row 2, and so on to Column A Row 10.
7. Copy the Formula result in the dialog box. For the encoded data of sample 1, the result
is 498. Then click OK.
8. Repeat the procedure for the other samples.
9. Compute the mean of the means also called overall means. This is the point estimate
of µ.

Alternatively, in the MS Excel program, the values may simply be encoded one after the other
Row 1 to Row 60. Next, select the entries. Then repeat steps 3 to 7. With the proliferation of

MIT-Senior High School


STATISTICS AND PROBABILITY Page 45
statistical software, you can choose the most convenient model to compute statistical values for
statistical analysis.

Method 2: Manual Estimation of the Population Mean

When dealing with a large number of values, the mean of small samples may be obtained.
These means constitute a sampling distribution of means. To find the overall mean, simply find
the sum of the mean values. Then, divide this sum by the total number of sample means.

For example, let us consider the six sample rows of the 60 bottles a s excellent samples. Next, we
compute each row mean.

Computing More Means


1. Compute the remaining column means.
2. Compute the mean of the means.
3. Enter your values in the following table.
4. Based on the 10 random samples, what is the point estimate of the population
mean?

Column Sum of Scores Means


1 2996 499.33
2 2984 497.33
3 2987
4 2984
5 2984
6 2988
7 2991 498.5
8 2988
9 2987
10 2981
Overall Mean

MIT-Senior High School


STATISTICS AND PROBABILITY Page 46
Now, Let us continue….

Point Estimate for the Population Proportion ̂

Proportion is a fraction expression where the favorable response is in the


numerator and the total number of respondents is in the denominator. The basic operation
involves division. Thus, the result is a decimal value that can be expressed as percent.

Characteristics of the sampling Distribution ̂


1. The mean of the sampling distribution of ̂ is p; that is, ̂ is the unbiased estimator of p

2. The standard deviation of the sampling distribution of ̂ is √ ; that is,

σp = √ , where q = 1 – p. .
3. For large samples, the sampling distribution of ̂ is approximately normal. A sample
size is considered large if the interval ̂ 3σp does not include 0 (like p = 0.001) or (like
p = 0.99)

Ex.)1
Great Escape

Explore this.
On a typical morning, Anthony asks his TV viewers if they believe or do not believe
the “great escape” story of a group of soldiers from a perceived adversary. At the end of his
show, he reported that 68% of the respondents believe the story.
 What does the report men to you?
 What do you understand by the expression 68%?

The percentage expression 68% is also called a proportion. Percentages are preferred
when reporting frequencies of subsets of populations. But in computing, percentages are first
converted to proportions in decimal form. Proportions also represent probabilities. So, the
probability that all TV viewers favor the “great escape” story is 0.68.

Proportions can be obtained from samples or populations.

To facilitate our discussions, we shall use the following symbols.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 47
III- Integration

Closure/synthesis
To check your learning from the activities you’ve done from exercises and drills. You have to
complete the sentence below. Write your answer on the line after doing the activities.
I learned that___________________________________________________________

Reflection guide
Discuss how could you use the confidence interval estimates to obtain the objective?
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________________________________
______________________________________________________

CLOSING

IV-Intervention

Time for activities! Enjoy making fun craft!

Assignment 4: 1. Fill in the blanks with the correct information to make meaningful statements.

a. A ___________ is a single value used to estimate a population parameter.

b. A range of values called ___________ is also used to estimate a population parameter.

c. Among the three values of central tendency, the best estimator of the population parameter
µ is the ___________.

d. A good estimator has no ___________.

e. To obtain a mean from a sample, the sum of the sample values is divided by ___________.

f. The standard deviation is a measure of ___________.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 48
Evaluation 4:
1. Find (a) the point estimate of the population parameter µ, and the
(b) standard deviation for each of the following sets of data.

Scores in a long test in Science

78 75 86 82 70 85 83 86
80 92 82 85 80 88 84 86
90 88 90 78 83 90 86 84
75 85 77 88 85 90 85 83
83 86 83 84 86 92 85 80
76 88 79 84 80 88 80 88

Study tip 4:
Get help from people around you – friends, parents, teachers… don’t be afraid to ask for
help.

POST ASSESSMENT

You have to read the questions comprehensively and choose the best answer among the options.
Just write the letter of your choice.

_______1. Which of the following is not true about the student’s distribution?
a. It has more area in the tails and less in the center than does the normal
distribution.
b. It is used to construct confidence intervals for the population mean
when the population standard deviation is known.
c. It is bell-shaped and symmetrical.
d. As the number of degrees of freedom increases, the t-distribution
approaches the normal distribution.
_______2. The t distribution
a. Assumes the population is normally distributed.
b. Approaches the normal distribution as the sample size increases.
c. Has more area in the tails than does the normal distribution.
d. All of the above.
_______3. A major department store chain is interested in estimating the average
amount its credit card customers spent on their first visit to the chain’s new store
in the mall. Fifteen credit card accounts were randomly sampled and analyzed
with the following result:
̅ = 2525 pesos and S2 - 20,000. Assuming the distribution of the amount spent
on their first visit is approximately normal, what is the shape of the sampling
distribution of the sample mean that will be used to create the desired confidence
interval for µ?

MIT-Senior High School


STATISTICS AND PROBABILITY Page 49
a. Approximately normal with a mean of Php 2525
b. A standard normal distribution
c. A t distribution with 15 degrees of freedom
d. A t distribution with 14 degrees of freedom
_______4. A major department store chain is interested in estimating the average
amount its credit card customers spent on their first visit to the chain’s new store
in the mall. Fifteen credit card accounts were randomly sampled and analyzed
with the following results:
̅ = 2525 pesos and S2 - 20,000. Construct a 95% confidence interval for the
average amount its credit card customers spent on their first visit to the chain’s
new store in the mall.
a. 2525 pesos ± 454.5 pesos
b. 2525 pesos ± 506 pesos
c. 2525 pesos ± 550 pesos
d. 2525 pesos ± 554 pesos
_______5. As an aid to the establishment of personnel requirements, director of a
hospital wishes to estimate the mean number of people who are admitted to the
emergency room during a 24-hour period. The director randomly selects 64
different 24-hour periods and determines the number of admissions for each. For
this sample, ̅ = 19.8 and S2 = 25. Which of the following assumptions is
necessary in order for a confidence interval to be valid?
a. The population sampled from has an approximate normal distribution.
b. The population sampled from has an approximate t distribution.
c. The mean of the sample equals the mean of the population.
d. None of these assumptions are necessary.

Stay safe. Everyone!

MIT-Senior High School


STATISTICS AND PROBABILITY Page 50
STATISTICS AND PROBABILITY
SELF-PACED LEARNING MODULE UNIT I1

Module 1: TEST OF HYPOTHESIS


Time Allotment: Week 1-7

Module Overview

In this module you will learn the definition of statistical hypothesis, the difference of null
hypothesis from alternative hypothesis, the consequences of making a decision and the two
possible errors that could be committed in a test of hypothesis.

Let’s proceed to our topic.

Learning Competencies

At the end of this module, yQou are expected to:


 illustrates: (a) null hypothesis; (b) alternative hypothesis; (c) level of significance; (d)
rejection region; and (e) types of errors in hypothesis testing.
 identifies the parameter to be tested given a real life problem.
 formulates the appropriate null and alternative hypotheses on a population mean.
 identifies the appropriate form of the test-statistic when: (a) the population variance is
assumed to be known; (b) the population variance is assumed to be unknown; and (c) the
Central Limit Theorem is to be used.
 identifies the appropriate rejection region for a given level of significance when: (a) the
population variance is assumed to be known; (b) the population variance is assumed to be
unknown; and (c) the Central Limit Theorem is to be used.
 computes for the test-statistic value (population mean).
 solves problems involving test of hypothesis on the population mean.
 formulates the appropriate null and alternative hypotheses on a population proportion.
 identifies the appropriate rejection region for a given level of significance when the
Central Limit Theorem is to be used.
 computes for the test-statistic value (population proportion).
 solves problems involving test of hypothesis on the population proportion.

Instructional Materials

 Oliva M. (2016). DIWA Senior High School Series: Disaster Readiness and Risk
Reduction (STEM Track). DIWA Learning Systems, Inc.
 paper and pen

MIT-Senior High School


STATISTICS AND PROBABILITY Page 51
 Pre-Assessment:

You have to read the questions comprehensively and choose the best answer among the
options. Just write the letter of your choice in your notebook.

_______1. Which of the following would be an appropriate null hypothesis?


a. The mean of a population is equal to 50.
b. The mean of a sample is equal to 50.
c. The mean of a population is greater than 50.
d. Only (a) and (c) are true.
_______2. Which of the following would be an appropriate null hypothesis?
a. The population proportion is less than0.45
b. The sample proportion is less than 0.45
c. The population proportion is no less than 0.45
d. The sample proportion is no less than 0.45
_______3. Which of the following would be an appropriate alternative hypothesis?
a. the mean of a population is equal to 50
b. the mean of a sample is equal to 50.
c. the mean of a population is greater than 50.
d. the mean of a sample is greater than 50
_______4. Which of the following would be an appropriate alternative hypothesis?
a. The population proportion is less than0.45
b. The sample proportion is less than 0.45
c. The population proportion is no less than 0.45
d. The sample proportion is no less than 0.45
_______5. A type II error is committed when
a. we reject null hypothesis that is true.
b. we don’t reject a null hypothesis that is true.
c. we reject a null hypothesis that is false.
d. we don’t reject a null hypothesis that is false.

I-Introduction

ENTRY CARD
Determine whether the statement is true or false by checking ( ) the appropriate box.
Statement True False
1. The value resulting from 1 - .05 is 95.
2. The area under the normal curve is 1.
3. The areas under the curve are considered the probability values.
4. The z-distribution applies to large sample cases.
5. Under the normal curve, there are many z-values.
6. The notations µ and σ are sample values.

II-Interaction (Learning essentials)

You still ok? Again, if you have any question, please do


message me in our class group messenger.

Ok! Let’s learn a new topic for today.


Please read the content below.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 52
Understanding Hypothesis Testing

Great Escape
Explore this.
On a typical morning, Anthony asks his TV viewers if they believe or do not believe
the “great escape” story of a group of soldiers from a perceived adversary. At the end of his
show, he reported that 68% of the respondents believe the story.
 What does the report men to you?
 What do you understand by the expression 68%?

The percentage expression 68% is also called a proportion. Percentages are preferred when
reporting frequencies of subsets of populations. But in computing, percentages are first
converted to proportions in decimal form. Proportions also represent probabilities. So, the
probability that all TV viewers favor the “great escape” story is 0.68.

Proportions can be obtained from samples or populations.

To facilitate our discussions, we shall use the following symbols.

n = number of observations in a simple random sample.


̂ = sample proportion (read as “p hat”)
p = population proportion
̂ =

̂= and = ̂ = 1 - ̂
Where: X is the number of sample elements that posses the desired characteristics; and n is the
sample size.

Are you done reading? Did you understand? If you’re


confused you can ask question from me.

Now, Let us continue with our


discussion….

TEST ON POPULATION MEAN


Let us consider this example and let us emphasized also that the population variance is unknown. The
problem can be stated as follows:

The father of a senior high school student lists down the expenses he will incur when he sends his
daughter to the university where he wants her to study. He hypothesizes that the average tuition fee is at
least Php20,000 per semester. He knows the variable of interest, which is the tuition fee, is measured at
least in the interval scale or specifically in the ratio population mean and variance are unknown. The
father asks, at random, 25 students of the university about their tuition fee per semester. He is able to get
an average of Php20,050 with a standard deviation of Php500.

 In this problem, the appropriate null and alternative hypothesis remain the same as in the previous
lesson and are stated as:

Ho: the average tuition fee in the targeted university is at least Php20,000. In symbols, Ho: µ 20,000
pesos.
Ha: the average tuition fee in the targeted university is less than Php20,000. In symbols, Ha: µ 20,000
pesos.

 With the assumption of unknown population variance ( 2) and the variable of interest is measured
at least in the interval scale and follows the normal distribution, the appropriate test statistic,

MIT-Senior High School


STATISTICS AND PROBABILITY Page 53
̅
denoted as tC = where ̅ and s, are sample mean and sample standard deviation,

respectively, computed from a simple random sample of n observations; and o is the


hypothesized value of the parameter. The test statistic follows the student’s t-distribution with
n – 1 degrees of freedom which means the tabular value in the student’s t-table will be used as
critical or tabular value. With this, the decision rule can be one of the following possibilities:

1. Reject Ho if tC -ta, n-1. Otherwise, we fail to reject Ho.


2. Reject Ho if tC ta, n-1. Otherwise, we fail to reject Ho.
3. Reject Ho if tC l ⁄ , n-1. Otherwise, we fail to reject Ho.

For the problem, the first is the appropriate decision rule. Suppose the level of significance ( ) is set as
0.05, then the decision rule for the problem can be stated as “ Reject Ho if the tC -t,24 = -2.064.
otherwise, we fail to reject Ho.”Note that this test procedure is referred to as “one-tail t-test for the
population mean” and the rejection region is illustrated as follows:

 The next step is to use the decision rule to make a decision. With the computed value of the test
statistic equal to 0.50 and the rule dictates that our decision is not to reject or fail to reject the
null hypothesis.
 Lastly, as a consequence of the decision, conclusions are to be stated. With the acceptance of
the null hypothesis, the father can say that the average tuition fee at the university where he
wanted his daughter to study is at least Php20,000.

We proceed to the next lesson by asking learners what they will do in case the variable of interest cannot
be assumed to follow a normal distribution. Is there a way to test the hypotheses?

The answer to this question is: Yes, there is a way to do it but they must be assured that the sample size
is large enough to invoke the Central Limit Theorem they learned under the lesson on sampling
distribution of the sample mean. Let us say that for a given problem, a random sample of size 36 is
sufficient for us to invoke the theorem. Hence, we could restate the problem as follows. Notice that we
emphasize the change in the sample size to invoke the theorem.

The father of a senior high school student lists down the expenses he will incur when he sends
his daughter to the university, where he wanted her to study. He hypothesizes that the average tuition
fee is at least Php20,000 per semester. He knows the variable of interest, which is the tuition fee, is
measured at least in the interval scale or specifically in the ratio scale. He assumes that the variable of
interest follows a distribution with unknown population mean and variance. The father asks, at
random, 36 students of the university about their tuition fee per semester. He is able to get an average
of Php20,200 with a standard deviation of 400 pesos.

 In this problem, the appropriate null and alternative hypotheses remain the same as in the
previous lesson and are stated as follow:

Ho: The average tuition fee in the targeted university is at least Php20,000. In symbols, Ho: 20,000
pesos.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 54
Ha: The average tuition fee in the targeted university is less than Php20,000. In symbols, Ha:
20,000 pesos.

 With the assumption of unknown distribution of the variable of interest as well as its population
variance ( 2) but with a sample size large enough to invoke the Central Limit Theorem, the test
statistic, denoted as tC which was used earlier, is still appropriate to use. This test statistic is
̅
computed as tC = where ̅ and s, are the sample mean and sample standard deviation,

respectively, computed from a simple random sample of n observations; and is the


hypothesized value of the parameter. However, this time with the Central Limit Theorem, we can
assume that the test statistic follows the standard normal distribution which means the tabular
value in Z-table will be used as critical or tabular value. With this, the decision rule can be one of
the following possibilities:

1. Reject Ho if tC - . Otherwise, we fail to reject Ho.


2. Reject Ho if tC . Otherwise, we fail to reject Ho.
3. Reject Ho if tC l ⁄ , n-1. Otherwise, we fail to reject Ho.

For the problem, the first is the appropriate decision rule. Suppose the level of significance ( ) is set at
0.05, then the decision rule for the problem can be stated as “ Reject Ho if t C -Z0.05, = -1.645. otherwise,
we fail to reject Ho.” The rejection region is illustrated as follows:

MIT-Senior High School


STATISTICS AND PROBABILITY Page 55
MIT-Senior High School
STATISTICS AND PROBABILITY Page 56
III-Integration

Closure/synthesis
To check your learning from the activities you’ve done from exercises and drills. You have to
complete the sentence below. Write your answer on the line after doing the activities.
I learned that___________________________________________________________

Reflection guide
Formulate the appropriate null and alternative hypotheses on the average tuition fees of a college
students in MIT?
______________________________________________________________________________
__________________________________________________________________

CLOSING

IV-Intervention

Assignment 5:

1. A teenager who wanted to lose weight is contemplating on following a diet she read
about in the facebook. She wants to adopt it but, unfortunately, following the diet requires buying
nutritious, low calorie yet expensive food. Help her decide by stating a null hypothesis and an
alternative hypothesis.
_________________________________________________________________
_________________________________________________________________
_________________________________________________________________

Evaluation 5:

1. After senior high school , Lilifut is pondering whether or not to pursue a degree in
statistics. She was told that if she graduates with a degree in statistics, a life of fulfillment and
happiness awaits her. Assist her in making a decision.

_____________________________________________________________________
_____________________________________________________________________
_____________________________________________________________________

Study tip 5: Reading your notes is not studying, reading info is


not studying. Create your own study guide, quizzes for yourself,
pretend you’re teaching someone else. Use the information you’re
learning.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 57
V-POST ASSESSMENT

You have to read the questions comprehensively and choose the best answer among the
options. Just write the letter of your choice.
Hkk kk

_______1. A type I error is committed when


a. we reject a null hypothesis that is true.
b. we don’t reject a null hypothesis that is true.
c. we reject a ull hypothesis that is false.
d. we don’t reject a null hypothesis that is false.
_______2. Suppose we wish to test H0: versus H1: what will result if
we conclude that the mean is greater than 47 when its true value is really 52?
a. we have made a type I error.
b. we have made a type II error.
c. we have made a correct decision.
d. none of the above is correct.
_______3. if, as a result of a hypothesis test, we reject the null hypothesis when it is
false, then we have committed
a. a type II error.
b. a type I error
c. no error.
d. an acceptance error.
_______4. The owner of a local restaurant has recently surveyed a random sample of
n = 250 customers of the restaurant. She would now like to determine whether or
not the mean age of her customers is over 30. If so, she planned to provide
background music to appeal to an older appropriate hypotheses to test are:
a. H0 : versus H1
b. H0 : versus H1
c. H0 : ̅ versus H1 ̅
d. H0 : versus H1
_______5. A major telco is considering opening a new telecom center in an area that
currently does not have any such centers. The telco will open the center if there is
evidence that more than 5,000 of the 20,000 households in the area use the telco. It
conducts a poll of 300 randomly selected households in the area and finds that 96
subscribe to the telco. State the test of interest to the rental chain.
a. H0 : versus H1
b. H0 : versus H1
c. H0 : versus H1
d. H0 : versus H1

MIT-Senior High School


STATISTICS AND PROBABILITY Page 58
PRELIMINARIES

Module 2: Correlation and Regression Analysis


Time Allotment: Week 7-10

Module Overview

In this module, you will provide discussions on basic concepts and tools in exploring
relations between two variables. In addition, an activity is conducted to give you hands-
on experience in exploring relationships between two variables using a random sample
data to show the relationship of interest and finally, estimate the correlation coefficient.
After all you have interpreted their correlation, they pool their findings as a class to
explore the variability in the correlations found. You construct approximations to the
sampling distributions of the correlation coefficient and use the sampling distributions
of the correlation coefficient and use the sampling distributions to make assertions about
the values of the population parameters.

Learning Competencies:

At the end of this module, you are expected to:


1. Illustrate the nature of bivariate data;
2. Construct a scatter plot;
3. Describe shape(form), trend (direction), and variation (strength) of bivariate
relationships based on a scatter plot;
4. Estimate strength of association between two variables based on a scatter plot;
5. Calculate the Pearson’s sample correlation coefficient;
6. Solve problems involving correlation analysis;
7. Identifies the independent and dependent variables; and
8. Calculates the the slope and y-intercept of the regression line.

Instructional Materials:

 Oliva M. (2016). DIWA Senior High School Series: STATISTICS & PROBABILITY.
DIWA Learning Systems, Inc.
 1st edition REX BOOK Statistics and Probability Rene R. Belecina, Elisa S. Baccay,
Efren B. Mateo

Pre-Assessment:

Before we start our fun and thrilled lesson, I will test your prior knowledge first. Don’t worry!
This is just a simple test to know how far you know this topic.
Choose one of the following questions to explore:
a. Does your daily allowance (x) increase or decrease with the number of text messages
you send in a day (y)?

MIT-Senior High School


STATISTICS AND PROBABILITY Page 59
b. Does your daily allowance (x) increase or decrease with how happy you are (y)?
c. Does your weight (x) increase or decrease with your height (y)?

To answer the question you choose, you are going to take a random sample of 30
observations from the database collected at beginning of the Statistics and Probability course.

I-Introduction

1. Motivation
In module 1, you are guided through the basic tools used for describing
data pertaining to one variable. A number of variables are collected per
data item, such as information on an individual, household,
establishment, farm, country, etc.

II- Interaction (Learning Activities)

How to Generate the Scatterplot:

Before starting with the lesson, we may review the concepts of population,
sample, population parameter and sample statistic to reinforce the knowledge and
competence on these basic concepts.

Data you obtained from lesson 1 may be used to create a scatterplot and to
conduct a correlation analysis.

For this lesson, the population of interest being discussed is the population of
senior high school learner’s who have filled out the Activity Sheet Number 1 in lesson 1

The statistical questions of interest for the activity are:


1. Does your daily allowance (x) increase or decrease with the number of text message
you send in a day(y).
2. Does your daily allowance (x) increase or decrease with how happy you are (y)?
3. Does your weight (x) increase or decrease with your height (y)?
Your have to choose the statistical question you would like to focus on.

Suppose that we have the following snapshot of a sample data set:

Student # Sex Height Weight Daily Usual Happiness


(in meters) (in kg) allowance number of
in school text/day
1 F 1.64 40 0 10 5
2 F 1.52 50 50 7
3 F 1.52 49 0 5 5
4 F 1.65 45 150 18 9
5 F 1.02 60 0 4 5
6 F 1.626 45 0 60 7
7 F 1.5 38 200 20 7
8 F 1.6 51 100 6
9 F 1.42 42.2 500 200 9

MIT-Senior High School


STATISTICS AND PROBABILITY Page 60
10 F 1.52 54 0 15 4
11 F 1.48 46 100 10 8
12 F 1.62 54 20 2 4
13 F 1.5 36 0 25 6
14 F 1.54 50 0 30 7
15 F 1.67 63 0 60 9
16 M 1.72 55 200 80 8
17 M 1.65 61 0 30 5
18 M 1.56 60 50 1 6
19 M 52 250 80 8
20 M 1.7 90 0 30 4
21 M 1.53 50 250 100 9
22 M 1.62 90 100 0 6
23 M 1.79 80 100 6 7
24 M 1.57 58 50 0 7
25 M 1.7 68 20 0 4
26 M 1.77 27 100 8
27 M 1.478 50 300 55 7
28 M 1.727 94 100 50 7
29 M 1.56 66 50 5 6
30 M 1.75 50 0 3 5

MIT-Senior High School


STATISTICS AND PROBABILITY Page 61
MIT-Senior High School
STATISTICS AND PROBABILITY Page 62
Processing question:
1. An ESP experiment was done wherein a participant guesses which of the four cards
the researcher has randomly picked, where each card is equally likely. This is repeated for 100
trials. The null hypothesis is that the subject is guessing, while the alternative is that the subject
has ESP and can guesss at higher than the chance rate.

III- Integration

1. Closure Activity/Synthesis
Let’s check your learning from the activities you’ve done. You have to
complete the sentence below.
I learned that_____________________________________________________
____________________________________________________________
____________________________________________________________

2. Reflection guide
Try to reflect on your learning/s. Write your answer on your notebook.

Based on the question you chose, generate a scatterplot to create a visual


representation of data. Does the relationship appear to be linear?
__________________________________________________________________
__________________________________________________________________
__________________________________________________________________
__________________________________________________________________
__________________________________________________________________

CLOSING

IV-Intervention

1. Assignment
Create a dot plot for the correlation.

2. Evaluation
One more step to go!
Look at the dot plot that pertains to your question. This dot plot represents an
Look at the
approximation to the sampling distribution of the correlation coefficient. What do
you notice about the dot plot? What is the range of the correlation coefficient? What
seems to be the most common correlation? If you are to guess what the correlation
was for the entire population, what would be your guess? Explain

MIT-Senior High School


STATISTICS AND PROBABILITY Page 63
V- Post Assessment

Congratulations! You are at the final part of your module. Again, let’s challenge
your mind to know how much information it got. Hope you get it all correct!
Good luck!

1. A study was done to investigate the relationship between the amount


of protix ( anew protein-vitamin-mineral supplement) on fortified-vitamin rice,
known as FVR, and the weight gain of children. Ten randomly chosen sections
of grade one pupils were fed with FVR containing protix; different amount (x)
of protix were used for the ten sections. The increase in weight of each child was
measured after a given period. The average gain (y) in weight for each section
with a prescribed protix level (x) is as follows:

Section Protix Gain Section Protix Gain


1 50 92.6 6 100 106.2
2 60 97.5 7 110 108.9
3 70 96.5 8 120 108.4
4 80 102.3 9 130 110.2
5 90 105.8 10 140 110.8

a. Create a scatter diagram based on the data.

b. Does the scatter diagram suggest a linear relationship? What other


relationships may be tenable?

c. What is the correlation between the protix level and the average weight gain?

3. Study tips

Do not memorize words. Understand them.


- Pay attention to the ideas you’re learning, don’t try to
Remember what the words look like.

(If you know the concept, you’ll recognize it on the test no matter how it’s
worded)

MIT-Senior High School


STATISTICS AND PROBABILITY Page 64
Stay safe Everyone.

ALWAYS REMEMBER:
LAMANG ANG MAY ALAM.

MIT-Senior High School


STATISTICS AND PROBABILITY Page 65

You might also like