Ch5 MMW BSN
Ch5 MMW BSN
Ch5 MMW BSN
Disclaimer
Introduction
A test of hypothesis is a method of using sample data to decide between two
competing claims (hypotheses) about a population characteristic. It concerns itself
with the decision-making rules for choosing alternatives while controlling and
minimizing the risks of wrong decisions.
Once a hypothesis has been formulated, we need a method for evaluating the
sample data to determine whether H o should be accepted or rejected. The decision
rule that we use for this purpose is called a test procedure.
The hypothesis testing procedure is simple a decision rule that specifies for
every possible value of a statistic observable in a simple random sample of size n
whether the null hypothesis should be rejected or accepted. The set of possible
values of the sample statistic is referred to as the sample space. Therefore, the test
procedure divides the sample space into mutually exclusive parts called the
acceptance region and the rejection (or critical) region.
In carrying out a test of null hypothesis versus alternative hypothesis, the null
hypothesis will be rejected in favor of the alternative hypothesis only if the sample
evidence strongly suggests that Ho is false. If the sample does not contain
such evidence, Ho will not be rejected.
Two basic types of decision problems can be considered for hypothesis testing
procedures. The first problem is one in which we want to know whether the
population parameter has changed or differs from a particular value. Here, we are
interested in detecting whether the population parameter is either larger or smaller
than a particular value. The second type of hypothesis test is one in which we want to
find out whether
1) the sample came from a population that has a parameter less than the
hypothesized value, or 2) from a population that has a parameter more than a
hypothesized value.
These situations give rise to a one-tailed test. The type of test in which we want to
determine whether a population parameter has changed is referred to as two-tailed
test, since the null hypothesis can be rejected by observing a statistic that falls in
either of the two tails of the sampling distribution.
In summary, a one-tailed test of hypothesis is a test where the
alternative hypothesis specifies a one-directional difference from the parameter of
interest while a two-tailed test specifies a difference that can fall in either of the
two tails of the
distribution. In other words, based on the alternative hypothesis, the one-tailed test is
directional, while the two-tailed test is non-directional.
What is a Hypothesis?
A hypothesis is a claim or statement about the population parameter
Examples of parameters are population mean and population
proportion
The parameter must be identified before analysis
Example of Hypothesis
1. The mean body temperature for patients admitted to elective surgery is
not equal to 370C.
The parameter of interest here is μ which is the mean body temperature for
patients admitted to elective surgery.
2. The proportion of registered voters in Quezon City favoring Candidate A
exceeds 0.60.
The parameter of interest here is p which is the proportion of registered voters
in Quezon City favoring Candidate A.
Null Hypothesis
denoted by Ho
the statement being tested
it represents what the experimenter doubts to be true
must contain the condition of equality and must be written with the symbol =,
≤, or ≥.
The null hypothesis corresponding to the common belief that the mean body
temperature is 37°𝐶 is expressed as
Ho: 𝜇 = 37°𝐶
We test the null hypothesis directly in the sense that we assume it is true and
reach a conclusion to either reject Ho or fail to reject Ho.
Alternative Hypothesis
denoted by Ha
Is the statement that must be true if the null hypothesis is false
the operational statement of the theory that the experimenter believes to be true and
wishes to prove
Is sometimes referred to as the research hypothesis
Solution:
a. 𝐻�: 𝜇 = 14 and 𝐻𝑎: 𝜇 > 14
b. 𝐻�: 𝜇 = 18 and 𝐻𝑎: 𝜇 < 18
c. 𝐻�: 𝜇1 = 73 and 𝐻𝑎: 𝜇1 ≠ 73
Some Notes:
In this context of trying to support the goal of the research, the alternative hypothesis
is sometimes referred to as the research hypothesis.
Also in this context, the null hypothesis is assumed true for the purpose of conducting
the hypothesis test, but it is hoped that the conclusion will be rejection of the
null hypothesis so that the research hypothesis is supported.
There are two kinds of errors that can be made in significant testing. The type I error
(alpha error), considers a conclusion that is drawn stating that the null hypothesis FALSE
when in fact it is TRUE. This means that a true hypothesis can be incorrectly rejected. This
probability of a Type I error is called the significance level and is usually set by a researcher.
On the other hand, the type II error (beta error) occurs when the hypothesis is not rejected
when in fact it is false.
True State of Null Hypothesis
Statistical Decision
Ho: True Ho: False
Reject Ho Type I Error Correct
Do not reject Ho Correct Type II Error
Level of Significance
The level of significance is the maximum probability of committing a type I error.
This
probability is symbolized by α (Greek letter alpha). That is, P(type I error)=α.
Statisticians generally agree on using three arbitrary significance levels: the 0.10,
0.05, and 0.01 levels. That is, if the null hypothesis is rejected, the probability of a
type I error will be 10%, 5%, or 1%, depending on which level of significance is used.
Here is another way of putting it: When 𝛼 = 0.10, there is a 10% chance of rejecting a
true null hypothesis; when 𝛼 = 0.05, there is a 5% chance of rejecting a true
null
hypothesis; and when 𝛼 = 0.01, there is a 1% chance of rejecting a true null
hypothesis.
Test Statistic
The critical or tabular value for a hypothesis test is a threshold to which the
value of the test statistic in a sample is compared to determine whether or not the null
hypothesis is rejected. The critical value can be on the right side of the mean or on
the left side of the mean for a one-tailed test. Its location depends on the inequality
sign of the alternative hypothesis. For example, in situation A, where the
alternative hypothesis is 𝐻𝑎: 𝜇 > 14. Since the inequality sign is >, the null
hypothesis will be
rejected only when the sample mean is significantly greater than 14. Hence, the critical
value must be on the right side of the mean.
The critical region or rejection region is a set if values if the test statistic for which the
null hypothesis is rejected in a hypothesis test; that is, the sample space for the test statistic
is partitioned into regions. The critical region will lead us to reject the hull hypothesis and
accept the other region represented by the alternative hypothesis. Therefore, if the observed
values of the test statistic is a member of the critical region, we conclude “reject Ho.” If it is
not a member of the critical region, then we can conclude “do not reject Ho.”
The p-value - the smallest level of significance at which Ho will be rejected based on
the information contained in the sample.
An alternative form of decision rule (based on the p-value) is to reject Ho if the p-
value is less than or equal to the level of significance (𝛼).
p-value Decision
0.01 Reject Ho
0.05 Reject Ho
0.10 Do not reject Ho
Figure 1
Procedure for
Finding p value
*Important Note:
Accept/Fail to Reject
A few textbooks continue to say “accept the null hypothesis” instead of “fail to reject
the null hypothesis.” The term accept is somewhat misleading, because it seems to imply
incorrectly that the null hypothesis has been proved, but we can never prove a null
hypothesis. The phrase fail to reject says more correctly that the available evidence isn’t
strong enough to warrant rejection of the null hypothesis. In this text we use the terminology
fail to reject the null hypothesis, instead of accept the null hypothesis.
Figure 2
Wording of Final Conclusion
Exercise 5A
1. Define null and alternative hypotheses, and give an example of each.
2. What symbols are used to represent the null hypothesis and the alternative
hypothesis?
3. What is meant by a type I error? A type II error? How are they related?
4. In hypothesis testing, why can’t the hypothesis be proved true?
5. For each conjecture, state the null and alternative hypotheses.
a. The average age of community college students is 24.6 years.
b. The average age of attorneys is greater than 25.4 years.
c. The average score of high school basketball games is less than 88.
d. The average pulse rate of male marathon runners is less than 70 beats
per minute.
e. The average weight loss for a sample of people who exercise 30 minutes
per day for 6 weeks is 8.2 pounds.
6. Given the following value of alpha (𝛼), decide whether to reject or do not reject the
null hypothesis (Ho)
If the sample size is 30 or morte sample units, we shall use the z-test formulas, as
follows:
A. z-Test using One-sample Mean
( 𝑥 −𝜇 ) √�
𝑧= 𝑠
𝑥 1 −𝑥 2
𝑧=
𝑠 2 𝑠 2
√ 𝑛1 + 2
1 𝑛2
Example:
A manufacturer claims that the average life of batteries used in their electronic games
is 150 hours. It is known that the standard deviation of this type of battery is 20 hours. A
consumer wishes to test the manufacturer’s claim and accordingly tests 100 electronic
games using this battery and found out that the mean life is equal to 169 hours. Test the
hypothesis by using a level of significance 𝛼 = 5%.
Solutions:
Given:
𝜇 = 150 hours 𝑥 = 169 hours
� = 20 hours 𝑛 = 100 electronic games
Since the population standard deviation is known and the sample size is greater than
30, the z-test will be used.
To have a better understanding of the application of this test, we shall present both
the one-tailed and two-tailed tests.
A. One-Tailed Test Using One-Sample Mean
Step 5. Decision Rule: Reject Ho if computed z-value is greater than or equal to the tabular
value. Otherwise, do not reject.
Decision: Since the computed value is greater than the tabular value, we reject the
null hypothesis.
Step 6. Conclusion: The manufacturer’s claim that the average life of its batteries at 150
hours is not true at 𝛼 = 0.05.
z-test using Two-Sample Means
Example:
A bank is opening a new branch in one of two neighborhoods. One of the
factors considered by the bank was whether the average monthly family income (in thousand
pesos) in the two neighborhoods differed. From census records, the bank drew two random
samples of 100 families each and obtained the following information:
Neighborhood
Sample A Sample B
𝑥1 = 10,800 𝑥2 = 10,300
�1 = 300 �2 = 400
𝑛1 = 100 𝑛2 = 100
The bank wishes to test the null hypothesis that the two neighborhoods are the same
mean income. What should the bank conclude? Test the hypothesis using 𝛼 = 0.05.
Solutions: For the purpose of discussion, we present both the one-tailed and the two-tailed
tests in solving the problem.
10 ,800 −10,300
=
3002 4002
√ 100 + 100
500
=
√900+1,600
500
= 50
Computed 𝑧 = 10
Step 5. Decision: Since the computed value is greater than the tabular value, we reject the
null hypothesis.
Step 6. Conclusion: The average family income of neighborhood A is higher than that
of
neighborhood B at 𝛼 = 0.05.
10 ,800 −10,300
=
3002 4002
√ 100 + 100
500
= 900+1,600
√
500
= 50
Computed 𝑧 = 10
Step 5. Decision: Since the computed value is greater than the tabular value, we reject the
null hypothesis.
Step 6. Conclusion: The two neighborhoods, A and B, do not have the same average family
income at 𝛼 = 0.05.
When the sample size is less than 30 sample units, we shall use the t-test formulas,
as follows.
Example:
The personnel department of a company developed an aptitude test for a certain
group of semi-skilled workers. The individual test scores were assumed to be normally
distributed. The department asserted a tentative hypothesis that the arithmetic mean grade
obtained by
this group of semi-skilled workers is 100. It was agreed that this hypothesis would be
subjected to a two-tailed test at 5% level of significance. The aptitude test was given to a
sample of 16 semi-skilled workers and the results are:
𝑥 = 94 �=5 𝑛 = 16 𝜇 = 100
Is the personnel department’s tentative hypothesis correct at 𝛼 = 5%?
Solution: Since the sample size is less than 30 units, we shall use the t-test using one-
sample mean.
Step 1. State the null and alternative hypothesis.
Ho: The arithmetic mean obtained by the semi-skilled workers is 100.
(𝐻�: 𝜇 = 100)
Ha: The arithmetic mean obtained by the semi-skilled workers is not equal to 100.
(𝐻𝑎: 𝜇 ≠ 100)
Step 2. Specify the level of significance. Use 𝛼 = 0.05.
Step 3. Determine the test statistic to be used and the degrees of freedom (df)
�� = 𝑛 − 1 (for one-sample mean)
�� = 𝑛1 + 𝑛2 − 2 (for two groups of samples)
The t-test using one-sample mean shall be used since 𝑛 = 16 (𝑛 < 30). At 𝛼 = 0.05,
and �� = 𝑛 – 1 = 16 – 1 = 15, the t-tabular value is ±2.131 for a two-tailed test.
Step 4. Compute the value of t.
( 𝑥 −𝜇 ) √�
Computed � = 𝑠
( 94−100 )
=
√16
5
−6(4)
=
5
Computed � = −4.80
At 0.05 level of significance, can we conclude that cars equipped with radial tires and
belted tires provide the same fuel consumption? Assume that the data are normally
distributed.
Solution:
Step 1. State the null and alternative hypothesis.
Ho: There is no significant difference on the fuel consumption of cars using radial and
belted tires.
(𝐻�: 𝜇1 = 𝜇2)
Ha: There is no significant difference on the fuel consumption of cars using radial and
belted tires.
(𝐻𝑎: 𝜇1 ≠ 𝜇2)
Step 2. Specify the level of significance. Use 𝛼 = 0.05.
Step 3. Determine the test statistic to be used and the degrees of freedom (df).
The t-test using two-sample mean shall be used since 𝑛1 and 𝑛2 are both 12 , (n < 30)
for a two-tailed test. At 𝛼 = 0.05 using the degrees of freedom of 22 based on �� = 𝑛1 +
𝑛2 – 2 = 12 + 12 – 2 = 22, we obtain the t-tabular value of ±2.074 for a two-tailed test.
𝑥 1 −𝑥 2
Computed � =
[ 𝑠1 2 (𝑛1 −1 )] +[ 𝑠2 2 ( 𝑛2 1 1
√[ −1 )] ][ + ]
[𝑛1+𝑛2]−2 𝑛1 𝑛2
5 .75−5 .61
=
( )2 2
√[ [ 1.10 (12−1)]+[(1.3) (12−1)] ][ 112+ 112]
[12+12]−2
0.14
= 13. 31+ 18 . 59 1
√[ 22 ][6]
0.14
=
√[ 31.90] [ 1]
22 6
0.14
=
√(1.45)(0.1667)
0.14
=
√0.2417
0.14
=
0.4916
Computed � = 0.2848
Step 5. Decision
Since the |computed � = 0.2848| < |����𝑙�� ��𝑙�� = ±2.131|,
we shall not reject the null hypothesis.
Step 6. Conclusion
There is no significant difference on the fuel consumption of cars using the
radial tires and the belted tires at 𝛼 = 0.05, with �� = 22.
Exercise 5B
A. z-test Using One-Sample Mean
2. In a particular community, it is claimed that the mean household water usage for a
particular month is 45 cubic meters. The following year, a countrywide water
conservation campaign was conducted. Fifty homes were randomly selected
and found that the mean consumption is 47 cubic meters with a standard deviation of
2.5 cubic meters. Is there enough evidence to accept the claim that the mean
household water usage per month is 45 cubic meters at 0.01 level.
Work Schedule 𝑥 s n
Fixed Shift 7.29 3.12 49
Rotating Shift 6.08 2.96 42
Gender 𝑥 s n
Men 20 35.5 11.9
Women 25 36.6 12.3
Republic of the Philippines
CATANDUANES STATE UNIVERSITY
College of Arts and Sciences
Virac, Catanduanes