Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Chapter 8 Hypothesis Testing

Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

Chapter 8: Hypothesis Testing P a g e | 112

CHAPTER 8
HYPOTHESIS TESTING

Introduction

A statistical test of hypothesis is a fairly clear-cut procedure that enables an


experimenter to either reject or accept null hypothesis with measure risks .
The experimenter can control the risk of falsely rejecting the null hypothesis by
selecting an appropriate value of . On the other hand, the value of depends on the
sample size and the values of the parameter under the test that are of practical
importance to the experimenter. When this information is not available, an
experimenter may decide to select an affordable sample size, in the hope that the
sample will contain sufficient information to reject the null hypothesis. The chance
that this decision is in error is given by , whose value has been set in advance. If
the sample does not provide sufficient evidence to reject , the experimenter may
wish to state the results of the test as “The data do not support the rejection of ”
rather than accepting the without knowing the chance of error .

A hypothesis is a conjectural statement of the relation between two or more


variables. Hypotheses are always declarative sentence form, and they relate either
generally or specifically.

According to Deveza (1996), the following are the criteria of “good” hypotheses and
hypotheses statements:

1. Hypotheses are statements about the relations between variables.

2. Hypotheses carry clear implications for testing the stated relations.

These two criteria mean that hypothesis statements contain two or more variables
that are measurable or potentially measurable and that they specify how the
variables are related.

A research hypothesis is a predictive statement that represents a very specific


proposed answer to the problem statement. Hypotheses should be stated in an
empirically testable form, and should indicate the specific nature of the connection
between or among variables that was initially identified in the problem statement.

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 113

Types of Hypothesis

1. Null Hypothesis is primarily used for statistical purposes. It is the hypothesis


to be tested and it represents what the investigation doubts to be true.

2. Alternative Hypothesis states the nature of the connection between or


among variables that we expect on theory and/or prior research. In other words, it
is the operational statement of the theory that the experimenter or researcher
believes to be true and wishes to prove.

The alternative hypothesis is a statement that contradicts or disagrees with the null
hypothesis. If the null hypothesis is rejected, the conclusion is reflected by the
alternative hypothesis. If there is no enough statistical evidence to reject the null
hypothesis, it is retained, but not accepted. In this case, the null hypothesis is not
necessarily true, but it just cannot be rejected from the current evidence from the
sample data.

Example 1

If we are going to test the correlation of Physics and Mathematics subjects, we


formulate the null and alternative hypotheses as follows:

: There is no correlation between the Physics and Mathematics subjects

or : Physics and Mathematics education subjects are not correlated.

: There is a correlation between the Physics and Mathematics subjects

or : Physics and Mathematics education subjects are correlated.

Example 2

A computer shop receives a delivery of a big number of laptops from a salesman.


The delivery is regarded as satisfactory if the percentage of defective laptops does
not exceed 3%(i.e., ); otherwise, it is regarded as unsatisfactory. The
percentage of defective sets is an unknown parameter hence, the sampling
inspection is done to decide whether the delivery is to be accepted or rejected. The
following portray the hypotheses of this problem.

: The delivery is satisfactory; i.e.,

: The delivery is satisfactory; i.e.,

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 114

Directional and Nondirectional Tests

A test of hypothesis is categorized into two:

a) One-tailed test is also known as the directional test. It occurs when researchers
do not have a priori expectation about the ample value they expect to observe. This
is usually done if the researcher just want to test whether a mean is larger or
smaller the hypothesized population mean.

Figure 8.1
Region of Rejection and Acceptance in One-tailed Test

b) Two-tailed test is also known as the nondirectional test of a hypothesis. It is a


test where alternative hypothesis does not specify a directional difference for the
parameter of interest. It is appropriate when researchers do not have a priori
expectation regarding the value they expect to observe in the sample. The
researchers would simply want to see if the sample mean is different from the
hypothesized population in either direction.

Figure 8.2
Region of Rejection and Acceptance in Two-tailed Test
Statistics and Probability by CMDAncheta and SMAGalman
Chapter 8: Hypothesis Testing P a g e | 115

Level of Confidence

In hypothesis testing, the conclusion reached is not based on certainty, but


on a probability estimate. This probability estimate or risk of committing an error in
the conclusion is determined by the researcher. This is known as the alpha level.

If we set the alpha level at 5% , then we are taking a 5% risk of


being wrong and 95% risk of being right. That is, we have a probability error of 5%
that our conclusion is wrong.

Furthermore, the level of significance determines the critical value of the test
statistic. This value serves as the reference point for rejecting or not rejecting a null
hypothesis. In a two-tailed test, the level of significance is given by . For instance, if
the indicated level of significance is 5% and the test is a two-tailed test, the level of
significance would be determined by 2.5%.

Types of Errors

Whichever hypothesis is correct. But there is a chance that sampling error


will cause a researcher to reject the hypothesis. We may reject the null hypothesis
that is true or we may accept a null hypothesis that is false. These two types of
mistakes in hypothesis testing are called type I and type II error, respectively.

Steps in Hypothesis Testing

The logical steps that should be followed in order to arrive at an objective


and meaningful result are the following:

1. State the null hypothesis .

2. Determine the measurement level of the variables under study. The level of
measurement of the variable determines to a large extent the statistical test the
hypothesis. Parametric tests are generally used for interval and ratio data of
measurement while nonparametric tests are usually applied to nominal and ordinal
types.

3. Choose the statistical test appropriate to test the hypothesis.

4. Specify a significance level and the sample size.

5. Compute the value of a statistical test.

6. Determine the significance of the computed value.

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 116

7. Interpret and discuss the result.

8.1 z-test

This section aims to:

1. determine the conditions when to use the z-test; and

2. test hypothesis using z-test.

This parametric test is used when the given data is population mean and
population standard deviation and . The formula is given by

Where:

Use z test when:

 Your sample size should be greater than 30. Otherwise, use a t test.
 Data points should be independent from each other.
 Your data should be normally distributed. However, for large sample sizes
(over 30) this doesn’t always matter.
 Your data should be randomly selected from a population, where each item has
an equal chance of being selected.
 Sample sizes should be equal if at all possible.

Example

In a recent survey of Information Technologists in Nueva Ecija, it was found out that
the average monthly net income of IT experts is P12, 542. Suppose a researcher
wants to test this figure by taking a random sample of 165 IT experts in the province
to determine whether the monthly net income has changed. Suppose further that

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 117

the average net monthly income of 165 IT experts is P13,861 and the population
standard deviation was found out to be P1,459. Test the significance at 5% level.

Solution:

Following the steps in hypothesis testing,

Step 1. The average monthly net IT experts is still P12, 542

The average monthly net IT experts is no longer P12, 542

Step 2. two-tailed test (nondirectional)

Step 3. z-test for a single mean

Step 4. Reject if the computed z-value is greater than 1.96 or less than -1.96

Step 5. Given :

Step 6. Reject since .

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 118

Step 7. Interpretation: The average monthly income of IT experts is no longer


P12,542.

Name: ___________________________________________________________ Date: _____________________

Section: _________________________________________________________ Score: ____________________

Exercises 8.1
z-test

1. In the population, the average IQ is 100 with a standard deviation of 15. A team of
scientists wants to test a new medication to see if it has either a positive or negative
effect on intelligence, or no effect at all. A sample of 30 participants who have taken
the medication has a mean of 140. Did the medication affect intelligence, using alpha
= 0.05?

2. The mean Verbal SAT score for the population of first students at NEUST is 520.
The standard deviation of scores in this population is 95. An investigator believes
that the mean Verbal SAT of first year psychology majors is significantly different
from the mean score of the population. The mean of a sample of 36 first year
psychology majors is 548. Please test the investigator's prediction using an alpha
level of .05.

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 119

3. The mean number of close friends for the population of people living in the
Philippines. is 5.7. The standard deviation of scores in this population is 1.3. An
investigator predicts that the mean number of close friends for introverts will be
significantly different from the mean of the population. The mean number of close
friends for a sample of 26 introverts is 6.5. Do these data support the investigator's
prediction? Use an alpha level of .05.

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 120

8.2 t-test

This section aims to:

1. determine the conditions when to use t-test; and

2. test hypothesis using t-test.

Use t-test when the sample variance is known instead of the population
variance and the sample is equal to or less than 30.

The formula for computing the t-value is given by

Where:

Example
A researcher is interested to find out whether the mental ability of the children in a
large community inhabited mostly by farmers in an agricultural area differs from
that of the population. He took a sample of 28 elementary and high school students
from the school in the community. Using a standardized IQ test, he found the mean
equal to 95 and the standard deviation equal to 14. He set the alpha level at 0.05. the
population mean is 100.

Solution:

Step 1. There is no significant difference between the mean IQ of the children


in the community and that of the population.
Statistics and Probability by CMDAncheta and SMAGalman
Chapter 8: Hypothesis Testing P a g e | 121

There is a significant difference between the mean IQ of the


children in the community and that of the population.

Step 2.variable under study: mental ability (interval)

Step 3. t-test (nondirectional)

Step 4.

Hence, the critical values of t at the 0.05 level of confidence with 29 degrees of
freedom is given by

Step 5. Given :

14

Step 6. Reject since the .

Step 7. Interpretation: There is a significant difference between the mean IQ of the


children in the community and that of the population.

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 122

Name: ___________________________________________________________ Date: _____________________

Section: _________________________________________________________ Score: ____________________

Exercises 8.2
t-test

1. A certain brand of bath soap is advertised to have a net weight of 200 grams. If the
net weights of 10 boxes are 195, 155, 175, 210, 225, 150, 145, 190, 215, and 220
grams, can it be concluded that the average net weight of the boxes is less than the
advertised amount?

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 123

2. The daily wages of factory workers in the country today is assumed as normally
distributed with a mean of P190.00. Factory ABC employs 25 workers and pays an
average of P180.00 a day with a standard deviation of P23.50. Can factory ABC be
accused of paying inferior wages to its employees at 0.05 level of significance.

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 124

8.3 One Factor Analysis of Variance (ANOVA)

This section aims to:

1. determine the conditions that guarantee the use of ANOVA; and

2. test hypothesis using ANOVA.

The Analysis of Variance (ANOVA) is a method for dividing the variation


observed into different parts, each part assignable to a known source, course, or
factor (Punzalan, 1995). It is used when we wish to test the significance of the
difference between two or more means obtained from independent samples. It is a
parametric test so that the same assumptions ascribed for a parametric test are
made:

1) random solution of subjects from normal population with equal variances;

2) samples or groups are independent; and

3) data being analyzed must be interval.

One-way ANOVA or one-factor ANOVA is called such because there is only


one factor being studied as the independent variable.

1) Within groups Sum of Squares

Where is a deviation score .

2) Between groups Sum of Squares

Where:
Statistics and Probability by CMDAncheta and SMAGalman
Chapter 8: Hypothesis Testing P a g e | 125

3) Total Sum of Squares

4) Mean Square (MS)

Where:

Where:

Degrees of Freedom

Where:

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 126

5. The F-test or ratio

Example

Test the hypothesis that teaching aptitude varies by academic performance. Data
are as follows.

Above Average Above Average Below Average

95 86 78
91 87 76
88 87 78
96 84 79
92 81 72

Solution:

Procedure:

1) : The Above Average, Average and Below Average teaching aptitude


groups do

not differ with respect to academic performance.

: The Above Average, Average and Below Average teaching aptitude


groups

differ with respect to academic performance.

2) level of significance = 0.05, critical value of F-ratio:

3) ANOVA (F-test)

4) Computation:

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 127

4.1 Computing the mean:

85

Thus,

Above Average Average Below Average

95 2.6 6.76 86 1 1 78 1.4 1.96


91 -1.4 1.96 87 2 4 76 -0.6 0.36
88 -4.4 19.36 87 2 4 78 1.4 1.96
96 3.6 12.96 84 -1 1 79 2.4 5.76
92 -0.4 0.16 81 -4 16 72 -4.6 21.16
41.2 26 31.2

4.2 Compute

4.3 Compute

4.4 Total Sum of Squares

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 128

4.5 Degrees of Freedom

4.6 Compute the Mean Square (MS)

4.7 compute the F-ratio

5. Decision: Since the computed F-ratio of 5.18 is greater than the critical F value of
3.88 for , the null hypothesis is rejected. It is significant at 0.05 level of
confidence.

Summary Table for One-Factor ANOVA of the Academic Performance of the


Three Sample Groups

Source Sum of Degrees Mean Compute Tabula Decisio Interpretatio


of Square of Square dF r n n
Variatio s Freedo s F(0.05
n m )
Betwee 624.6 2 312.45 5.18 3.88 Reject Significant
n
Column
Within 98.4 12 60.28
Column
Statistics and Probability by CMDAncheta and SMAGalman
Chapter 8: Hypothesis Testing P a g e | 129

6. Interpretation

The significant F-ratio reveals the rejection of null hypothesis. Thus the there is
enough statistical evidence to accept the alternative hypothesis. This means that the
Above Average, Average and Below Average teaching aptitude groups differ with
respect to academic performance. The mean academic performance of the above
average group is the highest while the mean academic performance of the below
group is the lowest. Thus it can be said that the academic performance varies on
their teaching aptitude.

Name: ___________________________________________________________ Date: _____________________

Section: _________________________________________________________ Score: ____________________

Exercises 8.3
Analysis of Variance (ANOVA)

1. The table below shows the test scores of the top five students in Statistics. Use the
analysis of variance to test the hypothesis that the mean scores are the same for five
students at 5% level of significance.

Test Scores of Students in Statistics Exam

Exam Student
A B C D E
1 24 29 27 23 22
2 27 22 34 22 34
3 25 25 33 17 23
4 31 27 32 21 25
5 36 28 31 20 26

Statistics and Probability by CMDAncheta and SMAGalman


Chapter 8: Hypothesis Testing P a g e | 130

2. In an experiment, five groups of nursing students were randomly assigned to


each of the three popular brands of shampoo: Brand A, B and C. Bacterial count after
1 hour of using the shampoo was recorded as shown in the table below. Find if
there’s a significant difference among the different brands of shampoo using ANOVA
at 1% level of significance.

Statistics and Probability by CMDAncheta and SMAGalman

You might also like