Testing of Hypothesis
Testing of Hypothesis
Testing of Hypothesis
STATISTICS
Population:
Sample:
A sample is proportion of the population that is studied to learn about the characteristics of the
population.
Random sample:
A random sample is one in which each item of a population has an equal chance of being
selected.
Sampling:
Sample size:
The number of items selected in a sample is called the sample size and it is denoted by ‘n’. If
n 30, the sample is called large sample and if n 30, it is called small sample
Sampling distribution:
Consider all possible samples of size’ n’ drawn from a given population at random. We calculate
mean values of these samples.
If we group these different means according to their frequencies, the frequency distribution so
formed is called sampling distribution.
The statistic is itself a random variate. Its probability distribution is often called sampling
distribution.
All possible samples of given size are taken from the population and for each sample, the statistic
is calculated. The values of the statistic form its sampling distribution.
Standard error:
The standard deviation of the sampling distribution is called the standard error.
Notation:
Note
Statistic S.E (Standard Error)
x
n
Difference of sample 1 1
p1' p2 ' pq
proportions n1 n2
2 2
Difference of sample 1 2
x1 x2
means n1 n2
pq
p' (Sample proportion)
n
Null Hypothesis ( H 0 )
The hypothesis tested for possible rejection under the assumption that it is true is usually called
null hypothesis. The null hypothesis is a hypothesis which reflects no change or no difference. It is
usually denoted by H 0
Alternative Hypothesis ( H1 )
The Alternative hypothesis is the statement which reflects the situation anticipated to be correct if
the null hypothesis is wrong. It is usually denoted by H1 .
For example:
If H0 : 1 = 2 (There is no diff' bet' the means) then the formulated alternative hypothesis is
H1 : 1 2
Level of significance
It is the probability level below which the null hypothesis is rejected. Generally, 5% and 1% level
of significance are used.
The shaded portion in the following figure is the critical region which corresponds to 5% LOS
The sample values of the statistic beyond which the null hypothesis will be rejected are called
critical values or significant values
Level of significance
Types of test 1% 5% 10%
When two tails of the sampling distribution of the normal curve are used, the relevant test is
called two tailed test.
When only one tail of the sampling distribution of the normal curve is used, the test is described
as one tail test H1 : 1 2 (or) 1 2
H0 1 2
two tailed test
H1 1 2
1. Formulate H 0 and H1
2. Choose the level of significance
3. Compute the test statistic Z, using the data available in the problem
4. Pick out the critical value at % level say Z
5. Draw conclusion: If |Z| < Z , accept H0 at % level. Otherwise reject H0 at % level
Let {x1, x 2 ,. . . .xn } be a sample of size (n 30) taken from a population with mean and
S.D . Let x be the sample mean. Assume that the population is Normal.
To test whether the difference between Population mean and sample mean x is significant or
not and this sample comes from the normal population whose mean is or not.
H0 : = a specified value
H1 : a specified value
x x
Z= N(0.1) for large n.
S .E ( x)
n
Note:
s
1. If is not known, for large n, S.E x = where 's' is the sample S.D
n
Problems:
1. A sample of 900 members is found to have a mean 3.5cm. Can it reasonably regarded as a simple
sample from a large population whose mean is 3.38 and a standard deviation 2.4cm?
Solution:
We formulate the null hypothesis that the sample is drawn from population whose mean is
3.38cm.
i.e., H 0 : = 3.38
H1 : 3.38
x
Test statistic Z =
3.5 3.32
Z= 1.5
2.4
900
Critical value:
Conclusion:
2. A manufacturer claims that his synthetic fishing line has a mean breaking strength of 8kg and S.D
0.5kg. Can we accept his claim if a random sample of 50 lines yield a mean breaking of 7.8kg. Use 1%
level of significance.
Solution:
We formulate H 0 : =8
H1 : 8
L.O.S = 0.01
x
Test statistic Z =
7.8 8
Z= 2.828
0.5
50
|Z| = 2.828
Critical value:
Conclusion:
3. A random sample of 200 Employee’s at a large corporation showed their average age to be 42.8 years,
with a S.D of 6.8 years. Test the hypothesis H 0 : = 40 versus H1 : > 40 at = 0.01 level of
significance.
Solution:
We set up H 0 : = 40
H1 : 40
L.O.S = 0.01
x
Test statistic Z =
Critical value:
Conclusion:
Type - II:
Consider two samples of sizes n1 and n2 taken from two different populations with population
means 1 and 1 and S.D's 1 and 2
Let x1 and x2 be the sample means and S1 and S2 be the S.D's of the samples
H0 : 1 = 2
H1 : 1 2
x1 x2
Z=
S .E ( x1 x2 )
x1 x2
ie., Z = N(0,1)
2 2
1 2
n1 n2
In many situations, we do not know the S.D’s of the populations (or) population from which the
samples are drawn.
In such cases, we can subs the S.D’s are of samples S1 and S2 in place of 1 and 2
x1 x2
The test statistic Z =
s12 s2 2
n1 n2
Problems
The mean of two sample large samples of 1000 and 200 members are 67.5 inches and 68 inches
respectively. Can the samples be regard as drawn from the population of standard deviation of 2.5 inches?
Test at 5% Los
Solution
we set up H0 : 1 2
H1: 1 2
x1 x2
The test statistic Z =
1 1
n1 n2
67.5 68
Z= 5.164
1 1
2.5
1000 2000
|Z| = 5.164
Conclusion:
2. Samples of students were drawn from two universities and from the weights is kilogram. The
means and S.D’s are calculated. Test the significance of the difference between the means of two
samples
Mean S.D Sample Size
University A 55 10 400
University B 57 15 100
Solution:
we set up H0 : 1 2
H1: 1 2 ; = 0.05
x1 x2
The test statistic Z = `
s12 s2 2
n1 n2
55 57
Z= 1.265
102 152
400 100
|Z| = 1.265
Critical value:
The table values of Z at 5% Los is Z = 1.96
Conclusion:
Since |Z| < 1.96, H 0 is accepted at 5% Los. We conclude that the difference between the means is
not significant.
3. The average hourly wage of a sample of 150 workers is plant A was Rs. 2.56 with a S.D of
Rs.1.08. The average wage of a sample of 200 workers in plant B was Rs. 2.87 with a S.D of Rs.
1.28. Can an applicant safety assume that the hourly wages paid by plant B are greater than those
paid by plant A?
Solution:
Let x1 and x2 denote the hourly wages paid to workers in plant A and plant B respectively.
= 0.05
x1 x2
Z =
s12 s2 2
n1 n2
2.56 2.87
Z= 2.453
(1.08)2 (1.28) 2
150 200
|Z| = 2.453
Critical value:
The hourly wage paid by Plant B are greater than those paid by Plant A
4. A sample of size 30 from a normal population yielded 80 and variance 150. A sample of
size 40 from a second normal population yielded the sample mean 71 and variance 200.
Test H 0 : 1 2 2. Versus H1 : 1 2 2
Solution:
H0 : 1 2 2.
x1 x2 1 2
Test Statistic Z =
s12 s2 2
n1 n2
(80 71) 2
Z= 2.215
150 200
30 40
Critical value:
Conclusion:
5. A buyer of electric bulbs purchases 400 bulbs; 200 bulbs of each brand. Upon testing these
bulbs be found that brand A has an average of 1225 hrs with a S.D of 42 hrs. where as brand B
had a mean life of 1265 hrs with a S.D of 60 hrs. Can the buyer be certain that brand B is
Superior than brand A in quality?
Solution:
H0 : 1 2 ;
L.o.s : = 0.05
x1 x2 x1 x2
Test Statistic Z =
S .E x1 x2 s12 s2 2
n1 n2
1225 1265 40
Z= 7.72
(42) 2 (60) 2 5.18
200 200
| Z | = 7.72
Critical value:
Conclusion:
Type - III:
p: population proportion.
p' p
Test statistic Z =
pq
n
Problems
1. A coin is tossed 400 times and it turns up head 216 times. Discuss whether the coin may be
regarded as unbiased one.
Solution
1 1
ie., p = q = 1- p =
2 2
H1 : coin is biased
= 0.05
p' p
Test statistic Z =
pq
n
216
Here p' ;n 400
400
0.54 0.5
Z= 1.6
1
600
Conclusion:
Solution:
1
we set up H0 : p =
2
1
H1 : p
2
p' p
Test statistic Z =
pq
n
280
Here p' 0.56; n 500; p=0.5
500
q = 1-p = 0.5
0.56 0.5
Z= 2.68
0.5 0.5
500
Conclusion:
To test the significance of the difference between the sample proportions p1' and p2' .
1 1
The standard error of p1' p2' pq
n1 n2
x x n1 p1' n2 p2 '
Where p= 1 2
n1 n2 n1 n2
p1' p2'
The test statistic is Z = N (0,1)
1 1
pq
n1 n2
Problems:
1. If a sample of 300 units of a manufactured product 65 units were found to be defective and in
another sample of 200 units, there were 35 defectives. Is there significant difference in the
Solution:
H0: p1 = p2 (ie., There is no significant difference in the proportion defectives in the samples)
Los: = 0.05
p1' p2'
The test statistic is Z =
1 1
pq
n1 n2
65
p1' 0.22; p 2 ' =0.175
300
100 1 4
p= q
500 5 5
1 1 4 1 1
pq 0.0365
n1 n2 25 300 200
0.22 0.175
Z= 1.233
0.0365
Critical value :
Conclusion:
2. A machine puts out 16 imperfect articles in a sample of 500. After the machine is over-hauled
in puts out 3 imperfect articles in a batch of 100. Has the machine improved?
Solution:
ie., H0 : p1 = p2
Los: = 0.05
p1' p2'
The test statistic is Z =
1 1
pq
n1 n2
16
Here p1' 0.032; p 2' =0.03
500
n1 = 500; n 2 =100
19 581
p= and q
600 600
0.032 0.03
Z= 0.104
19 581 1 1
600 600 500 100
| Z | = 0.104
Critical value :
The table value of Z for one tailed test Z = 1.645 at 5 Los
Conclusion:
Z < 1.645, H 0 is accepted at 5% Los.
The Machine has not improved due to overhaulding.
3. Before an increase in excise duty on tea, 800 perons out of a sample of 1000 persons were
found to be tea drinkers. After an increse is excise duty. 800 people were tea drinkers in a
sample of 1200 people. Test whether there is a significant decrease in the consumption of
Solution:
H0: the proportion of tea drinkers before and after the increase in excise duty are equal
ie., p1 = p 2
H1: p1 p2
Los: = 0.05
p1' p2'
The test statistic is Z =
1 1
pq
n1 n2
800 800
n 2 =1200; p1' 0.8; p2' = 0.67
1000 1200
x1 x2 1600 8 3
p= = = q
n1 n2 2200 11 11
1 1 24
pq 0.001 0.0008 0.0189
n1 n2 121
0.13
Z= 6.88 |Z| 6.88
0.0189
Conclusion:
There is a significance decrease in the consumption of tea due to increase in excise duty.
Type - V: (Test of significance for the difference of S.D’s of two large samples)
Let S1 and S2 be the S.D's of two indepedent samples of sizes n1 and n2 respectively
S1 S2
the test statistic is Z = N (0,1) for large 'n'
S.E (S1 S2 )
S1 S2
Z= N (0,1)
2 2
1 2
2n1 2n2
S1 S2
Z=
S12 S2 2
2n1 2n2
Problems:
1. The sample of sizes 1000 and 800 gave the following results
Mean S.D
Sample I 17.5 2.5
Sample II 18 2.7
Assuming that the samples are indepedent, test whether the two samples may be
Solution:
We set up H0 : 1 2 ;
ie., two samples maybe regarded as drawn from the universe with same S.D’s
H1 : 1 2
S1 S2
Test statistic Z =
S12 S2 2
2n1 2n2
| Z | =2.282
Critical value :
At 1% Los, the tabulated value is 2.58
Conclusion:
Since Z < 2.58, H 0 is accepted at 1% Los.
The two samples may be regarded as drawn from the universe with the same S.D's
2. In a survey of incomes of two classes of workers, two random samples gave the following
results. Examine whether the differences between (i) the means and (ii) the S.D's are
significant.
I 100 582 24
II 100 546 28
Examine also whether the samples have been drawn from a population with same S.D
Solution:
(i) We set up H0 : 1 2 ;
H1 : 1 2
x1 x 2 582 546
Test statistic Z =
2 2
S1 S2 (24)2 (28)2
n1 n2 100 100
360
Z = =9.76
(24) 2 (28) 2
| Z | = 9.76
Critical value :
Conclusion:
(ii) H 0 : 1 2
H1 : 1 2
S1 S2 24 28
Test statistic Z =
S12 S2 2 (24)2 (28)2
2n1 2n2 200 200
40
Z = = 1.53
288 392
| Z | = 1.53
Critical value :
Conclusion:
Hence we conclude that the two samples have been drawn from population with the same S.D's
3. Two machines A and B produced 200 and 250 items on the average per day with a S.D of
20 and 25 items reply on the basis of records of 50 day's production. Can you regard both
Solution:
Los: = 0.05
S1 S2
Test statistic Z =
S12 S2 2
n1 n2
n1 =200 50; S2 = 25
n2 =250 50; S1 20
(20 25) 50 5 50
Z = = 23.57
400 625 1 1.25
400 500
| Z | = 23.57
Critical value :
At 1% Los, the table value of Z is 2.58
Conclusion:
Since Z > 2.58, H 0 is rejected at 1% Los.
We conclude that the both machines are not equally efficient at 1% Los
n 1
|x |
The student's t-statistic is defined as t = n , Where n = sample size
s
The degree of freedon of this statistic
V=n 1
Type I:
x x
Test Statistic t =
S .D S
n 1 n
1 ns 2
S2 ( x x)2 (or) S=
n 1 n 1
If the computed value of t is greater than the critical value t ,Ho is rejected
10 parts shows a mean diamter of 0.742 inch with a S.D of 0.40. Test whether the work is
Solution:
ns 2 10 (0.40)2
= 0.700 inches S= = 0.4216
n 1 9
H0 : 0.700inches
H1 : 0.700inches
|x |
Test Statistic t = n =0.316
s
2. Ten individuals are chosen at random from a population and their heights are found to be
in inches 63, 63, 66, 67, 68, 69, 70, 70, 71, 71. In the light of this data, discuss the suggestion
Solution:
x : 63 63 66 67 68 69 70 70 71 71
(x- x)2 : 23.04 23.04 3.24 0.64 0.04 1.44 4.84 4.84 10.24 10.24
(x - x) 2 81.6
S= 3.011
9 9
x1 x2
Test Statistic t =
1 1
S
n1 n2
n1s12 n2 s2 2
Where S2 (or)
n1 n2 2
2
( x1 x1 ) 2 ( x2 x 2 )2
S
n1 n2 2
The calculated value of t is less than the table value of t for d.f = n1 n2 2, H0 is accepted
1. Two independent samples from normal pop's with equal variances gave the following
results
2 12 24.9 2.8
Solution:
H1 : 1 2
Los: = 0.05
x1 x2 n1s12 n2 s2 2
Test Statistic t = Where S2
1 1 n1 n2 2
S
n1 n2
100 94.08
= =7.465
26
S = 2.732
23.4 24.9
t= = -1.438
1 1
2.732
16 12
| t | = 1.438
Critical value :
t 0.05 2.056
Conclusion:
H0 is accepted at 5% Los.
2. Two independent samples of 8 and 7 items respectively had the following values
Sample I : 9 13 11 11 15 9 12 14
Sample II : 10 12 10 14 9 8 10
Solution:
We set up H0 : 1 2
H1 : 1 2
Los: = 0.05
x1 x2
Test Statistic t =
1 1
S
n1 n2
2
( x1 x1 ) 2 ( x2 x 2 )2
Where S
n1 n2 2
Computation of t:
d1 ( x1 x1 ) d2 ( x2 x2 )
x1 d1 x1 11.75 d1 2
( x1 x1 ) 2 x2 d2 x2 10.43 d 22 ( x2 x2 ) 2
9 -2.75 7.5625
10 -0.43
0.1849
13 1.25 1.5625
12 1.57
2.4649
11 -0.75 0.5625
10 -0.43
0.1849
11 -0.75 0.5625
14 3.57
12.7449
15 3.25 10.5625
9 -1.43
2.0449
9 -2.75 7.5625
8 -2.43
5.9049
12 0.25 0.0625
10 -0.43
0.1849
14 2.25 5.0625
6
x1 11 11.75
8
3
x2 10 10.43
7
2
2 2
d1 36
( x1 x1 ) d 1 38 33.5
n1 8
2
2 2
d2 9
( x2 x2 ) d2 25 33.5
n2 7
33.5 23.71
S2 S = 2.097
8 7 2
x1 x2
t=
1 1
S
n1 n2
11.75 10.43
=
1 1
2.097
8 7
t = 1.218
d.f = 8 + 7 - 2 = 13
Critical value:
Conclusion:
Type III:
When the two samples are of the same sizes and the data are paired
d
the test statistic is t =
S
n
Where d mean of differences
(d d ) 2
and S =
n 1
Degrees of freedom = n-1
1. Elevan school boys were given a test in painting. They were given a month's further tution
and a second test of equal difficulty was held at the end of the month. Do the marks give
evidance that the students have beneifit by extra coaching?
Boys: 1 2 3 4 5 6 7 8 9 10 11
Solution:
ie., The mean of the difference between the marks of the two tests is zero
ie., H0 : d 0
H1: d 0
d
the test statistic is t =
S
n
S. No 1 2 3 4 5 6 7 8 9 10 11
d=x-y -1 1 -3 3 -2 -2 -2 -3 0 -4 2
d-d 0 2 -2 4 -1 -1 -1 -2 1 -3 3
2 0 4 4 16 1 1 1 4 1 9 9
d-d
d 11
d 11; d 1
n 11
2
d-d 50
(d d ) 2 50
S= 5 2.236
n 1 10
d 1
t=
S 2.236
n 11
1
|t|= 1.48
0.625
Critical value:
Conclusion:
2. The scores of 10 candidates prior and after training are given below,
Prior : 84 48 36 37 54 69 83 96 90 65
After : 90 58 56 49 62 81 84 86 84 75
Is the training effective?
Solution:
ie., H0 : d 0
H1: d 0
d
the test statistic is t =
S
n
S. No 1 2 3 4 5 6 7 8 9 10
d=x-y -6 -10 -20 -12 -8 -12 -1 10 6 -10
d-d 0.3 -3.7 -13.7 -5.7 -1.7 -5.7 5.3 16.3 12.3 -3.7
2 0.09 13.69 187.69 32.49 2.89 32.49 28.09 265.69 151.29 13.69
d-d
d 63
d 63; d 6.3
n 10
2
d-d 728.1
(d d ) 2 728.1
S= 80.9 8.994
n 1 9
S 8.994
6.3 6.3
t 2.21
8.994 2.844
10
| t | = 2.21
Critical value:
Conclusion:
This test is used to test the significance of two or more sample estimates of population variance
2
x2 x2
S2 2
n2 1
S12
The distribution of F = 2 S12 S2 2 is given by the following p.d.f
S2
If S12 and S22 are the variances of two sample of sizes n1 and n 2 respectively, the estimate
n1s12 n 2s 2 2
S12 ; S2 2
n1 1 n2 1
d.f V1 = n1 1 &V2 = n2 1
While defining the statistic F, the large oftwo variances is always placed in the numerator and
Consider two independent R, samples x1 ,x 2 ,.......x n1 & y1 ,y 2 ,.......y n 2 from normal populations
The hypothesis to be tested is
2 2
we set up: H0 : 1 = 2
2 2
& H1: 1 2
S12
The test statistic F= 2 S12 S2 2
S2
n n
1 2 1 2
S12 xi x and S12 yj y
n1 1 i 1 n1 1 j 1
Problems:
1. It is known that the mean diameters o rivets produced by two firms A and B are practically the
same but the standard deviations may differ.
For 22 rivets produced by A, the S.D is 2.9 m, while for 16 rivets manufactured by B, the S.D is
3.8 m. Test whether the products of A have the same variability as those of B
Solution:
2 2
H0 : 1 = 2
ie., variability for the two types of products are same.
Los: = 0.05 (or) 5%
S12
The test statistic F= 2 S12 S2 2
S2
Given, n1 = 22; n 2 = 16
S1 = 2.9; S2 = 3.8
n1s12 22(2.9) 2
S12 8.81
n1 1 22 1
n 2s 2 2 16(3.8)2
S2 2 15.40
n2 1 16 1
S2 2
F = S2 2 S12
S12
15.40
=
8.81
F = 1.748
Number of degrees of freedom are V1 16 1 15
V2 22 1 21
Critical value:
At 5% Los, the table value of F at d.f (15,21) is F = 2.18
Conclusion:
F< 2.18, H0 is accepted at 5% Los.
Variability for two types of products may be same.
2. Two random samples of sizes 8 and 11, drawn from two normal populations are characterized
as follows
You are to decide if the two populations can be taken to have the same variance.
Solution:
2 2
we set up: H0 : 1 = 2
2 2
& H1: 1 2
For sample I
2
2
x2 x
s 1
n n
2
61.52 9.6
=
8 8
s12 6.25
For sample II
2
2
y2 y
s2
n n
2
73.26 16.5
=
11 11
s22 4.41
n1s12 8(6.25)
S12 7.143
n1 1 7
n 2s 2 2 11(4.41)
S2 2 4.851
n2 1 10
S2 2
F = S2 2 S12
S12
7.143
= 1.472
4.851
F = 1.472
Chi-Square Test
Definition
2
is defined as
2
n
2 Oi Ei
i 1 Ei
Problems
1. The following table gives the number of aircraft accident that occured during the various days
of the week. Test whether the acidents are uniformly distributed over the week.
Solution:
Los α = 0.05
2
n
2 Oi Ei
Test Statistic
i 1 Ei
84
The expected frequency of the on each day = 14
6
Oi : 14 18 12 11 15 14
Ei : 14 14 14 14 14 14
2 2 2 2 2 2
2 14 14 18 14 12 14 11 14 15 14 14 14
14 14 14 14 14 14
Critical value:
2
The tablulated value of at 5% for 6 d.f is 12.59
Conclusion:
2
Since < 12.59, we accept the null hypothesis
We conclude that the accidents are uniformly distributed over the week.
2. The theory predicts the population of beans in the four groups A, B , C and D should be
9:3:3:1. In an experiment among 1600 beans, the number in the four groups were 882, 313,
287 and 118. Does the experimental result support the theory?
Solution:
9
E(882) = 1600 900
16
3
E(313) = 1600 300
16
3
E(287) = 1600 300
16
1
E(118) = 1600 100
16
2
n
2 Oi Ei
Test Statistic
i 1 Ei
2 2 2 2
2 882 900 313 300 287 300 118 100
900 300 300 100
Critical Value:
2
The table value of at 5% for 3 d.f is 7.815
Conclusion:
2
Since < 7.815, H0 is accepted at 5% Los.
We conclude that there is a very good correspondent between theory and experiment
3. 4 coins were tossed 160 times and the following results were obtained.
No. of heads : 0 1 2 3 4
Frequency : 19 50 52 30 9 160
0 50 104 90 36 280
2
Test the goodness of fit with the help of on the assumption that the coins are unbiased
Solution:
1
The probability if getting the success of heads is p =
2
1
q = 1- p =
2
When 4 coins are tossed, the probability of getting 'r' heads is given by,
P(x = r) = n Cr pr qn-r ; r = 0, 1, 2, 3, 4
r 4-r
1 1
= 4Cr
2 2
4
1
= 4Cr
2
1
P(x = r) 4C r r = 0, 1, 2, 3, 4
16
1
The expected frequencies of getting 0, 1, 2, 3, 4 heads are given by 1604Cr
16
= 104Cr , r = 0, 1, 2, 3, 4
Oi : 19 50 52 30 9
Ei : 10 40 60 40 10
26 48 43 26 12
2
n
2 Oi Ei
Test Statistic
i 1 Ei
2 2 2 2 2
2 19 10 50 40 52 60 30 40 9 10
10 40 60 40 40
D.f V= n-1=5-1=4
Critical value:
2
The table value of for 4 d.f at 5% Los is 9.488
Conclusion:
2
Since > 9.488, H 0 is rejected at 5% Los
The coins are biased
No. of goals : 0 1 2 3 4 5 6 7
Solution:
x: 0 1 2 3 4 5 6 7
f : 95 158 108 63 40 9 5 2
fx 812
x 1.7
f 480
1.7
e (1.7) r
= 480 r = 0, 1, 2, 3, 4, 5, 6, 7
r!
2
n
2 Oi Ei
Test Statistic
i 1 Ei
Oi : 95 158 108 63 40 9 5 2
16
Ei : 88 150 126 72 30 10 3 1
14
2 2 2 2 2 2 2
2O E 95 88 158 150 108 126 40 30 16 14 63 72
=
E 88 150 126 30 14 72
= 0.56 + 0.43 + 2.57 + 3.33 + 1.12 +0.29
2
= 8.30
Critical value:
2
The table value of at 5% Los for 4 d.f is 9.483
Conclusion:
2
Since < 9.483, H 0 is accepted at 5% Los.
2
5. Apply the test of goodness of fit to the follwoing data
Oi : 1 5 20 28 42 22 15 5 2
Ei : 1 6 18 25 40 25 18 6 1
Solution:
= 0.05 (or) 5%
Oi : 1 5 20 28 42 22 15 5 2
6 7
Ei : 1 6 18 25 40 25 18 6 1
7 7
n=7
2
n
2 Oi Ei
Test Statistic
i 1 Ei
2 2 2 2 2 2 2
2 6 7 20 18 28 25 42 40 22 25 15 18 7 7
7 18 25 40 25 18 7
d.f V= n-1 = 7 - 1 = 6
Critical value:
2
At 5% Los, the table value of for 6 d.f is 12.592
Conclusion:
2
Since <12.592, H0 is accepted at 5% Los.
No. of days : 12 39 47 40 20 17 3 2
2
Use , examine whether the data are poisson distributed.
Solution:
x: 0 1 2 3 4 5 6 7
f : 12 39 47 40 20 17 3 2
fx : 0 39 94 120 80 85 18 14
fx 450
x 2.5
f 180
2.5
e (2.5) r
= 180 r = 0, 1, 2, 3, 4, 5, 6, 7
r!
r = 0 1 2 3 4 5 6 7
Oi : 12 39 47 40 20 17 3 2
5
Ei : 15 37 46 38 24 12 5 2
7
2
Test Statistic
2 2 2 2 2 2 2
2 12 15 39 37 47 46 40 38 20 24 17 12 5 7
15 37 46 38 24 12 7
d.f V = n-1 = 7 - 1 = 6
Critical value:
2
At 5% Los, the table value of for 6 d.f is 12.592
Conclusion:
2
Since < 12.592, H0 is accepted at 5% Los.
Attribute A
B B2
. .
. .
Bs Os1 Os 2 ................Osj.................. Ost (Bs )
Attribute A
A1 A 2 ................A j .................. A t Total
Attribute B1 O11 O12 ................O1j.................. O1t (B1 )
B B2
.
.
.
Bi O i1 Oi 2 ................O ij.................. Oit (Bi )
. .
. .
. .
Bs O s1 O s 2 ................O sj.................. O st (B s )
Total (A1 ) (A 2 )................(A i ).................(A t ) N
j = 1, 2 ........... t
a b 2
In a 2 2 consistency table where in the frequencies are , the value of is
c d
2 (a b c d )(ad bc)2
(a b)(a c)(c d )(b d )
Problems :
1. An opinion poll was conducted to find the reaction to a proposed civic reform in 100 members
Solution:
Total
40 30 30 100
42 28 30 100
Total 82 58 60 200
2
n
2 Oi Ei
i 1 Ei
Oi : 40 30 30 42 28 30
Ei : 41 29 30 41 29 30
2 2 2 2 2 2
2 40 41 30 29 30 30 42 41 28 29 30 30
41 29 30 41 29 30
Critical value:
2
At 5% Los, the table value of for 2 d.f is 5.99
Conclusion:
2
Since < 5.99, H0 is accepted at 5% Los.
2. In a locality 100 persons were randomly selected and asked about their educational
Education
Middle High School College
Male 10 15 25
Female 25 10 15
Can you say that education depends on sex?
3. The following table gives the classification of 100 workers according to sex and the nature
of work. Test whether nature of work is independent of the sec of the worker.
Male 40 20 60
Sex Female 10 30 40
Total 50 50
Solution:
40 50 40 50
E(10) = 20; E(30) = 20
100 100
2
Test Statistic
2
n
2 Oi Ei
i 1 Ei
Oi : 40 20 10 30
Ei : 30 30 20 20
2 2 2 2
2 40 30 20 30 10 20 30 20
30 30 20 20
= 3.333 + 3.333 + 5 + 5
2
= 16.67
Critical value:
2
The table value of at 5% Los, for 1 d.f is 3.841
Conclusion:
2
Since > 3.841, H0 is rejected at 5% Los.
4. From the following data, test whether there is any association between intelligency and
economics conditions
Intelligences
Excellent Good Medium Dull Total
Economic Good 48 200 150 80 478
Conditions Not Good 52 180 190 100 522
Total 100 380 340 180 1000
Solution:
2
Test Statistic
2
n
2 Oi Ei
i 1 Ei
2 2 2 2 2
48 47.8 150 162.52 52 52.2 190 177.48 200 181.64
47.8 162.52 52.2 177.48 181.64
2 2 2
80 86.04 180 198.36 100 93.96
86.04 198.36 93.96
Conclusion:
2
Since < 7.815, H0 is accepted at 5% Los.
5. From the following data, test the hypothesis that the flower color is independent of flatness
of leaf
Solution:
We set up: H0 : flower color is independent of flatness of leaf. Los =0.05 (or) 5%
2
we use the formula to find is
2 (a b c d )(ad bc)2
(a b)(a c)(c d )(b d )
2
= 0.4919
Critical value:
2
The table value of at 5% Los for 1 d.f is 3.841
Conclusion:
2
Since < 3.841, H0 is accepted at 5% Los.
2 ns 2
The test statistic 2
s = sample variance
= population variance
Note:
2
The test statistic z= 2 2n 1 ~ N (0,1)
1. A random sample of size 9 from a normal population have the following values 72, 68, 74, 77,
61, 63, 63, 73, 71. Test the hypothesis that the population variance is 36.
Solution:
2
Null hypothesis H0 : 36
2
Alternative hypothesis H1: 36
2 ns 2
The test statistic 2
x : 72 68 74 77 61 63 63 73 71
x 622
x 622; x 69.11
n 9
x x : 2.9 -1.1 4.9 7.9 -8.1 -6.1 -6.1 3.9 1.9
2
x x : 8.41 1.21 24.01 62.41 65.61 37.21 37.21 15.21 3.61
2
x x 254.89
2 ns 2 254.89
2
7.08
36
d. f n-1 = 9-1 = 8
Critical value:
2
The table value of for 8 d.f at 5% Los is 15.51
Conclusion:
2
Since < 15.51, H0 is accepted at 5% Los.
2. Test the hypothesis that 10, given that s = 15 for a random sample of size 50 from a
normal population
Solution:
2 ns 2 50 225
2
= 112.5
100
2
Since 'n' is large (n > 30, the test statistic z = 2 2n 1
= 225 99 15 9.95
z = 5.05
Critical value:
Conclusion:
We conclude that 10
3. The standard deviation of the distribution of times taken by 12 workers for performing a Job
Solution:
Let H0: 10
H1: 10
Given n = 12; s = 11
2 ns 2
2
12 121
= 14.52
100
2
14.52
Degrees of freedom = n - 1 = 12 - 1 = 11
Critical value:
2
The table value of for 11 d.f at 5%Los is 19.675.
www.rejinpaul.com
www.rejinpaul.com
Conclusion:
2
Since < 19.675, H0 is accepted at 5% level