MSF Full Chapters PDF
MSF Full Chapters PDF
MSF Full Chapters PDF
(AK20 Regulation)
(Common to CSE, AI&DS)
(AK20 Regulation)
(Common to CSE, AI&DS)
MEASURES OF DISPERSION
Introduction:
Dispersion is defined as deviation or scattering of values from their central values i.e,
average (Mean, Median or Mode but preferably Mean or Median). In other words, dispersion
measures the degree or extent to which the values of a variable deviate from its average.
Two distributions may have:
i. Same central tendency and same dispersion
ii. Different central tendency but same dispersion
iii. Same central tendency but different dispersion
iv. Different central tendency and different dispersion
Definition: The degree to which numerical data tend to spread about an average value is
called variation or dispersion or spread of the data.
Problem1: Compute the range for the following observation 15, 20, 25, 25, 30, 35.
Solution: Range = Largest value – Smallest value
i.e., 35-15=20
Problem 2: The following table gives the daily sales (Rs.) of two firms A and B for five
days.
Firm A 5050 5025 4950 4835 5140
Firm B 4900 3100 2200 1800 13000
Solution: The sales of both the firms in average are same but distribution pattern is not
similar. There is a great amount of variation in the daily sales of the firm B than that of the
firm A
Range of sales of firm A = Greatest value – Smallest value = 5140-4835=305
Range of sales of firm B = Greatest value – Smallest value = 13000-1800=11200
MEAN DEVIATION:
Mean deviation is defined as arithmetic average of absolute values of the deviations
of the variates measured from an average (median, mode or mean).
The absolute value of the deviation denoted by | deviation | is the numerical value of
the deviation with positive sign.
Note: Mean deviation can be similarly calculated by taking deviations from the median or
mode.
x i x
⸫ Mean deviation = i 1
n
Problem 1: Calculate the mean deviation of the variates 40, 62, 54, 68, 76 from A.M
40 62 54 68 76 300
Solution: A.M = x = 60
5 5
n 5
xi x x 60i
⸫ Mean deviation = i 1
i 1
=
n 5
40 60 62 60 54 60 68 60 76 60
5
= 52/5 = 10.4
Problem 2: Find the mean deviation from the mean for the following data: 38, 70, 48,
40, 42, 55, 63, 46, 54, 44.
38 70 48 40 42 55 63 46 54 44 500
Solution: Mean x = 50
10 10
10
x
i 1
i x
Mean deviation from the mean = =
10
38 50 70 50 48 50 40 50 42 50 55 50 63 50 46 50 54 50 44 50
10
= 84/10 = 8.4
Problem 3: Find the mean deviation from the median for the data 34, 66, 30, 38, 44, 50,
40, 60, 42, 51.
Solution : Arranging the data in ascending order, we have :
30, 34, 38, 40, 42, 44, 50, 51, 60, 66 (n=10 terms)
th th
n n
2 term 2 1 term
Now Median ( M d )
2
= (42+44)/2 = 43.
10 10
xi Median x 43 i
i 1
i 1
30 43 34 43 38 43 40 43 42 43 44 43 50 43 51 43 60 43 66 43
10
= 87/10 = 8.7
xi x f1 xi x f 2 xi x f 3 ....... xi x f n f i xi x
=
i 1
f1 f 2 f 3 .......... f n n
fi 1
i
f i xi x n
where f i N
i 1
=
N i 1
Problem 1: Find the mean deviation about the mean for the following data
xi 2 5 7 8 10 35
,f i 6 8 10 6 8 2
35 2 70 27 54
∑,fi=N=40 ∑xi,fi=320 140
Thus A.M= x
f i xi 320
8
f i 40
n
f i xi x
⸫ Mean deviation = i 1
n =140/40 = 3.5
i 1
fi
Problem 2: Find the mean deviation about the median for the following data
xi 6 9 3 12 15 13 21 22
,fi 4 5 3 2 5 4 4 3
f i xi Median
149
i 1
4.97
⸫ Mean deviation from the median = f i 30
Problem 3: Find the mean deviation about the mean for the following data
xi 5 10 15 20 25
,f i 7 4 6 3 5
Solution:
Mean=14;
Mean deviation about the mean= 6.32
Problem 4: Find the mean deviation from median for the following data
xi 6 7 8 9 10 11 12
,f i 3 6 9 13 8 5 4
Solution : Median = 9 ;
Mean deviation about the median = 1.25
Mean Deviation from mean for Continuous Frequency Distribution: A continuous frequency
distribution is a series in which the data is classified into different class intervals along with
respective frequency. We calculate the A.M. of a continuous frequency distribute, we take xi
as the mid value of the class interval.
Problem 1:The following table gives the sales of 100 companies. Find the mean deviation
from the mean.
Sales in thousands 40-50 50-60 60-70 70-80 80-90 90-100
,Number of companies 5 15 25 30 20 5
Solution: we construct the following table for the given data
Sales Number of Midpoint of xi fi |xi - x | |xi - x | ,fi
companies fi the class xi
40-50 5 45 225 26 130
50-60 15 55 825 16 240
60-70 25 65 1625 6 150
70-80 30 75 2250 4 120
80-90 20 85 1700 14 280
90-100 5 95 475 24 120
∑fi = N= ∑xi,fi = ∑|xi - x |,fi
100 7100 =1040
Now x
fx i i
7100
71
f i 100
n
f
i 1
i xi x
Mean Deviation from mean= n =1040/40 = 10.4
i 1
fi
Problem 1: Find the mean deviation about the mean for the following data
Classes 0-100 100-200 200-300 300-400 400-500 500-600 600-700 700-800
Freq. 4 8 9 10 7 5 4 3
Solution : Assumed mean a = 350
Now x a
fi di 4
class size = 350 x100 358
f i 50
n
f
i 1
i xi x
Mean Deviation from mean= n = 7896 / 50 = 157.92
i 1
fi
Problem 2: Find the mean deviation about the mean for the following data
Classes 0-10 10-20 20-30 30-40 40-50
Freq. 5 8 15 16 6
Solution :
Classes Mid Frequency(fi) di fidi |xi - x | |xi - x |,fi
values(xi)
0-10 5 5 -2 -10 22 110
10-20 15 8 -1 -8 12 96
20-30 25 15 0 0 2 30
30-40 35 16 1 16 8 128
40-50 45 6 2 12 18 108
50 10 472
Now x a
fi di
h = 25
10
10 27 and
xi x xi 27
f i
50 h 10
f
i 1
i xi x
Mean Deviation from mean= n = 472 / 50 = 9.44
f
i 1
i
Problem 3: Find the mean deviation from median for the following data
Age of 20-25 25-30 30-35 35-40 40-45 45-50 50-55 55-60
workers
No. of 120 125 175 160 150 140 100 30
workers
f i xi Median
8175
i 1
8.175
⸫ Mean deviation from the median = f i 1000
Problem 4: Find the mean deviation from median for the following data
Wages/week 10-20 20-30 30-40 40-50 50-60 60-70 70-80
(Rs.)
No. of 120 125 175 160 150 140 100
workers
Solution : Here N/2=30 ; Median = 45 ;
Mean deviation from median = 11.33
Let x1 and x2 be the sample means of two independent large random samples sizes n1 and
n2 drawn from two populations having means 1 and 2 and standard deviations 1 and 2 .
To test whether the two population means are equal.
Note : If the two samples are drawn from population with unknown Standard deviations 12
and 22 , then 12 and 22 can be replaced by sample by variances s12 and s22 provided both the
samples n1 and n2 are large.
s12 s22
n1 n2
Solved Problems
1. The means of two large samples of sizes 1000 and 2000 members are 67.5 inches and 68.0
respectively. Can the samples be regarded as drawn from the same population of S.D 2.5
inches.
Step:1 Null Hypothesis H 0 : The samples have been drawn from the same population of S.D
0.5
z 5.16
0.0968
z 5.16
Step:5 Hence the null hypothesis H 0 is Rejected at 5% level of significance and we conclude
that the samples are not drawn from the sample population of S.D. 2.5 inches.
Step: 1 Null Hypothesis H 0 : The groups have been came from the same population.
z 2.7116
3. Samples of students were drawn from two universities and from their weights in
kilograms, mean and S.D. are calculated and shown below. Make a large sample test to the
significance of the difference between the means.
University B 57 15 100
z 1.26
Step:5 Hence the null hypothesis H 0 is accepted at 5% level of significance and we conclude
that there is no significant difference between the means.
4. The average marks scored by 32 boys is 72 with S.D. of 8. While that for 36 girls is70 with
a S.D. of 6. Does this indicate that the boys perform better than girls at level of significance
0.05?
z 1.1547
Step:5 Hence the null hypothesis H 0 is accepted at 5% level of significance and we conclude
that the performance of boys and girls is the same.
5. A sample of the height of 6400 Englishmen has a mean of 67.85 inches and a S.D. of 2.56
inches while a simple sample of heights of 1600 Austrians has a mean of 68.55 inches and
S.D. of 2.52 inches. Do the data indicate the Austrians are on the average taller than the
Englishmen? (Use as 0.01).
z 9.9
Step:5 Hence the null hypothesis H 0 is rejected at 5% level of significance and we conclude
that the Austrians are taller than Englishmen.
6. In a certain factory there are two independent processes for manufacturing the same item.
The average weights in a sample of 700 items produced from one process is found to be 250
gms with a S.D. of 30 gms while the corresponding gures in a sample of 300 items from the
other process are 300 and 40. Is there significant difference between the mean at 1% level of
significance.
z 19.43
Step:5 Hence the null hypothesis H 0 is Rejected at 5% level of significance and we conclude
that there is a significant difference between the means.
7. The mean yield of wheat from a district A was 210 pounds with S.D. 10 pounds per acre
from a sample of 100 plots. In another district the mean yield was 220 pounds with S.D. 12
pounds from a sample of 150 plots. Assuming that the S.D. of yield in the entire state was 11
pounds, test whether there is any significant difference between the mean yield of crops in the
two districts.
NOTE: 1. Find a critical value for a 95% confidence level (Two-Tailed Test).
Procedure: Confidence limit = 95%
i.e, (1 )100% 95%
(1 )100 95
95
(1 ) 0.95 0.05 0.025
100 2
1 0.025 0.975, z value at 0.975 is 1.96
z 1.96
2
Sample Proportion:
Large Proportion:
Step:1 Let us set the Null Hypthesis be H 0 : P P0 (P0 is a particular value of P).
pP
Step:4 The test statistic z ; where p is the sample proportion is approximately
PQ
n
normally distributed.
Step:5 The critical Rejection for z depending on the nature of H1 and level of significance
is given in the following table.
Note: 1. without any reference to the level of significance, we may reject the Null
Hypothesis H 0 when z 3 .
pq
2. (i) Limits for population proportion P are given by p 3 where q 1 p .
n
pq pq
(ii) Confidence interval for proportion P are given by p z z p z
2 n 2 n
where Q 1 P .
Solved Problems
1. A manufacturer claimed that at least 95% of the equipment which he supplied to a factory
conformed to specifications. An examination of a sample of 200 pieces of equipment
revealed that 18 were faulty. Test his claim at 5% level of significance.
x 182
p =Proportion of pieces confirming to specifications= 0.91
n 200
95
Let P = Population proportion= 0.95 ; Q=1-P=1-0.95=0.05
100
i.e., P 95% .
z 2.59
Since alternative hypothesis is left tailed, the tabulated value of z at 5% level of significance
is 1.645.
Step:5 Hence the null hypothesis H 0 is Rejected at 5% level of significance and we conclude
that the manufacture’s claim is rejected.
2. In a sample of 1000 people in Karnataka 540 are rice eaters and the rest are wheat eaters.
Can we assume that both rice and wheat are equally popular in this state at 1% level of
significance?
540
Let p =Sample proportion of rice eaters = 0.54
1000
1
Let P = Population proportion of rice eaters = 0.5 .
2
Q 1 P 1 0.5 0.5.
Step:1 Null Hypothesis H 0 : Both rice and wheat are equally popular in the state.
i.e., P 0.5 .
z 2.532
Since alternative hypothesis is left tailed, the tabulated value of z at 1% level of significance
is 2.58.
Step:5 Hence the null hypothesis H 0 is accepted at 1% level of significance and conclude
that both rice and wheat are equally popular in the state.
3. A random sample of 500 pineapples was taken from a large consignment and 65 were
found bad. Find the percentage of bad pineapples in the consignment.
65
Let p = Proportion of bad pineapples in the sample = 0.13
500
q 1 p 0.87 .
pq 0.13 0.87
p3 0.13 3 0.13 0.045 (0.085,0.175)
n 500
The percentage of bad pineapples in the consignment lies between 8.5 and 17.5.
4. A manufacturer claims that only 4% of his products are defective. Test the hypothesis at
random sample of 500 were taken among which 100 were defective. Test the hypothesis at
0.05 level.
x 100
Let p = 0.2
n 500
z 18.26
Since alternative hypothesis is right tailed, the tabulated value of z at 5% level of significance
is 1.96.
5. In a random sample of 100 packages shipped by air freight 13 had some damage.
Construct 95% confidence interval for the true proportion of damage package.
x 13
Let p =sample proportion of damage packages= 0.13
n 100
q 1 p 1 0.13 0.87
PQ pq 0.13 0.87
p 0.034 ( Pis not known, we take p for P)
n n 100
pq
p 1.96 0.13 1.96(0.034) 0.13 0.067 (0.063,0.197)
n
Hence the 95% confidene limits for the true proportion of damage packages is (0.063, 0.197)
6. In a hospital 480 females and 520 male babies were born in a week. Do these figures
confirm the hypothesis that males and females are born in equal number?
x 480
Let p =proportion of females born= 0.48
n 1000
1
Step:1 Null Hypothesis H 0 : The probability of equal proportion i.e, P 0.5 .
2
z 1.265
7. In a random sample of 125 cool drinkers, 68 said they prefer thumsup to pepsi. Test the
null hypothesis P=0.5 against the alternative hypothesis P>0.5.
x 68
Solution: Given sample size, n=125, x=68 and p = 0.544
n 125
z 0.9839
Since alternative hypothesis is right tailed, the tabulated value of z at 5% level of significance
is 1.645.
Step:5 Hence the null hypothesis H 0 is accepted at 5% level of significance.
8. In a big city 325 men out of 600 men were found to be smokers. Does this information
support the conclusion that the majority of men in this city are smokers?
9. A die was thrown 9000 times and of these 3220 yielded a 3 or 4. Is this consistent with the
hypothesis that the die was unbiased?
10. In a random sample of 160 workers exposed to a certain amount of radiation, 24
experienced some ill effects. Construct a 99% confidence interval for the corresponding true
percentage.
p1 p2
The test statistic z
PQ PQ
1 1
2 2
n1 n2
(b) When the population proportions P1 and P2 are not known but sample proportions
p1 and p2 are known .
In this case we have two methods to estimate P1 and P2 .
In this method, the estimated value for the two population proportions is obtained by
pooling the two sample proportions p1 and p2 into a single proportion p by the formula given
below.
Sample proportion of two samples or estimated value of p is given by
n p n p x x
p 1 1 2 2 1 2 ; q 1 p
n1 n2 n1 n2
p1 p2
The test statistic z
1 1
pq
n1 n2
Solved Problems
x1 45 x 34
p1 0.25, p2 2 0.283
n1 180 n2 120
n1 p1 n2 p2 x1 x2 45 34 79
p 0.263 ;
n1 n2 n1 n2 180 120 300
q 1 p 1 0.263 0.737
p1 p2 0.25 0.283
The test statistic z 0.647
1 1 1 1
pq (0.263)(0.737)
n1 n2 180 120
z 0.647
Since alternative hypothesis is two tailed, the tabulated value of z at 5% level of significance
is 1.96.
2. In two large population, there are 30% and 25% respectively of fair haired people. Is this
difference likely to be hidden in samples of 1200 and 900 respectively from the two
populations?
Solution: We have n1 1200 , n2 900 , x1 30 and x2 25
30
P1 Pr oportion of fair haired people in the first population 30% 0.3,
100
25
P2 Pr oportion of fair haired people in the sec ond population 25% 0.25
100
Q1 1 P1 1 0.3 0.7;
Q2 1 P2 1 0.25 0.75
P1 P2 0.3 0.25
Step:4 The test statistic z 2.56
PQ PQ 0.3 0.7 0.25 0.75
1 1
2 2
n1 n2 1200 900
z 2.56
Since alternative hypothesis is two tailed, the tabulated value of z at 5% level of significance
is 1.96.
x1 17 x 22
p1 0.045, p2 2 0.049
n1 375 n2 450
n1 p1 n2 p2 x1 x2 17 22 39
p 0.047 ;
n1 n2 n1 n2 375 450 825
q 1 p 1 0.047 0.953
p1 p2 0.045 0.049
The test statistic z 0.015
1 1 1 1
pq (0.047)(0.953)
n1 n2 375 450
Since alternative hypothesis is right tailed, the tabulated value of z at 5% level of significance
is 1.96.
4. During a country wide investigation the incidence of tuberculosis was found to be 1%. In a
college of 400 students 3 reported to be affected, where as in another college of 1200 students
10 were affected. Does this indicate any significant difference?
Solution: We have n1 400 , n2 1200 , x1 3 and x2 10
x1 3 x 10
p1 0.0075, p2 2 0.0083
n1 400 n2 1200
Given
p1 p2 0.0075 0.0083
The test statistic z 0.14
1 1 1 1
pq (0.01)(0.99)
n1 n2 400 1200
z 0.14 1.96
Since alternative hypothesis is two tailed, the tabulated value of z at 5% level of significance
is 1.96.
5. A sample poll of 300 voters from district A and 200 voters from district B showed that
56% and 48% respectively, were in favour of a given candidate. At a 0.05 level of
significance, test the hypothesis that there is a difference in the districts.
P1 P2 0.56 0.48
Step:4 The test statistic z 1.78
PQ PQ 0.56 0.44 0.48 0.52
1 1
2 2
n1 n2 300 200
z 1.78
Since alternative hypothesis is two tailed, the tabulated value of z at 5% level of significance
is 1.96.
6. A random sample of 300 shoppers at a supermarket includes 204 who regularly use cents
off coupons. In another sample of 500 shoppers at a supermarket includes 75 who regularly
use cents off coupons. Construct confidence interval for the probability that any one shopper
at the supermarket, selected at random, will regularly use cents off coupons.
Solution: Here n1 300 , n2 500 , x1 204 and x2 75
x1 204
p1 Pr oportion of shoppers who use cents off coupons in the first sample 0.68,
n1 300
x2 75
p2 Pr oportion of shoppers who use cents off coupons in the first sample 0.15
n2 500
q1 1 0.68 0.32; q2 1 0.15 0.85
The 98% confidence interval for the probability that any one shopper in sample selected at
random is
7. A study shows that 16 of 200 tractors produced on one assembly line required extensive
adjustments before they could be shipped, while the same was true for 14 of 400 tractors
produced on another assembly line. At the 0.01 level of significance, does this support the
claim the second production line superior work?
8. On the basis of their total scores, 200 candidates of a civil service examination are divided
into two groups, the upper 30% and the remaining 70%. Consider the first question of the
examination. Among the first group, 40had the correct answer, where as among the second
group, 80 had the correct answer. On the basis of these results, can one conclude that the first
question is not good at discriminating ability of the type being examined here?
https://getcalc.com/statistics-fdistribution-table.htm 1/2
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
Grammarly
https://getcalc.com/statistics-fdistribution-table.htm 2/2
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
Objective of F-Test:
To find out whether the two independent estimates of population variance differ
significantly.
(OR)
To find out whether the two samples may be regarded as drawn from the normal
populations having same variance.
xi x yi y
2 2
n s2 n s2
S12 1 1 (or ) and S22 2 2 (or ) , where s12 and s22 are the
n1 1 n1 1 n2 1 n2 1
variances of the two samples.
Assuming that H 0 is true, the test statistic is
S12 S 22
F when S1
2
S 2
2 (or) F when S22 S12 follows F-distribution with
S 22 S12
( n1 1, n2 1) degrees of freedom.
Step:4 Set the level of significance .
Step:5 If the calculated value of F> the tabulated value of F at , we reject the null
hypothesis H 0 and conclude that the variances 12 and 22 are not equal . Otherwise,
we accept the null hypothesis H 0 and conclude that the variances 12 and 22 are
equal.
Note:
In numerical problem, we take the greater of the two variances S12 and S 22 in the
numerator and the other in the denominator.
Greater var iance
i.e, F
Smaller var iance
When F is close to 1, the two sample variances S1 and S 2 are nearly same.
Properties of F-distribution:
1. F-distribution curve is Skewed towards right with range 0 to and having the roughly
median value 1.
2. Value of F will always be more than 0.
3. Shape of F-distribution curve is dependent on – d.f. of numerator & d.f. of denominator.
4. F-distribution curve is never symmetrical, but if d.f. will be increased then it will be more
similar to the symmetrical shape.
5. F cannot be negative, and it is a continuous distribution.
f(F)
F
0 F (1 , 2 )
F-distribution
Solved Problems
1. In one sample of 8 observations from a normal population, the sum of the squares of
deviations of the sample values from the sample mean is 84.4 and in another sample of 10
observations it was 102.6. Test at 5% level whether the populations have same variance.
Solution: Let 12 and 22 be the variances of the two normal populations from which the
samples are drawn.
Here n1 8, n2 10
Step:1 Let the null hypothesis be H 0 : 12 22 .
Step:2 Then the Alternative hypothesis is H1 : 12 22 .
Step:3 The estimates of 12 and 22 are given by
x x yi y
2 2
i 84.4 102.6
S12 12.057 and S22 11.4 .
n1 1 8 1 n2 1 10 1
S12 12.057
Assuming that H 0 is true. Since S12 S22 , the test statistic is F 1.057
S22 11.4
with ( (1, 2 ) n1 1, n2 1 )=(8-1,10-1)=(7,9) degrees of freedom.
Step:4 Set the level of significance 0.05 .
Step:5 The tabulated value of F at 5% level for (7,9) degrees of freedom is 3.29.
The calculated value of F< the tabulated value of F at 0.05 , we accept the null
hypothesis H 0 and conclude that the variances 12 and 22 are equal.
2. Two random samples reveal the following results:
Sample Size Sample Mean Sum of Squares of deviations from the mean
1 10 15 90
2 12 14 108
Test whether the samples came from the same normal population.
Solution: Let 12 and 22 be the variances of the two normal populations from which the
samples are drawn.
Here we have to use two tests (i) To test equality of variances by F-test (ii) To test equality of
means by t-test.
(i) F-test (equality of variances)
Given n1 10, n2 12, x 15, y 14
x x y y
2 2
i 90 i 108
S12 10 and S22 9.82
n1 1 10 1 n2 1 12 1
S12 10
F 2
1.018
S2 9.82
i.e, Calculated F= 1.018. Assuming that H 0 is true. Since S12 S22 , the test statistic is
with ( n1 1, n2 1 )=(9,11) degrees of freedom. Set the level of significance 0.05 .
The tabulated value of F at 5% level for (9,11) degrees of freedom is 2.89.
The calculated value of F< the tabulated value of F at 0.05 , we accept the null
hypothesis H 0 and conclude that the samples came from the same normal populations
with same variances.
(ii) t-test (to test equality of means):
Null hypothesis: H 0 : 1 2
Given x 15, y 14, n1 10, n2 12
1
x x y y 1
90 108 9.9
2 2
Now S 2
n1 n2 1
10 12 1
S 3.15
x y 15 14
The test statistic is t 0.74 .
1 1 3.15
S
n1 n2
Tabulated value of t for 20 d.f. (n1+n2-2) at 5% level of significance is 2.086.
Since calculated value of t < tabulated value of t , we accept the null hypothesis.
Hence from (i) and (ii), the given samples have been drawn from the same normal
populations. Hence we accept the null hypothesis that 1 2 and 12 22
3. The nicotine contents in milligrams in two samples of tobacco were found to be as
follows:
Sample A 24 27 26 31 25 ---
Sample B 27 30 28 31 22 36
Can it be said that the two samples have come from the same normal population.
4. The measurements of the output of two units have given the following results. Assuming
that both samples have been obtained from the normal population at 10% significant level,
test whether the two populations have the same variances.
Unit-A 14.1 10.1 14.7 13.7 14.0
Unit-B 14.0 14.5 13.7 12.7 14.1
x
x 1 (14.1 10.1 14.7 13.7 14.0) 13.32
Now
n5
y
y 1 (14.0 14.5 13.7 12.7 14.1) 13.8
n 5
x x x =x-13.32 ( x x)2 y y y =y-13.8 ( y y)2
14.1 0.78 0.6084 14.0 0.2 0.04
10.1 -3.22 10.3684 14.5 0.7 0.49
14.7 1.38 1.9044 13.7 -0.1 0.01
13.7 0.38 0.1444 12.7 -1.1 1.21
14.0 0.68 0.4624 14.1 0.5 0.09
x =66.6 ( x x)2 =13.488 y 69 ( y y)2 =1.84
x x yi y
2 2
i 13.4888 1.84
S12 3.372 and S22 0.46
n1 1 5 1 n2 1 5 1
S12 3.372
F 7.33
S22 0.46
Since S12 S22 , the test statistic is with ( n1 1, n2 1 )=(4,4) degrees of freedom.
Tabulated value of F for (4,4) d.f. at 10% =0.01 level of significance is 15.97.
5. Two independent samples of 8 & 7 items respectively had the following values of the
variables.
Sample I 9 11 13 11 16 10 12 14
Sample II 11 13 11 14 10 8 10 ---
Do the estimates of the population variance differ significantly?
Given n1 8& n2 7
x
x 1 (9 11 13 11 16 10 12 14) 96 12
n1 8 8
Now
y
y 1 (11 13 11 14 10 8 10) 11
n2 7
x x x =x-12 ( x x)2 y y y =y- ( y y)2
13.8
9 -3 9 11 0 0
11 -1 1 13 2 4
13 1 1 11 0 0
11 -1 1 14 3 9
16 4 16 10 -1 1
10 -2 4 8 -3 9
12 0 0 10 -1 1
14 2 4 --- --- ---
x =96 ( x x)2 =36 y 77 ( y y)2 =24
x x
yi y
2 2
i 36 24
S12 5.14 and S22 4
n1 1 8 1 n2 1 7 1
S12 5.14
F 1.285
S22 4
Since S12 S22 , the test statistic is with ( n1 1, n2 1 )=(7,6) degrees of freedom.
6. It is known that the mean diameters of rivets produced by two firms A and B are
practically the same, but the standard deviation may differ. For 22 rivets produced by firm A,
the S.D. is 2.9 mm, while for 16 rivets manufactured by firm B, the S.D. is 3.8 mm, compute
the statistic you would use to test whether the products of firm A have the same variability as
those of firm B and test its significance.
Assuming that H 0 is true and S22 S12 then the test statistic is
S22 15.393
F 1.74
S12 15
Tabulated value of F- with (21, 15) d.f. at 0.05 level of significance is 2.31.
Step:4 The calculated value of F< the tabulated value of F at 0.05 ,we accept the null
hypothesis H 0 i.e, the products of both the firms A and B have the same variability.
So we may conclude that the products of firm A are not superior to those of firm B .
7. Pumpkins were grown under two experimental conditions. Two random samples of 11 and
9 pumpkins, show the sample standard deviations of their weights as 0.8 and 0.5 respectively.
Assuming that the weight distributions are normal, test hypothesis that the true variances are
equal.
8. The time taken by workers in performing a job by Method I and Method II is given below.
Method I 20 16 26 27 23 22 ---
Method II 27 33 42 35 32 34 38
Do the data show that the variances of time distribution from population from which these
samples are drawn do not differ significantly?
To make the generalization about the population from the sample, statistical tests are used. A
statistical test is a formal technique that relies on the probability distribution, for reaching the
conclusion concerning the reasonableness of the hypothesis. These hypothetical testing
related to differences are classified as parametric and nonparametric tests.
The parametric test is one which has information about the population.
On the other hand, the nonparametric test is one whether no exact information about the
population . Chi-square test is commonly used non-parametric test.
2 - test:
Properties:
1. 2 - distribution curve is not symmetrical, lies entirely in the first quadrant, and hence
not a normal curve, since 2 varies from 0 to .
2. As the number of degrees of freedom increases, the chi-square distribution becomes
more symmetric.
3. It depends only on the degree of freedom .
4. The values are non-negative. i.e, the values of are greater than or equal to 0.
5. Mean= and variance=2 .
f( )
2
Applications of 2 distribution:
1.
To test the goodness of fit.
2.
To test the independence of attributes.
3.
To test the homogeneity of independent estimation of the population variances.
4.
To test the homogeneity of independent estimation of the population Correlation
coefficient.
Conditions of validity :
Following are the conditions which should be satisfied before 2 test can be applied.
1.
The sample observations should be independent.
2.
N, the total frequency is large, i.e, >50.
3.
The constraints on the cell frequencies, if any, are linear.
4.
No theoretical (or expected) frequency should be less than 10. If small theoretical
frequencies occur, the difficulty is overcome by regrouping 2 or more classes together
before calculating (O-E). Note that the degrees of freedom is determined with the
number of classes after regrouping.
Definition: If a set of events A1, A2,…..,An are observed to occur with frequencies O1,
O2,…..,On respectively and according to probability rules A1, A2,…..,An are expected to
occur with frequencies E1, E2,…..,En respectively with O1, O2,…..,On are called observed
frequencies and E1, E2,…..,En are called expected frequencies.
If Oi (i=1,2,…..,n) is a set of observed (experimental) frequencies and E i (i=1,2,…..,n)
is the corresponding set of expected (theoretical ) frequencies, then 2 is defined as
(Oi Ei )2
n
2
with (n-1) degrees of freedom.
i 1 Ei
Note:
If the data is given in a series of ‘n’ numbers then degrees of freedom = n-1.
O E
i 1
i
i 1
i (or ) (O E ) 0
i 1
i i
Solved Problems
1. The number of automobile accidents per week in a certain community are as follows:
12,8,20,2,14,10,15,6,9,4. Are these frequencies in agreement with the belief that accident
conditions were the same during this 10 week period.
100
Solution: Expected frequency of accidents each week= 10 .
10
Null hypothesis H0: The accident conditions were the same during the 10 week period.
Alternative hypothesis H0: The accident conditions are different during 10 week period.
Since Calculated > Tabulated , therefore the Null hypothesis is rejected and concluded
2 2
that the accident conditions were not the same during the 10 week period.
2. The following figures show the distribution of digits in numbers chosen at random from a
telephone directory.
Digits 0 1 2 3 4 5 6 7 8 9
Frequency 1026 1107 997 966 1075 933 1107 972 964 853
Test whether the digits may be taken to occur equally in the directory.
Solution: Null hypothesis H0: The digits occur equally frequently in the directory.
Alternative hypothesis H1: The digits occur differently frequently in the directory.
Since Calculated > Tabulated , therefore the Null hypothesis is rejected and concluded
2 2
3. A die is thrown 264 times with the following results. Show that the die is biased. [Given
0.05
2
11.07 for 5 d. f ]
Male Births 4 3 2 1 0
Observed frequency 10 55 105 58 12
Solution: Null hypothesis H0: The male and female births are equally probable.
1
i.e, p q
2
The expected frequency x of male births is given by
x 4 3 2 1 0
f(x) 15 60 90 60 15
Let us now apply 2 test to examine the goodness of fit of the given data to the above
Binomail distribution.
Since Calculated 2 < Tabulated 2 , therefore the Null hypothesis is accepted and conclude
that the male and female births are equally probable.
5. 4 coins were tossed 160times and the following results were obtained.
No.of Heads 0 1 2 3 4
Observed frequencies 17 52 54 31 6
Under the assumption that coins are balanced, find the expected frequencies of 0,1,2,3 or 4
heads and test the goodness of fit 0.05 .
6. Fit a Poisson distribution to the following data and for its goodness of fits at level of
significance 0.05?
x 0 1 2 3 4
f(x) 419 352 154 56 19
Solution:
x f f.x
0 419 0
1 352 352
2 154 308
3 56 168
4 19 76
N= f =1000 fx 904
Mean
fi xi 904
0.904
f 1000
e x
Theoretical distribution is given by N p( x) 1000
x!
Hence the theoretical frequencies are given by
e0.904 (0.904) x 1000 0.4049 (0.904) x
f ( x) 1000 (1)
x! x!
Putting x=0,1,2,3,4, we get
x 0 1 2 3 4
f(x) 406.2 366 165.4 49.8 12.6
Now,
n
(Oi Ei )2 (419 406.2) 2 (352 366) 2 (154 165.4) 2 (56 49.8) 2 (19 12.6)2
2
i 1 Ei 406.2 366.6 165.4 49.8 12.6
5.748
Degrees of freedom (d.f)=n-2=5-2=3. Tabulated value at 0.05 with 3 d.f. is 7.82
2
7. A sample analysis of examination results of 500 students was made. It was found that 220
students had failed, 170 had secured a third class, 90 were placed in second class and 20 got a
first class. Do these figures commensurate with the general examination result which is in the
ratio of 4:3:2:1 for the various categories respectively.
Solution: Null Hypothesis: H0: The observed results commensurate with the general
examination results.
Total frequency=500
If we divide the total frequency 500 in the ratio 4:3:2:1, we get the expected frequencies as
4 3 2 1
500 200 ; 500 150 ; 500 100 and 500 50
10 10 10 10
Since Calculated 2 >Tabulated 2 , therefore the Null hypothesis is rejected and conclude
that the observed results are not commensurate with the general examination.
8. A pair of dice is thrown 360 times and the frequency of each sum is indicated below.
Sum 2 3 4 5 6 7 8 9 10 11 12
Frequency 8 24 35 37 44 65 51 42 26 14 14
Would you say that the dice are fair on the basis of the Chi-Square test at 0.05 level of
significance?
https://getcalc.com/statistics-one-tailed-tdistribution-table.htm 1/4
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
https://getcalc.com/statistics-one-tailed-tdistribution-table.htm 2/4
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
https://getcalc.com/statistics-one-tailed-tdistribution-table.htm 4/4
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
https://getcalc.com/statistics-two-tailed-tdistribution-table.htm 1/4
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
https://getcalc.com/statistics-two-tailed-tdistribution-table.htm 2/4
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
https://getcalc.com/statistics-two-tailed-tdistribution-table.htm 4/4
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
https://getcalc.com/statistics-chisquared-distribution-table.htm 1/7
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
https://getcalc.com/statistics-chisquared-distribution-table.htm 2/7
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
https://getcalc.com/statistics-chisquared-distribution-table.htm 5/7
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
https://getcalc.com/statistics-chisquared-distribution-table.htm 7/7
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
0.6 0.7257 0.7291 0.7324 0.7357 0.7389 0.7422 0.7454 0.7486 0.7517 0.7549
0.7 0.7580 0.7611 0.7642 0.7673 0.7704 0.7734 0.7764 0.7794 0.7823 0.7852
0.8 0.7881 0.7910 0.7939 0.7967 0.7995 0.8023 0.8051 0.8078 0.8106 0.8133
0.9 0.8159 0.8186 0.8212 0.8238 0.8264 0.8289 0.8315 0.8340 0.8365 0.8389
1.0 0.8413 0.8438 0.8461 0.8485 0.8508 0.8531 0.8554 0.8577 0.8599 0.8621
1.1 0.8643 0.8665 0.8686 0.8708 0.8729 0.8749 0.8770 0.8790 0.8810 0.8830
1.2 0.8849 0.8869 0.8888 0.8907 0.8925 0.8944 0.8962 0.8980 0.8997 0.9015
1.3 0.9032 0.9049 0.9066 0.9082 0.9099 0.9115 0.9131 0.9147 0.9162 0.9177
1.4 0.9192 0.9207 0.9222 0.9236 0.9251 0.9265 0.9279 0.9292 0.9306 0.9319
1.5 0.9332 0.9345 0.9357 0.9370 0.9382 0.9394 0.9406 0.9418 0.9429 0.9441
1.6 0.9452 0.9463 0.9474 0.9484 0.9495 0.9505 0.9515 0.9525 0.9535 0.9545
1.7 0.9554 0.9564 0.9573 0.9582 0.9591 0.9599 0.9608 0.9616 0.9625 0.9633
1.8 0.9641 0.9649 0.9656 0.9664 0.9671 0.9678 0.9686 0.9693 0.9699 0.9706
1.9 0.9713 0.9719 0.9726 0.9732 0.9738 0.9744 0.9750 0.9756 0.9761 0.9767
2.0 0.9772 0.9778 0.9783 0.9788 0.9793 0.9798 0.9803 0.9808 0.9812 0.9817
2.1 0.9821 0.9826 0.9830 0.9834 0.9838 0.9842 0.9846 0.9850 0.9854 0.9857
2.2 0.9861 0.9864 0.9868 0.9871 0.9875 0.9878 0.9881 0.9884 0.9887 0.9890
2.3 0.9893 0.9896 0.9898 0.9901 0.9904 0.9906 0.9909 0.9911 0.9913 0.9916
2.4 0.9918 0.9920 0.9922 0.9925 0.9927 0.9929 0.9931 0.9932 0.9934 0.9936
2.5 0.9938 0.9940 0.9941 0.9943 0.9945 0.9946 0.9948 0.9949 0.9951 0.9952
2.6 0.9953 0.9955 0.9956 0.9957 0.9959 0.9960 0.9961 0.9962 0.9963 0.9964
2.7 0.9965 0.9966 0.9967 0.9968 0.9969 0.9970 0.9971 0.9972 0.9973 0.9974
2.8 0.9974 0.9975 0.9976 0.9977 0.9977 0.9978 0.9979 0.9979 0.9980 0.9981
https://getcalc.com/statistics-normal-distribution-table.htm 1/3
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
3.1 0.9990 0.9991 0.9991 0.9991 0.9992 0.9992 0.9992 0.9992 0.9993 0.9993
3.2 0.9993 0.9993 0.9994 0.9994 0.9994 0.9994 0.9994 0.9995 0.9995 0.9995
3.3 0.9995 0.9995 0.9995 0.9996 0.9996 0.9996 0.9996 0.9996 0.9996 0.9997
3.4 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9998
3.5 0.9998 0.9998 0.9998 0.9998 0.9998 0.9998 0.9998 0.9998 0.9998 0.9998
-0.6 0.2743 0.2709 0.2676 0.2643 0.2611 0.2578 0.2546 0.2514 0.2483 0.2451
-0.7 0.2420 0.2389 0.2358 0.2327 0.2296 0.2266 0.2236 0.2206 0.2177 0.2148
-0.8 0.2119 0.2090 0.2061 0.2033 0.2005 0.1977 0.1949 0.1922 0.1894 0.1867
-0.9 0.1841 0.1814 0.1788 0.1762 0.1736 0.1711 0.1685 0.1660 0.1635 0.1611
-1.0 0.1587 0.1562 0.1539 0.1515 0.1492 0.1469 0.1446 0.1423 0.1401 0.1379
-1.1 0.1357 0.1335 0.1314 0.1292 0.1271 0.1251 0.1230 0.1210 0.1190 0.1170
-1.2 0.1151 0.1131 0.1112 0.1093 0.1075 0.1056 0.1038 0.1020 0.1003 0.0985
-1.3 0.0968 0.0951 0.0934 0.0918 0.0901 0.0885 0.0869 0.0853 0.0838 0.0823
-1.4 0.0808 0.0793 0.0778 0.0764 0.0749 0.0735 0.0721 0.0708 0.0694 0.0681
-1.5 0.0668 0.0655 0.0643 0.0630 0.0618 0.0606 0.0594 0.0582 0.0571 0.0559
-1.6 0.0548 0.0537 0.0526 0.0516 0.0505 0.0495 0.0485 0.0475 0.0465 0.0455
-1.7 0.0446 0.0436 0.0427 0.0418 0.0409 0.0401 0.0392 0.0384 0.0375 0.0367
-1.8 0.0359 0.0351 0.0344 0.0336 0.0329 0.0322 0.0314 0.0307 0.0301 0.0294
-1.9 0.0287 0.0281 0.0274 0.0268 0.0262 0.0256 0.0250 0.0244 0.0239 0.0233
-2.0 0.0228 0.0222 0.0217 0.0212 0.0207 0.0202 0.0197 0.0192 0.0188 0.0183
https://getcalc.com/statistics-normal-distribution-table.htm 2/3
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
-2.1 0.0179 0.0174 0.0170 0.0166 0.0162 0.0158 0.0154 0.0150 0.0146 0.0143
-2.2 0.0139 0.0136 0.0132 0.0129 0.0125 0.0122 0.0119 0.0116 0.0113 0.0110
-2.3 0.0107 0.0104 0.0102 0.0099 0.0096 0.0094 0.0091 0.0089 0.0087 0.0084
-2.4 0.0082 0.0080 0.0078 0.0075 0.0073 0.0071 0.0069 0.0068 0.0066 0.0064
-2.5 0.0062 0.0060 0.0059 0.0057 0.0055 0.0054 0.0052 0.0051 0.0049 0.0048
-2.6 0.0047 0.0045 0.0044 0.0043 0.0041 0.0040 0.0039 0.0038 0.0037 0.0036
-2.7 0.0035 0.0034 0.0033 0.0032 0.0031 0.0030 0.0029 0.0028 0.0027 0.0026
-2.8 0.0026 0.0025 0.0024 0.0023 0.0023 0.0022 0.0021 0.0021 0.0020 0.0019
-2.9 0.0019 0.0018 0.0018 0.0017 0.0016 0.0016 0.0015 0.0015 0.0014 0.0014
-3.0 0.0013 0.0013 0.0013 0.0012 0.0012 0.0011 0.0011 0.0011 0.0010 0.0010
-3.1 0.0010 0.0009 0.0009 0.0009 0.0008 0.0008 0.0008 0.0008 0.0007 0.0007
-3.2 0.0007 0.0007 0.0006 0.0006 0.0006 0.0006 0.0006 0.0005 0.0005 0.0005
-3.3 0.0005 0.0005 0.0005 0.0004 0.0004 0.0004 0.0004 0.0004 0.0004 0.0003
-3.4 0.0003 0.0003 0.0003 0.0003 0.0003 0.0003 0.0003 0.0003 0.0003 0.0002
-3.5 0.0002 0.0002 0.0002 0.0002 0.0002 0.0002 0.0002 0.0002 0.0002 0.0002
Grammarly
https://getcalc.com/statistics-normal-distribution-table.htm 3/3
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
AK 20 Regulations
Course Outcomes:
1. Interpret the association of characteristics and through correlation and regression tools.
2. Make use of the concepts of probability and their applications.
3. Apply discrete and continuous probability distributions.
4. Design the components of a classical hypothesis test for large sample.
5. Design the components of a classical hypothesis test for small samples.
UNIT -1
Measures of Central Tendency, Measures of Dispersion,
Correlation,Regression
10 marks
1.Compute the arithmetic mean of the following data:
Roll 1 2 3 4 5 6 7 8 9 10
No.s
Marks(x) 40 50 55 78 58 60 73 35 43 48
Page 1
Page 2
Blind 55 40 40 40 36 22 18 15
.15. Following are the ranks obtained by 10 students in two subjects, Statistics and Mathematics.
To what extent the knowledge of the students in two subjects is related?
Statistics 1 2 3 4 5 6 7 8 9 10
Mathematics 2 4 1 5 3 9 7 10 6 8
16. The ranks of 16 students in Mathematics and Statistics are as follows:
(1,1) (2, 10) (3, 3) (4, 4) (5, 5) (6, 7) (7, 2) (8, 6) (9, 8) (10, 11) (11, 15) (12, 9) (13, 14) (14,
12) (15, 16) (16, 13) . Calculate the rank correlation coefficient for proficiencies of this group in
Mathematics and Statistics
17. Ten competitors in a musical test were ranked by the three judges A, B and C in the
following order.
Ranks 1 6 5 10 3 2 4 9 7 8
by A
Ranks 3 5 8 4 7 10 2 1 6 9
by B
Ranks 6 4 9 8 1 2 3 10 5 7
by C
Using rank correlation method, discuss which pair of judges has the nearest approach to common
likings in music.
18. A sample of 12 fathers and their elder sons gave the following data about their elder sons.
Calculate the rank correlation coefficient
X 65 63 67 64 68 62 70 66 68 67 69 71
Y 68 66 68 65 69 66 68 65 71 67 68 70
19.Find the standard deviation of the following data (use the step deviation method):
Wages 125- 175- 225- 275- 325- 375- 425- 475- 525-
(Rs.) 175 225 275 325 375 425 475 525 575
no. of 2 22 19 14 3 4 6 1 1
workers
20.Goals scored by two teams A and B in foot ball season are as follows.
Page 3
Section of A 40 25 19 80 38 8 67 121 66 76
xi
Section of A 28 70 31 0 14 111 66 31 25 4
yi
22. Calculate Karl Pearson’s coefficient of skewness for the following data: 25, 15, 23, 40, 27,
25, 23, 25, 20.
23. Calculate Karl Pearson’s coefficient of skewness for the following data:
Variable 0-5 5-10 10-15 15-20 20-25 25-30 30-35 35-40
Frequency 2 5 7 13 21 16 8 3
24. From the following distribution, calculate (i) First 4 moment about the mean (ii) Skewness
based on moments (iii) Kurtosis
Income (Rs) 0-10 10-20 20-30 30-40
Frequency 1 3 4 2
2Marks
Problem 1: According to the census of 1991, following are the population figure, in thousands, of
10 cities : 1400, 1250, 1670, 1800, 700, 650, 570, 488, 2100, 1700. Find the median
4. Find the variance and standard deviation of the following data: 5, 12, 3, 18, 6, 8, 2, 10.
5.Find the variance and standard deviation of the following data: 45, 60, 62, 60, 50, 65, 58,
68,44, 48.
6.Calculate the standard deviation for the following distribution:
xi 4 8 11 17 20 24 32
fi 3 5 9 5 4 3 1
7. Calculate the mean and standard deviation for the following frequency distribution:
xi 6 10 14 18 24 28 30
fi 2 4 7 12 8 4 3
Compute the range for the following observation 15, 20, 25, 25, 30, 35
Page 4
8. The following table gives the daily sales (Rs.) of two firms A and B for five days.
Firm A 5050 5025 4950 4835 5140
Firm B 4900 3100 2200 1800 13000
Calculate the mean deviation of the variates 40, 62, 54, 68, 76 from A.M
9. Find the mean deviation from the mean for the following data: 38, 70, 48, 40, 42, 55, 63, 46,
54, 44.
10.Find the mean deviation about the mean for the following data
xi 2 5 7 8 10 35
,f i 6 8 10 6 8 2
11.Find the mean deviation about the median for the following data
xi 6 9 3 12 15 13 21 22
,f i 4 5 3 2 5 4 4 3
12.The following table gives the sales of 100 companies. Find the mean deviation from the
mean.
Sales in thousands 40-50 50-60 60-70 70-80 80-90 90-100
,Number of companies 5 15 25 30 20 5
13.Find the mean deviation about the mean for the following data
Classes 0-100 100-200 200-300 300-400 400-500 500-600 600-700 700-800
Freq. 4 8 9 10 7 5 4 3
:
14.Find the mean deviation from median for the following data
Age of 20-25 25-30 30-35 35-40 40-45 45-50 50-55 55-60
workers
No. of 120 125 175 160 150 140 100 30
workers
21.Define Co-Variance.
22.Define Skewness
Page 5
23.Define kurtosis
PROBABILITY
10 Marks
1. A class consists of 6 girls and 10 boys. If a committee of 3 is chosen at random from the
class, find the probability that (a) 3 boys are selected,(b) exactly two girls are selected.
2. A and B throw alternately with a pair of ordinary dice. A wins if he throws 6 before B throws
7 and B wins if he throws 7 before A throws 6.If A begins , show that his chance of winning is
30/61.
3. A,B and C in order toss a coin. The first one to toss head wins the game. What are the
probabilities of winning, assuming that the game may continue indefinitely.
4. Two cards are selected at random from 10 cards numbered 1 to 10.Find the probability that
the sum is even if (a)The two cards are drawn together. (b)The two cards are drawn one after
other with replacement.
5. A box contains n tickets marked 1 through n. Two tickets are drawn in succession without
replacement. Determine the probability that the number on the tickets are consecutive integers.
6. Determine the probability for each of the following events: A non-defective bolt will be found
if out of 600 bolts already examined 12 were defective.
7. What is the probability of picking an ace and a king from a 52 cards deck?
8. Out of 15 items 4 are not in good condition 4 are selected at random. Find the probability that
(a) All are not good (b) Two are not good
9. Five persons in a group 20 are engineers. If three persons are selected at random, determine
the probability that all engineers and the probability that at least one being an engineer.
10. Three students A,B,C are in running race. A and B have the same probability of winning and
each is twice as likely to win as C. Find the probability that B or C wins.
11. A class has 10 boys and 5 girls. Three students are selected at random one after another.
Find the probability that (a) first two are boys and third is girl (b) first and third are of same sex
and the second is of opposite sex.
Page 6
12. Two aeroplanes bomb a target in succession. The probability of each correctly scoring a hit
is 0.3 and 0.2 respectively. The second will bomb only if the first misses the target. Find the
probability that (a)target is hit (b) both fails to score hits.
13. Box A contains 5 red and 3 white marbles and box B contains 2 red and 6 white marbles. If a
marble is drawn from each box, what is the probability that they are both of same colour.
14. Two marbles are drawn in succession from a box containing 10 red, 30 white,20 blue and 15
orange marbles, with replacement being made after each draw. Find the probability that (a) both
are white (b) first is red and second is white.
15. A can hit a target 3 times in 5 shots, B hits target in 5 shots ,C hits target 3 times in 4 shots.
Find the probability of the target being hit when all of them try.
16. Two dice are thrown. Let A be the event that the sum of the points on the faces is 9.Let B be
the event that at least one number is 6.Find
17. If the probability that a communication system will have high fidelity is 0.81 and the
probability that it will have high fidelity and selectivity is 0.81.What is the probability that a
system with high fidelity will also have high selectivity?
18. Suppose 5 men out of 100 and 25 women out of 10,000 are colour blind. A colour blind
person is choosen at random. What is the probability of the person being a male(Assume male
and female to be in equal numbers)?
19. In a bolt factory machines A,B,C manufacture 20%,30% and 50% of the total of their output
and 6%,3% and 2% are defective. A bolt is drawn at random and found to be defective. Find the
probabilities that it is manufactured from (a)Machine A. (b)Machine B. (c)Machine C.
20. Of the three men,the chances that a politician,a business man or an academician will be
appointed as a vice-chancellor (V.C) of a University are 0.5,0.3,0.2 respectively.Probability that
research is promoted by these persons if they are appointed as V.C are 0.3,0.7,0.8 respectively.
22. Two dice are thrown. Let X assign to each point (a,b) in S the maximum of its numbers i.e.,
X(a,b)=max.(a,b). Find the probability distribution. X is a random variable with X(s) =
{1,2,3,4,5,6}. Also find the mean and variance of the distribution.
(OR)
Page 7
X 1 2 3 4 5 6
P(x) 1/36 3/36 5/36 7/36 9/36 11/36
Find (a) the mean (b) variance (c) P(1<X<6)
x 0 1 2 3 4 5 6 7
2 2 2
P(x) 0 k 2k 2k 3k 𝑘 2𝑘 7𝑘 + 𝑘
(a) Determine K (b) Evaluate P(X<6),P(X>6),P(0<X<5) and P(0<X<4) (c) if P(X<K)
>1/2, find the minimum value of K and, (d) Determine the distribution of X (e) Mean (f)
Variance.
X 0 1 2 3 4 5 6
P(X) K 3k 5k 7k 9k 11k 13k
(a) Find K (b) Find P(X<4), P(X>5), P(3<X<6) (c) what will be the minimum value of k so
that P(X<2)>0.3 ?
𝑥𝑖 -3 -2 -1 0 1 2 3
𝑃(𝑥𝑖 ) K 0.1 K 0.2 2K 0.4 2K
Find (a) K (b) Mean (c) Variance
26. From a lot of 10 items containing 3 defectives, a sample of 4 items is drawn at random.Let
the random variable X denote the number of defective items in the sample. Find the probability
distribution of X when the sample is drawn without replacement.
27. Let X denote the minimum of the two numbers that appear when a pair of fair dice is thrown
once. Determine the
28. A sample of 4 items is selected at random from a box containing 12 items of which 5 are
defective. Find the expected number E of defective items.
29. A fair coin is tossed until a head or five tails occurs. Find the expected number E of tosses of
the coin.
30. Find the mean of the probability distribution of the number of heads obtained in three flips
of a balanced coin.
Page 8
𝑓 (𝑥 ) = 𝑐𝑒 −|𝑥| , −∞ < 𝑥 < ∞. Show that c=1/2 and find that the mean and variance of the
distribution. Also find the probability that the variate lies between 0 and 4.
1
𝑠𝑖𝑛𝑥 , 𝑓𝑜𝑟 0 ≤ 𝑥 ≤ 𝜋
3. Probability density function of a random variable X is 𝑓 (𝑥 ) = {2 .
0, 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
Find the mean,mode and median of the distribution and also find the probability between
𝜋
0 and ?
2
4. A continuous random variable X has the distribution function
0, 𝑖𝑓 𝑥 ≤ 1
𝑓 (𝑥 ) = {𝑘 (𝑥 − 1)4 , 𝑖𝑓 1 < 𝑥 ≤ 3. Determine (i) f(x) (ii) k (iii) Mean
1, 𝑖𝑓 𝑥 > 3
8. The daily consumption of electric power (in millions of kW-hours) is a random variable
1 𝑥
𝑥𝑒 − ⁄3 , 𝑥 > 0
having the probability density function 𝑓 (𝑥 ) = {9 . If the total production is
0, 𝑥 ≤ 0
12 million kW-hours, determines the probability that there is power cut on any given day.
𝑒 −𝑥 , 𝑥 ≥ 0
9. The density function of a random variable X is 𝑓(𝑥 ) = { . Find E(X), E(X2),
0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
Var(X).
10. The cumulative distribution function for a continuous random variable X is
1 − 𝑒 −2𝑥 , 𝑥 ≥ 0
𝐹 (𝑥 ) = { . Find (i) the density function f(x) (ii) Mean (iii) Variance of the
0, 𝑥 < 0
density function.
BINOMIAL DISTRIBUTION
Page 9
1. 10 coins are thrown simultaneously. Find the probability of getting at least (i) 7 heads (ii)
6 heads
2. 2 dice are thrown 5 times. Find the probability of getting 7 as sum (i) at least once (ii)
two times (iii) p(1<X<5)
3. In 256 sets of 12 tosses of a coin, how many cases one can expect 8 heads and 4 tails
4. Out of 800 families with 4 children each, how many families would be expected to have
(a) 2 boys and 2 girls (b) at least one boy (c) no girl (d) at most 2 girls? Assume equal
probabilities for boys and girls
5. In a binomial distribution consisting of 5 independent trails, probabilities of 1 and 2
success are 0.4096 and 0.2048 respectively. Find the parameter p of the distribution.
6. Fit a binomial distribution to the following data
x 0 1 2 3 4 5
f 2 14 20 34 22 8
7. The mean of binomial distribution is 3 and variance is 9/4. Find (i) the value of n (ii) p(x
≥ 7) (iii) p(1 ≤ X < 6)
8. The probability that the life of a bulb is 100 days is 0.05. find the probability that out of 6
bulbs (i) at least one (ii) greater than 4 (iii) none, will be having a life of 100 days
POISSON DISTRIBUTION
1. A car-hire firm has two cars which it hires out day by day. The number of demands for a
car on each day is distributed as a poisson distribution with mean 1.5. Calculate the
proportion of days (a) on which there is no demand (b) on which demand is refused.
2. A hospital switch board receives an average of 4 emergency calls in a 10 minute interval.
What is the probability that (a) there are at most 2 emergency calls in a 10 minute interval
(b) there are exactly 3 emergency calls in a 10 minute interval.
3. If a random variable has a poisson distribution such that P(1)=P(2), find (a) mean of the
distribution (b) P(4) (c) P(x>1) (d) P(1<x<4).
4. The average number of phone calls / minute coming into a switch board between 2 p.m
and 4 p.m is 2.5. Determine the probability that during one particular minute there will be
(a) 4 or fewer (b) more than 6 calls.
Page
10
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
5. Average number of accidents on any day on a national highway is 1.8. Determine the
probability that the number of accidents are (a) at least one (b) at most one
6. If X is a poisson variate such that P(x=0)=P(x=1),find P(x=0) and using recurrence
formula find the probabilities at x=1,2,3,4 and 5.
7. If the variance of a poisson variate is 3,then find the probability that (a) x=0 (b) 0<x<3
(c) 1<x<4.
8. If X is a poisson variate such that 3P(x=4)=1/2 P(x=2) + P(x=0), find (a) the mean of x
(b) P(x<2)
9. Wireless sets are manufactured with 25 soldered joints each. On the average 1 joint in
500 is defective. How many sets can be expected to be free from defective joints in a
consignment of 10,000 sets.
NORMAL DISTRIBUTION
10. If X is a normal variate with mean 30 and standard deviation 5. Find the probabilities
that (a) 26<X<40 (b) X>45
11. In a Normal distribution, 7% of the items are under 35 and 89% are under 63. Determine
the mean and variance of the distribution.
(OR)
Find the mean and standard deviation of a normal distribution in which 7% of items are
under 35 and 89% are under 63.
12. The mean and standard deviation of the marks obtained by 1000 students in an
examination are respectively 34.5 and 16.5. Assuming the normality of the distribution,
find the approximate number of students expected to obtain marks between 30 and 60.
13. The marks obtained in mathematics by 1000 students is normally distributed with mean
78% and standard deviation 11%.Determine
(a) How many students got marks above 90%
(b) What was the highest mark obtained by the lowest 10% of the students
(c) Within what limits did the middle of 90% of the students lie.
14. Suppose the weights of 800 male students are normally distributed with mean µ=140
pounds and standard deviation 10 pounds. Find the number of students whose weights are
(a) between 138 and 148 pounds (b) more than 152 pounds.
Page
11
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
15. The marks obtained in statistics in a certain examination found to be normally distributed.
If 15% of the students > 60 marks,40% < 30 marks, find the mean and standard deviation.
16. A sales tax officer has reported that the average sales of the 500 business that he has to
deal with during a year is Rs. 36,000 with a standard deviation of 10,000. Assuming that
the sales in these business are normally distributed,find
(a) the number of business as the sales of which are Rs.40,000.
(b) the percentage of business the sales of which are likely to range between
Rs.30,000 and Rs.40,000.
17. If the masses of 300 students are normally distributed with mean 68 kgs and standard
deviation 3kgs, how many students have masses
(a) greater than 72 kg
(b) less than or equal to 64 kg
(c) between 65 and 71 kg inclusive.
18. Given that the mean height of students in a class is 158cms with standard deviation of
20cms. Find how many students heights lie between 150cms and 170cms, if there are 100
students in the class.
2marks
1. A box contains n tickets marked 1 through n. Two tickets are drawn in succession without
replacement. Determine the probability that the number on the tickets are consecutive integers.
2. Determine the probability for each of the following events: A non-defective bolt will be found
if out of 600 bolts already examined 12 were defective.
3. What is the probability of picking an ace and a king from a 52 cards deck?
4. Out of 15 items 4 are not in good condition 4 are selected at random. Find the probability that
(a) All are not good (b) Two are not good.
5. Let X denote the number of heads in a single toss of 4 fair coins. Determine (a) P(X<2) (b)
P(1<X<3).
6. The mean and variance of a binomial distribution are 4 and 4/3 respectively. Find p(x ≥ 1).
Page
12
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
7. Using recurrence formula find the probabilities when x=0,1,2,3,4 and 5;if the mean of poisson
distribution is 3.
8. Find the probability of getting an even number 3 or 4 or 5 times in throwing 10 dice. Using
binomial distribution.
9. Determine the probability that getting an even number on face 3 to 5 times in throwing 10
dice together.
UNIT-IV
TEST OF HYPOTHESIS
1.A sample of 64 students have a mean weight of 70 kgs. Can this be regarded as a sample from a
population with mean weight 56 kgs and standard deviation 25kgs.
2.A sample of 900 members has a mean of 3.4 cms and S.D 2.61 cms. Is this sample has been taken
from a large population of mean 3.25 cm and S.D 2.61 cms. If the population is normal and its mean
is unknown find the 95% fiducial limits of true mean.
3.A sample of 400 items is taken from a population whose standard deviation is 10. The mean of the
sample is 40. Test whether the sample has come from a population with mean 38. Also calculate
95% confidence interval for the population.
4.An ambulance service claims that it takes on the average less than 10 minutes to reach its
destination in emergency calls. A sample of 36 calls has a mean of 11 minutes and the variance of 16
minutes. Test the claim at 0.05 level significance.
5.It is claimed that a random sample of 49 tyres has a mean life of 15200 km. This sample was drawn
from a population whose mean is 15150 kms and a standard deviation of 1200km. Test the
significance at 0.05 level.
6.The means of two large samples of sizes 1000 and 2000 members are 67.5 inches and 68.0 inches
respectively. Can the samples be regarded as drawn from the same population of S.D 2.5 inches.
7.A researcher wants to the intelligence of students in a school. He selected two groups of students.
In the first group there 150 students having mean IQ of 75 with a S.D. Of 15 in the second group
there are 250 students having men IQ of 70 with S.D. of 20.
8.The mean life of a sample of 10 electric bulbs ( or motors ) was found to be 1456 hours with S.D.
Of 423 hours. A second sample of 17 bulbs chosen from a different batch showed a mean life 1280
hours with S.D. of 398 hours. Is there a significant difference between the means of two batches.
Page
13
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
9.The mean height of 50 male students who participated in sports is 68.2 inches with a S.D of 2.5.
The mean height of 50 male students who have not participated in sport is 67.2 inches with a S.D of
2.8. Test the hypothesis that the height of students who participated in sports is more than the
students who have not participated in sports.
10.A manufacturer claimed that at least 95% of the equipment which he supplied to a factory
conformed to specifications. An examination of a sample of 200 pieces of equipment revealed that
18 were faculty. Test his claim at 5% level of significance.
11.In a sample of 1000 people in Karnataka 540 are rice eaters and the rest are wheat eaters. Can
we assume that both rice and wheat are equally popular in this state at 1% level of significance.
12.In a big city 325 men out of 600 men were found to be smokers. Does this information support
the conclusion that the majority of men in this city are smokers?
13.Experience had shown that 20% of a manufactured product is of the top quality. In one day’s
production of 400 articles only 50 are of top quality. Test the hypothesis at 0.05 level.
14.In a sample of 500 from a village in Rajasthan, 280 are found to be wheat eaters and the rest rice
eaters. Can we assume that the both articles are equally popular.
15.20 people were attacked by a disease and only 18 survived. Will you reject the hypothesis that
the survival rate if attacked by this disease is 85% in favour of the hypothesis that is more at 5%
level.
16.A random sample of 500 apples was taken from a large consignment of 60 were found to be bad,
obtain the 98% confidence limits for the percentage number of bad apples in the consignment.
17.Random samples of 400 men and 600 women were asked whether they would like to have a
flyover near their residence. 200 men and women in favour of the proposal are same, at 5% level.
18.On the basis of their total scores, 200 candidates of a civil service examination are divided into
two groups, the upper 30% and the remaining 70%. Consider the first question of the examination.
Among the first group, 40 had the correct answer, whereas among the second group, 80 had the
correct answer. On the basis of these results, can one conclude that the first question is not good at
discriminating ability of the type being examined here?
19.In two large populations, there are 30%, and 25% respectively of fair haired people. Is this
difference likely to be hidden in samples of 1200 and 900 respectively from the two populations.
20.In a random sample of 1000 persons from town A, 400 are found to be consumers of wheat. In a
sample of 800 from town B, 400 are found to be consumers of wheat. Do these data reveal a
significant difference between town A and town B, so far as the proportion of wheat consumers is
concerned?
Page
14
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
21.Before an increase on excise duty on tea 500 people out of a sample of 900 found to have the
habit of having tea. After an increase on excise duty 250 are have the habit of having tea among
1100. Is there any decrease in the consumption of tea. Test at 5% level.
2Marks
1Find the value of the finite population correction factor for n=10 and N=100.
2.A random sample of size 81 was taken whose variance is 20.25 and mean is 32, construct 98%
confidence interval .
3.In a random sample of 100 packages shipped by air freight 13 had some damage. Construct 95%
confidence interval for the true proportion of damage package.
9.Write the confidence limits for (i) single mean (ii) Difference of Means (iii)Single Proportion (iv)
Difference of Proportion.
UNIT-V
Page
15
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
2> A random of sample of size 25 from a normal population has the mean𝑥 =4.75 and the standard
deviation S = 8.4 .Does this information tend to support or refute the claim that mean of the population
is µ = 42.5?
3> Ten bearings made by a certain process have a mean diameter of 0.5060cm with a standard deviation
of 0.0040cm. Assuming that the data may be taken as a random sample from a normal distribution,
construct a 95% confidence interval for the actual average diameter of the bearings?
4> A sample of size 10 was taken from a population S.D of sample is 0.03.Find maximum error with 99%
confidence.
5>A sample of 11 rats from a central population had an average blood viscosity of 3.92 with a S.D of0.61.
Estimate the 95% confidence limits for the mean blood viscosity of the population.
6>A mechanist is making engine parts with axle diameters of 0.700inch.A random sample of 10 parts
shows a mean diameter of 0.742inch with a S.D of 0.040inch. Compute the statistics you would use to
test whether the work is meeting the specification at 0.05 level of significance.
7>A sample of 26 bulbs gives a mean life of 990 hours with a S.D of 20 hours. The manufacturer claims
that the mean life of bulbs is 1000 hours. Is the sample not upto the standard .
8> The average breaking strength of the steel rods is specified to be 18.5 thousand pounds. To test this
sample of 14 rods were tested .The mean and standard deviations obtained were 17.85 and 1.955
respectively. Is the result of experiment significant?
9>> A random of sample of size 16 values from a normal population showed a mean of 53 and a sum of
squares of deviations from the mean equals to 150. Can this sample be regarded as taken from
population having 50 as mean? Obtain 95% confidence limits of the mean of the population.
10>A random of sample of six steel beams has a mean compressive strength of 58,392 p.s.i (pounds per
square inch) with a standard deviation of 648 p.s.i. Use this information and the level of significance α =
0.05 to test whether the true average compressive strength of the steel from which this sample came is
58,000 p.s.i. Assume normality.
11> A random of sample of 10 boys had the following I.Q’s : 70,120,110,101,88,83,95,98,107 and 100.
(𝑎)Do these data support the assumption of a population mean I.Q of 100
(𝑏)Find the reasonable range in which most of the mean I.Q values of samples of 10 boys lie.
11>Producer of ‘gutkha’, claims that the nicotine content in his gutkha on the average is 1.83mg.Can this
claim accepted if a random sample of 8 gutkha of this type have the nicotine contents of
2.0,1.7,2.1,1.9,2.2,2.1,2.0,1.6mg? Use a 0.05 level of significance.
12>Two horses A and B were tested according to the time (in sec) to run a particular track with the
following results.
Horse A 28 30 32 33 33 29 34
Page
16
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
Horse B 29 30 30 24 27 29
Test whether two horses have the same running capacity.
13>To examine the hypothesis that the husbands are more intelligent than the wives, an investigator
took a sample of 10 couples and administered them a test which measures the I.Q. The results are as
follows:
14>To compare two kinds of bumper guards,6 of each kind were mounted on a car and then the car was
run into a concrete wall. The following are the costs of repairs.
15>Scores obtained in a shooting competition by 110 soliders before and after intensive training are
given below:
Before 67 24 57 55 63 54 56 68 33 43
After 70 38 58 58 56 67 68 75 42 38
Test whether the intensive training is useful at 0.05 level of significance.
16>The blood pressure of 5 women before and after intake of a certain drug are given below:
17>Memory capacity of 10 students were tested before and after training. State whether the training
was effective or not from the following scores.
Before 12 14 11 8 7 10 3 0 5 6
training
After 15 16 10 7 5 12 10 2 3 8
training
18>In one sample of 8 observations from a normal population, the sum of the squares of deviations of
the sample values from the sample mean is 84.4 and in another sample of 10 observations it was 102.6.
Test at 5% level whether the populations have the same variance.
19>The nicotine contents in milligrams in two samples of tobacco were found to be as follows:
Sample A 24 27 26 21 25 —
Page
17
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
Sample B 27 30 28 31 22 36
Can it be said that the two samples have come from the same normal population?
20>The measurements of the output of the two units have given the following results. Assuming that
both samples have been obtained from the normal populations at 10% significant level, test whether the
two populations have the same variance.
21>In one sample of 10 observations, the sum of the squares of the deviations of the sample values
from sample mean was 120 and in the other sample of 12 observations, it was 314. Test whether the
difference is significant at 5% level?
22> Two independent samples of 8 and 7 items respectively had the following values of the variables
Sample I 9 11 13 11 16 10 12 14
SampleII 11 13 11 14 10 8 10 —
Do the estimates of the population variance differ significantly.
23>The number of automobile accidents per week in a certain community are as follows:
12,8,20,2,14,10,15,6,9,4. Are these frequencies in agreement with the belief that accident conditions
were the same during this 10 week period.
24>A die is thrown 264 times with the following results. Show that die is biased. [Given χ20.05 =11.07 for 5
d.f]
25> A pair of dice are thrown 360 times and the frequency of each sum is indicated as below:
Sum 2 3 4 5 6 7 8 9 10 11 12
Frequency 8 24 35 37 44 65 51 42 26 14 14
Would you say that the dice are fair on the basis of the chi-square test at 0.05 level of significance?
26>4 coins were tossed 160 times and the following results were obtained:
No. of heads 0 1 2 3 4
Observed frequencies: 17 52 54 31 6
Under the assumption that coins are balanced, find the expected frequencies of 0,1,2,3 or 4 heads, and
test the goodness of fit (α= 0.05).
27> Fit a poisson distribution to the following data and for its goodness of fit at level of significance
0.05?
Page
18
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
x 0 1 2 3 4
f 419 352 154 56 19
28>The following table gives the classification of 100 workers according to gender and nature of work.
Test whether the nature of work is independent of the gender of the worker.
29>Given the following contingency table for hair colour and eye colour. Find the value of χ2 . Is there
good association between the two?
Hair colour
Fair Brown Black Total
Eye colour Blue 15 5 20 40
Grey 20 10 20 50
Brown 25 15 20 60
Total 60 30 60 150
30> From the following data find whether there is any significant liking in the habit of taking soft drinks
among the categories of employees.
Employees
Soft Drinks Clerks Teachers Officers
Pepsi 10 25 65
Thumsup 15 30 65
Fanta 50 60 30
31> In an investigation on the machine performance, the following results are obtained:
32>A firm manufacturing rivets wants to limit variations in their length as much as possible. The lengths
(in cms) of 10 rivets manufactured by a new process are
Page
19
Downloaded by Palla Srija (srijapalla01@gmail.com)
lOMoARcPSD|37321784
Examine whether the new process can be considered superior to the old if the old population has
standard deviation 0.145 cm?
2Marks
Page
20
Downloaded by Palla Srija (srijapalla01@gmail.com)