Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
269 views

Module 4

The document discusses probability distributions, including the Poisson and normal distributions. It provides examples and practice problems related to these distributions. It also covers topics like sampling distribution, test of hypotheses, and chi-square tests.

Uploaded by

Niramay K
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
269 views

Module 4

The document discusses probability distributions, including the Poisson and normal distributions. It provides examples and practice problems related to these distributions. It also covers topics like sampling distribution, test of hypotheses, and chi-square tests.

Uploaded by

Niramay K
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 22

1

Notes and Practice Problem Set on Module 4

Module-4: Probability Distribution and Sampling Theory

4.1 Probability Distribution: Poisson and Normal distribution

4.2 Sampling distribution Test of Hypothesis, Level of Significance, Critical


region, One-tailed, and two-tailed test, Degree of freedom.

4.3 Students’ t-distribution (Small sample). Test the significance of mean


and Difference between the means of two samples. Chi-Square Test: test of
goodness of fit and independence of attributes, Contingency table.

In a Random Experiment, all the possible outcomes are known, however,


its exact outcome cannot be precisely predicted in advance. There are
certain terms associated with random experiments that are given as follows:

Outcome: An outcome is a possible result of the random experiment.

Sample space: A sample space can be defined as the list of all possible
outcomes of a random experiment.

Event: An event is a possible outcome of an experiment and forms a subset


of the sample space.

Trial: When a random experiment is repeated many times each one is


known as a trial.

An example of random experiment: Suppose a coin is tossed. The two


possible outcomes are getting a head or a tail. The outcome of this
experiment cannot be predicted before it has been performed. Furthermore,
it can be conducted many times under the same conditions. Thus, tossing a
coin is an example of a random experiment.

Random variable is a variable that is used to quantify the outcome of a


random experiment.

Example-1: Suppose 2 dice are rolled and the random variable, X, is used to
represent the sum of the numbers. Then, the smallest value of X will be
equal to 2 (1 + 1), while the highest value would be 12 (6 + 6). Thus, X could
take on any value between 2 to 12 (inclusive)

Prof. Divesh Singh KJSIT, Sion


2

Types of Random Variables

Examples of discrete variables include:

Years of schooling

Number of goals made in a soccer match

The number of workers in an office

The number of steps you take in a day

The number of babies born each day

Votes for a particular politician

Number of times a coin lands on heads after ten coin tosses.

Examples of continuous variables include:

Prof. Divesh Singh KJSIT, Sion


3

The time it takes for office employees to commute to work

The distance you walk in a day

The weight of baby elephants

The body temperature of patients with the flu

Probability distribution for a discrete random variable

The probability distribution for a discrete random variable X is a formula,


table, or graph that gives all the possible values of x and the probability P (x)
associated with each value of X.

Poisson Distribution
𝑒 −𝑚 (𝑚)𝑥
𝑃(𝑋 = 𝑥) = , where m is mean
𝑥!

Problems

1. The mean and variance of the Poisson distribution is 2. Write down


𝑒 −2 (2)𝑥
the distribution [Ans: 𝑃(𝑋 = 𝑥) = , 𝑥 = 0,1,2,3 ….]
𝑥!
2. In a Poisson distribution P(x=3) is 2/3 of P(x=4) . Find the mean and
standard deviation. [Ans: 6,√6]
3. If X is Poisson variate such that 𝑃(𝑋 = 1) = 𝑃(𝑋 = 2), find 𝐸(𝑋 2 ).
[Ans: 6]
4. If the mean of Poisson distribution is 2. Find the probabilities of
x=1,2,3,4 from the recurrence relation of probability.
5. Assume that the probability of an individual coal miner being killed in
a mine accident during a year is 1/2400 . Use appropriate statistical
distribution to calculate the probability that in a mine employing 200
miners there will be at least one fatal accident every year.
[Ans:1/12,0.08]
6. Suppose that a local appliances shop has found from experience that
the demand for the tube lights is roughly distributed as Poisson with
mean of 4 tube lights per week. If the shop keeps 6 tube lights during
a particular week. What is the probability that the demand will exceed
the supply during that week. [Ans: 0.0183]
7. If the probability that an individual suffers a bad reaction from a
particular injection is 0.001, determine the probability that out of
2000 individuals (i) exactly three (ii) more than 2 individuals will
suffer a bad reaction. [Ans: (i) 0.1804 (ii) 0.3233]
8. Fit a Poisson distribution to the following data
X: 0 1 2 3 4 Total
f: 192 100 24 3 1 320
[Ans: m=0.5, Frequencies: 194,97,24,4,1]

Prof. Divesh Singh KJSIT, Sion


4

9. Fit a Poisson distribution to the following data


X: 0 1 2 3 4 Total
f: 211 90 19 5 0 325
[Ans: m=0.44, Frequencies: 209,92,20,3,1]
10. Fit a Poisson distribution to the following data
X 0 1 2 3 4 5 6 7 8
f 56 156 132 92 37 22 4 0 1
[Ans: Frequencies: 70,137,135,89,44,17,6,2,0]
11. Fit a Poisson distribution to the following data
X 0 1 2 3 4 5
f 142 156 69 27 5 1
[Ans: Frequencies: 147,147,74,24,6,2]
Normal Distribution

The normal distribution, also known as the Gaussian distribution is a


continuous probability distribution that is symmetrical around its mean.

Normal probability density function

𝟏 𝟏 𝒙−𝒎 𝟐
𝒆 𝟐 𝝈 )
− (
𝑓(𝑥) =
√𝟐𝝅𝝈𝟐
Where the parameter m is the mean and the parameter σ is the standard
deviation

Properties

The normal distribution is a continuous distribution that is:

 bell-shaped

 unimodal

 symmetric

 asymptotic

 area under the curve is equal to 1

 The mean, median, and mode are all equal

 Half of the population is less than the mean and half is greater than
the mean

Prof. Divesh Singh KJSIT, Sion


5

X∼N (100, 15): Random variable X is normally distributed with mean 100
and standard deviation 15.

Normal curve with two different means and same standard deviation

Normal curves with two different standard deviations and same mean

Prof. Divesh Singh KJSIT, Sion


6

There are an infinite number of normal distribution models, as m can take


on any real number and σ any positive real number. In order to make
finding probabilities associated with the normal distribution easier, we
generally compute what is known as z -score.

Calculation of z-score
𝒙−𝒎
𝒛= 𝝈

Suppose random variable X which is normally distributed has mean 20 and


variance 5. If X takes two values 28 and 35 then corresponding z-scores are

x − m 28 − 20 8
z= = = = 1.6
σ 5 5

Prof. Divesh Singh KJSIT, Sion


7

x − m 35 − 20 15
z= = = =3
σ 5 5

 about 68% of a normal distribution is within 1 standard deviation of the


mean

 about 95% of a normal distribution is within 2 standard deviations of


the mean

 Almost the entire curve (99.7%) of the normal distribution is within 3


standard deviations of the mean.

Prof. Divesh Singh KJSIT, Sion


8

Prof. Divesh Singh KJSIT, Sion


9

Problems:

1. For a normal variate with mean 2.5 and standard deviation 3.5 , find
the probability that
(i) 2 ≤ 𝑋 ≤ 4.5 (ii) −1.5 ≤ 𝑋 ≤ 5.5 [Ans: (i) 0.2714 (ii) 0.6610]
2. If X is a normal variate with mean 10 and standard deviation 4 , find
(ii) 𝑃(5 ≤ 𝑋 ≤ 18) (ii) 𝑃(𝑋 ≤ 12) (iii) 𝑃(|𝑋 − 14| < 1)

[Ans: (i) 0.8716 (ii) 0.6914 (iii) 0.6826]

3. If Z is standard normal variate , find c such that


(i) 𝑃(−𝑐 ≤ 𝑧 ≤ 𝑐)=0.95 (ii) ) 𝑃(|𝑧| > 𝑐) = 0.01

If X is a normal variate with mean 120 and standard deviation 10,

find c such that (𝑖𝑖𝑖)𝑃(𝑋 > 𝑐) = 0.02 (iv) 𝑃(𝑋 < 𝑐) = 0.05

[Ans: (i) 1.96 (ii) 2.58 (iii) 140.5 (iv) 103.6]

4. If X is normal variate with mean 5 and standard deviation 3. What is


the probability that the deviation from the mean of an item taken at
random will be negative? [Ans: 0.0575]
5. If X is normally distributed with mean and standard deviation 4 , find
(iii) 𝑃(5 ≤ 𝑋 ≤ 10) (ii) 𝑃(𝑋 ≥ 15) (iii) 𝑃(10 ≤ 𝑋 ≤ 15) (iv) 𝑃(𝑋 ≤ 5)

[Ans: (i) 0.3326 (ii) 0.003 (iii) 0.1557 (iv) 0.05987]

6. The daily sales of a firm are normally distributed with mean ₹ 8000
and variance of ₹ 10,000 (i) what is the probability that on a certain
day the sales will be less than ₹8210? (ii) What is % of days on which
the sales will be between ₹ 8100 and ₹ 8200?
[Ans: (i) 0.5832 (ii) 14%]
7. The mean height of soldiers is 68.22” with variance 10.8”. Find the
expected number of soldiers in a regiment of 1000 whose height will
be more than 6” (Area from z=0 to z=1.15 is 0.3749)[Ans: 125]
8. A sample of 100 dry battery cell is tested to find the length of life ,
produced the following results. 𝑋̅ = 12 ℎ𝑜𝑢𝑟𝑠 and 𝜎 = 3 ℎ𝑟𝑠 .
Assuming normal distribution what percentage of cells is expected to
have life (i) more than 15 hrs (ii) between 10 and 14 hrs . [Ans: (i)
15.87% (ii) 49.72%]
9. The income distribution of workers in a certain factory was found to
normal with mean of ₹ 500 and standard deviation ₹ 50. There were
228 persons above ₹600. How many persons were there in all? (Area

Prof. Divesh Singh KJSIT, Sion


10

under the S.N curve between height at 0 and 2 is 0.4772) [Ans:


10,000]
10. The life of army shoes is normally distributed with mean 8
months and standard deviation 2 months . If 5000 pairs are issued ,
how many pairs would be expected to need replacement after 12
months . [Ans: 2386]
11. In an intelligence test administered to 1000 students the
average was 42 and standard deviation was 24. Find the number of
students (i) exceeding 50 (ii) between 30 and 54 (iii) the least score of
top 100 students. [Ans: (i) 371 (ii)383 (iii) 72.72 ]
12. The marks obtained by students in a class are normally
distributed with mean 75 and standard deviations. If top 5% got grade
A and bottom 25% got grade B, what are the marks of the lowest of A
and what are the marks of highest of B? Also find the percentage of
the students who got marks between 60 and 70.
[Ans:(i) 83(ii)72(iii)15.74 ]
13. The local authorities in a certain city installed 10,000 electric
lamps in the streets of the city. If these lamps have average life of
1000 burning hrs with a standard deviation of 200 hrs , what number
of lamps might be expected to fail (i) in the first 800 hrs (ii) between
800 and 1200 hrs ?
After what period of burning hrs would you expect that (i) 10% of lamp
would fail (ii) 10% of lamp would be still burning? (The area between
the ordinates corresponding to S.N.V z=0 and z=1 is 0.34134 and 80%
of the area lies between the ordinate corresponding to𝑧 = ±1.25)
[Ans: (i) 1587 (ii) 6827 (iii) 750 hrs 9 (iv) 1250 hrs ]
14. In a test of 2000 electric bulbs , it was found that the life of a
particular make was normally distributed with an average life of 2040
hrs and standard deviation of 60 hrs . Estimate the number of bulbs
likely to burn for (i) more than 2150 hrs (ii) less than 1950 hrs
[Ans: (i)67 (ii) 184 ]
15. Assuming that the diameter of 100 brass plugs taken
consecutively from a normal distribution with mean 0.7515cm. and
standard deviation 0.0020 cm . how many plugs are likely to be
rejected if the approved diameter is 0.752±0.004 cms ? [Ans: 53]
16. The marks of 1000 students of a university are found to be
normally distributed with mean 70 and standard deviation 5 .
Estimate the number of students whose marks will be (i) between 60
and 75 (ii) more than 75 (iii) less than 68 [Ans: (i) 910 (ii) 23 (iii) 37]
17. In a normal distribution 31 % items are under 45 and 8% are
over 64. Find the mean and standard deviation. Find also the
percentage of items lying between 30 and 75.(Given : S.N.V.Z area
from z=0 to z=0.5 is 0.19 and that from z=0 to z=1.4 is 0.42 )

Prof. Divesh Singh KJSIT, Sion


11

[Ans: m=50 , σ = 10: 0.957]


18. Marks obtained by students in an examination follow a normal
distribution. If 30% of students got below 35 marks and 10% got
above 60 marks. Find the mean and variance of the normal
distribution.% of students who got marks between 40 and 50 .
[Ans: m=42.25 , σ = 13.81: 28%]
19. Suppose that the length in hrs , say X of light bulbs
manufactured by accompany A are normally distributed with mean
800 hrs and standard deviation of 120 hrs and those of B with mean
850 hrs and standard deviation of 50 hrs . One bulb is selected from
the production of each company and is burned till death .Find the
probability that the length of life of the bulb from company A exceeds
the length of the bulb from the company B at least by 15 hrs . [Ans:
0.2979]
20. The marks obtained by students in a college are normally
distributed with mean 65 and variance 25. If 3 students are selected
at random from this college what is the probability that at least one of
them would have scored more than 75 marks? [Ans: 0.07]
21. A manufacturer knows from his experience that the resistant of
resisters he produces is normal with mean 𝜇 = 100 𝑜ℎ𝑚𝑠 and standard
deviation σ = 2 𝑜ℎ𝑚𝑠 .What percentage of resisters will have
resistance between 98 𝑜ℎ𝑚𝑠 and 102 𝑜ℎ𝑚𝑠 ?[Ans: 68.26%]
22. If 𝑋1 and 𝑋2 are two independent random variates with mean 30
and 25 and variance 16 and 12 and if 𝑌 = 3𝑋1 − 2𝑋2 ,
find 𝑃(60 ≤ 𝑌 ≤ 80) [Ans: 0.0730]
23. In an examination marks obtained by students in Mathematics,
Physics and Chemistry are normally distributed with mean 51,53,and
46 with SD 15,12,16 respectively . Find the probability of securing
total marks (i) 180 or above (ii) 80 or below. [Ans: 0.1151 ,0.0082]
24. In a competitive examination the top 15% of the students
appeared will get grade A, while the bottom 20% will be declared fail. If
the grades are normally distributed with mean % of marks 75 and SD
10 , determine the lowest % of marks to receive grade A and the lowest
% of marks that passes . [85.4,66.6]
25. In a distribution exactly normal 7% of items are under 35 and
89% of the items are under 63. Find the probability that an item
selected at random lies between 45 and 56 . [Ans: 0.4038]

Prof. Divesh Singh KJSIT, Sion


12

Statistics: When we select a random sample from a population, the


numerical descriptive measures [mean, median, mode, variance, SD etc.] we
calculate from the sample are called statistics.

Parameters are numbers that describe the numerical descriptive measures


of entire populations.

Sampling Distribution:

The sampling distribution of a statistic is the probability distribution for


the possible values of the statistic that results when random samples of size
n are repeatedly drawn from the population.

The sampling distribution of a statistic is the probability distribution of that


statistic.

Hypothesis testing:

Making a decision about the value of a parameter based on some


preconceived idea about what its value might be.

A statistical test of hypothesis consists of five parts:

1. The null hypothesis, denoted by 𝑯𝟎

2. The alternative hypothesis, denoted by 𝑯𝒂 or 𝑯𝟏

3. The test statistic and its p-value

4. The significance level and the rejection region

5. The conclusion

Prof. Divesh Singh KJSIT, Sion


13

Level of Significance:

Prof. Divesh Singh KJSIT, Sion


14

t- test was developed by William Sealy Gosset in 1908

Assumptions for t-test:

1. Sample size should be small i.e 𝑛 ≤ 30.


2. Population is normally distributed
3. Sample should be random.

Student’s t-distribution (Small Sample)

If 𝑋̅ is the sample mean and 𝜇 is the population mean then


̅ −𝝁
𝑿
𝒕 = 𝒔/
√𝒏

̅ : Sample mean
𝑿

𝝁: Population mean

𝒔: sample standard deviation

n: sample size

Standard deviation of population can be calculated by


̅ )𝟐
∑(𝑿𝒊 −𝑿
𝒔𝟐 = , where n-1 is degree of freedom
𝒏−𝟏

Type-1 Testing the hypothesis that the population mean is 𝝁

Prof. Divesh Singh KJSIT, Sion


15

Problems:

1. Vanaspati oil is marketed in tins of 10 kg., a sample of 20 tins showed


the mean weight as 9.5 kg. with standard deviation of 3kg. . Does the
sample justify the claim that the mean weight is 10kg ? Mention the
level of significance you use.[Ans:t=0.726,Yes at 5% level of sign.]
2. A random sample of 16 observations has mean 103.75 cm. The sum of
the squares of the deviation from the mean is 843.75 cm. Can this
sample be regarded as coming from population having 108.75 cm as
the mean?
3. A company supplies tooth-paste in a packing of 100 gm. A sample of
10 packing gave the following weights in gms.
100.5, 100.3, 100.1, 99.8, 99.7, 99.7, 100.3, 100.4, 99.2, 99.3
Does the sample support the claim of the company that the packing
weighs 100 gms? .[Ans: ̅ X = 99.93, 𝑠 2 = 0.2112,t=0.48,Yes ]

4. A machine is designed to produce insulating washers for electrical


devices of average thickness of 0.025 cms. A random sample of 10
washers was found to have average thickness of 0.024 cms., with SD
of 0.002 cms . Test the significance of the deviation.
[Ans: |𝑡| = 1.5: 𝐴𝑐𝑐𝑒𝑝𝑡 𝐻0 ]
5. A certain drug administered to 12 patients resulted in the following
changes in their blood pressure.
5, 2, 8, -1, 3, 0, 6, -2, 1, 5, 0, 4
Can we conclude that the drug increases the blood pressure?
[Ans: t=2.89,One tailed test X ̅ > 𝜇 is to be accepted. There is increase
in B.P]
6. Nine items of a sample had the following values
45, 47, 50, 52, 48, 47, 49, 53, 51
Does the mean of 9 items differ significantly from the assumed
population mean 47.5? [Ans: |𝑡| = 1.84: 𝐴𝑐𝑐𝑒𝑝𝑡 𝐻0 ]
7. Ten individuals are chosen at random from a population and their
heights are found to be 63, 63, 64, 65, 66, 69, 69, 70, 70, 71 inches.
Discuss the suggestion that the mean height of the universe is 65
inches. [Ans: |𝑡| = 2.02: 𝐴𝑐𝑐𝑒𝑝𝑡 𝐻0 ]
8. Tests made on breaking strength of 10 pieces of a metal wire gave the
following results.
578, 572, 570, 568, 572, 570, 570, 572, 596 and 584 in kgs. Test if
the breaking strength of the metal wire can be assumed to be 577kg. ?
[Ans: |𝑡| = 0.65: 𝐴𝑐𝑐𝑒𝑝𝑡 𝐻0 ]

Prof. Divesh Singh KJSIT, Sion


16

Type-2 Testing the difference between means (Independent samples)

1. Two independent sample of size 8 and 7 gave the following results.


Sample 1: 19 17 15 21 16 18 16 14
Sample 2: 15 14 15 19 15 18 16
Is the difference between sample means significant ? [Ans: t=0.93 ,
accept 𝐻0 ]
2. The mean and standard deviation of height of 8 randomly chosen
soldiers are 166.9 cm and 8.29 cm respectively. The corresponding
values of 6 randomly chosen sailors are 170.3 cm and 8.5 cm
respectively. Based on this data can we conclude that, in general , the
soldiers are shorter than the sailors ? Find 95% confidence limits for
the static used. [Ans: |𝑡| = 0.6967: 𝑁𝑜 ]
3. Samples of electric tubes of two companies were tested for lengths of
their life and the following information was obtained ,

Company A Company B
No. of sample 8 7
Mean life in hrs 1210 1314
Standard deviations in hrs 36 42
Test at 5% level of significance whether the difference in the

sample means is significant. [Ans: |𝑡| = 4.81: 𝑅𝑒𝑗𝑒𝑐𝑡 ]

4. Two types of anti-biotics were tested on two groups of patients for


curing a particular disease and the following data were obtained.
Type-A Type-B
No. of patients 6 6
Mean period in days 13.55 10.10
Unbiased SD in days 3.2 2.8
Use 5% level of significance to test the null hypothesis that the difference
in the mean period of the two drugs is significant. [Ans: |𝑡| = 1.40: 𝑅𝑒𝑗𝑒𝑐𝑡 ]

5. Two kind of manures were used in 17 plots of the same size other
conditions being the same. The yields in the quintals are given below.
Manure I 35, 42, 40, 42, 34, 24, 42
Manure II 34, 44, 32, 40, 52, 41, 50, 40, 42, 45
Test at 5% level of significance whether the two manures differ as
regards their mean yields. [Ans: |𝑡| = 1.68: 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑖𝑠 𝑛𝑜𝑡 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑡 ]
6. The following are the gain in weights of cows fed on two types of diets
X and Y.
Diet X: 30, 37, 35, 37, 29, 19, 37
Diet Y: 29, 39, 27, 35, 47, 37, 45, 35, 37, 40

Test at 5% level of significance whether the diets differ as regards their

Prof. Divesh Singh KJSIT, Sion


17

effect on mean increase in weight. [Ans: |𝑡| = 1.62: 𝑑𝑜 𝑛𝑜𝑡 𝑑𝑖𝑓𝑓𝑒𝑟 ]

7. The mean of two random samples of size 9 and 7 are 196 and 199
respectively. The sum of the squares of the deviations from the mean
are 27 and 19 respectively . Can the samples be regarded to have been

drawn from the same normal population? [Ans: |𝑡| = 3.30: 𝑁𝑜 ]

8. A sample of 8 students of 16 years each shown up a mean systolic BP


of 118.4 mm of Hg and SD of 12.17mm. While a sample of 10 student
of 17 years each showed the mean systolic B.P of 121.0 mm with SD
of 12.88 during an investigation. The investigator feels that the
systolic BP is related to age. Do you think that the data provides
enough reasons to support investigators feeling at 5% LOS(Assume
the distribution of systolic BP to be normal [Ans: |𝑡| = 0.14: 𝑁𝑜 ]
9. The means of two random samples of size 9 and 7 are 196.42 and
198.82 respectively. The sums of squares of the deviations from the
means are 26.94 and 18.73 respectively. Can the samples be
considered to have been drawn from the sample population?
[Ans: |𝑡| = 2.64: 𝑁𝑜]
10. Six guinea pigs injected with 0.5 mg of a medication took on an
average 15.4 secs. to fall asleep with an unbiased standard deviation
2.2 secs. , while six other guinea pigs injected with 1.5 mg of the
medication took on an average 11.2 secs. to fall asleep with an
unbiased standard deviation 2.6 cms. Use 5% los to test the null
hypothesis that the difference in dosage has no affect.
[Ans: : |𝑡| = 5.288 , the difference is significant]
11. The height of six randomly chosen sailors are in inches : 63, 65,
68, 69, 71, 72. The heights of ten randomly chosen soldiers are: 61,
62, 65, 66, 69, 69, 70, 71, 72, 73.
Discuss in the light that these data throw on the suggestion that the
soldiers on an average are taller than sailors.
[Ans: |𝑡| = 0.099, suggestion cannot be accepted . ]

Type-3 Testing the difference between means (Dependent samples)

1. A drug was administered to 5 persons and the systolic blood


pressure before and after was measured. The result are given below
Candidates: I II III IV V
B.P. before: 140 130 132 150 140
B.P after: 132 126 133 144 133
Test whether the drug is affective in lowering the systolic blood
pressure . [Ans: |𝑡| = 3,the drug is not affective.

Prof. Divesh Singh KJSIT, Sion


18

2. A drug was administered to 10 patients and the changes in the


sugar content in the blood was recorded as under 10, -6, -4, 2, -8,
6, -5, -3, -6 . Is it reasonable to believe that the drug has no effect
on change of sugar? [Ans: |𝑡| = 0.289,Drug has no affect ]
3. The sales-data of an item in six shops before and after a special
promotional campaign are as under
Shops: A B C D E F
Before campaign: 53 28 31 48 50 42
After campaign: 58 29 30 55 56 45
Can the campaign be judged to be a success at 5% level of
significance? [Ans: |𝑡| = 3.14,Yes ]
4. The following data relates to the marks obtained by 11 students in
two tests, one held at the beginning of the year and the other at the
end of the year after giving intensive coaching.
Test-I 19 23 16 24 17 18 20 18 21 19 20
Test-II 17 24 20 24 20 22 20 20 18 22 18
Do the data indicate that the students are benefited by coaching?
[Ans: |𝑡| = 4,Yes]
5. An IQ test was administered to 5 persons before and after training .
The results are given below.

1 2 3 4 5
IQ before training 110 120 123 132 125
IQ after training 120 118 125 136 121
Test whether there is any change in IQ after training programme. Use
1% level of significance.

[Ans: |𝑡| = 0.82, The value of t for 𝜈 = 4 at 1% level of significance =4.6.


,Null hypothesis accepted]

6. A certain injection administered to 12 patients resulted in the


following changes of blood pressure:
5, 2, 8, -1, 3, 0, 6, -2, 1, 5, 0, 4
Can it be concluded that the injection will be in general
accompanied by an increase in blood pressure? [Ans: |𝑡| = 2.89, 𝐻0
rejected]

7. Ten school boys were given a test in Mathematics and their scores
were recorded. They were given a month special coaching and
second test was given to them in the same subject at the end of the
coaching period. Test if the marks given below give evidence to the
fact that the students are benefitted by coaching,
Marks in Test-I 70 68 56 75 80 90 68 75 56 58
Marks in Test-II 68 70 52 73 75 78 80 92 54 55

Prof. Divesh Singh KJSIT, Sion


19

[Ans: |𝑡| = 0.036, 𝐻0 accepted]


8. In a certain experiment to compare two types of pig-tools A and B ,
the following results of increasing weights were obtained .
Pig No. 1 2 3 4 5 6 7 8
Increase in weight X kg by A: 49 53 51 52 47 50 52 53
Increase in weight X kg by B: 52 55 52 53 50 54 54 53

(i) Assuming that the two sample of pigs are independent, can
we conclude that food B is better than food A.
(ii) Examine the case if the same set of pigs were used in both
the cases.

[Ans:(i)|𝑡| = 2.17, B is superior to food A(ii) |𝑡| = 4.17, B is superior to food A]

𝝌𝟐 𝒕𝒆𝒔𝒕

(𝑶 − 𝑬)𝟐
𝝌𝟐 = ∑
𝑬
O- Observed frequency, 𝑬 – expected frequency

Type-I : Independence of attributes

1. In an experiment on immunisation of cattle from Tuberculosis


the following results were obtained.
Affected Not-affected Total
Inoculated 267 27 294
Not Inoculated 757 155 912
Total 1024 182 1206
Use 𝝌 − test to determine the efficacy of vaccine in preventing
𝟐

tuber-culosis [Ans: Calculated 𝝌𝟐 =10.19 , 𝝌𝟐 table is 3.84 ,effective]

2. Based on following data determine if there is relation between


literacy and smoking.
Smokers Non-smokers
Literates 83 57
Illiterates 45 68
[Ans: Calculated 𝝌 =9.19, Yes]
𝟐

Prof. Divesh Singh KJSIT, Sion


20

3. A total of 3759 individuals were interviewed in a public opinion


survey on a political proposal. Of them 1872 were men and the
rest were women. A total of 2257 individuals were in favour of
the proposal and 917 were opposed to it. A total of 243 men
were undecided and 442 women were opposed to it. Do you
justify or contradict the hypothesis that there is no association
between sex and attitude at 5% level of significance.
[Ans: Calculated 𝝌𝟐 =18.76, Sex and attitude are associated]
4. A sample of 400 students of under-graduate and 400 of post-
graduate classes was keen to know their opinion about
autonomous colleges. 290 of the undergraduate and 310 of the
post-graduate students favoured the autonomous status.
Present these facts in the form of a table and test at 5% level ,
that the opinion regarding autonomous status of colleges is
independent of the level of classes of students .
[Ans: Calculated 𝝌𝟐 =2.66, Opinions are independent]
5. Investigate the association between the darkness of eye colour
in father and son from the following data.

Colour of father’s eye


Colour Dark Not-dark total
of son’s Dark 48 90 138
eyes Not-dark 80 782 862
total 128 872 1000

[Ans: Calculated 𝝌𝟐 =66.88, Yes there is association]


6. The following table gives the number of accounting clerks not
committing errors among trained and untrained clerks working
in an organisation.

No. of clerks No. of clerks not Total


Committing errors Committing errors
Trained 70 530 600
Untrained 155 745 900
Total 225 1275 1500
Test the effectiveness of training in preventing errors.

[Ans: Calculated 𝝌𝟐 =8.7, training is effective in preventing errors]

7. In a survey of 200 boys of which 75 were intelligent, 40 had


educated fathers while 85 of the unintelligent boys had
uneducated father. Do this figures support the hypothesis that
educated fathers have intelligent boys.
[Ans: Calculated 𝝌𝟐 =11.89, Yes there is association]

Prof. Divesh Singh KJSIT, Sion


21

Problems on Goodness of Fit

1. The following table gives the number of accidents in a city during a


week. Find whether the accidents are uniformly distributed over a
week.
Day: Sun, Mon, Tue, Wed, Thu, Fri, Sat Total

No of accidents: 13, 15, 9, 11, 12, 10, 14, 84.

[Ans: Calculated 𝝌𝟐 =2.33, occurs equally on all working days]


2. A die was thrown 132 times and following frequencies were observed.
No. Obtained 1 2 3 4 5 6 Total
Frequency 15 20 25 15 29 28 132
Test the hypothesis that the die is unbiased. [Ans: Calculated 𝝌𝟐 =8.91,

the die is unbiased]

3. The number of car accidents in a metropolitan city was found to be


20,17,12,6,7,15,8,5,16,14 per month respectively. Use 𝝌𝟐 -test to
check whether these frequencies agreement with the belief that
occurrence of accidents was the same during 10 months period. Test
at 5% level of significance.

[Ans: Calculated 𝝌𝟐 =20.33, do not occur equally on all months]

4. 300 digits were chosen at random from a table of random numbers.


The frequencies of digits were as follows.
Digit: 0 1 2 3 4 5 6 7 8 9 Total
Frequency: 28 29 33 31 26 35 32 30 31 25 300
Use 𝝌 -test examine the hypothesis that the digits were distributed in
𝟐

equal numbers in the table.


[Ans: Calculated 𝝌𝟐 =2.87, digits are equally distributed in the table]
5. Theory predicts that proportion of beans in the four groups A, B, C, D
should be 9:3:3:1. In an experiment among 1600 beans the number in
the four groups were 882,313,287 and 118 . Do the experimental
results support the theory? [Ans: Calculated 𝝌𝟐 =4.72, the proportion
9:3:3:1 is correct.]
6. In an experiment of pea breeding the following frequencies were
obtained
Round and Wrinkled Round and Wrinkled Total
yellow and yellow green and green
315 101 108 32 556

Prof. Divesh Singh KJSIT, Sion


22

Theory predicts that the frequencies should be in proportion of


9:3:3:1. Examine the correspondence between theory and experiment
using Chi-square test. [Ans: Calculated 𝝌𝟐 =0.51, the proportion
9:3:3:1 is correct .]
7. The figures given below are (a) the observed frequencies of a
distribution , (b) the frequencies of the normal distribution having the
same mean , SD and the total frequency as in (a) .
(a) 1,12,66,220,495,792,924,792,495,220,66,12,1.
(b) 2,15,66,210,484,799,943,799,484,210,66,15,2.

Apply 𝝌𝟐 test of goodness of fit. [Ans: Calculated 𝝌𝟐 =3.84, the fit is good]

8. The following figures show the distribution of digits in numbers


chosen at random from a telephone directory.
Digit: 0 1 2 3 4 5 6 7 8 9
Freq: 1026 1107 997 966 1075 933 1107 972 964 853
Test at 5% level whether digits may be taken to occur equally
frequently in the directory. [Ans: Calculated 𝝌𝟐 =59.36, No]

Prof. Divesh Singh KJSIT, Sion

You might also like