BST503Lec4 (Autosaved)
BST503Lec4 (Autosaved)
BST503Lec4 (Autosaved)
1 751.5
2098.0
2
951.5
3
1628.0
4
4679.0
5
1240.0
6
7 1966.0
8 2219.0
9 4121.0
10 1133.0
Direct Method
1 751.5
2098.0
2
951.5
3
1628.0
4
4679.0
5
1240.0
6
7 1966.0
8 2219.0
9 4121.0
10 1133.0
N = 10 ∑ X = 20787
Short cut Method
20 8
12
30
20
40
10
50
6
60
70 4
Direct method for Discrete series
12 630
30
20 800
40
10 500
50
6 360
60
70 4 280
N = 60 ∑fX = 2460
Short cut method for Discrete series
12 -10 -120
30
20 0 0
40
10 10 100
50
6 20 120
60
70 4 30 120
N = 60 ∑fd = 60
AM for Continuous Series
• Obtain the mid point (m)
• Multiply m with frequency (f) and get sum of fm
• Divide the total sum of fm by total frequency
N = 100
Direct Method for continuous series
15 10 150
10-20
25 25 625
20-30
35 30 1050
30-40
45 20 900
40 - 50
50-60 55 10 550
N = 100 ∑fm =
3300
Short cut Method for continuous series
15 10 -20 -200
10-20
25 25 -10 -250
20-30
35 30 0 0
30-40
45 20 10 200
40 - 50
50-60 55 10 20 200
N = 100 ∑fd = -200
Advantages of Mean:
• It is easy to understand & simple calculate.
• It is based on all the values.
• It is rigidly defined .
• It is easy to understand the arithmetic average even if some of
the details of the data are lacking.
• It is not based on the position in the series.
Disadvantages of Mean:
• It is affected by extreme values.
• It cannot be calculated for open end classes.
• It cannot be located graphically
• It gives misleading conclusions.
• It has upward bias.
PROPERTIES OF ARITHMETIC
MEAN
1. The algebraic sum of the deviations of all the
variates from their arithmetic mean is zero
Geometric Mean
• The geometric mean is a type of average , usually used for
growth rates, like population growth or interest rates. While
the arithmetic mean adds items, the geometric
mean multiplies items. Also, you can only get the geometric
mean for positive numbers.
• Use to find average per cent increate in sale, population,
production etc.
• A geometric mean, unlike an arithmetic mean, tends to
dampen the effect of very high or low values, which might
bias the mean if a straight average (arithmetic mean) were
calculated.
Geometric Mean
• Geometric mean is a type of mean or average, which
indicates the central tendency or typical value of a set of
numbers by using the product of their values (as opposed to
the arithmetic mean which uses their sum). The geometric
mean is defined as the nth root of the product of n numbers,
i.e., for a set of numbers x1, x2, ..., xn, the geometric mean is
defined as
𝐺𝑀= √ 𝑥 1 𝑋 𝑥 2 𝑋 𝑥3 𝑋 …………. 𝑋 𝑥𝑛
𝑛
Individual Series
Discrete Series
Continuous Series
Geometric mean of discrete series
X f
-------------------------
9.5 – 14.5 10
14.5 – 19.5 15
19.5 – 24.5 17
24.5 – 29.5 25
29.5 – 34.5 18
34.5 – 39.5 12
39.5 – 44.5 8
-----------------
Geometric mean of continuous series
Individual series
1
=
[
1 1 1 1 1
+ + + +……………
𝐻 𝑁 𝑥 1 𝑥2 𝑥 3 𝑥 4
1
𝑥𝑛 ]
continuous series
Discrete series
Harmonic mean of discrete series
∑f = 50 ∑f /X = 2.95
--------------------------------
Harmonic mean of continuous series
x f
0 – 10 5
10 – 20 15
20 – 30 10
30 – 40 15
40 – 50 5
Solution : First of all, we shall find out mid points of the various
classes. They are 5, 15, 25, 35 and 45.
Then we will calculate the H.M. by applying the following formula :
H.M. =
Calculation of Harmonic Mean
--------------------------------------------
X f X f c.f.
15000 24 15000 24 24
15500 26 15500 26 50
18000 16 16800 20 70
16800 20 17800 30 100
18500 6 18000 16 116
17800 30 18500 6 122
Calculation of median – Continuous series
Where
L1 = the lower boundary of the class in which the middle item of the distribution lies
cf = the cumulative frequency class preceding the median class
n = the total frequency
f = the frequency of the class median
I = class interval
• Use N/2 as the rank/size of median in place of (n+1)/2
• Now look at the cumulative frequency column and find that total which is either equal
to n/2 or next higher and determine the value of the variable corresponding to it. It
gives the size of median
Example: Median of a set Grouped Data in a
Distribution of Respondents by age
Age Group Frequency of Cumulative
Median class(f) frequencies(cf)
0-20 15 15
20-40 32 47
40-60 54 101
60-80 30 131
80-100 19 150
Total 150
Median (M)=40+
= 40+
= 40+0.52X20
= 40+10.37
= 50.37
Practical Problem 9
• The following data pertains to the number of
members in a family. Find median size of the
family.
Number of
1 2 3 4 5 6 7 8 9 10 11 12
members x
Frequency f 1 3 5 6 10 13 9 5 3 2 2 1
The cumulative frequency just
greater than 30.5 is 38.and the
value of x corresponding to 38 is
6.Hence the median size is 6
members per family.
Practical - 10
• For the frequency distribution of weights of sorghum ear-
heads given in table below. Calculate the median.
Practical Problem 7
An incomplete distribution is given below
x 0-10 10-20 20-30 30-40 40-50 50-60 60-70
f 10 20 ? 40 ? 25 15
a) You are given the median value is 35 and total frequency is 170. Find the
missing frequency
b) Calculate the AM for the complete table
0-10 10
10-20 20 Let missing freq. are f1 and f2.
20-30 ? Total Frequency = 170
30-40 40 The freq. of class other then missing freq = 10+20+40+25+15 = 110
40-50 ? F1+f2 = 170-110= - 60
50-60 25
60-70 15
Z =2000+
Z=2000+0.8 ×500=400
Z=2400
Mode for Bimodal Distribution
• For a bimodal distribution ,
the value of the mode can
not be determined with the
help of the formula given.
• When mode is ill-defined,
mode is calculated on the
formula
Mode = 3 Median – 2 mode
Locating Mode Graphically
Steps
1. Draw a histogram for the data
2. Draw two lines diagonally in the inside of the modal class
bar starting from each upper corner of the bar to the upper
corner of the adjacent bar
3. Draw a perpendicular line from the intersection of the two
diagonal line to the X-axis (horizontal scale), which gives the
modal value
Locating Mode Graphically
Practical
• Determine the Modal Weight for the following
data
Weight f
100-110 4
110-120 6
120-130 20
130-140 32
140 -150 33
150-160 17
160-170 8
170-180 2
Advantages of Mode :
• Mode is readily comprehensible and easily
calculated
• It is the best representative of data
• It is not at all affected by extreme value.
• The value of mode can also be determined
graphically.
• It is usually an actual value of an important
part of the series.
Disadvantages of Mode :
• It is not based on all observations.
• It is not capable of further mathematical
manipulation.
• Mode is affected to a great extent by
sampling fluctuations.
• Choice of grouping has great influence
on the value of mode.
Conclusion
• A measure of central tendency is a measure that
tells us where the middle of a bunch of data lies.