Mean Median Mode
Mean Median Mode
Mean Median Mode
Number of 9 6 15 3 1 2 2 1 1
Students
Solution:
The given distribution has discontinuous class intervals, so we need to make them
continuous.
Class intervals Number of students (fi) Class mark (xi) di = xi – a fidi
40.5 – 45.5 15 43 = a 0 0
45.5 – 50.5 3 48 5 15
50.5 – 55.5 1 53 10 10
55.5 – 60.5 2 58 15 30
60.5 – 65.5 2 63 20 40
65.5 – 70.5 1 68 25 25
70.5 – 75.5 1 73 30 30
Total ∑fi = 40 ∑fidi = 35
Here, ∑fi = 40 and ∑fidi = 35
By Assumed mean method,
Mean = a + (∑fidi/∑fi)
= 43 + (35/40)
= 43 + 0.875
= 43.875
Therefore, the mean weight of the students is 43.875 kg.
5. Find the mean for the following distribution.
xi 15 21 27 30 35
fi 3 5 6 7 8
Solution:
xi fi fixi
15 3 45
21 5 105
27 6 162
30 7 210
35 8 280
Number of Students 7 10 10 20 20 15 8
Solution:
Class interval Number of students (frequency) Cumulative frequency
10 – 20 7 7
20 – 30 10 17
30 – 40 10 27 = cf
40 – 50 20 = f 47
50 – 60 20 67
60 – 70 15 82
70 – 80 8 90
N/2 = 90/2 = 45
Cumulative frequency greater and nearer to 45 is 47, which lies in the interval 40 – 50
Median class is 40 – 50.
Lower limit of the median class = l = 40
Class size = h = 10
Frequency of the median class = f = 20
Cumulative frequency of the class preceding the median class = cf = 27
As we know,
������=�+(�2−���)×ℎ
Median = 40 + [(45 – 27)/20] × 10
= 40 + (18/2)
= 40 + 9
= 49
Hence, the median marks of the students = 49.
Mode formula for grouped data
����=�+(�1−�02�1−�0−�2)×ℎ
Here,
l = Lower limit of the modal class
h = Class size
f1 = Frequency of the modal class
f0 = Frequency of the class preceding the modal class
f2 = Frequency of the class succeeding the modal class
7. The given distribution shows the number of runs scored by some top batsmen of the
world in one-day international cricket matches.
Run Scored Number of Batsman
3000-4000 4
4000-5000 18
5000-6000 9
6000-7000 7
7000-8000 6
8000-9000 3
9000-10000 1
10000-11000 1
Find the mode of the above distribution.
Solution:
From the given,
Modal class = 4000 – 5000
Lower limit of the modal class = l = 4000
Class width (h) = 1000
Frequency of the modal class = f1 = 18,
Frequency of the class preceding the modal class = f0 = 4
Frequency of the class succeeding the modal class = f 2 = 9
Mode = l + [(f1 – f0)/ (2f1 – f0 – f2)] × h
Mode = 4000 + [(18 – 4)/(36 – 4 – 9)] × 1000
= 4000 + (14000/23)
= 4000 + 608.695
= 4608.695
= 4608.7 (approximately)
Thus, the mode of the given data is 4608.7 runs
8. If the median of a distribution given below is 28.5, then find the value of x and y.
CI 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 Total
Frequency 5 x 20 15 y 5 60
Solution:
From the given,
N/2 = 60/2 = 30
Median of the given data = 28.5
Median class is 20 – 30 with a cumulative frequency = 25 + x.
Lower limit of median class = l = 20
Frequency of the median class = f = 20
Cumulative frequency of the class preceding the median class = cf = 5 + x
Class size = h = 10
������=�+(�2−���)×ℎ
28.5 = 20 + [(30 − 5 − x)/ 20] × 10
28.5 – 20 = (25 – x)/2
8.5 × 2 = 25 – x
17 = 25 – x
x = 25 – 17
Therefore, x = 8
Also,
60 = 5 + 20 + 15 + 5 + x + y
60 = 5 + 20 + 15 + 5 + 8 + y
y = 60 – 53
y=7
Therefore, the value of x = 8 and y = 7.
Empirical Relation between Mean Median and Mode
Mode = 3 Median – 2 Mean
(or)
Mean – Mode = 3 (Mean – Median)
9. For a moderately skewed distribution, the mean and median are respectively 26.8
and 27.9. What is the mode of the distribution?
Solution:
Given,
Mean = 26.8
Median = 27.9
Using the relationship between mean, median and mode,
Mode = 3 Median – 2 Mean
= 3 × 27.9 – 2 × 26.8
= 83.7 – 53.6
= 30.1
Therefore, the mode of the distribution is 30.1.
10. If the mean of the given frequency distribution is 35, then find the missing frequency
y. Also, calculate the median and mode for the distribution.
Class 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60
Frequency 2 4 7 y 1
Solution:
Class Frequency (fi) Class mark (xi) fixi
10 – 20 2 15 30
20 – 30 4 25 100
30 – 40 7 35 245
40 – 50 y 45 45y
50 – 60 1 55 55
Total ∑fi = 14 + y ∑fixi = 430 + 45y
As we know,
Mean = ∑fixi/∑fi
Given that the mean of the distribution is 35.
So, (430 + 45y)/(14 + y) = 35
430 + 45y = (14 + y)35
430 + 45y = 490 + 35y
45y – 35y = 490 – 430
10y = 60
y=6
Thus, the missing frequency is 6.
Now, we can calculate the mode as follows:
Class 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60
Frequency 2 4 = f0 7 = f1 6 = f2 1
Here,
l = 30
h = 10
Mode = l + [(f1 – f0)/ (2f1 – f0 – f2)] × h
Mode = 30 + [(7 – 4)/(14 – 4 – 6)] × 10
= 30 + (30/4)
= 30 + 7.5
= 37.5
Now, using the formula Mode = 3Median – 2 Mean, we can get the value of median.
37.5 = 3 Median – 2 (35)
3 Median = 37.5 + 70
Median = 107.7/3
= 35.9
3,752
2,340
Practice Questions on Mean Median Mode
1. Find the mean, median and mode of the following data:
23, 18, 24, 23, 31, 37, 28, 30, 25, 40, 35, 35, 27, 25
2. The mileage (km per litre) of 50 cars of the same model was tested by a
manufacturer, and details are tabulated as given below:
Mileage (km/l) 10 – 12 12 – 14 14 – 16 16 – 18
Number of cars 7 12 18 13
Find the mean mileage.
The manufacturer claimed that the mileage of the model was 16 km/litre. Do you agree
with this claim?
3. Find the median of the following frequency distribution.
xi 15 20 22 17 30 25
fi 6 9 4 4 10 6
4. Calculate the mode for the following distribution.
Cl 40-45 45-50 50-55 55-60 60-65 65-70 70-75 75-80
Frequency 4 4 13 5 6 5 2 1
5. Suppose the mean and median of distribution are 10.14 and 8, respectively. Find the
mode of the distribution using the empirical relationship between mean, median and
mode.
Mean
The arithmetic mean of a given data is the sum of all observations divided by the
number of observations. For example, a cricketer's scores in five ODI matches are as
follows: 12, 34, 45, 50, 24. To find his average score in a match, we calculate the
arithmetic mean of data using the mean formula:
Mean = Sum of all observations/Number of observations
Mean = (12 + 34 + 45 + 50 + 24)/5
Mean = 165/5 = 33
Mean is denoted by x (pronounced as x bar). Mean is calculated a little differently
when the data is grouped or ungrouped. Let's find the mean in both cases.
Mean of Ungrouped Data
Let x1, x2, x3 , . . . , xn be n observations. We can find the arithmetic mean using the
mean formula:
Mean, x = (x1 + x2 + ... + xn)/n
Example: If the heights of 5 people are 142 cm, 150 cm, 149 cm, 156 cm, and 153 cm.
Find the mean height.
Mean height, x = (142 + 150 + 149 + 156 + 153)/5
= 750/5
= 150
Mean, x = 150 cm
Thus, the mean height is 150 cm.
Mean of Grouped Data
When the data is present in tabular form, we use the following formula:
Mean, x = (x1f1 + x2f2 + ... + xnfn)/(f1 + f2 + ... + fn)
Consider the following example.
Example 1: Find the mean of the following distribution:
x 4 6 9 10 15
f 5 10 10 7 8
Solution:
Calculation table for arithmetic mean:
xi fi xifi
4 5 20
6 10 60
9 10 90
10 7 70
15 8 120
∑ fi = 40 ∑ xi fi = 360
0-10 2
10-20 6
20-30 9
30-40 7
40-50 4
50-60 2
Solution:
In this case, we find the classmark (also called as mid-point of a class) for each class.
Note: Class mark = (lower limit + upper limit)/2
Let x1, x2, x3 , . . . , xn be the class marks of the respective classes.
Hence, we get the following table:
5 2 10
15 6 90
25 9 225
35 7 245
45 4 180
55 2 110
Frequency 2 12 22 8 6
Solution:
We need to calculate the cumulative frequencies to find the median.
Calculation table:
0-10 2 2
10-20 12 2 + 12 = 14
20-30 22 14 + 22 = 36
30-40 8 36 + 8 = 44
40-50 6 44 + 6 = 50
N = 50
N/2 = 50/2 = 25
Median Class = (20 - 30)
l = 20, f = 22, c = 14, h = 10
Using Median formula:
Median = l+[n2−cf]×h�+[�2−��]×ℎ
= 20 + (25 - 14)/22 × 10
= 20 + (11/22) × 10
= 20 + 5 = 25
∴ Median = 25
Mode
The value which appears most often in the given data i.e. the observation with the
highest frequency is called a mode of data.
Case 1: Ungrouped Data
For ungrouped data, we just need to identify the observation which occurs maximum
times.
Mode = Observation with maximum frequency
For example in the data: 6, 8, 9, 3, 4, 6, 7, 6, 3, the value 6 appears the most number of
times. Thus, mode = 6. An easy way to remember mode is: Most Often Data Entered.
Note: A data may have no mode, 1 mode, or more than 1 mode. Depending upon the
number of modes the data has, it can be called unimodal, bimodal, trimodal, or
multimodal.
The example discussed above has only 1 mode, so it is unimodal.
Case 2: Grouped Data
When the data is continuous, the mode can be found using the following steps:
Step 1: Find modal class i.e. the class with maximum frequency.
Step 2: Find mode using the following formula:
Mode = L+[fm−f12fm−f1−f2]×h�+[��−�12��−�1−�2]×ℎ
where,
L = lower limit of modal class,
fm = frequency of modal class,
f1 = frequency of class preceding modal class,
f2 = frequency of class succeeding modal class,
h = class width
Mode formula equivalently is written as follows as well:
Number of students 5 10 12 6 3
Solution:
The highest frequency = 12, so the modal class is 40-60.
L = lower limit of modal class = 40
fm = frequency of modal class = 12
f1 = frequency of class preceding modal class = 10
f2 = frequency of class succeeding modal class = 6
h = class width = 20
Using the mode formula,
Mode = L+[fm−f12fm−f1−f2]×h�+[��−�12��−�1−�2]×ℎ
= 40 + [12−102×12−10−6]×20[12−102×12−10−6]×20
= 40 + (2/8) × 20
= 45
∴ Mode = 45
Mean, Median and Mode Formulas
We covered the formulas and methods to find the mean, median, and mode for a
grouped and ungrouped set of data. Let us summarize and recall them using the list of
mean, median, and mode formulas given below,
Mean formula for ungrouped data: Sum of all observations/Number of
observations
Mean formula for grouped data: x = (x1f1 + x2f2 + ... + xnfn)/(f1 + f2 + ... + fn)
Median formula for ungrouped data: If n is odd, then use the formula: Median =
(n + 1)/2th observation. If n is even, then use the formula: Median = [(n/2) th obs.+
((n/2) + 1)th obs.]/2
Median formula for grouped data: Median = l+[n2−cf]×h�+[�2−��]×ℎ, where
l = lower limit of median class
c = cumulative frequency of the class preceding the median class
f = frequency of the median class
h = class size
Mode formula for ungrouped data: Mode = Observation with maximum
frequency
Mode formula for grouped data: Mode
= L+[fm−f12fm−f1−f2]×h�+[��−�12��−�1−�2]×ℎ, where
L = lower limit of modal class,
fm = frequency of modal class,
f1 = frequency of class preceding modal class,
f2 = frequency of class succeeding modal class,
h = class width
Take a quick look at the figure below with mean mode median formulas.
Relation Between Mean, Median and Mode
The three measures of central values i.e. mean, median, and mode are closely
connected by the following relations (called an empirical relationship).
Mean - Mode = 3 (Mean - Median) (or) 2Mean + Mode = 3Median
For instance, if we are asked to calculate the mean, median, and mode of continuous
grouped data, then we can calculate mean and median using the formulas as
discussed in the previous sections and then find mode using the empirical relation.
For example, we have data whose mode = 65 and median = 61.6.
Then, we can find the mean using the above mean, median, and mode relation.
2Mean + Mode = 3 Median
⇒ 2Mean = 3 × 61.6 - 65
⇒ 2Mean = 119.8
⇒ Mean = 119.8/2
⇒ Mean = 59.9
Difference Between Mean and Average
The term average is frequently used in everyday life to denote a value that is typical for
a group of quantities. Average rainfall in a month or the average age of employees of
an organization is a typical example. We might read an article stating "People spend
an average of 2 hours every day on social media. " We understand from the use of the
term average that not everyone is spending 2 hours a day on social media but some
spend more time and some less.
However, we can understand from the term average that 2 hours is a good indicator of
the amount of time spent on social media per day. Most people use average and
mean interchangeably even though they are not the same.
Average is the value that indicates what is most likely to be expected.
They help to summarise large data into a single value.
An average tends to lie centrally with the values of the observations arranged in
ascending order of magnitude. So, we call it an average measure of the central
tendency of the data. Averages are of different types. What we refer to as mean i.e.
the arithmetic mean is one of the averages. Mean is called the mathematical average
whereas median and mode are positional averages.
Difference Between Mean and Median
Mean is known as the mathematical average whereas the median is known as the
positional average. To understand the difference between the two, consider the
following example. A department of an organization has 5 employees which include a
supervisor and four executives. The executives draw a salary of ₹10,000 per month while
the supervisor gets ₹40,000.
Mean = (10000 + 10000 + 10000 + 10000 + 40000)/5 = 80000/5 = 16000
Thus, the mean salary is ₹16,000.
To find the median, we consider the ascending order: 10000, 10000, 10000, 10000,
40000.
n = 5,
so, (n + 1)/2 = 3
Thus, the median is the 3rd observation.
Median = 10000
Thus, the median is ₹10,000 per month.
Now let us compare the two measures of central tendencies.
We can observe that the mean salary of ₹16,000 does not give even an estimated
salary of any of the employees whereas the median salary represents the data more
effectively.
One of the weaknesses of the mean is that it gets affected by extreme values (known
as outliers).
Look at the following graph to understand how extreme values affect mean and
median:
So, mean is to be used when we don't have extremes in the data. If we have extreme
points, then the median gives a better estimation. Here's a quick summary of the
differences between the two.
Mean Vs
Mean Median
Median
Arrange data in
Add all values and divide
ascending / descending
Calculation by the total number of
order and find the middle
observations
value
To find the Median Alex places the numbers in value order and finds the middle
number.
So 2 runners took between 51 and 55 seconds, 7 took between 56 and 60 seconds, etc
Oh No!
The groups (51-55, 56-60, etc), also called class intervals, are of width 5
The midpoints are in the middle of each class: 53, 58, 63 and 68
We can estimate the Mean by using the midpoints.
So, how does this work?
Think about the 7 runners in the group 56 - 60: all we know is that they ran somewhere
between 56 and 60 seconds:
Maybe all seven of them did 56 seconds,
Maybe all seven of them did 60 seconds,
But it is more likely that there is a spread of numbers: some at 56, some at 57, etc
So we take an average and assume that all seven of them took 58 seconds.
Let's now make the table using midpoints:
Midpoint Frequency
53 2
58 7
63 8
68 4
Our thinking is: "2 people took 53 sec, 7 people took 58 sec, 8 people took 63 sec and 4
took 68 sec". In other words we imagine the data looks like this:
53, 53, 58, 58, 58, 58, 58, 58, 58, 63, 63, 63, 63, 63, 63, 63, 63, 68, 68, 68, 68
Then we add them all up and divide by 21. The quick way to do it is to multiply each
midpoint by each frequency:
Midpoint Frequency Midpoint ×
x f Frequency
fx
53 2 106
58 7 406
63 8 504
68 4 272
Totals: 21 1288
And then our estimate of the mean time to complete the race is:
Estimated Mean = 128821 = 61.333...
Very close to the exact answer we got earlier.
Estimating the Median from Grouped Data
Let's look at our data again:
Seconds Frequency
51 - 55 2
56 - 60 7
61 - 65 8
66 - 70 4
The median is the middle value, which in our case is the 11th one, which is in the 61 - 65
group:
We can say "the median group is 61 - 65"
But if we want an estimated Median value we need to look more closely at the 61 - 65
group.
We call it "61 - 65", but it really includes values from 60.5 up to (but not including) 65.5.
Why? Well, the values are in whole seconds, so a real time of 60.5 is measured as 61.
Likewise 65.4 is measured as 65.
At 60.5 we already have 9 runners, and by the next boundary at 65.5 we
have 17 runners. By drawing a straight line in between we can pick out where the
median frequency of n/2 runners is:
Example: You grew fifty baby carrots using special soil. You dig them up and measure
their lengths (to the nearest mm) and group the results:
Length
Frequency
(mm)
150 - 154 5
155 - 159 2
160 - 164 6
165 - 169 8
170 - 174 9
175 - 179 11
180 - 184 6
185 - 189 3
Mean
Length Midpoint Frequency
(mm) x f fx
150 - 154 152 5 760
155 - 159 157 2 314
160 - 164 162 6 972
165 - 169 167 8 1336
170 - 174 172 9 1548
175 - 179 177 11 1947
180 - 184 182 6 1092
185 - 189 187 3 561
Totals: 50 8530
Mode
The Modal group is the one with the highest frequency, which is 175 - 179:
L = 174.5 (the lower class boundary of the 175 - 179 group)
fm-1 = 9
fm = 11
fm+1 = 6
w=5
Estimated Mode= 174.5 + 11 − 9(11 − 9) + (11 − 6) × 5
= 174.5 + 1.42...
= 175.9 mm (to 1 decimal)
Age Example
Age is a special case.
When we say "Sarah is 17" she stays "17" up until her eighteenth birthday.
She might be 17 years and 364 days old and still be called "17".
This changes the midpoints and class boundaries.
Example: The ages of the 112 people who live on a tropical island are grouped as
follows:
Age Number
0-9 20
10 - 19 21
20 - 29 23
30 - 39 16
40 - 49 11
50 - 59 10
60 - 69 7
70 - 79 3
80 - 89 1
A child in the first group 0 - 9 could be almost 10 years old. So the midpoint for this
group is 5 not 4.5
The midpoints are 5, 15, 25, 35, 45, 55, 65, 75 and 85
Similarly, in the calculations of Median and Mode, we will use the class boundaries 0, 10,
20 etc
Mean
Age Midpoint Number
x f fx
0-9 5 20 100
10 - 19 15 21 315
20 - 29 25 23 575
30 - 39 35 16 560
40 - 49 45 11 495
50 - 59 55 10 550
60 - 69 65 7 455
70 - 79 75 3 225
80 - 89 85 1 85
Totals: 112 3360
Median
The Median is the mean of the ages of the 56th and the 57th people, so is in the 20 - 29
group:
L = 20 (the lower class boundary of the class interval containing the median)
n = 112
B = 20 + 21 = 41
G = 23
w = 10
Estimated Median= 20 + (112/2) − 4123 × 10
= 20 + 6.52...
= 26.5 (to 1 decimal)
Mode
The Modal group is the one with the highest frequency, which is 20 - 29:
L = 20 (the lower class boundary of the modal class)
fm-1 = 21
fm = 23
fm+1 = 16
w = 10
Estimated Mode= 20 + 23 − 21(23 − 21) + (23 − 16) × 10
= 20 + 2.22...
= 22.2 (to 1 decimal)
Summary
For grouped data, we cannot find the exact Mean, Median and Mode, we can
only give estimates.
To estimate the Mean use the midpoints of the class intervals:
Estimated Mean = Sum of (Midpoint × Frequency)Sum of Frequency
To estimate the Median use:
Estimated Median = L + (n/2) − BG × w
where:
L is the lower class boundary of the group containing the median
n is the total number of data
B is the cumulative frequency of the groups before the median group
G is the frequency of the median group
w is the group width
To estimate the Mode use:
Estimated Mode = L + fm − fm-1(fm − fm-1) + (fm − fm+1) × w
where:
L is the lower class boundary of the modal group
fm-1 is the frequency of the group before the modal group
fm is the frequency of the modal group
fm+1 is the frequency of the group after the modal group
w is the group width