Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

sp5 1

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 20

Measure of Central Tendency and

Dispersion
Quartiles
Quartiles are the values that divide a list of numbers into quarters:
Put the list of numbers in order
Then cut the list into four equal parts
The Quartiles are at the "cuts"
• Range:
• It is defined as, it is the difference of highest and lowest observation in the
data.

• Example: Find the range from the following soil compaction data.
• 1, 4, 9, 12, 45, 89, 90, 40, 22, 79, 98, 103, 104, 99, 30, 87, , 9, 12, 45, 89, 90,
40, 22, 79, 98, 103, 104, 103, 104, 99, 30, 87, , 9, 12, 45, 89, 90.

• Solution.
• Highest value = 104
• Lowest value = 1
• Range = H.V – L.V = 104 – 1 = 103 ans.
Quartiles
The values that divide the data set into 4 equal parts after it has been arranged in ascending order are
called quartiles.

Example:

Split into quartes the following set of data:


213211231431

Step 1. Rearrange the value in ascending order:


11111222333

Step 2. Split into quartes:

This are the same data as before, but this tim eit's split into quartes:
• The "Interquartile Range" is from Q1 to Q3:

• To calculate it just subtract Quartile 1 from Quartile 3, like this:


• Example:

• The Interquartile Range is:
• Q3 − Q1 = 7 − 4 = 3
• Quartiles for ungrouped data
• Quartiles are the values of arranged data which divide whole data
into four equal parts. They are 3 in numbers namely Q1, Q2 and Q3.
Here Q1 is first quartile, Q2 is second quartile and Q3 is third quartile.
• The formula for ith quartile is
• Qi= Value of (i(n+1)4)th observation, i=1,2,3
• where n is the total number of observations.
• Example 1
• A random sample of 15 patients yielded the following data on the length of
stay (in days) in the hospital.
• 5, 6, 9, 10, 15, 10, 14, 12, 10, 13, 13, 9, 8, 10, 12.
• Find quartiles.
• Solution
• The formula for ith quartile is

• When n is the total n Arrange the data in ascending order


• 5, 6, 8, 9, 9, 10, 10, 10, 10, 12, 12, 13, 13, 14, 15
• First Quartile Q1
• First Quartile Q1
• The first quartile Q1 can be computed as follows:

•=9
• Thus, 25 % of the patients had length of stay in the hospital less than or equal
to 9 days.
• Second Quartile Q2
• The second quartile Q2 can be computed as follows:

• =10
• Thus, 50 % of the patients had length of stay in the hospital less than or equal
to 10 days.
• Third Quartile Q3
• The third quartile Q3 can be computed as follows:

• Thus, 75 % of the patients had length of stay in the hospital less than or equal
to 13 days.

• Example 2
• Blood sugar level (in mg/dl) of a sample of 20 patients admitted to the hospitals
are as follows:
• 75,89,72,78,87, 85, 73, 75, 97, 87, 84, 76,73,79,99,86,83,76,78,73.
• Find the value of Q1, Q2 and Q3.
• Mean Deviation: (action of doing something)
• It is the arithmetic mean of absolute values of all observations from their
mean. All observations are taken positive because it is the property of
arithmetic mean that the algebraic sum of the deviation from their mean is
zero.
• Suppose the observations x1, x2,…,xn have a mean then mean deviation is
defined as,

• For the grouped data, the mean Deviation is calculated as

• where
• Example: Find the mean deviation from the following data.
C – I (%) 88-91 92-95 96-99 100-103 104-107
Frequency 5 8 12 10 11

• Solution.
C- I (%) f Mid point x ()
88-91 5 89.5 9.217 46.08
92-95 8 93.5 5.217 41.73
96-99 12 97.5 1.217 14.60
100-103 10 101.5 2.783 27.83
104-107 11 105.5 6.783 74.61
Sum 46 204.85
• Standard Deviation
• The most satisfactory measure of dispersion is the standard deviation which is
always calculated from mean and represented by Greek letter ( sigma). In the
calculation of standard deviation the sign disappears as the number are squared.
Standard deviation is the positive square root of the sum of the squares of
deviations of the variables from their mean divided by total number of
observations.

• For grouped data.


• where
• For ungrouped data
• where
• Square of the S.D is called Variance.
• Example: Find the standard deviation from the following data.
C – I (%) 88-91 92-95 96-99 100-103 104-107
Frequency 5 8 12 10 11

• Solution.
C- I (%) f Mid point x ()
88-91 5 89.5 9.217 424.76
92-95 8 93.5 5.217 217.73
96-99 12 97.5 1.217 17.77
100-103 10 101.5 2.783 77.45
104-107 11 105.5 6.783 506.09
Sum 46 1243.8
• Coefficient of Variance (C.V)

• Coefficient of Variance is a relative measure of dispersion is independent of


units. It is expressed in percentage. It is used to compare the validity of two
groups having the same variables. Its was introduced by Karl Pearson. The
group which has the lowest value of CV is comparatively more reliable and
group which has higher value of CV has comparatively more fluctuations.

• Coefficient of Variance Is defined as,


• Example: in a survey of domestic consumption during a month, two samples of
100 residents in two cities were selected and the current consumed was as
follows:
Number of units Number of Number of
consumers consumers

0-49 18 7

50-99 12 13

100-149 30 31

150-199 35 41

200-249 15 8
• Solution.
• In which city the residence have low variation in their consumption of
electricity.
• Calculation for city A
C- I f x fx

0-49 8 24.5 196 -118.5 14042.25 112338

50-99 12 74.5 894 -68.5 4692.25 56307

100-149 30 124.5 3735 -18.5 342.25 10267.5

150-199 35 174.5 6107.5 31.5 992.25 24728.75

200-249 15 224.5 3367.5 81.5 6642.25 99633.75

Sum 100 14300 313275


• Solution.
• In which city the residence have low variation in their consumption of
electricity.
• Calculation for city B
C- I f x fx

0-49 7 24.5 171.5 -11.5 13225 92575

50-99 13 74.5 968.5 -6.5 4225 54925

100-149 31 124.5 3859.5 -15 225 6975

150-199 41 174.5 7154.5 35 1225 50225

200-249 8 224.5 1796 85 7225 57800

Sum 100 13950 262500


Conclusion:
Since the CV of city B is less than CV of city A, therefore resident of city B
have low variation in their consumption of electricity.

You might also like