graphical representation of data
graphical representation of data
Table 1.3
Using the data in the table 1.3, calculate the true limits and use to:
a) Draw a histogram
b) On the histogram draw a frequency polygon
c) Draw a cumulative frequency curve (Ogive)
The relationship between numbers of various magnitudes can usually be seen more
quickly and easily from a graph than from a table.
Figures are not always interesting, and as their size and number increase they become
confusing and uninteresting to such an extent that no one (unless he is specifically
interested) would care to study them.
Their study is a greater strain upon the mind without, in most cases, any scientific
result. The aim of statistical methods, inter alia, is to reduce the size of statistical data
and to render them easily intelligible.
To attain this objective the methods of classification, tabulation, averages and
percentages are generally used.
But the method of diagrammatic representation (visual aids) is probably simpler and
more easily understandable. It consists in presenting statistical material in geometric
figures, pictures, maps and lines or curves.
1. They have greater attraction than mere figures. They give delight to the eye and add a
spark of interest.
2. They help in deriving the required information in less time and without any mental strain.
4. They may reveal unsuspected patterns in a complex set of data and may suggest directions
in which changes are occurring. This warns us to take an immediate action.
5. They have greater memorising value than mere figures. This is so because the impression
left by the diagram is of a lasting nature.
3. It can give only an approximate idea and as such where greater accuracy is needed
diagrams will not be suitable.
Construction of graphs
The choice of the particular form among the different possibilities will depend on personal
choices and/or the type of the data.
• Bar charts and pie chart are commonly used for qualitative or quantitative discrete data.
2. Titles are usually placed below the graph and it should again question what ? Where?
When? How classified?
3. Legends or keys should be used to differentiate variables if more than one is shown.
4. The axes label should be placed to read from the left side and from the bottom.
6. The numerical scale representing frequency must start at zero or a break in the line should
be shown.
Examples of diagrams:
a) The horizontal axis is a continuous scale running from one extreme end of the
distribution to the other.
b) It should be labelled with the name of the variable and the units of measurement.
(i) its base on the horizontal axis extending from one class boundary of the class to
the other class boundary, there will never be any gap between the histogram
rectangles.
(ii) the bases of all rectangles will be determined by the width of the class intervals.
If a distribution with unequal class-interval is to be presented by means of a
histogram, it is necessary to make adjustment for varying magnitudes of the class
intervals.
2. FREQUENCY POLYGON:
If we join the midpoints of the tops of the adjacent rectangles of the histogram with line
segments a frequency polygon is obtained.
Note that it is not essential to draw histogram in order to obtain frequency polygon. It can be
drawn with out erecting rectangles of histogram as follows:
1) The scale should be marked in the numerical values of the mid- points of intervals.
2) Erect ordinates on the midpoints of the interval - the length or altitude of an ordinate
representing the frequency of the class on whose mid-point it is erected.
3) Join the tops of the ordinates and extend the connecting lines to the scale of sizes.
When the cumulative frequencies of a distribution are graphed the resulting curve is
Prepare a graph with the cumulative frequency on the vertical axis and the true upper
class limits (class boundaries) of the interval scaled along the X-axis (horizontal axis).
The true lower limit of the lowest class interval with lowest scores is included in the
X-axis scale; this is also the true upper limit of the next lower interval having a
cumulative frequency of 0.
The line graph is especially useful for the study of some variables according to the passage of
time. The time, in weeks, months or years is marked along the horizontal axis; and the value
of the quantity that is being studied is marked on the vertical axis. The distance of each
plotted point above the base-line indicates its numerical value. The line graph is suitable for
depicting a consecutive trend of a series over a long period.
Bar Chart
There are different types of bar diagrams, the most important ones are:
Bar Chart Bar diagrams are used to represent and compare the frequency
distribution of discrete variables and attributes or categorical series.
When we represent data using bar diagram, all the bars must have equal width
and the distance between bars must be equal.
There are different types of bar diagrams, the most important ones are:
Question 1.Table shows the overall immunization status of children in Adami Tullu Woreda,
Feb. 1995
Multiple bar chart: In this type of chart the component figures are
shown as separate bars adjoining each other. The height of each bar
represents the actual value of the component figure. It depicts
distributional pattern of more than one variable
Question 2.Table shows Tetanus Toxoid immunization by marital status of the women of
childbearing age, Assendabo town, Jimma Zone, 1996
Immunization Status
No. % No %
It is a circle divided into sectors so that the areas of the sectors are
proportional to the frequencies.
The pie chart represents data in a circular graph. The pieces of the
chart or graph show the data size. The various components are
represented by the pieces of a circle and the whole circle is the sum of
the values of all components.
The total of all data in a pie is equal to 360° and the total value of a
pie is always 100%.
Central angle for a component = (Value of the component / Sum of the values of all
components) * 360°
1. Enter the given data into a table to make the process easy for you.
3. Divide each value in the table by sum and multiply the result with 100
to get the percent.
4. To get the degrees of each value, substitute the values in the formula
i.e (Value of each component/sum) * 360
Advantages
Disadvantages
It can represent only one set of data. So, you need a series to compare
multiple sets.
If you have too many pieces of data, and even you add labels and
numbers may not help here, they themselves may become crowded and
hard to read.