Epre 412 Chapter 6 and 7
Epre 412 Chapter 6 and 7
Epre 412 Chapter 6 and 7
quantitative data
issues of interpretation and
quality
Chapter 6
And chapter 7
Analysing and presenting
quantitative data
• Focus on how “statistics “is used to present data and provide
evidence
• Refers in common usage to numerical data.
• What is statistics: procedures and rules for reducing large masses of
data to manageable proportions
• Refers to the methodology for the collection, presentation, analysis
and interpretation of data
• For allowing us to draw conclusions from those data.
• Page: 138 prescribed textbook
Two types of statistics
1.descriptive statistics
• Tool for organising, tabulation, depicting and describing, summarising and
reduction of mass of data.
• To transform or summarise a set of data into either a visual overview, such
as a table or a graph.
2. inferential statistics
• It builds upon descriptive statistics
• To predict or estimate or surmise the properties of a population from a
knowledge of the properties of only a sample of the population.
• It aims to make predictions or inferences about the similarity of a sample to
the population from which the sample is drawn.
Terminology page 139
1. Categorical data: tells how many observations there are in a particular
category
• Can be sorted into groups or categories
• Example: 23 girls and 17 boys)
2. Measurement or numerical data: they are the result of measurement
• Example: person is 173 cm tall
• Test score, weight, speed
3. Variables
• Is a property of an object or event or person that can take on different
values
Terminology cont…..
• Variable: is a characteristic which differ among the observational units
on which it is defined
• Variables can be numerical: travelled kilometres
• may be non-numerical, eg. Language preference, occupation
• Example: Test scores
• Activity 6.2 (textbooks)
Think of children in your class or in your school. What are some of the
things that make them different from one another. Write down these
variables
Organising and presenting data:
page 141
1. Tabulate data
2. Organise data: arrange scores in a descending order
3. Have a frequency distribution: presenting data as a number of
learners/people/ in a particular category
Examples of frequency distribution-textbook figures 6.1-6.4
Example: frequency distribution
black 56 25
coloured 70 45
Graphic representation of data
1. Histogram: bar graph
2. Frequency polygon: line graph
3. Pictograms: use a stick figure to represent a number of people or
any variable (page, 141 prescribed book)
4. Pie chart
histogram
Frequency polygon
Pie graph
Graphing relative frequencies
(percentages)
• Relative frequency says what the frequency of a category is relative to
the whole data set, or in everyday terms, what percentage that
category comprises of the whole
• Represent the percentage of the whole (the relative frequency) rather
than the number in each category (the frequency)
• Example
Total number No of Girls Total number Percentage Percentage
of learners in passed test of girls of girl who of girls who
grade 4 passed passed in
grade 4
130 30 60 50% ?
Measures of central tendency
• Also called averages (see prescribe book, page 153)
Measures of central tendency may summarise data by quoting a “typical” or
representative score for the whole set.
Three (3) measures of tendency
a) mode: is a score which occurs more frequently in a collection/distribution.
It is the most frequent score. When all the scores in a group occurs with the
same frequency= no mode.
When group of two scores has the same frequency=two modes/bimodal
Example: 7,9,10,20,6,2,6,17,6,3
Mode= 6
Measures of central tendency
cont…..
b) The mean
The mean of a set of observation /distribution is their sum divided by
the number of the observation
The results we would have if we could share the data evenly on
categories
Example: if all leaners were to share the total walking distance to
school, between them, each learner would walk approximately …….km.
Measures of central tendency cont
( c ) median: separates the top half of a data set from the bottom half
E.g. Half of learners scored less than…..and half scored more than…
If its even numbers add the 2 middle numbers and divide by 2
(d) Standard deviation: measure of how much the data deviate from the
mean or how far the data on average is from the mean.
One way of measuring the spread of data
If standard deviation is high, data are very spread out
(e)Range: is the difference between the highest and the lowest value in the
data.
Correlation: Linear relationship
• The relationship between two different sets of scores.
• E.g. Want to know whether high IQ scores are associated with high
scores in academic attainment
• Do not think that negative correlations is undesirable or that it
indicates lack of relationship
• What is correlation: means an association or variation
• Two variables are correlated if they tend to “go together”.
• A coefficient of correlation is a statistical summary of the degree of
relationship or association between two variables
Kinds of correlation
• Positive correlation
The concomitant variation is in the same direction
An increase in one variable is accompanied by an increase in the other variable.
Perfect positive correlation is +1
Example: increase in intelligence being accompanied by increase in scholastic
achievement.
Negative correlation
With some variables concomitant change or variation is in the opposite direction e.g.
fatness and speed
Increase in one variable is followed by decrease in another
Perfect negative correlation is - 1
correlation
• Scatter plots are used to graphically show or explore whether
correlations exist or not. Example of scatterplot page164
• The strength of a relationship is expressed by means of +1 and -1
• No relationship is expressed by 0
• Example: 0.77=fairly strong relationship
• Which one is stronger: -0.6; + 0,7; - 0.9
Answer is -0.9
Validity and reliability
• Validity: In terms of measurement procedure, validity is the ability of
an instrument to measure what it is designed to measure.
• Reliability: when a research instrument is able to provide similar
results when used repeatedly under similar conditions.
• Therefore reliability indicates accuracy, stability, and predictability of a
research instrument.
• The higher the reliability the higher the accuracy
Inferential statistics
• Used to make predictions about the similarity of the sample to the
population from which the sample is drawn
Test run to test prediction:
• Probability (p value)- mathematical way of stating the degree of
confidence we have in predicting something.
• Inferential statistics can tell us the probability that the results we
have obtained in a particular study occurred by chance or not. Page
167
Chapter 7