Math Majorship Statistics
Math Majorship Statistics
Math Majorship Statistics
Descriptive Statistics
Terminologies
Descriptive statistics - Numbers which are used to describe information or data or those techniques used to
calculate those numbers.
Variable (x) - A measurable characteristic. Individual measurements of a variable are called varieties,
observations, or cases.
Population (X) - All subjects or objects possessing some common specified characteristic. The population in a
statistical investigation is arbitrarily defined by naming its unique properties.
Parameter - A measurable characteristic of a population. A measurable quantity derived from a population, such as
population mean or standard deviation.
Sample - A smaller group of subjects or objects selected from a large group (population).
Bar graph or histogram - A frequency graph with number of blocks or length of bar representing the frequency of
occurrence.
Frequency polygon - A modification of the bar graph with lines connecting the midpoints of the highest point on
each bar.
Frequency curve - A modification of a frequency polygon with the sharp corners rounded. The area under the
connecting line of the bar graph, frequency polygon, and frequency curve are equivalent and represent frequency
of occurrence.
One characteristic that is particular interest is a measure of central tendency, which gives some idea of the
average or typical score in the distribution. This average is more precisely thought of as index of location of the
distribution on the score scale.
1. Arithmetic Mean or Mean (Arithmetic Average). The mean is the arithmetic average of a set of scores,
which is the most widely used and familiar average, the most reliable and the most stable of all measures of
central tendency. It is the point on the score scale obtained by finding the sum of all the scores and dividing by
their number. An important property of the mean is that every one of the scores of the group affects its value.
2. Median (Positional Average). The median is the point on the score scale which is so located that the same
number of scores are above it as are below it. This is a score that divides the distribution into two equal halves.
It is sometimes called counting average.
3. Mode (Counting Average). The mode is the score value that has a frequency which is large in relation to
other frequencies near it in the distribution. It is the score that most often in the distribution. It is also referred
to as crude or inspectional average
Measures of Variability
When describing a set of scores, it is often significant to report how variable the scores are, how much they
spread out high to low scores. How much they spread out from high to low scores. Then, a measure of this spread
is an important statistic for describing a group. Variability is the extent to which the scores of a group tend to scatter
(disperse/spread) above and below a central point in the distribution.
1. Range. A very simple measure of variability is the range of scores in the group. This is simply the difference
between the highest and the lowest scores.
2. Quartile Deviation (or Semi-interquartile range). One measure of variability that includes a specified part of
the total group, usually the middle 50 % (between the 25 th and the 75th percentile). It is actually the average
distance from the median to the 2 quartiles, that is, it tells how far the quartile points lie from the median, on
the average.
3. Standard Deviation. It is a measure of dispersion among all scores in the distribution. It is the square root of
the average of the squared deviations from the mean.
A. Mean
Procedure:
1. Mean of Ungrouped Data: used for few cases ( N<30 )
a. Get the sum of scores ( X)
b. Divide the sum by the number of cases (N)
Formula: =
Procedure Example:
There are several ways in the computation of median for ungrouped data. The process depends
on a case to case basis
Procedure Example:
C. Mode ( X̂ )
Procedure
when there are more than three modes, they are called polymodal or multimodal
a. Crude Mode - refers to the midpoint of the class limit with the highest frequency.
Procedure: Example
b. Refined Mode - refers to the mode obtained from an ordered arrangements or a class
frequency distribution
Formula: X̂ = Lmo + ( )c
Procedure: Example
1. Identify Class Limits f
the modal class with the 45-49 2
highest frequency occurs 40-44 0
2. Determi 35-39 12 d2 = 1
ne the lower boundary Lmo = 29.5 30-34 13
(Lmo) of the modal class 25-29 10 d1 = 3
3. Get the 20-24 5
difference between the 15-19 4
frequency of the modal 10-14 4
class and the frequency
just before it (d1)
4. Get the X̂ = Lmo +
difference between the
frequency of the modal
class and the frequency ( )c
just after it (d2)
5. Find
the size of the class = 29.5 +
interval (c) and the (5)
frequency of the modal
class
6. Then, = 29.5 +
apply the formula
= 29.5 + 3.75
= 33.25
The value that represents a set of data will be the basis in determining whether the group is performing
better or poorer than the other groups.
Types of Distribution
On the normal distribution curve, the Measures of Central Tendency (mean, median, and mode) are in the
same spot at the middle of the curve. The curve is symmetrical.
A skewed distribution represents a set of scores or numbers that is not equal on both sides. This results
from a few scores in a data set falling farther to one end or the other. This arrangement is quite common in
real world settings, especially when smaller numbers of scores are involved.
When a set of scores is skewed, the "averages" are not at the exact center and the curve is asymmetrical.
The mode remains at the highest part of the curve, the median is still the middle score, and the mean is
pulled in the direction of the outliers. This refers to a negatively skewed distribution, that is, the distribution
is said to be left-skewed or "skewed to the left". Thus, the mean is pulled toward the negative tail of the curve.
Hence, tail is stretched toward the left side of the baseline. This means that the left tail is longer; the mass of
the distribution is concentrated on the right of the figure. It has relatively few low values.
Conversely, when disproportionately high scores occur, the mean is pulled toward the positive tail of the
curve. In this case, tail is stretched toward the right side of the baseline. This refers to a positively skewed
distribution, that is, the distribution is said to be right-skewed or "skewed to the right". This means that
the right tail is longer; the mass of the distribution is concentrated on the left of the figure. It has relatively
few high values.
A. Range ( R )
1. For Ungrouped Data – the difference between the highest and lowest score
2. For Grouped Data – the difference between the highest limit of the highest class limit and the
lowest limit of the lowest class limit
d = 40
2
cases:
1. The results will help you determine if the group is homogeneous or not.
2. The results will also help you determine the number of students that fall below and above the average
performance
STANDARD SCORES
Indicate the pupil’s relative position by showing how far his raw score is above or below average
Express the pupil’s performance in terms of standard unit from the mean.
Represented by the normal probability curve or what is commonly called the normal curve
Used to have a common unit to compare raw scores from different tests
1. Z-SCORES
Formula: Where
X = individual’s raw score
XX
Z X = mean of the normative group
SD SD = standard deviation of the
normative group
Given: X 26 ; SD = 2
Joseph’s Score
X = 27
X X 27 26 1
Z = Z = 0.5
SD 2 2
John’s Score
X = 25
X X 25 26 1
Z Z = -0.5
2 2 2
Note:
Z – score is negative when X < X
Z – score is positive when X > X
2. T – SCORES
it refers to any set of normally distributed standard deviation score that has a mean of 50 and a
standard deviation of 10.
Computed after converting raw scores to z-scores to get rid of negative values
The equivalence of z-scores, t-scores and their relation to percentiles and to the
normal curve is shown below.
2% 2%
14% 34% 34% 14%
SD’s -4 -3 -2 -1 0 +1 +2 +3 +4
Z-Scores -4 -3 -2 -1 0 +1 +2 +3 +4
T-Scores 10 20 30 40 50 60 70 80 90
Percentiles 1 2 16 50 84 98 99.9