Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Math in The Modern World Stat Lecture

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

Bicol University Polangui Campus

Name _____________________________________________
GEC 14 Math in the Modern World
1st Sem, SY 2023-2024
Measures of Central Tendency Course/Block __________________ Date ___________
Measures of Variation

MEASURES OF CENTRAL TENDENCY


Ungrouped Data – (or raw data) is data that has not been placed in any group or category after collection
Mean – arithmetic average
∑𝑥 ∑𝑥
𝑥̅ = n, sample size 𝜇= N, population size
𝑛 𝑁

Median – a value in an array of numerical data which separates the array into two equal parts, 50%
above the median and 50% below the median.

Arrange first the data points in ascending or descending order


When n is odd, the median is the [(𝑛 + 1)/2]𝑡ℎ observation.
When n is even, the median is the average of the (𝑛/2)𝑡ℎ and (𝑛/2 + 1)𝑡ℎ observations.

Mode – most frequent data point in the sample


∑ 𝑤𝑥
Weighted Mean - ∑𝑤

Grouped Data - the type of data which is classified into groups after collection. The raw data is categorized into
various groups and a table is created.
∑ 𝑓𝑖 𝑥𝑖 1
Mean = ∑ 𝑓𝑖
where 𝑥𝑖 is the class mark or the midpoint which is 2 (𝑙𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 + 𝑢𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡)

𝑛 *To find the median class, we have to find the


−𝑐𝑓
2
Median = 𝑙 + ( )𝑖 cumulative frequencies of all the classes and n/2.
𝑓
After that, locate the class whose cumulative
l = lower boundary of median class frequency is greater than (nearest to) n/2.
n = number of observations
f = frequency of the median class
i = class interval/width
cf = cumulative frequency of the class preceding the median class

1 0 𝑓 −𝑓 ∆1
Mode = 𝑙 + (2𝑓 −𝑓 )𝑖 or Mode = 𝑙 + (∆ )𝑖
−𝑓1 0 2 1 +∆2

l = lower class boundary of modal class


𝑓1 = frequency of modal class
𝑓0 = frequency of class preceding to modal class
𝑓2 = frequency of class succeeding to modal class
i = class interval/width
To get the
*The modal class is the class • lower boundary → lower limit of the class interval
with the highest frequency. minus 0.5
• class width → in a class interval, higher limit minus
lower limit plus 1
MEASURES OF VARIATION/DISPERSION
Dispersion is a degree to which the data are spread out.
1. Range R = largest data value – smallest data value
= maximum – minimum

The range does not tell us how the observations are distributed between the largest and the smallest
data. It ignores the way in which data are distributed and it is sensitive to outliers. As such, the range is
not a measure of dispersion of all the observations.

Variance and Standard Deviation for Ungrouped Data

2. Variance is defined as the average of all squared deviations of the observations about their mean.
3. Standard Deviation is the positive square root of the variance. It provides a measure of standard
distance from the mean.

∑(𝑥𝑖 −𝜇)2 ∑(𝑥𝑖 −𝜇)2


Population variance 𝜎2 = Population sd 𝜎 = √
𝑁 𝑁

∑(𝑥𝑖 −𝑥̅ )2 ∑(𝑥𝑖 −𝑥̅ )2


Sample variance 𝑠2 = Sample sd 𝑠=√
𝑛−1 𝑛−1

The variance and the standard deviation are the most important concepts in a course on elementary
statistics.

∑ |𝑥−𝑥̅ |
4. Mean Absolute Deviation MAD = 𝑁

Variance and Standard Deviation for Grouped Data

∑ 𝑓𝑖 (𝑥𝑖 −𝜇)2 ∑ 𝑓𝑖 (𝑥𝑖 −𝜇)2


Population variance 𝜎2 = Population sd 𝜎 = √
𝑁 𝑁

∑ 𝑓𝑖 (𝑥𝑖 −𝑥̅ )2 ∑ 𝑓𝑖 (𝑥𝑖 −𝑥̅ )2


Sample variance 𝑠2 = Sample sd 𝑠=√
𝑛−1 𝑛−1

∑ 𝑓𝑥 ∑ 𝑓𝑥
Recall that 𝜇 = and 𝑥̅ =
𝑁 𝑛

Measures of Variation
Summary Characteristics
1. The more the data are spread out, the greater the range, variance, and standard deviation.
2. The more the data are concentrated, the smaller the range, variance, and standard deviation.
3. If the values are all the same (no variation), all these measures will be zero.
4. None of these measures are ever negative.
MEASURES OF RELATIVE POSITION
- are used to locate relative position of an observation in a set of data
- are conversion of values, usually standardized test scores, to show where a given value stands in
relation to other values of the same grouping.

For ungrouped data:


Quartiles describe the division of observations into four defined intervals based on the values of the data and
how they compare to the entire set of observation. They are three values that split sorted data into four parts,
each with an equal number of observations.

𝑘(𝑁 + 1)
𝑄𝑘 = 𝑡ℎ 𝑖𝑡𝑒𝑚
4
Q1 is the first quartile, Q2 is the second quartile, and Q3 is the third quartile.
Interquartile Range = Q3 – Q1
IQR is a measure of statistical dispersion or spread; represents the middle 50% of the data.
Outliers are observation points that are distant or “too far away” from other observations. To find the outliers,
use the formula 1.5∙ (IQR). Anything less that Q1 – 1.5(IQR) or anything greater than Q3 + 1.5(IQR) will be an
outlier.

Deciles are a form of statistical division used to divide a data set into 10 equal parts.

𝑘(𝑁 + 1)
𝐷𝑘 = 𝑡ℎ 𝑖𝑡𝑒𝑚
10

Percentiles indicate the percentage of scores that fall below a particular value. For example, a person with an IQ
of 120 is at the 91st percentile, which indicates that their IQ is higher than 91 percent of other scores.

𝑘(𝑁 + 1)
𝑃𝑘 = 𝑡ℎ 𝑖𝑡𝑒𝑚
100

In short, a quartile is a statistical measure that divides a distribution into four equal parts. A quintile divides a
distribution into five equal parts. A decile divides a distribution into ten equal parts. A percentile divides a
distribution into one hundred equal parts.

You might also like