Intro - Stat
Intro - Stat
• aid in summarization
• aid in “getting at what’s going on”
• aid in extracting “information” from the data
• aid in communication
Origin of statistics
6
What is/are Statistics?
In Singular sense
Statistics means the art or science which is concerned
with the collection, presentation, and analysis of
quantitative data so that intelligent judgment may be
formed upon them.
In Plural sense
Statistics means information useful to decision makers
such as statistics on employment and unemployment,
or vital statistics on births, deaths and marriages.
Classifications:
Descriptive Statistics
Gives numerical and graphic
procedures to summarize a
collection of data in a clear and
understandable way
• Inferential Statistics
Provides procedures to draw
inferences about a population
from a sample
Populations & Samples
Population –The entire set of individuals or objects of
interest or the measurements obtained from all
individuals or objects of interest
Example:
In a recent survey, 250 college students at Union College
were asked if they smoked cigarettes regularly. 35 of the
students said yes. Identify the population and the sample.
9
Populations & Samples
Responses of
students in survey
(sample)
A Good Sample
o The sample needs to be substantial enough to
accurately represent the entire population.
o Every unit in the population has an equal and
independent probability of being chosen;
individuals are chosen at random.
o If data is not collected randomly, it cannot be
used in any meaningful way to draw
conclusions. Random (or probability) sampling
lowers bias and sample error.
11
Statistical data
The collection of data that are relevant to the problem being
studied is commonly the most difficult, expensive, and time-
consuming part of the entire research project.
Statistical data are usually obtained by counting or
measuring items.
Primary data are collected specifically for the analysis
desired
Secondary data have already been compiled and are
available for statistical analysis
A variable is an item of interest that can take on many
different numerical values.
A constant has a fixed numerical value.
Data
Statistical data are usually obtained by counting or
measuring items. Most data can be put into the
following categories:
• Qualitative - data are measurements that each
fail into one of several categories. (hair color,
ethnic groups and other attributes of the
population)
• Quantitative - data are observations that are
measured on a numerical scale (distance
traveled to college, number of children in a
family, etc.)
Types of Variables
• Variables can be classified as discrete or
continuous.
• Discrete variables (such as class size)
consist of indivisible categories, and
continuous variables (such as time or
weight) are infinitely divisible into
whatever units a researcher may choose.
For example, time can be measured to
the nearest minute, second, half-second,
etc.
14
Basic Concepts and Definition
Variable: A characteristic about each individual element of a
population or sample.
Data (singular): The value of the variable associated with
one element of a population or sample. This value may be
a number, a word, or a symbol.
Data (plural): The set of values collected for the variable
from each of the elements belonging to the sample.
Experiment: A planned activity whose results yield a set of
data.
Parameter: A numerical value summarizing all the data of
an entire population.
Statistic: A numerical value summarizing the sample data.
Variable
A researcher may be interested in the
relation between class size (variable 1)
and academic performance (variable 2) for
the population of third-grade children.
16
Types of variables
Variables
Qualitative Quantitative
19
Research Question Independent variable(s) Dependent variable(s)
Do tomatoes grow fastest under •The type of light the •The rate of growth of
fluorescent, incandescent, or tomato plant is grown the tomato plant
natural light? under
What is the effect of diet and •The type of soda you •Your blood sugar levels
regular soda on blood sugar drink (diet or regular)
levels?
How does phone use before •The amount of phone •Number of hours of
bedtime affect sleep? use before bed sleep
•Quality of sleep
20
Measuring Variables
• To establish relationships between variables,
researchers must observe the variables and
record their observations. This requires that
the variables be measured.
• The process of measuring a variable requires a
set of categories called a scale of
measurement and a process that classifies
each individual into one category.
21
4 Types of Measurement Scales
1. A nominal scale is an unordered set of
categories identified only by name. Nominal
measurements only permit you to determine
whether two individuals are the same or
different.
2. An ordinal scale is an ordered set of
categories. Ordinal measurements tell you
the direction of difference between two
individuals.
22
4 Types of Measurement Scales
3. An interval scale is an ordered series of equal-sized
categories. Interval measurements identify the
direction and magnitude of a difference. The zero
point is located arbitrarily on an interval scale.
4. A ratio scale is an interval scale where a value of
zero indicates none of the variable. Ratio
measurements identify the direction and
magnitude of differences and allow ratio
comparisons of measurements.
23
Applications of Statistics
• describe the characteristic of the elements in the
population under study through the computation
or estimation of a parameter such as the
proportion, total, and average