0% found this document useful (0 votes)

20 views

Chapter 1 - Sampling and Experimental Design

This document provides an overview of key concepts in sampling and experimental design covered in sections 1.3-1.5 of Chapter 1, including: 1) Biased and random sampling methods, with random sampling being preferred since it avoids systematic differences between the sample and population. 2) Observational studies versus experiments, with experiments being preferred for determining causation since treatments are randomly assigned. 3) The difference between prospective and retrospective observational studies, with prospective preferred due to less potential for confounding variables and bias. 4) The definition of a confounding variable as one that is related to both the explanatory and response variables, obscuring the effect of the explanatory variable.

Uploaded by

Yassine Belhaje

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

Chapter 1 - Sampling and Experimental Design

Uploaded by

Yassine Belhaje

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Chapter 1 - Sampling and Experimental Design

Read sections 1.3 - 1.5

Sampling
(1.3.3 and 1.4.2)

Sampling Plans: methods of selecting individuals from a population. We are interested in sampling
plans such that results from the sample can be used to make conclusions about the population.
Biased Samples: Bias occurs when the sample tends to diﬀer from the population in a systematic
way. When this happens, results from the sample can not be used to make conclusions about the
population of interest.

1. Convenience Sample - An “easily available” sample of individuals which was convenient for
the researcher to collect. This is a BAD sampling plan since the individuals in the convenience
sample may systematically diﬀer from the population and therefore may not represent the entire
population.

2. Voluntary Response Sample - A sample of individuals who volunteer to participate. This

is a BAD sampling plan since the individuals who volunteer may systematically diﬀer from the
population and therefore may not represent the entire population.

Types of Bias in Sampling:

• Selection Bias - The sampling plan excludes some part of the population from the selection
process. Those excluded from the selection process systematically diﬀer from those included.
EXAMPLES:
– Phone surveys exclude (1) households without a phone, (2) prisoners, and (3) homeless
people.
– Call-in polls on TV exclude (1) individuals without a TV, (2) individuals not watching the
program, and (3) individuals who do not care to participate

• Measurement/Response Bias - The method of observation tends to produce measurements

that diﬀer from the true value of the response.
EXAMPLES:
– uncalibrated scale
– untrained or ill-trained technician
– wording of a survey or interviewer influence

• Non-response Bias - Data is not obtained from all individuals in the sample. This bias occurs
when those who respond systematically diﬀer from those who do not respond.
EXAMPLES:
– Telephone and mail surveys

IMPORTANT POINTS to remember:

• A biased sample is a biased sample, regardless of its size! Collecting more data in a biased fashion
will not correct the problem.

1
• A biased sample still contains information about a population, but this population is not the one
that a researcher is interested in! Information can still be gleaned from biased samples, but one
must be wary of the interpretation.
EXAMPLE:
– Drug trials using human volunteers
– Studies on animals which have been specifically bred for experiments

QUESTION: What type of sample and bias?

A researcher is interested in the opinions of MSU students about updating gym
equipment. A surveyor stands at the gym entrance door and uses the next 50 people
who enter as a sample and asks each their opinion about updating gym equipment.

Random Sampling: A sample of individuals who have been chosen randomly from the population.
Random samples tend to represent the population from which they are chosen since randomization
does not systematically favor some individuals in the population over others.

Since random samples are representative of the population of interest, then inference is valid. In other
words, results from a random sample can be generalized to make conclusions about the population.

Random Sampling can be done:

• With replacement, which means that after an individual is selected to be the sample, that
individual can potentially be selected into the sample again. This method needs to be used when
the sample size n is more than 5% of the population size, 20n > N , where N is the population
size.

• Without replacement, which is much more commonly used, is where once an individual is
selected to be in the sample, that individual may not be selected again. Therefore, the sample
consists of n distinct individuals. This method is used when the population size is infinite; or
if the population size is N , and the sample size n is no more than 5% of the population size,
20n ≤ N .

QUESTION: With or without Replacement?

1. Randomly sampling a 5 card hand from a standard deck of 52 playing cards:

2. Randomly sampling 2000 Montanans:

2
Types of Random Samples:

1. Simple Random Sample (SRS) - Each possible sample of size n has an equal chance of being
selected from the population.

• How to Select a SRS:

– Put slips of paper in a hat, mix well, then choose n slips.
– Use a computer:
(a) Create a sampling frame, a numbered list of all individuals in the population.
(b) Use a random number generator to select individuals from the list.

2. Stratified Random Sample - Separate the population into non-overlapping homogeneous

groups, called strata. Take a SRS from each strata, then combine the SRSs to form the stratified
random sample.

• Stratifying is beneficial if the population consists of strata that diﬀer in regards to the
variable of interest.
• Usually, ni , the size of the SRS from each strata, is proportional to Ni , the size of the strata
within the population.

QUESTION:
Give an example of a study for which stratifying would be necessary.

3. Cluster Sample - If the population naturally consists of non-overlapping groups, called clusters,
where each cluster is heterogeneous (i.e. it represents and reflects the variability in the population)
then a SRS of clusters can be drawn. All individuals in the selected clusters form the cluster
sample.

3
QUESTION:
(a) There are about 20 sections of STAT 216 oﬀered at MSU each semester. How
would you use cluster sampling to choose a sample from all STAT216 students?

(b) What are the two main diﬀerences between a stratified random sample and a
cluster sample?

4. Systematic Sample - Select every k th individual from the numbered population list. This works
well only if:
• the variable of interest is not related to the order of the list or
• the variable of interest is related to the list’s order, but not in a cyclic manner.

QUESTION: Would Systematic Sampling Work Well?

(a) A Phonebook:

(b) Husband/Wife Listing:

1. husband 2. wife 3. husband 4. wife etc.

4
Observation and Experimentation
(1.3.5 and 1.4.1)

Observational Study: A study which observes individuals and measures variables, but does not
attempt to influence the responses.

• An observational study on individuals from a random sample allows one to generalize conclusions
about the sample to the population.

• An observational study cannot show cause-and-eﬀect relationships because there is the

possibility that the response is aﬀected by some variable(s) other than the ones being measured.
That is, confounding variables may be present. “It ain’t what you don’t know that gets you
into trouble. It’s what you know for sure that just ain’t so.” - Mark Twain

• In prospective observational studies, investigators choose a sample and collect new data
generated from that sample. That is, the investigators “look forward in time.”

• In retrospective observational studies, investigators “look backwards in time” and use data
that have already been collected. Retrospective studies are often criticized for having more
confounding and bias compared to prospective studies.

QUESTION: Prospective or Retrospective Observational Study?

1. A study that follows marijuana users in Colorado for 5 years.
2. A study of illegal immigrant activity last year in Arizona.

Experiment: A study in which treatment(s) are deliberately imposed on individuals in order to

observe their response.

• An experiment in which the treatments are randomly assigned to individuals can provide evidence
for a cause-and-eﬀect relationship. Furthermore, if the individuals are from a random sample,
then one can generalize conclusions from the experiment to the population.

To recognize the diﬀerence between an Observational Study and an Experiment, ask yourself, “Was
there a treatment imposed on the individuals?” In an experiment, the researcher determines (randomly)
which individuals receive which treatment. In an observational study, the individuals have already self-
chosen their groups.

QUESTION: Observational Study or Experiment?

1. A study of the birth weight of babies and the mother’s level of coﬀee consumption.
2. A study of lab mice whose spinal cords have been severed.
3. A study of gender versus salary.
4. A study of grizzly bear attacks.
5. A study of the number of 1’s rolled on a weighted die.

5
Confounding Variable: A variable that is related to the response variable and to the explanatory
variable in such a way that makes it impossible to distinguish the effects of the confounding variable
on the response from the effects of the explanatory variable on the response.
EXAMPLES:
• In a study of gender differences in salary, it was found that female nurses (in a certain hospital)
have higher salaries, on average, than do male nurses. It also was found that female nurses
have a greater number of years of experience than do male nurses. Years of experience is a
confounding variable. It may be that the data give no clue as to whether the salary difference is
due to gender discrimination or due to years of experience.

• In a study investigating the association between the occurrence of low birth weight babies and
the mother’s level of coffee consumption, it was found that an increase in the mother’s coffee
consumption is associated with an increase in the risk of having a low birth weight baby. It also
was found that moms who smoke also consume large amounts of coffee and moms who do not
smoke consume no or small amounts of coffee. Smoking is a confounding variable. Are the low
birth weights due to the smoking or the coffee? CAN’T TELL!

Principles of Experimental Design

(1.5)

Experimental Designs: methods of assigning treatments to individuals (units or cases)

Unit: an individual in an experiment
Subject: a human experimental unit
Factor: a categorical explanatory variable
Treatment: a combination of levels of factors
Extraneous Factor: a factor that is not of primary interest and yet affects the response variable. An
extraneous factor is called a confounding variable if its effect on the response cannot be distinguished
from the effect of another factor on the response.
The goal of an experiment is to determine the effects of factor(s) on the response while taking into
account extraneous factors that also affect the response.
Control Group: a group that receives no treatment (or a placebo). The response of the treatment
group is compared to the response of the control group to determine effectiveness of the treatment.

Placebo: a treatment that has no active ingredients (a fake treatment). A placebo is supposed
to resemble the real treatment as far as appearance, taste, and feel so that subjects believe they
are receiving the true treatment. Use of a placebo mitigates “the power of suggestion.” That is, a
treatment, when thought to be beneficial, tends to positively aﬀect responses (and a non-beneficial
treatment tends to negatively aﬀect responses).

Single-blind: the subjects do not know what treatment was received. A single-blind experiment
avoids the unconscious expectations of the subjects of one treatment over another.
EXAMPLE:
Give an example of an experiment which can not be made single-blind.

6
Double-blind: neither the subject nor the person recording the response know what treatment was
received. A double-blind experiment avoids the unconscious expectations of the subjects and of the
recorder of one treatment over another.

Four Basic Principles of Experimental Design:

1. Direct Control - Holding extraneous factors constant for all units so that the eﬀects of the
extraneous factors are not confounded with the factors of interest.

2. Random Assignment - Treatments are randomly assigned to units in order to create similar
experimental groups. In other words, the values of the extraneous variables will be similar, on
average, for each experimental group.

3. Replication - The experiment is replicated on many units for each treatment group to reduce
the role of random variation due to uncontrolled and “unblocked” extraneous variables.

• If there was only one unit in each of two treatment groups, then it could happen that these
two units are quite diﬀerent. But if we randomly assign several more units to each group,
then any diﬀerences will get “evened out”.

4. Blocking - Units are classified into subgroups or blocks so that the extraneous factors are held
constant for all units within a given block. Treatments are randomly assigned to units within
each block.
• “Block what you can, randomize what you cannot,” George Box.
• Blocks and strata are diﬀerent. Blocking refers to classifying experimental units into blocks
whereas stratification refers to classifying individuals of a population into strata.
• The samples from the strata in a stratified random sample can be the blocks in an
experiment.

7
Two Basic Experimental Designs:

1. Completely Randomized Design (CRD): Experimental units are randomly assigned to each
treatment (using principles 1-3).

EXAMPLE:
Consider an experiment to compare the effect of using two different neurons (olfactory and
motor) and two different antibiotics (amoxicillin and tetracycline) to repair severed spinal cords
in laboratory mice.

What are the experimental units?

What is the response?

Give one factor in the experiment.

Give a second factor.

What are the treatments?

What are some potential extraneous factors?

How can direct control be used so that these extraneous factors don’t obscure the eﬀects of the
treatments on the response?

8
2. Randomized Block Design (RBD): Units are classified into blocks that are similar with
respect to extraneous variable(s), then units are assigned to treatments independently within
each block (using principles 1-4).

EXAMPLE:
Consider an experiment to determine the eﬀect of diﬀerent wheat strains (called A, B, and C) on crop
yield (in bushels/acre).
What are the experimental units?

What is the response?

Give the factor in the experiment.

What are the treatments?

What are some potential extraneous factors?

How can direct control and blocking be used to account for these extraneous factors so they don’t
obscure the eﬀects of the treatments on the response?

Exercises
Sampling, p. 58: 1.9 - 1.15 odd
Observational studies, p. 60: 1.17 - 1.29 odd
Experiments, p. 63: 1.31 - 1.37 odd

Unit 3 - Sampling and Experimental Design New - Read-Only
No ratings yet
Unit 3 - Sampling and Experimental Design New - Read-Only
44 pages
Chapter 2
No ratings yet
Chapter 2
56 pages
Collecting Data Sensibly: Chapter Is VERY Important!
No ratings yet
Collecting Data Sensibly: Chapter Is VERY Important!
52 pages
Statistics Chapter 4 Notes Section 4.1 Designing Studies: Definition: Population and Sample
No ratings yet
Statistics Chapter 4 Notes Section 4.1 Designing Studies: Definition: Population and Sample
6 pages
AP Stats Module 3 Notes
No ratings yet
AP Stats Module 3 Notes
2 pages
Sample and Sampling Distribution
No ratings yet
Sample and Sampling Distribution
27 pages
Sampling Theory PPT Present
No ratings yet
Sampling Theory PPT Present
13 pages
Statistics Chapter 5.1
No ratings yet
Statistics Chapter 5.1
17 pages
Sampling Techniques Ali 2014
No ratings yet
Sampling Techniques Ali 2014
43 pages
BMS1042 Module 1
No ratings yet
BMS1042 Module 1
7 pages
Chapter 5: Selecting Research Participants
No ratings yet
Chapter 5: Selecting Research Participants
5 pages
Introduction
No ratings yet
Introduction
9 pages
LESSON 4 Sampling and Sampling Distribution
No ratings yet
LESSON 4 Sampling and Sampling Distribution
16 pages
Types of Samples: Probability Sampling (Representative Samples)
No ratings yet
Types of Samples: Probability Sampling (Representative Samples)
4 pages
Action Research Work 22
No ratings yet
Action Research Work 22
5 pages
Chapter Two: Data Collection and Sampling
No ratings yet
Chapter Two: Data Collection and Sampling
21 pages
Notes Chapter 1b Bluman (2023) Elementary Statistics 2
No ratings yet
Notes Chapter 1b Bluman (2023) Elementary Statistics 2
8 pages
Lesson 11 Sampling
No ratings yet
Lesson 11 Sampling
19 pages
lec 5. sampling method
No ratings yet
lec 5. sampling method
58 pages
Sampling Procedure
No ratings yet
Sampling Procedure
11 pages
Week 6 Sampling Methods
No ratings yet
Week 6 Sampling Methods
52 pages
Lesson 2.4 the Sample and Sampling Procedure Copy
No ratings yet
Lesson 2.4 the Sample and Sampling Procedure Copy
40 pages
3T2324 Module 2 - 3
No ratings yet
3T2324 Module 2 - 3
43 pages
SEU - DS510 - Module 2 Data Collection
No ratings yet
SEU - DS510 - Module 2 Data Collection
217 pages
B.SC (CS With AI) Unit - 1
No ratings yet
B.SC (CS With AI) Unit - 1
19 pages
Unit 3 Statistics Notes
No ratings yet
Unit 3 Statistics Notes
6 pages
Sampling Designs in Operational Health Research: Dr. Syed Irfan Ali
No ratings yet
Sampling Designs in Operational Health Research: Dr. Syed Irfan Ali
35 pages
Sampling - How To Design and Evaluate Research in Education - Jack - Fraenkel, - Norman - Wallen, - Helen - Hyun
No ratings yet
Sampling - How To Design and Evaluate Research in Education - Jack - Fraenkel, - Norman - Wallen, - Helen - Hyun
6 pages
Probability and Non Probability of Sampling
No ratings yet
Probability and Non Probability of Sampling
17 pages
Population and Sample
No ratings yet
Population and Sample
10 pages
Bstat Handouts - Descriptive Only Handouts 2
No ratings yet
Bstat Handouts - Descriptive Only Handouts 2
12 pages
Selecting A Sample
No ratings yet
Selecting A Sample
3 pages
Sampling Techniques
No ratings yet
Sampling Techniques
23 pages
Lecture 1 - Biostat Basic
No ratings yet
Lecture 1 - Biostat Basic
60 pages
Sampling Techniques for College Research Students
No ratings yet
Sampling Techniques for College Research Students
38 pages
Biostat
No ratings yet
Biostat
10 pages
Appendix
No ratings yet
Appendix
4 pages
Notes - Sampling Design - Mac 2023
No ratings yet
Notes - Sampling Design - Mac 2023
41 pages
Population and Sample
No ratings yet
Population and Sample
5 pages
7/10/15 SR - Muzaitul Akma Mustapa Kamal Basha: Biostatistics NUR 3163
No ratings yet
7/10/15 SR - Muzaitul Akma Mustapa Kamal Basha: Biostatistics NUR 3163
32 pages
Presentation 1
No ratings yet
Presentation 1
37 pages
Types of Non-Probability Sampling
No ratings yet
Types of Non-Probability Sampling
4 pages
T0b_sampling
No ratings yet
T0b_sampling
19 pages
Sampling and Sampling Distributions
No ratings yet
Sampling and Sampling Distributions
63 pages
Statistics Assignment: by Vuyyuri Sujith Varma REG - NO: 17010141138 Bba Sec (A) Sem-2
No ratings yet
Statistics Assignment: by Vuyyuri Sujith Varma REG - NO: 17010141138 Bba Sec (A) Sem-2
20 pages
Sampling Techniques TULIO JO GABRIEL
No ratings yet
Sampling Techniques TULIO JO GABRIEL
35 pages
GEDS 802 Note - Descriptive Stat - pt.2
No ratings yet
GEDS 802 Note - Descriptive Stat - pt.2
27 pages
Sampling Techniques
No ratings yet
Sampling Techniques
41 pages
Sampling Design: Dr. Eunice B. Custodio Philippines
No ratings yet
Sampling Design: Dr. Eunice B. Custodio Philippines
30 pages
Sampling
No ratings yet
Sampling
22 pages
MRM Mod 3
No ratings yet
MRM Mod 3
121 pages
9
No ratings yet
9
54 pages
Chapter 4 Slides
No ratings yet
Chapter 4 Slides
29 pages
Chapter Two
No ratings yet
Chapter Two
17 pages
Mmw Stat Lesson 7
No ratings yet
Mmw Stat Lesson 7
11 pages
2006 - Philosophy, Methodology and Action Research
No ratings yet
2006 - Philosophy, Methodology and Action Research
43 pages
Week 7.1 PR Ppt Corrected
No ratings yet
Week 7.1 PR Ppt Corrected
20 pages
Sampling Methods
No ratings yet
Sampling Methods
14 pages
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
No ratings yet
Research in Psychology
From Everand
Research in Psychology
Connor Whiteley
No ratings yet
Assignment 1: (Decision Support System & Expert System)
No ratings yet
Assignment 1: (Decision Support System & Expert System)
10 pages
Sister Callista Roy: Roy Adaptation Model
67% (3)
Sister Callista Roy: Roy Adaptation Model
29 pages
Fifth Grade Unit 3 Addition and Subtraction of Fractions
No ratings yet
Fifth Grade Unit 3 Addition and Subtraction of Fractions
11 pages
People V de La Cruz
100% (1)
People V de La Cruz
2 pages
METHODS OF PHILOSOPHIZING (PPT1112-Ic-2.1)
100% (10)
METHODS OF PHILOSOPHIZING (PPT1112-Ic-2.1)
51 pages
Why Believe in A Flat Earth?: One Minute Interview
No ratings yet
Why Believe in A Flat Earth?: One Minute Interview
1 page
Command Terms Overview
No ratings yet
Command Terms Overview
3 pages
AERO2379 & AERO2350 Lecture 5 Prezzi Style - 24 Mar 2020-1
No ratings yet
AERO2379 & AERO2350 Lecture 5 Prezzi Style - 24 Mar 2020-1
71 pages
Effects of Procrastination or Crammig o
100% (1)
Effects of Procrastination or Crammig o
36 pages
Idealism: Brief History of Idealism
No ratings yet
Idealism: Brief History of Idealism
2 pages
Test Bias
No ratings yet
Test Bias
5 pages
Understanding Desistance From Crime Laub and Sampson
No ratings yet
Understanding Desistance From Crime Laub and Sampson
70 pages
Aba v. de Guzman
100% (1)
Aba v. de Guzman
2 pages
Kant and Right Theory Ethics
0% (1)
Kant and Right Theory Ethics
5 pages
Evolution 350 Uv Vis Spectrophotometer Specifications PS52998
0% (1)
Evolution 350 Uv Vis Spectrophotometer Specifications PS52998
2 pages
Foreign Literature
83% (12)
Foreign Literature
4 pages
Blooms Taxonomy and 3 Domains of Learning
No ratings yet
Blooms Taxonomy and 3 Domains of Learning
5 pages
A - Three Step - Formula PDF
No ratings yet
A - Three Step - Formula PDF
3 pages
Law As...
No ratings yet
Law As...
41 pages
Generalized Measurement System PDF
No ratings yet
Generalized Measurement System PDF
24 pages
108 Legi Universale
100% (1)
108 Legi Universale
9 pages
Teaching Guide in General Mathematics 11
No ratings yet
Teaching Guide in General Mathematics 11
35 pages
Ba 03
No ratings yet
Ba 03
46 pages
STCT 10-2014 6slide Handout
No ratings yet
STCT 10-2014 6slide Handout
28 pages
Shankya Final
No ratings yet
Shankya Final
71 pages
Critical Discourse Analysis: Demystifying The Fuzziness: October 2015
No ratings yet
Critical Discourse Analysis: Demystifying The Fuzziness: October 2015
9 pages
NACA TN-3273 Compressibility Factor For Steam
No ratings yet
NACA TN-3273 Compressibility Factor For Steam
62 pages
Research Assignment Unit 1
No ratings yet
Research Assignment Unit 1
8 pages
6 - Predicates and Quantifiers
No ratings yet
6 - Predicates and Quantifiers
29 pages
Conclusion Reflectionws
No ratings yet
Conclusion Reflectionws
2 pages