Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

CHEM-205 Analytical Chemistry-I: Anova

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 20

ANOVA

CHEM-205 Analytical Chemistry-I

BS(Hons) Semester-4

Dr. Muhammad Zubair


Department of Chemistry, UOG
Hypothesis Testing and ANOVA
Key elements of hypothesis testing

Review of common of sample tests

Introduction to ANOVA
What is Hypothesis Testing?
The intent of hypothesis testing is formally is to examine
two opposing estimations, H0 and HA
 
These two hypotheses are mutually exclusive and exhaustive so that one
is true to the exclusion of the other

We collect and analyze sample information –


for the purpose of determining which of the two hypotheses is true and
which is false
Null and Alternative Hypothesis
 

Null hypothesis, (H0)


 States the assumption (numerical) to be tested
 Begin with the assumption that the null hypothesis is TRUE
 Always contains the ‘=’ sign
 

Alternative hypothesis(Ha)
 Is the opposite of the null hypothesis
 Challenges the status quo
 Never contains just the ‘=’ sign
 Is generally the hypothesis that is believed to be true by the researcher
P-Value
Calculate a test statistic in the sample data that is relevant to the tested hypothesis

After calculating a test statistic we convert this to a P-value by comparing its value
to distribution of test statistic’s under the null hypothesis
  
Measure of how likely the test statistic value is under the null hypothesis
 
P-value ≤ α ⇒ Reject H0 at level α P-value > α ⇒ Do not reject H0 at level α

α it may be 0.05at 95% CI


CONFIDENCE INTERVAL
 
 
 
 
 

Confidence interval: an interval of reasonable values for the


parameter being estimated, where degree of plausibility specified
by a “confidence level”

  Interpreting a 95% CI

We calculate a 95% CI for a hypothetical sample mean to be


between 20.6 and 35.4. Does this mean there is a 95%
probability the true population mean is between 20.6 and 35.4?
One-Way ANOVA
Simplest case is for One-Way (Single Factor) ANOVA

The outcome variable is the variable you’re comparing

The factor variable is the categorical variable being used to define the groups

We will assume k samples (groups)

The one-way is because each value is classified in exactly one way  

ANOVA easily generalizes to more factors


General ANOVA table
Source of df Sum of     MS   F  

 
Variation  
   
Squares  
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

      SSTG
 

 
 

 
SSTG  

 
 

k 1
     
 

SSTG
 

             

Group k-1  
 
 
 
k 1  
 
 
 
 
 
SSTE    

N k
 

                       

                           

   
SSTE          
SSTE
 

                 

Error N-k   N k            

                           

Total N-1 SST                      

                           
Cooking oil Mean±SD Predictor equation a+bx+E

K-1 140.63±91.29 33.48+0.64 (Temperature)+24.67

K-2 99.26±71.14 15.07+0.63 (Temperature)+12.34

K-3 117.52±68.54 35.80+0.48 (Temperature)+25.65

K-4 110.30±65.10 32.40+0.46 (Temperature)+23.53

K-5 155.05±107.34 29.95+0.74 (Temperature)+24.74

Source of Variation SS df MS F P-value F crit  

Between Groups 7958.38 4 1989.60 0.30 0.88 3.47  

Within Groups 70523.09 10 7052.31  

Total 78481.48 14  
Confidence intervals, limits, and levels
QUESTIONs
Could you explain the difference between confidence intervals, limits, and levels?
How are these statistics calculated?
When are they generally used?

ANSWERs
Once again, in preparing to answer this seemingly easy question, I discovered that the
answer is a bit more complex than I at first thought. To explain what I found, I will have to address
the following sub-questions:

1. What are standard errors?


2. How are these standard error statistics calculated?
3. What are confidence intervals, confidence limits, confidence levels, etc.?
What Are Standard Errors?
To understand these various confidence concepts, it is necessary to first understand that, when we
calculate any statistic based on a sample, it is an estimate of something else. Thus when we
calculate the sample mean (M), that statistic is an estimate of the population mean (μ); when we
calculate a reliability estimate for a set of test scores, it is an estimate of the proportion of true
score variance accounted for by those scores; and when we use regression to predict one student’s
score on Test Y from their score on Test X, it is simply an estimate of what their actual score might
be.
However, estimates are just that, estimates. Thus they are not 100% accurate. The issues of
standard errors and confidence are our statistical attempts to examine the inaccuracy of our
estimates; this inaccuracy is also known as error. All statistics are estimates and all statistics have
associated errors.
The mean of a sample on some measured variable is an estimate as are the standard deviation, th
variance, any correlations between that variable and others, means comparisons statistics (e.g., t-
test, F-ratio, etc.), frequency comparisons (e.g., chi-square), and so forth. We can estimate the
magnitude of the errors for any of these statistics by calculating the standard error for whatever
statistic is involved. We then interpret the standard error in probability terms, which is where
confidence intervals, limits, and levels come in.
What Are Confidence Intervals, Confidence Limits, Confidence Levels,

The confidence intervals, limits, and


levels that you asked about in your
question, all have to do with the next
step after you have the standard error
calculated.
confidence interval
Coming back to the terminology, a confidence
interval is the “range of values of a sample
statistic that is likely (at a given level of
probability, called a confidence level) to contain a
population parameter.
The interval that will include the population
parameter a certain percentage (confidence level)
of the time in the long run (over repeated
sampling)”
confidence level
In contrast, a confidence level is the degree of confidence, or certainty,
that the researcher wants to be able to place in the confidence interval.
Put another way, the confidence level is the probability that the
parameter being estimated by the statistic falls within the confidence
interval.

The confidence level is usually expressed as a percentage, but it can also


take the form of a proportion (which is also sometimes called a
confidence coefficient).
confidence limits
the confidence limits (also known as confidence
bounds), are simply the “The upper and lower values of
a confidence interval, that is, the values defining the
range of a confidence interval”
So in a case where the ±2 se confidence interval turns
out to be 47.98 to 54.02 for the 95% confidence level,
the confidence limits are 47.98 and 54.02.
EXAMPLE
The Raw Data
Here are the raw data from the three groups (6 values in Group 1,
and 5 each in Groups 2 and 3).
Group 1 Group 2 Group 3

3 4 9
1 3 7
3 5 8
2 5 11
4 4 9
3
SUM of Square
Summary of data set
Group 1 Group 2 Group 3 Totals

Sum of X 16 21 44 81

Sum of X2 48 91 396 535

n 6 5 5 16

Mean 2.67 4.20 8.80

SS 5.33 2.80 8.80


ANOVA Table

Source df SS MS F

Between 2 108.00 54.00 41.46

Within 13 16.93 0.30  

Total 15 124.94  

You might also like