Professional Documents
Culture Documents
Chapter 7
Chapter 7
STATISTICAL DATA
TREATMENT AND
EVALUATION
Most common applications of statistical tests to the
treatment of analytical results:
𝑧
𝑥−
z=
CI for = 𝑥ҧ (for N measurements)
𝑁
Determine the 80% and 95% confidence interval for (a) the first entry
(1108 mg/L glucose) and (b) the mean value (1100.3 mg/L) for
month 1 in the example. Assume that in each part, s = 19 is a good
estimate of .
How many replicate measurements in month 1 are needed to
decrease the 95% confidence interval to 1100.3 10.0 mg/L
of glucose?
It is essential to keep in mind at
all times that confidence intervals
apply only in the absence of bias
and only if we can assume that s
is a good approximation of
(s → )
for N measurements
A chemist obtained the following data for the alcohol content of a
sample of blood: % C2H4OH: 0.084, 0.089, and 0.079. Calculate
the 95% confidence interval for the mean assuming (a) the three
results obtained are the only indication of the precision of the
method and (b) from previous experience on hundreds of
samples, we know that the standard deviation of the method is
0.005% C2H4OH and is a good estimate of .
STATISTICAL AIDS TO HYPOTHESIS TESTING
H0 : = o
Ha : > o or < o or o
27.7−30.8
𝑧= 5.2 = −3.26
ൗ 30
Since, z < -1.96 (95% CL) and z < -2.85 (99% CL) we reject
the null hypothesis.
Conclusion: The student mean is actually different from the
literature value and not just the result of random error.
Small Sample t Test
𝑥ഥ −o 0.116−0.123
𝑡= 𝑠 = 0.0032 = −4.375
ൗ 𝑁 ൗ 4
tcrit (95% CL) = 3.18 and tcrit (99% CL) = 5.84
If it were confirmed
by further
experiments that the
method always gives
low results, we
would say that the
method had a
negative bias.
• Comparison of Two Experimental Means
H0 : 1 = 2
Ha : 1 2 (two-tailed test)
1 < 2 or 1 > 2 (one-tailed test)
number of degrees of
freedom for finding the
critical value of t is N1 +
N2 - 2.
If absolute value of t < tcritical, the null hypothesis is accepted
and no significant difference between the means has been
demonstrated.
If absolute value of t > tcritical , indicates a significant
difference between the means.
H0 : 1 = 2
Ha : 1 2
Degrees of freedom = 10-2 = 8
Paired Data
Ho: d = o where o is a specific value for the difference to
be tested, often zero
Ho: d o
d > o A new automated procedure for
d < o determining glucose in serum (Method
A) is to be compared with the
𝑑ഥ −o established method (Method B). Both
𝑡= 𝑠𝑑
ൗ 𝑁 methods are performed on serum from
the same six patients to eliminate
patient-to-patient variability. Do the
following results confirm a difference in
the two methods at the 95% confidence
level?
N=6
di = 16 + 9 + 25 + 5 + 22 + 11 = 88
di2 = 1592
𝑑ҧ = 14.67
882
1592 − 14.67
𝑠𝑑 =
6−1
6
= 7.76 𝑡= 7.76 = 4.628
6
tcrit (95% CL) = 2.57, degrees of freedom = 5
Since t > tcrit , therefore, the two methods give different
results (null hypothesis is rejected)
Errors in Hypothesis Testing
• Comparison of Precision
H 0 : 1 2 = 2 2
H a : 1 2 2 2 (two-tailed test)
12 > 22 or 12 < 22 (one-tailed test)
2
𝑠1 the larger variance always appears in
F= 2 the numerator
𝑠2
2 Fcrit = 2.30
𝑠𝑠𝑡𝑑 (0.21)2
𝐹1 = 2 = = 1.96 F1 < Fcrit , null hypothesis is
𝑠1 (0.15)2
2 accepted (no improvement in
𝑠𝑠𝑡𝑑 (0.21)2 precision)
𝐹2 = 2 = = 3.06
𝑠2 (0.12)2
F2 > Fcrit, null hypothesis is
rejected
whether the precision of the second modification is
significantly better than that of the first
2
𝑠1 (0.15)2 Fcrit = 2.69
𝐹2 = = = 1.56
𝑠2 2 (0.12)2
Since F < 2.69, we must accept Ho and conclude that
the two methods give equivalent precision.
Q-Test
ห𝑋𝑞 − 𝑋𝑛 ȁ
𝑄𝑒𝑥𝑝 = where Xq = questionable value
𝑤
Xn = nearest numerical value
w = range
ȁ56.23 − 56.08ȁ
𝑄𝑒𝑥𝑝 = = 0.54
56.23 − 55.95
Qt (95% CL) =0.71.
Since Qexp < Qt
we retain the outlier
at 95% confidence level.
Recommendations for Treating Outliers