Statistical Tests
Statistical Tests
Statistical Tests
use Which ?
For a person being from a non-statistical background the
most confusing aspect of statistics, are always the
fundamental statistical tests, and when to use which.
This blog post is an attempt to mark out the difference
between the most common tests, the use of null
value hypothesis in these tests and outlining the
conditions under which a particular test should be
used.
Critical Value
A critical value is a point (or points) on the scale of the
test statistic beyond which we reject the null hypothesis,
and, is derived from the level of significance α of the
test. Critical value can tell us, what is the
probability of two sample means belonging to the
same distribution. Higher, the critical value means
lower the probability of two samples belonging to
same distribution. The general critical value for a two-
tailed test is 1.96, which is based on the fact that 95%
of the area of a normal distribution is within 1.96
standard deviations of the mean.
Statistical Distributions
discrete values or whether the data is continuous; whether a new
pharmaceutical drug gets FDA approval or not is a…
people.stern.nyu.edu
Z-test
In a z-test, the sample is assumed to be normally
distributed. A z-score is calculated with population
parameters such as “population mean” and
“population standard deviation” and is used to
validate a hypothesis that the sample drawn
belongs to the same population.
z = (x — μ) / (σ / √n), where
x= sample mean
μ = population mean
T-test
A t-test is used to compare the mean of two given
samples. Like a z-test, a t-test also assumes a normal
distribution of the sample. A t-test is used when the
population parameters (mean and standard deviation)
are not known.
x1 = mean of sample 1
x2 = mean of sample 2
n1 = size of sample 1
n2 = size of sample 2
ANOVA
ANOVA, also known as analysis of variance, is used
to compare multiple (three or more) samples with a
single test. There are 2 major flavors of ANOVA
Null: All pairs of samples are same i.e. all sample means
are equal
m = number of restrictions
k = number of independent variables
Chi-Square Test
Chi-square test is used to compare categorical
variables. There are two type of chi-square test
Note: As one can see from the above examples, in all the
tests a statistic is being compared with a critical value to
accept or reject a hypothesis. However, the statistic and
way to calculate it differ depending on the type of
variable, the number of samples being analyzed and if
the population parameters are known. Thus depending
upon such factors a suitable test and null hypothesis is
chosen.
Disclaimer