0% found this document useful (0 votes)

21 views

Statistical Tests

statistical tests for each type of hypothesis

Uploaded by

luzviminda.dulay

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Statistical Tests

statistical tests for each type of hypothesis

Uploaded by

luzviminda.dulay

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Statistical Tests — When to

use Which ?
For a person being from a non-statistical background the
most confusing aspect of statistics, are always the
fundamental statistical tests, and when to use which.
This blog post is an attempt to mark out the difference
between the most common tests, the use of null
value hypothesis in these tests and outlining the
conditions under which a particular test should be
used.

Null Hypothesis and Testing

Before we venture on the difference between different
tests, we need to formulate a clear understanding of
what a null hypothesis is. A null hypothesis, proposes
that no significant difference exists in a set of given
observations. For the purpose of these tests in general

Null: Given two sample means are equal

Alternate: Given two sample means are not equal

For rejecting a null hypothesis, a test statistic is

calculated. This test-statistic is then compared with a
critical value and if it is found to be greater than the
critical value the hypothesis is rejected. “In the
theoretical underpinnings, hypothesis tests are
based on the notion of critical regions: the null
hypothesis is rejected if the test statistic falls in
the critical region. The critical values are the
boundaries of the critical region. If the test is one-sided
(like a χ2 test or a one-sided t-test) then there will be
just one critical value, but in other cases (like a two-
sided t-test) there will be two”.[1]

Critical Value
A critical value is a point (or points) on the scale of the
test statistic beyond which we reject the null hypothesis,
and, is derived from the level of significance α of the
test. Critical value can tell us, what is the
probability of two sample means belonging to the
same distribution. Higher, the critical value means
lower the probability of two samples belonging to
same distribution. The general critical value for a two-
tailed test is 1.96, which is based on the fact that 95%
of the area of a normal distribution is within 1.96
standard deviations of the mean.

Critical values can be used to do hypothesis testing in

following way

1. Calculate test statistic

2. Calculate critical values based on significance level

alpha

3. Compare test statistic with critical values.

If the test statistic is lower than the critical value,

accept the hypothesis or else reject the hypothesis.
For checking out how to calculate a critical value in
detail please do check

Before we move forward with different statistical tests it

is imperative to understand the difference between a
sample and a population.

In statistics “population” refers to the total set of

observations that can be made. For eg, if we want to
calculate average height of humans present on the earth,
“population” will be the “total number of people
actually present on the earth”.

A sample, on the other hand, is a set of data

collected/selected from a pre-defined procedure. For our
example above, it will be a small group of people
selected randomly from some parts of the earth.

To draw inferences from a sample by validating a

hypothesis it is necessary that the sample is
random.

For instance, in our example above if we select people

randomly from all regions(Asia, America, Europe, Africa
etc.)on earth, our estimate will be close to the actual
estimate and can be assumed as a sample mean,
whereas if we make selection let’s say only from the
United States, then our average height estimate will not
be accurate but would only represent the data of a
particular region (United States). Such a sample is then
called a biased sample and is not a representative of
“population”.

Another important aspect to understand in statistics is

“distribution”. When “population” is infinitely large it is
improbable to validate any hypothesis by calculating the
mean value or test parameters on the entire population.
In such cases, a population is assumed to be of some
type of a distribution.

The most common forms of distributions are Binomial,

Poisson and Discrete. However, there are many other
types which are mentioned in detail at

Statistical Distributions
discrete values or whether the data is continuous; whether a new
pharmaceutical drug gets FDA approval or not is a…
people.stern.nyu.edu

The determination of distribution type is necessary

to determine the critical value and test to be
chosen to validate any hypothesis

Now, when we are clear on population, sample, and

distribution we can move forward to understand
different kinds of test and the distribution types for
which they are used.
Relationship between p-value,
critical value and test statistic
As we know critical value is a point beyond which we
reject the null hypothesis. P-value on the other hand is
defined as the probability to the right of respective
statistic (Z, T or chi). The benefit of using p-value is that
it calculates a probability estimate, we can test at any
desired level of significance by comparing this
probability directly with the significance level.

For e.g., assume Z-value for a particular experiment

comes out to be 1.67 which is greater than the critical
value at 5% which is 1.64. Now to check for a different
significance level of 1% a new critical value is to be
calculated.

However, if we calculate p-value for 1.67 it comes to be

0.047. We can use this p-value to reject the hypothesis at
5% significance level since 0.047 < 0.05. But with a
more stringent significance level of 1% the hypothesis
will be accepted since 0.047 > 0.01. Important point to
note here is that there is no double calculation
required.

Z-test
In a z-test, the sample is assumed to be normally
distributed. A z-score is calculated with population
parameters such as “population mean” and
“population standard deviation” and is used to
validate a hypothesis that the sample drawn
belongs to the same population.

Null: Sample mean is same as the population mean

Alternate: Sample mean is not same as the population

mean

The statistics used for this hypothesis testing is called z-

statistic, the score for which is calculated as

z = (x — μ) / (σ / √n), where

x= sample mean

μ = population mean

σ / √n = population standard deviation

If the test statistic is lower than the critical value,

accept the hypothesis or else reject the hypothesis

T-test
A t-test is used to compare the mean of two given
samples. Like a z-test, a t-test also assumes a normal
distribution of the sample. A t-test is used when the
population parameters (mean and standard deviation)
are not known.

There are three versions of t-test

1. Independent samples t-test which compares mean for
two groups

2. Paired sample t-test which compares means from the

same group at different times

3. One sample t-test which tests the mean of a single

group against a known mean.

The statistic for this hypothesis testing is called t-

statistic, the score for which is calculated as

t = (x1 — x2) / (σ / √n1 + σ / √n2), where

x1 = mean of sample 1

x2 = mean of sample 2

n1 = size of sample 1

n2 = size of sample 2

There are multiple variations of t-test which are

explained in detail here

T Test (Student's T-Test): Definition and Examples

Contents: The t test (also called Student's T Test) compares two
averages ( means) and tells you if they are different…
www.statisticshowto.com

ANOVA
ANOVA, also known as analysis of variance, is used
to compare multiple (three or more) samples with a
single test. There are 2 major flavors of ANOVA

1. One-way ANOVA: It is used to compare the difference

between the three or more samples/groups of a single
independent variable.

2. MANOVA: MANOVA allows us to test the effect of one

or more independent variable on two or more dependent
variables. In addition, MANOVA can also detect the
difference in co-relation between dependent variables
given the groups of independent variables.

The hypothesis being tested in ANOVA is

Null: All pairs of samples are same i.e. all sample means
are equal

Alternate: At least one pair of samples is significantly

different

The statistics used to measure the significance, in this

case, is called F-statistics. The F value is calculated
using the formula

F= ((SSE1 — SSE2)/m)/ SSE2/n-k, where

SSE = residual sum of squares

m = number of restrictions
k = number of independent variables

There are multiple tools available such as SPSS, R

packages, Excel etc. to carry out ANOVA on a given
sample.

Chi-Square Test
Chi-square test is used to compare categorical
variables. There are two type of chi-square test

1. Goodness of fit test, which determines if a sample

matches the population.

2. A chi-square fit test for two independent variables is

used to compare two variables in a contingency table to
check if the data fits.

a. A small chi-square value means that data fits

b. A high chi-square value means that data doesn’t fit.

The hypothesis being tested for chi-square is

Null: Variable A and Variable B are independent

Alternate: Variable A and Variable B are not

independent.
The statistic used to measure significance, in this case, is
called chi-square statistic. The formula used for
calculating the statistic is

Χ2 = Σ [ (Or,c — Er,c)2 / Er,c ] where

Or,c = observed frequency count at level r of Variable A

and level c of Variable B

Er,c = expected frequency count at level r of Variable A

and level c of Variable B

Note: As one can see from the above examples, in all the
tests a statistic is being compared with a critical value to
accept or reject a hypothesis. However, the statistic and
way to calculate it differ depending on the type of
variable, the number of samples being analyzed and if
the population parameters are known. Thus depending
upon such factors a suitable test and null hypothesis is
chosen.

This is the most important point which I have

noted, in my efforts to learn about these tests and
find it instrumental in my understanding of these
basic statistical concepts.

Disclaimer

This post focuses heavily on normally distributed data. Z-

test and t-test can be used for data which is non-
normally distributed as well if the sample size is greater
than 20, however there are other preferable methods to
use in such a situation. Please visit
http://www.statisticshowto.com/probability-and-
statistics/non-normal-distributions/ for more info on tests
for non normal distributions.

Inferential Statistics
100% (4)
Inferential Statistics
28 pages
Chi Square Assignment MOHA 570
No ratings yet
Chi Square Assignment MOHA 570
3 pages
SPSS - Practice Questions For Exam
50% (2)
SPSS - Practice Questions For Exam
7 pages
Types of Statistical Hypothesis: Statistics
No ratings yet
Types of Statistical Hypothesis: Statistics
18 pages
Statistical Test
No ratings yet
Statistical Test
38 pages
Comparison of Means: Hypothesis Testing
No ratings yet
Comparison of Means: Hypothesis Testing
52 pages
CH 11 - Small Sample Test
No ratings yet
CH 11 - Small Sample Test
8 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
12 pages
2statistics Prac New
No ratings yet
2statistics Prac New
13 pages
Statistics - The Big Picture
No ratings yet
Statistics - The Big Picture
4 pages
Hypothesis Testing. BCApptx
No ratings yet
Hypothesis Testing. BCApptx
34 pages
7-9
No ratings yet
7-9
99 pages
UNIT 10
No ratings yet
UNIT 10
30 pages
Hypothesis Testing : Z-Test, T-Test, F-Test
No ratings yet
Hypothesis Testing : Z-Test, T-Test, F-Test
42 pages
5 Session 18-19 (Z-Test and T-Test)
No ratings yet
5 Session 18-19 (Z-Test and T-Test)
28 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
10 pages
Hypotheses Testing
No ratings yet
Hypotheses Testing
5 pages
Unit 5 Mba 1ST
No ratings yet
Unit 5 Mba 1ST
197 pages
Basic Statistical Analysis
No ratings yet
Basic Statistical Analysis
12 pages
Wk. 13 Ppt. - Quantitative Techniques in Business
No ratings yet
Wk. 13 Ppt. - Quantitative Techniques in Business
24 pages
Parametric Test
No ratings yet
Parametric Test
49 pages
Data Science Interview Preparation (30 Days of Interview Preparation)
No ratings yet
Data Science Interview Preparation (30 Days of Interview Preparation)
27 pages
Things To Know PDF
No ratings yet
Things To Know PDF
56 pages
StockWatson Econ CH 2
No ratings yet
StockWatson Econ CH 2
39 pages
Tests of Hypothesis
No ratings yet
Tests of Hypothesis
16 pages
CH 21
No ratings yet
CH 21
58 pages
90156hypothesis Testing
No ratings yet
90156hypothesis Testing
34 pages
Biostatistics Notes: Descriptive Statistics
No ratings yet
Biostatistics Notes: Descriptive Statistics
16 pages
Biostatistics Notes
No ratings yet
Biostatistics Notes
8 pages
Chapter 5
No ratings yet
Chapter 5
35 pages
Defining Hypothesis Testing
No ratings yet
Defining Hypothesis Testing
17 pages
LR22 Test Statistic
No ratings yet
LR22 Test Statistic
30 pages
PSAI Unit 5
No ratings yet
PSAI Unit 5
25 pages
Z-Test and T-Test
No ratings yet
Z-Test and T-Test
6 pages
Testing of Hypothesis Hypothesis
No ratings yet
Testing of Hypothesis Hypothesis
32 pages
Biostats 2
No ratings yet
Biostats 2
7 pages
Tests of Significance
No ratings yet
Tests of Significance
35 pages
Hypothesis Testing.pptx
No ratings yet
Hypothesis Testing.pptx
24 pages
ADA Revision Questions and Quick Reads
No ratings yet
ADA Revision Questions and Quick Reads
17 pages
L7-Hypothesis Testing
No ratings yet
L7-Hypothesis Testing
44 pages
D1UA401B Research Methodology-UNIT-4 Pazhanisamy-BBA IV Semester Section19
No ratings yet
D1UA401B Research Methodology-UNIT-4 Pazhanisamy-BBA IV Semester Section19
108 pages
Statistical Techniques - Bda
No ratings yet
Statistical Techniques - Bda
33 pages
Unit 3 Hypothesis
No ratings yet
Unit 3 Hypothesis
41 pages
Statistical Inferences
No ratings yet
Statistical Inferences
46 pages
Normal Distribution
No ratings yet
Normal Distribution
8 pages
Testing of Hypothesis
67% (3)
Testing of Hypothesis
37 pages
Lecture note 5
No ratings yet
Lecture note 5
8 pages
Inferential Statistics For Data Science
100% (1)
Inferential Statistics For Data Science
10 pages
Hypothesis Testting3
No ratings yet
Hypothesis Testting3
7 pages
T - Test
No ratings yet
T - Test
45 pages
PSM 201 Sampling Distributions and Hypothesis Testing
No ratings yet
PSM 201 Sampling Distributions and Hypothesis Testing
31 pages
STATISTICS AND PROBABILITY Unit 1-2
No ratings yet
STATISTICS AND PROBABILITY Unit 1-2
6 pages
Notes For RM Lab Viva
No ratings yet
Notes For RM Lab Viva
9 pages
Inferentialstatistics 210411214248
No ratings yet
Inferentialstatistics 210411214248
102 pages
What Is A Hypothesis
No ratings yet
What Is A Hypothesis
4 pages
Unit V Hypothesis Testing
No ratings yet
Unit V Hypothesis Testing
3 pages
Hypothesis Testing (1)
No ratings yet
Hypothesis Testing (1)
7 pages
Some Statistical Methods in Anachem
No ratings yet
Some Statistical Methods in Anachem
39 pages
Hypothesis Testing Sheet
No ratings yet
Hypothesis Testing Sheet
1 page
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Overview Of Bayesian Approach To Statistical Methods: Software
From Everand
Overview Of Bayesian Approach To Statistical Methods: Software
Vinaitheerthan Renganathan
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Lilliefors Test For The Exponential Distribution
No ratings yet
Lilliefors Test For The Exponential Distribution
2 pages
Minitab Workbook
No ratings yet
Minitab Workbook
28 pages
Biostat07 H
No ratings yet
Biostat07 H
17 pages
Frequency Table: Lampiran 10 Hasil Uji Chi-Square
No ratings yet
Frequency Table: Lampiran 10 Hasil Uji Chi-Square
3 pages
T Test
No ratings yet
T Test
20 pages
MVP 2
No ratings yet
MVP 2
2 pages
Chi Square
No ratings yet
Chi Square
20 pages
Two-Sample Tests and One-Way ANOVA: Chapter 10, Slide 1
No ratings yet
Two-Sample Tests and One-Way ANOVA: Chapter 10, Slide 1
69 pages
Z - TEST and T Test
No ratings yet
Z - TEST and T Test
45 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
93 pages
T-Test-Assignment 3-Act
No ratings yet
T-Test-Assignment 3-Act
4 pages
Chapter-4
No ratings yet
Chapter-4
8 pages
3.1 Hypothesis Testing (Critical Value Approach) : Statistics
No ratings yet
3.1 Hypothesis Testing (Critical Value Approach) : Statistics
3 pages
Free Assignments in PDF: Assignment No. 2
No ratings yet
Free Assignments in PDF: Assignment No. 2
15 pages
Sampling PDF
No ratings yet
Sampling PDF
117 pages
Hypothesis Testing Betsy Farber
No ratings yet
Hypothesis Testing Betsy Farber
13 pages
Sampling Techniques: Large and Small Sample Test
No ratings yet
Sampling Techniques: Large and Small Sample Test
80 pages
3.normality Test and Homogenity
No ratings yet
3.normality Test and Homogenity
4 pages
Nonparametric Tests in R
No ratings yet
Nonparametric Tests in R
5 pages
BoxPlot Levene
No ratings yet
BoxPlot Levene
21 pages
Lilliefors Test For Normality
No ratings yet
Lilliefors Test For Normality
2 pages
L18 Hypothesis Testing1
No ratings yet
L18 Hypothesis Testing1
62 pages
Chapter 3.2 WILCOXON RANK SUM TEST
No ratings yet
Chapter 3.2 WILCOXON RANK SUM TEST
16 pages
Pengaruh Model Sugesti-Imajinasi Terhadap Keterampilan Menulis Anekdot
No ratings yet
Pengaruh Model Sugesti-Imajinasi Terhadap Keterampilan Menulis Anekdot
8 pages
Type I and II Errors
No ratings yet
Type I and II Errors
11 pages
BRM 5th Unit
No ratings yet
BRM 5th Unit
16 pages
(M4) Posttask
No ratings yet
(M4) Posttask
4 pages