0% found this document useful (0 votes)

4 views

Hypothesis Testing

Summarized Cheat Sheet - Hypothesis Testing

Uploaded by

sayantini123bak

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Hypothesis Testing

Summarized Cheat Sheet - Hypothesis Testing

Uploaded by

sayantini123bak

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 29

HYPOTHESIS TESTING

n
1
• x= ∑ x i
n i=1
n
1
• s21= ∑
n−1 i=1
( x i−x )
2

n
1
s = ∑ ( x i−x )
2 2
•
n i=1
• If E(statistic) = parameter

• then the statistic is said to be an Unbiased Estimate of the parameter.

• Sample mean is an unbiased estimate of the population mean.

• This means that the average of all sample means equals the population mean.

• E ( x )=μ

Also, E ( s1 ) =σ E ( s2 ) ≠ σ 2
2 2
and

• Unknown parameters are estimated using sample observations.

• Parameter values are fixed.

• Values of statistics vary from sample to sample.

• Each sample has some probability of being chosen.

• Each value of a statistic is associated with probability.

• Thus, Statistic is a random variable.

• Distribution of a statistic is called a sampling distribution.

• Distribution of a statistic may not be the same as the distribution of the population.

• We saw in previous example, E ( x )=μ∧Var ( x )=σ 2 /n.

• This is always true and can be proved as below:

( )
n n n
1 1 1
• E ( x )=E ∑ x i = ∑ E ( x i )= ∑ μ=μ
n i=1 n i=1 n i =1

( )
n n n
1 1 1 1
• Var ( x )=Var ∑ x i = 2 ∑ Var ( x i )= 2 ∑ σ 2= 2 n σ 2=σ 2 /n
n i =1 n i=1 n i=1 n
• Square root of variance is generally called as standard deviation.

• Here we shall call it Standard Error.

• Different samples of the same size from the same population yield different sample means.

• Standard Error of x is a measure of the variability in different values of sample mean.

Central Limit Theorem

• When population distribution is N(μ, σ),

σ
• Then x N (μ , )
√n
• When the population distribution is not normal,

σ
• Then also, x N (μ , ), provided n → ∞ .
√n
• Practically, this result is true for n ≥ 30.

• The result may also be written as

( x−μ)
• N (0 ,1)
σ /√n
• Clearly, this result is valid when

• Sample comes out of a normal population, or

• Sample size is large (n ≥ 30).

• Suppose a population has mean μ = 8 and standard deviation σ = 3.

• Suppose a random sample of size n = 36 is selected.
• What is the probability that the sample mean is between 7.75 and 8.25?
• P ( 7.75< x <8.25 ) ?
σ
•
x N (μ , ) x N (8 , 0.5)
√ n , or
• Using Excel,
• P ( 7.75< x <8.25 ) = NORM.DIST(8.25,8,0.5,1)-
NORM.DIST(7.75,8,0.5,1)

POPULATION & SAMPLE PROPORTIONS

• X and π are population parameters.

• x and p are sample statistics.
• p provides an estimate of π .
• Note that, x B(n , π )
• E ( x )=n π ,
• Var (x )=n π (1 – π ),
• This implies that
• E( p)=E (x /n)=π ,
2
• Var ( p)=Var (x /n)=n π (1 – π )/n =π (1 – π )/n .
• Standard Error ( p)=√ [Var ( p)]=√[ π (1 – π )/n]

• When the sample size n is large enough, binomial distribution approaches normal distribution.
• So, for large n ,
p−π

•
•
√ π (1−π )
n
~N (0,1),
This is a particular case of central limit theorem.
• Practically, this result is true for n ≥ 30.
Or, when nπ ≥ 5as well as n(1 – π )≥5 .

• We have seen the following 2 results:

x−μ
• N ( 0 , 1)
σ /√ n
• This result is valid:
• When sample size is 30 or more, or
• When parent population has normal distribution
p−π
N ( 0 ,1 )
• √ π ( 1−π ) /n
• This result is valid
• when sample size is 30 or more, or
• When nπ ≥ 5 , as well as n ( 1−π ) ≥ 5

• Two types of error:

• Type I Error: Reject H0, when it is true
• Size of Type I Error = P(Type I Error)
• =P(Reject H0, when it is true)
• =α (Also called Producer’s risk)
• Type II Error: Accept H0, when it is wrong
• Size of Type II Error = P(Type II Error)
• =P(Accept H0, when it is wrong)
• =β (Also called Consumer’s risk)
• Size of Type I Error (α) is called the Level of Significance.
• α is set by the researcher in advance.

• Critical value divides the whole area under the probability curve into two regions:
• Critical (Rejection) region
• When the statistical outcome falls into this region, H is rejected.
0
• Size of this region is α.
• Acceptance Region
• When the statistical outcome falls into this region, H is
0
accepted. Size of this region is (1-α).
Testing of Statistical Hypothesis
(One Samples Test)

Testing of Hypothesis for µ (z-test)

• Conditions/ Assumptions:
• Population is normal or n ≥ 30
• σ is known or n ≥ 30
x−μ
Z c=
σ
Test Statistic:
√n
1. Obtain the Critical Values using Excel or the Statistical Table
• Excel Formula
• For TTT: NORM.S.INV(α/2) and
NORM.S.INV(1 - α/2)
• For RTT: NORM.S.INV(1- α)
• For LTT: NORM.S.INV(α)
• p – value Approach
• Let Z be the computed value of the test statistic and Z ~ N (0,1)
c
• Then p – value is given by the following probability
• For two-tailed tests: 2P(Z> |Zc|)
• Excel Formula: 2*(1-
NORM.S.DIST(ABS(Zc),1))
• For right-tailed tests: P(Z> Zc)
• Excel Formula: 1-NORM.S.DIST(Zc,1))
• For left-tailed tests: P(Z< Zc)
• Excel Formula: NORM.S.DIST(Zc,1))

Testing of Hypothesis for µ (z-test)

• Conditions/ Assumptions:
• n<30; Population is normal; σ is unknown
x−μ
Z c=
σ
• Test Statistic:
√n
1.Obtain the Critical Values using t
distribution with (n-1) degree of
freedom (t ). (n−1)

• Excel Formula
• For TTT: T.INV(α/2,n-1) and
T.INV(1 - α/2 ,n-1)
• For RTT: T.INV(1- α ,n-1)
• For LTT: T.INV(α ,n-1)
2. p – value Approach in t-test
1. Let Tc be the computed value of the test statistic and T ~t(n-1)
2. Then p – value is given by the following probability
• For two-tailed tests: 2P(T> |Tc|)
1. Excel Formula: 2*(1-T.DIST(ABS(Tc),n-1,1))
• For right-tailed tests: P(T> Tc)
1. Excel Formula: 1-T.DIST(Tc, n-1,1))
• For left-tailed tests: P(T< Tc)
1. Excel Formula: T.DIST(Tc, n-1,1))

Testing of Statistical Hypothesis

(Two Samples Tests)
x 1−x 2
Z c= N (0 ,1)

√
2 2
• Z test for two independent samples σ1 σ 2
+
n 1 n2
x1 −x2
Z c= N (0 , 1)

√
2 2
• Z test for two independent samples s s 1 2
+
n1 n2
• t test for two independent samples assuming equal variances
x 1−x 2
T c= t ( n +n −2 ) 1
[ ( n1−1 ) s 1+ ( n2−1 ) s2 ]
√
2 2 2
• 1 1 1 2
, where S =
S + n 1 +n 2 −2
n1 n2
• Use t (n +n −2) distribution for critical value/ p-value.
1 2

• t test for two independent samples assuming unequal variances

2
x 1−x 2 ( s21 /n 1+ s 22 /n 2)
T c= t (f ) f=

√ [ ]
2 2
• 2
s1 s2
+
2
, Where ( s 1 /n1 ) ( s 2 /n2 )
2 2

n1 n2 +
n1 −1 n2−1
d
• paired t test T c= t (n−1)
sd / √ n
• Testing the Hypothesis for Difference of Proportions
p1− p 2
Z c= N (0 , 1) n1 p 1+ n2 p2
•
√^π (1− ^π ) 1
+
1
n1 n2( )
, where ^π =
n1 + n2

( p1 − p2 )
N ( 0 ,1 )
• Thus,
√ 1 1
π (1−π )( + )
n1 n2
.
• In the given example, we have three populations.
• We wish to test
• H0: π1 = π2 = π3 (All the proportions are the same)
• H1: Not all π1, π2, π3 are equal
• The table of data shown in the example is called the Contingency Table.
• Contingency Tables are used to classify sample observations according to two or more
characteristics.
• Contingency Table is useful in situations involving multiple population proportions.
• Let a contingency table has r rows and c columns.
• Then, it will have r x c cells
Chi square tests are always right tailed.

• We always have

• If we approximate some expected frequency, we must make sure that above condition is
satisfied.
• In these problems, data is of discrete type
• Chi – Square distribution is a continuous distribution.
• It loses its validity if any expected frequency is less than FIVE.
• In such case, the expected frequency is pooled with the preceding or succeeding
frequency.
• D.f. is reduced by one for one such pooling.
• We do not make any assumption about the distribution of parent population.
• The difference between two means can be examined using t – test or Z – test.
• If we have more than 2 samples.
• We wish to test the hypothesis that
• all the samples are drawn from the population having the same means.
• Or all population means are the same.
• We use ANOVA.

• ANOVA is essentially a
procedure for testing the
difference among various
groups of data for
homogeneity.
• At its simplest, ANOVA tests
the following hypotheses:
 H0: The means of all the
groups are equal.
 H1: Not all the means are
equal
• doesn’t say how
or which ones
differ.
• Can follow up
with “multiple
comparisons”
•
ANOVA IS ALWAYS RIGHT TAILED
TOO
• If the observations are large, you can
shift their origin and scale.
• This will not change the result.
• Shifting origin means adding or
subtracting some constant.
• Shifting of scale means multiplying or
dividing by some constant.
• Two-way analysis of variance is
an extension of one-way
analysis of variance.
• The variation is controlled by
two factors.
• The values of random variable
X are affected by different
levels of two factors.
• Assumptions
 The populations are normally
distributed.
 The samples are
independent.
 The variances of the
populations are equal.

• HA0: All levels of Factor A have the

same effect
• HA1: All levels of Factor A don’t have
the same effect
• HB0: All levels of Factor B have the
same effect
• HB1: All levels of Factor B don’t have
the same effect
• HAB0: There is no interaction effect
• HAB1: Interaction effect is there
•

Adjustment Computation
100% (3)
Adjustment Computation
20 pages
Cape Applied Mathematics Cheat Sheet
No ratings yet
Cape Applied Mathematics Cheat Sheet
6 pages
CFA Level 1 Review - Quantitative Methods
50% (2)
CFA Level 1 Review - Quantitative Methods
10 pages
Statistics and Probability: Quarter 3 - Module 15: Computing For The Parameter and Statistic
No ratings yet
Statistics and Probability: Quarter 3 - Module 15: Computing For The Parameter and Statistic
27 pages
STS 201 Week 6 Lecture Note
No ratings yet
STS 201 Week 6 Lecture Note
35 pages
List of Formula - Managerial Statistics
No ratings yet
List of Formula - Managerial Statistics
6 pages
Lecture note 5
No ratings yet
Lecture note 5
8 pages
Справочник по Гипотезам
No ratings yet
Справочник по Гипотезам
3 pages
QM Consolidated Formulae
No ratings yet
QM Consolidated Formulae
40 pages
Probability and Statistics - 3
No ratings yet
Probability and Statistics - 3
59 pages
I. Test of a Mean: σ unknown: X Z n Z N X t s n ttn
No ratings yet
I. Test of a Mean: σ unknown: X Z n Z N X t s n ttn
12 pages
Unit 5 Mba 1ST
No ratings yet
Unit 5 Mba 1ST
197 pages
chapter 8
No ratings yet
chapter 8
45 pages
GB Academy Equation List
No ratings yet
GB Academy Equation List
16 pages
22nd Inferences Based On Two Samples-Confidence Intervals and Tests of Hypothesis
No ratings yet
22nd Inferences Based On Two Samples-Confidence Intervals and Tests of Hypothesis
69 pages
Statistics-Help-Card-Formulas
No ratings yet
Statistics-Help-Card-Formulas
3 pages
Statistics Help Card Formulas
No ratings yet
Statistics Help Card Formulas
3 pages
Inferential Analysis
No ratings yet
Inferential Analysis
9 pages
课本附录 (二) - 公式表 Formula Sheet - final
No ratings yet
课本附录 (二) - 公式表 Formula Sheet - final
2 pages
CRE Equations and Formulas Print Out
No ratings yet
CRE Equations and Formulas Print Out
30 pages
MAS202 - Assignment 2: Exercise 1
No ratings yet
MAS202 - Assignment 2: Exercise 1
16 pages
Section 5.7
No ratings yet
Section 5.7
47 pages
Formuleblad-statistiek
No ratings yet
Formuleblad-statistiek
10 pages
Reliance JIO
No ratings yet
Reliance JIO
69 pages
Chapter 9 (Independent Means Only) UPDATED!!!
No ratings yet
Chapter 9 (Independent Means Only) UPDATED!!!
27 pages
Chapter 8
No ratings yet
Chapter 8
21 pages
Review chapter 8
No ratings yet
Review chapter 8
5 pages
CQE Academy Equation Cheat Sheet B
No ratings yet
CQE Academy Equation Cheat Sheet B
15 pages
PSCV Unit-Iii Digital Notes
No ratings yet
PSCV Unit-Iii Digital Notes
46 pages
Statistics Packet
No ratings yet
Statistics Packet
17 pages
Important Formulas and Tables Statistics
No ratings yet
Important Formulas and Tables Statistics
7 pages
Foundations of Statistical Inference
No ratings yet
Foundations of Statistical Inference
22 pages
Research Methodology and Biostatistics Part II 2
No ratings yet
Research Methodology and Biostatistics Part II 2
45 pages
Black Belt Training - Module 2 - Day 3
No ratings yet
Black Belt Training - Module 2 - Day 3
95 pages
U-3 Notes
No ratings yet
U-3 Notes
42 pages
4 Inferentials
No ratings yet
4 Inferentials
53 pages
Large Sample Test
No ratings yet
Large Sample Test
6 pages
P&S UNIT-5 Testing of Hypothesis
No ratings yet
P&S UNIT-5 Testing of Hypothesis
47 pages
IEM 4103 Quality Control & Reliability Analysis IEM 5103 Breakthrough Quality & Reliability
No ratings yet
IEM 4103 Quality Control & Reliability Analysis IEM 5103 Breakthrough Quality & Reliability
46 pages
Statistical Inference For Namagers
No ratings yet
Statistical Inference For Namagers
4 pages
Final Exam of Business Statistics I at ADA University
No ratings yet
Final Exam of Business Statistics I at ADA University
14 pages
ECM1001 Formula Sheet
No ratings yet
ECM1001 Formula Sheet
15 pages
Outline 3
No ratings yet
Outline 3
1 page
Formulas
No ratings yet
Formulas
8 pages
Important Formulas: Data Description Discrete Probability Distributions
No ratings yet
Important Formulas: Data Description Discrete Probability Distributions
7 pages
QT
No ratings yet
QT
16 pages
CH 8
No ratings yet
CH 8
20 pages
ST102 Notes
0% (1)
ST102 Notes
21 pages
Business Analytics & Machine Learning: Regression Analysis
No ratings yet
Business Analytics & Machine Learning: Regression Analysis
58 pages
Stat Prob
No ratings yet
Stat Prob
7 pages
CQE Academy Equation Cheat Sheet - D
No ratings yet
CQE Academy Equation Cheat Sheet - D
15 pages
Hypothesis Testing.pptx
No ratings yet
Hypothesis Testing.pptx
24 pages
Statistics 221 Summary of Material
No ratings yet
Statistics 221 Summary of Material
5 pages
Unit III - Formulae
No ratings yet
Unit III - Formulae
37 pages
ECON 332 Business Forecasting Methods Prof. Kirti K. Katkar
No ratings yet
ECON 332 Business Forecasting Methods Prof. Kirti K. Katkar
46 pages
Statistics07_TwoSamplesHypothesisTest
No ratings yet
Statistics07_TwoSamplesHypothesisTest
45 pages
Inference using normal and t distribution
No ratings yet
Inference using normal and t distribution
9 pages
FormulaSheet FinalExam
No ratings yet
FormulaSheet FinalExam
8 pages
Unit-Iii P&S
No ratings yet
Unit-Iii P&S
21 pages
Statistical Formula Sheet 1: X X N X N X F X N
No ratings yet
Statistical Formula Sheet 1: X X N X N X F X N
11 pages
Estimation and Test of Hypothesis
No ratings yet
Estimation and Test of Hypothesis
41 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Chapter 5: Common Distributions: 5.1 The Normal Distribution
No ratings yet
Chapter 5: Common Distributions: 5.1 The Normal Distribution
21 pages
Sampling and Sampling Distribution - Open Board
No ratings yet
Sampling and Sampling Distribution - Open Board
37 pages
Q3 Week 6
No ratings yet
Q3 Week 6
38 pages
Chapter 3 Sampling and Sampling Distribution
100% (1)
Chapter 3 Sampling and Sampling Distribution
87 pages
Statistics and Probability: Quarter 3 - Modules 7-8 The Central Limit Theorem and T-Distribution
No ratings yet
Statistics and Probability: Quarter 3 - Modules 7-8 The Central Limit Theorem and T-Distribution
35 pages
Chapter 3
No ratings yet
Chapter 3
43 pages
Solutions Manual to accompany Applied Multivariate Statistical Analysis 6th edition 0131877151 - Read Directly Or Download With One Click
100% (4)
Solutions Manual to accompany Applied Multivariate Statistical Analysis 6th edition 0131877151 - Read Directly Or Download With One Click
36 pages
STAT Summative Test - Q3 (Week 5-6)
100% (2)
STAT Summative Test - Q3 (Week 5-6)
2 pages
Statistics Success in 20 Min a Day
No ratings yet
Statistics Success in 20 Min a Day
221 pages
Hotelling T2 For Batch
No ratings yet
Hotelling T2 For Batch
14 pages
Solving Problem Involving Sampling Distribution of The Sample MeansApril 16 2024
No ratings yet
Solving Problem Involving Sampling Distribution of The Sample MeansApril 16 2024
31 pages
Chapter 05 - Sampling and Sampling Distribution
No ratings yet
Chapter 05 - Sampling and Sampling Distribution
5 pages
7 Estimation Describing A Single Population
No ratings yet
7 Estimation Describing A Single Population
92 pages
Stat 122
No ratings yet
Stat 122
22 pages
Performance Task in Statistics and Probability
No ratings yet
Performance Task in Statistics and Probability
3 pages
Bus 3104.E1 Midtm Fall 16
No ratings yet
Bus 3104.E1 Midtm Fall 16
8 pages
Complete Answer Guide for Business Statistics in Practice 8th Edition Bowerman Solutions Manual
100% (9)
Complete Answer Guide for Business Statistics in Practice 8th Edition Bowerman Solutions Manual
52 pages
Week 7 STUDY GUIDE Statistics Probability 1
No ratings yet
Week 7 STUDY GUIDE Statistics Probability 1
16 pages
First Course in Statistics 11th Edition McClave Test Bank all chapter instant download
100% (6)
First Course in Statistics 11th Edition McClave Test Bank all chapter instant download
60 pages
Ad3491 Fdsa Unit 3 Notes Eduengg
No ratings yet
Ad3491 Fdsa Unit 3 Notes Eduengg
37 pages
C 24 05
No ratings yet
C 24 05
58 pages
PNS Notes
No ratings yet
PNS Notes
351 pages
Probability and Statistical Analysis: Chapter Five
No ratings yet
Probability and Statistical Analysis: Chapter Five
25 pages
LAS in EMPOWERMENT TECHNOLOGY QUARTER 2 Week 3-4
No ratings yet
LAS in EMPOWERMENT TECHNOLOGY QUARTER 2 Week 3-4
28 pages
Small Sample Estimation of A Population Mean
No ratings yet
Small Sample Estimation of A Population Mean
28 pages
96
No ratings yet
96
702 pages
Gradient Extrapolated Stochastic Kriging
No ratings yet
Gradient Extrapolated Stochastic Kriging
25 pages
Psych Stats 4 Parametric Tests
No ratings yet
Psych Stats 4 Parametric Tests
133 pages