Lecture 8 Hypothesis Testing

Hypothesis testing is a statistical method used to evaluate two mutually exclusive population statements through experimental data, involving a null hypothesis (H0) and an alternative hypothesis (H1). The document outlines the processes for conducting Z-tests and T-tests, including calculating test statistics and determining significance levels. It also discusses the Chi-square test for assessing discrepancies between observed and expected frequencies, and its application in feature selection for machine learning.

Uploaded by

saibole2003

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lecture 8 Hypothesis Testing

Uploaded by

saibole2003

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 44

Hypothesis Testing

Hypothesis testing is a statistical method that is used in making a

statistical decision using experimental data.

Hypothesis testing evaluates two mutually exclusive population statements to

determine which statement is most supported by sample data.
Parameters of hypothesis testing

•Null hypothesis(H0): It is a basic assumption based on the

problem knowledge.

•Alternative hypothesis(H1): The alternative hypothesis is the

hypothesis used in hypothesis testing that is contrary to the null
hypothesis.

Null Hypothesis : A company production is equal to 50 unit/per day

Alternate Hypothesis: : A company production is not equal to 50
unit/per day
H0 : amount of lead in Maggie noodles does not exceed the maximum limit i.e., 2.5ppm
H1: amount of lead in Maggie noodles exceed the maximum limit i.e., 2.5ppm
Outcome 1: We reject the null hypothesis when in reality it is false.
Outcome 2: We reject the null hypothesis when in reality it is true.
(Type 1 Error)
Outcome 3: We failed to reject the null hypothesis when in reality it is false.
(Type 2 Error)
Outcome 4: We failed to reject the null hypothesis when in reality it is true.

We say “We failed to reject the null hypothesis” instead of “we accept the null hypothesis”.
• P-value
The P value is the probability for the null hypothesis to be true.

• Level of significance
The level of significance is the probability of rejecting the null hypothesis when
it is true.

If the p-value is less than α, then the null hypothesis is rejected, and the
alternative hypothesis is accepted. If the p-value is greater than α, then the null
hypothesis is not rejected.
Z - Test

When to Use Z-test:

•Samples should be drawn at random from the population.
•The sample size should be greater than 30.
•The standard deviation of the population should be known.
Steps to perform Z-test:
• First, identify the null and alternate hypotheses.
• Determine the level of significance (∝).
• Calculate the z-test statistics. Below is the formula for calculating the z-test
statistics.

where,
: mean of the sample.
: mean of the population.
: Standard deviation of the population.
n: sample size.
• Find p value using z statistics.
• Now compare with the hypothesis and decide whether to reject or not to reject
the null hypothesis
Suppose the arousal of hot cats has a population that is normally distributed with a
standard deviation of 6. Tomorrow you sample 49 hot cats from this population and
obtain a mean arousal of 46.44 and a standard deviation of 5.6968. Using an alpha
value of α = 0.01, is this observed mean significantly less than an expected arousal of
47?
Problem: A school claimed that the student’s study is more intelligent than the average
school. On calculating the IQ scores of 50 students, the average turns out to be 110. The
mean of the population IQ is 100 and the standard deviation is 15. State whether the claim of
principal is right or not at a 5% significance level.
A teacher claims that the mean score of students in his class is
greater than 82 with a standard deviation of 20. If a sample of 81
students was selected with a mean score of 90 then check if there is
enough evidence to support this claim at a 0.05 significance level.
Suppose the width of makeshift personalities has a population that is normally
distributed with a standard deviation of 7. You want to sample 22 makeshift
personalities from this population and obtain a mean width of 87.19 and a standard
deviation of 7.257. Using an alpha value of α = 0.01, is this observed mean significantly
less than an expected width of 89?
Z – Test (two – tailed)
Suppose the jewelry of exams has a population that is normally distributed with a
standard deviation of 5. You are walking down the street and sample 9 exams from
this population and obtain a mean jewelry of 28.95 and a standard deviation of
6.3802. Using an alpha value of α = 0.01, is this observed mean significantly different
than an expected jewelry of 27?
Suppose the life expectancy of Seattleites has a population that is normally distributed
with a standard deviation of 1. You go out and sample 45 Seattleites from this
population and obtain a mean life expectancy of 88.51 and a standard deviation of
1.0815. Using an alpha value of α = 0.05, is this observed mean significantly different
than an expected life expectancy of 89?
Suppose the width of bus riders has a population that is normally distributed with a
standard deviation of 10. Suppose that before graduation, your first job was to sample
98 bus riders from this population and obtain a mean width of 49.98 and a standard
deviation of 10.3386. Using an alpha value of α = 0.01, is this observed mean
significantly different than an expected width of 52?
T - Test

A t-test is a statistical test that compares the means of two samples. It is used in
hypothesis testing, with a null hypothesis that the difference in group means is
zero and an alternate hypothesis that the difference in group means is different
from zero.
There are three main types of t-test:

• A One sample t-test tests the mean of a single group against a known mean.
• An Independent Samples t-test compares the means for two groups.
• A Paired sample t-test compares means from the same group at different times
(say, one year apart).
Steps to perform T-test:
• First, identify the null and alternate hypotheses.
• Determine the level of significance (∝).
• Calculate the degree of freedom df = n-1
• Find the critical value of t in the t-test using t- table.
• Calculate the t-test statistics. Below is the formula for calculating the t-test
statistics.

where,
: mean of the sample.
: mean of the population.
: Standard deviation of the sample.
n: sample size.
• Now compare with the hypothesis and decide whether to reject or not to reject
the null hypothesis
Problem: A school claimed that the students’ study that is more intelligent than the average
school. On calculating the IQ scores of 30 students, the average turns out to be 140 and
standard deviation is 20. The mean of the population IQ is 100 . State whether the claim of
principal is right or not at a 5% significance level.
Suppose we are interested in determining whether the average weight of a certain
breed of dog is significantly different from a target weight of 25 pounds. We randomly
select a sample of 20 dogs from this breed and weigh them and get the mean 24
pounds and standard deviation is 0.7. State whether the claim we made is right or not
at a 5% significance level.
There are three main types of t-test:

Q2
Chi- Square Test

It is a powerful test for testing the significance of the discrepancy between theory and
experiment.
(OR)
The Chi-square (χ2 ) test represents a useful method of comparing experimentally obtained
results with those to be expected theoretically on some hypothesis.
The value of chi-square is very big it indicates that the divergence between expected
and observed frequencies is large.
If the value of chi-square is very small it indicates that the divergence between
actual and expected frequencies is very little.
The following steps are followed for the above said purpose:
i. A null and alternative hypothesis related to the enquiry
ii. expected or theoretical frequencies are derived through probability.
iii. A level of significance is chosen for rejection of the null hypothesis.
iv. Chi Square value

v. The observed frequencies are compared with the expected or theoretical

frequencies.

If the calculated value of is less than the table value, failed to reject the null
hypothesis. On the other hand, if the calculated value of is greater than the table
value, we will reject the null hypothesis.
Problem Ninety-six subjects are asked to express their attitude towards the
proposition “Should AIDS education be integrated in the curriculum of Higher
secondary stage” by marking F (favorable), I (indifferent) or U (unfavorable).
Observed(fo) 48 24 24

Expected (fe) 32 32 32

Test the hypothesis that “there is no difference between preferences in the group”.
Two hundred bolts were selected at random from the output of each of the five machines.
The number of defective bolts found were 5, 9, 13, 7 and 6 . Is there a significant
difference among the machines? Use 5% level of significance.
Chi- Square Test for
feature selection

Feature selection is selecting best and optimal features for Machine learning model.

In this we remove irrelevant or partially relevant features from the data.

(i) Minimizes the cost of computation.

(ii) Reduces the curse of dimensionality
(iii) Helps in achieving good accuracy.
Chi-square Test for Feature Extraction:

We calculate Chi-square between each feature and the target and select the desired
number of features with best Chi-square scores.
The higher the value of , the more dependent the output label is on the feature and
higher the importance the feature has on determining the output.

It determines if the association between two categorical variables of the sample would
reflect their real association in the population.
Consider the following table:-
The contingency table for the feature “Outlook” is constructed as below:-
The contingency table for the feature “Wind” is constructed as below:-
Thank
you

Abstract Reasoning Practice Test
100% (1)
Abstract Reasoning Practice Test
13 pages
C 17
No ratings yet
C 17
20 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
44 pages
Hypothesis Testing Assignment
No ratings yet
Hypothesis Testing Assignment
12 pages
R.C. Combined Footings For Two R.C. Columns Subjected To Vertical Load & Moments by Working Stress Method As Per Is:456-2000
No ratings yet
R.C. Combined Footings For Two R.C. Columns Subjected To Vertical Load & Moments by Working Stress Method As Per Is:456-2000
4 pages
QT Session 16 - 22 Hypothesis Testing
No ratings yet
QT Session 16 - 22 Hypothesis Testing
58 pages
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
No ratings yet
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
15 pages
Testing Technique in Data Science
No ratings yet
Testing Technique in Data Science
65 pages
Stat
67% (3)
Stat
70 pages
Eda Research
No ratings yet
Eda Research
11 pages
MTH 4th Grading Notes
No ratings yet
MTH 4th Grading Notes
19 pages
Chapter 5
No ratings yet
Chapter 5
35 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
3 pages
Testing Hypothesis
No ratings yet
Testing Hypothesis
9 pages
Test On Variables: in Surveys, The Foolish Ask Questions, Wise Cannot Answers
No ratings yet
Test On Variables: in Surveys, The Foolish Ask Questions, Wise Cannot Answers
24 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
20 pages
Hypothesis Lecture
No ratings yet
Hypothesis Lecture
7 pages
Engineering Mathematics 2
No ratings yet
Engineering Mathematics 2
29 pages
Hypothesis Tesing
No ratings yet
Hypothesis Tesing
30 pages
Prostat Tperf
No ratings yet
Prostat Tperf
5 pages
Lesson 3 Hypothesis Testing
No ratings yet
Lesson 3 Hypothesis Testing
23 pages
Lesson-4-Quantitative-Methods-with-Modeling-and-Simulation-Hypothesis-Testing
No ratings yet
Lesson-4-Quantitative-Methods-with-Modeling-and-Simulation-Hypothesis-Testing
38 pages
Hypothesis Testing 15pages
No ratings yet
Hypothesis Testing 15pages
15 pages
Statistics Full Notes
No ratings yet
Statistics Full Notes
14 pages
Hypothesis Test
No ratings yet
Hypothesis Test
20 pages
LESSON 2.1: Z-Test and T-Test
No ratings yet
LESSON 2.1: Z-Test and T-Test
4 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
29 pages
Hypothesis Testing - Class 2
No ratings yet
Hypothesis Testing - Class 2
30 pages
PT Module5
No ratings yet
PT Module5
30 pages
Stat - Hypothesis Testing
No ratings yet
Stat - Hypothesis Testing
34 pages
Handout#3 - Statistical Inference, Z and T Test
100% (1)
Handout#3 - Statistical Inference, Z and T Test
3 pages
Statistics & Probability Q4 - Week 5-6
No ratings yet
Statistics & Probability Q4 - Week 5-6
13 pages
90156hypothesis Testing
No ratings yet
90156hypothesis Testing
34 pages
Hypothesis Testing- Z Test
No ratings yet
Hypothesis Testing- Z Test
19 pages
Statistical Analysis (T-Test)
No ratings yet
Statistical Analysis (T-Test)
61 pages
lab5
No ratings yet
lab5
7 pages
Day 3
No ratings yet
Day 3
88 pages
C22 P09 Chi Square Test
No ratings yet
C22 P09 Chi Square Test
33 pages
Q4 Stat Las-Week 5
No ratings yet
Q4 Stat Las-Week 5
12 pages
Chapter No. 08 Fundamental Sampling Distributions and Data Descriptions - 02 (Presentation)
No ratings yet
Chapter No. 08 Fundamental Sampling Distributions and Data Descriptions - 02 (Presentation)
91 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
56 pages
Testing Hypothesis
No ratings yet
Testing Hypothesis
51 pages
STAT Q4 Week 2 Enhanced.v1
No ratings yet
STAT Q4 Week 2 Enhanced.v1
11 pages
Lecture 6 Hypothesis Testing and z Test
No ratings yet
Lecture 6 Hypothesis Testing and z Test
47 pages
Lecture Z Test
No ratings yet
Lecture Z Test
31 pages
PSNM - Ch. 3
No ratings yet
PSNM - Ch. 3
32 pages
Probability and Statistics - Asynch A.1
No ratings yet
Probability and Statistics - Asynch A.1
4 pages
Handout#3 - Statistical Inference, z and t Test
No ratings yet
Handout#3 - Statistical Inference, z and t Test
3 pages
Statsprob -Reviewer q2
No ratings yet
Statsprob -Reviewer q2
24 pages
Topic 18 Identifying The Appropriate Test Statistics Involving Population Mean
No ratings yet
Topic 18 Identifying The Appropriate Test Statistics Involving Population Mean
6 pages
4th Lesson 1
No ratings yet
4th Lesson 1
58 pages
Local Media6288925927020885212
No ratings yet
Local Media6288925927020885212
12 pages
Hypothesis Testing 1
No ratings yet
Hypothesis Testing 1
27 pages
stats_final_review
No ratings yet
stats_final_review
11 pages
226lec11 JDA
No ratings yet
226lec11 JDA
54 pages
Chisquare
No ratings yet
Chisquare
10 pages
Chapter IX Hypothesis Testing
No ratings yet
Chapter IX Hypothesis Testing
31 pages
Unit 3 (Hypothesis Testing)
No ratings yet
Unit 3 (Hypothesis Testing)
40 pages
CH 21
No ratings yet
CH 21
58 pages
The Central Limit Theorem and Hypothesis Testing Final
100% (1)
The Central Limit Theorem and Hypothesis Testing Final
29 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet
Lecture 10
No ratings yet
Lecture 10
25 pages
Aarti_Drugs
No ratings yet
Aarti_Drugs
14 pages
Essen Speciality
No ratings yet
Essen Speciality
8 pages
Aadhar_Hsg__Fin_
No ratings yet
Aadhar_Hsg__Fin_
8 pages
5Paisa_Capital
No ratings yet
5Paisa_Capital
10 pages
A_B_B
No ratings yet
A_B_B
9 pages
3B_Blackbio
No ratings yet
3B_Blackbio
14 pages
Centum_Electron
No ratings yet
Centum_Electron
14 pages
merged_companies
No ratings yet
merged_companies
516 pages
Aditya_Vision
No ratings yet
Aditya_Vision
7 pages
Embassy_Off_REIT
No ratings yet
Embassy_Off_REIT
8 pages
Real-Time_Portfolio_Management_System_Utilizing_Machine_Learning_Techniques
No ratings yet
Real-Time_Portfolio_Management_System_Utilizing_Machine_Learning_Techniques
14 pages
Numerical solution of ODE(up to Improved Euler's method)
No ratings yet
Numerical solution of ODE(up to Improved Euler's method)
9 pages
Non-Isothermal Kinetic and Thermodynamic Studies of The Dehydroxylation Process of Synthetic Calcium Hydroxide Ca (OH) 2
No ratings yet
Non-Isothermal Kinetic and Thermodynamic Studies of The Dehydroxylation Process of Synthetic Calcium Hydroxide Ca (OH) 2
11 pages
APPLIED THERMODYNAMICS 18ME42 Module 04 Question No 7a-7b
No ratings yet
APPLIED THERMODYNAMICS 18ME42 Module 04 Question No 7a-7b
27 pages
1 - PHYS 204 - Course Outline - Fall 2022-Section - 01
No ratings yet
1 - PHYS 204 - Course Outline - Fall 2022-Section - 01
14 pages
Computer Graphics: Unit 3 Overview of Transformations
No ratings yet
Computer Graphics: Unit 3 Overview of Transformations
85 pages
Practical Task Edm Djj40142 Sesi 1 2021-2022
No ratings yet
Practical Task Edm Djj40142 Sesi 1 2021-2022
12 pages
EPA Map of RSR Lead Smelter Site
No ratings yet
EPA Map of RSR Lead Smelter Site
1 page
Maths 2A Quadratic Expressions Important Questions
No ratings yet
Maths 2A Quadratic Expressions Important Questions
6 pages
Suspended Systems: Lighting and Security
No ratings yet
Suspended Systems: Lighting and Security
8 pages
Mayr11 20 Exam
No ratings yet
Mayr11 20 Exam
19 pages
Sci Comm I Presentation
No ratings yet
Sci Comm I Presentation
19 pages
Bolted Connections (Sheet 4)
No ratings yet
Bolted Connections (Sheet 4)
16 pages
Bruno & Randolph 1999
No ratings yet
Bruno & Randolph 1999
11 pages
Junk Basket Details
No ratings yet
Junk Basket Details
8 pages
Step 2. - Jorge - Mendieta - Muñoz
No ratings yet
Step 2. - Jorge - Mendieta - Muñoz
20 pages
Mock Test Class-4 Review
No ratings yet
Mock Test Class-4 Review
6 pages
Approximate Traveling Wave Solution of Avian Flu
No ratings yet
Approximate Traveling Wave Solution of Avian Flu
6 pages
Microsoft PowerPoint - 06-Gas Well Testing - PPT (Read-Only)
No ratings yet
Microsoft PowerPoint - 06-Gas Well Testing - PPT (Read-Only)
14 pages
Lec 003 Math 3 Fall 2020
No ratings yet
Lec 003 Math 3 Fall 2020
42 pages
School Physics Experiments. 2022-2023
No ratings yet
School Physics Experiments. 2022-2023
7 pages
22 - Relative Permeability Effects On The Miscible CO2 WAG Injection Schemes
No ratings yet
22 - Relative Permeability Effects On The Miscible CO2 WAG Injection Schemes
9 pages
PRESSURE LAB WORK INSTRUCTION
No ratings yet
PRESSURE LAB WORK INSTRUCTION
15 pages
Paper 1
No ratings yet
Paper 1
21 pages
Analysis of Reinforced Concrete Beams
67% (3)
Analysis of Reinforced Concrete Beams
87 pages
Eee-Lab Manual
No ratings yet
Eee-Lab Manual
14 pages
Webinar 34 - Photovoltaics Revisited
No ratings yet
Webinar 34 - Photovoltaics Revisited
32 pages
Ferroresonance in Voltage Transformers Analysis An
No ratings yet
Ferroresonance in Voltage Transformers Analysis An
8 pages
structures-ii-notes
No ratings yet
structures-ii-notes
53 pages