Assignment 4 - Statistical Inference
Assignment 4 - Statistical Inference
Hypothesis testing is a common statistical procedure used to make decisions about population
parameters based on sample data. The process involves setting up a null hypothesis (H0) and an
alternative hypothesis (H1). The significance level, often denoted as α, determines the
probability of making a Type I error, which is the rejection of a true null hypothesis.
Null Hypothesis (H0): The null hypothesis represents the default assumption, often stating that
there is no significant difference or relationship between variables.
Alternative Hypothesis (H1): The alternative hypothesis presents an alternative claim to the null
hypothesis, suggesting that there is a significant difference or relationship between variables.
Significance Level (α): The significance level determines the threshold at which we reject the
null hypothesis. Commonly used significance levels are 0.05 (5%) or 0.01 (1%).
Test Statistic: A test statistic is a value calculated from the sample data that is used to assess the
likelihood of observing the data under the null hypothesis.
P-value: The p-value is the probability of obtaining the observed sample data, or more extreme
data, assuming the null hypothesis is true. It is used to make decisions about rejecting or failing
to reject the null hypothesis.
Confidence intervals provide a range of plausible values for a population parameter. They give
an indication of the precision or uncertainty associated with an estimated value. Key points
regarding confidence intervals include:
Confidence Level: The confidence level, often denoted as (1 - α), represents the probability that
the calculated confidence interval will capture the true population parameter. Commonly used
confidence levels are 95% and 99%.
Point Estimate: A point estimate is a single value calculated from the sample data that serves as
an estimate of the population parameter.
Margin of Error: The margin of error is the maximum amount by which the point estimate is
likely to differ from the true population parameter.
Calculation of Confidence Interval: Confidence intervals are typically calculated as the point
estimate plus or minus the margin of error.
Type I Error (False Positive): A Type I error occurs when we reject the null hypothesis (H0)
when it is true. It represents the incorrect rejection of a true claim or the presence of a significant
effect when there is none. The probability of committing a Type I error is equal to the
significance level (α).
Type II Error (False Negative): A Type II error occurs when we fail to reject the null hypothesis
(H0) when it is false. It represents the failure to detect a true effect or difference between
variables. The probability of committing a Type II error is denoted as β.
The relationship between Type I and Type II errors is generally inversely related. As the
significance level (α) decreases, the probability of committing a Type I error decreases but the
probability of committing a Type II error increases. This trade-off is important to consider when
designing hypothesis tests.