Business Statistics 4
Business Statistics 4
Business Statistics 4
Ikram-E-Khuda
Agenda Items
• Calculation of Z scores using area/ probability values
• Critical Z values
• Empirical rule
• Central Limit Theorem
• confidence intervals and interval estimation
• Hypothesis testing
• Hypothesis testing of mean value
Critical Z Values..
• What are the critical z values taken symmetrical from the middle
position corresponding to :
• Central limit theorem says that the probability of getting a given value of random variable
or all the values of the random variable in a series of trials (identically independent) will
approach a normal curve ( distributed according to a normal distribution).
• Plot probability distribution of a binomial random experiment with N=10, n=4, r=0,1,2,3
and 4. What distribution do you get?
Central Limit Theorem- Statement 3
• If a random sample of n observations is selected from any population, then,
when the sample size is sufficiently large (n >30) the sampling distribution
of the mean tends to approximate the normal distribution. The larger the
sample size, n, the better will be the normal approximation to the sampling
distribution of the mean.
• In general, it can be shown that the mean of the sample means is the same
as the population mean, and the standard error of the mean is smaller than
the population standard deviation.
The real advantage of the central limit theorem is that sample data drawn
from populations not normally distributed or from populations of unknown
shape also can be analyzed by using the normal distribution, because the
sample means are normally distributed for sample sizes of n>30.
• Note that the distribution of the sample means begins to approximate the normal
curve as the sample size, n, gets larger.
Central Limit Theorem- Statement 3
• The sample mean will be approximately normally distributed for large
sample sizes regardless of the distribution from which we are sampling
• Suppose we are sampling from a population with mean μ and standard deviation σ
• Let 𝑿 be the random variable representing the sample mean of n independently drawn
observations
• The mean of the sampling distribution of the sample mean is equal to the population mean,
i.e. 𝑥𝑋 = 𝜇
• The standard deviation of the sampling distribution of the sample mean 𝑿 is equal to the
𝜎
population mean, i.e. s𝑋 = 𝑛
Central Limit Theorem- Statement 3
• Since the central limit theorem states that sample means are normally distributed regardless of the shape of
the population for large samples and for any sample size with normally distributed population, sample
means can be analyzed by using Z scores.
• Mathematically this means that the 𝑧 score for random variable sample mean, 𝑿 can be can be written as
follows (provided mean and variance of population are finite values):
𝑥−𝜇
𝑧=𝜎 → ℕ 0,1 𝑎𝑠 𝑛 → ∞
ൗ 𝑛
Here 𝑧 is the 𝑧 score value
𝑥 is the sample mean
𝜇 is the population mean
𝜎 is the population standard deviation
n is the sample size
Central Limit Theorem- Statement 3
• The random variable 𝑿 can be considered approximately normally distributed if the sample size is
at least 30.
• This implies that we need sample size n ≥ 𝟑𝟎 to be considered for normal distribution
• Note: When sample size n < 𝟑𝟎 , we obtain a distribution which is called t distribution.
Student’s t Distribution
• df is short of degree of freedom. It shows the total sample size available for analysis.
• df = n in a normal distribution
• df < n for a t distribution
Example Problem
• Suppose salaries at a very large corporation have a mean of PKR
50,000/= and a standard deviation of PKR 10,000/=.
CORRECT
• Now using the standard normal table we have this probability equal to 0.00272
Central Limit Theorem in Statistics
• We normally do not have the information of the population distribution
• Thanks to CLT !
Confidence Intervals and Interval Estimation
Form of Confidence Intervals (CI)
A confidence interval estimates are intervals within which the
parameter is expected to fall, with a certain degree of confidence.
• For example for a 95% CI, we say “we are 95% confident that the true
population parameter is between the lower and upper calculated values”.
• A high confidence level means that there is only a very small probability that
the result (e.g. correlation or independence) happened purely by chance
(given in terms of α value).
Interval Estimation of Mean
• Uses central limit theorem (CLT)
• If population standard deviation is known , then interval estimation is called a z test and it is calculated as follows:
𝜎
𝑥±𝑧 𝑛
• If population standard deviation is unknown then sample variance is also estimated and mean interval estimation is called a t test. This is
calculated as follows:
𝑠
𝑥±𝑡
𝑛
• The 𝑡 score formula is:
𝑋−𝜇
𝑡= 𝑠
ൗ 𝑛
Here 𝑡 is the 𝑡 score value
𝑥 is the sample mean
𝜇 is the population mean
𝑠 is the sample standard deviation
n is the sample size
Practice Problems