Important Statistics Formulas
Important Statistics Formulas
Important Statistics Formulas
This web page presents statistics formulas described in the Stat Trek tutorials. Each formula links to
a web page that explains how to use the formula.
Parameters
Population mean = = ( Xi ) / N
Population variance = 2 = ( Xi - )2 / N
Standardized score = Z = (X - ) /
Statistics
Sample mean = x = ( xi ) / n
Sample variance = s2 = ( xi - x )2 / ( n - 1 )
Pooled sample standard deviation = sp = sqrt [ (n1 - 1) * s12 + (n2 - 1) * s22 ] / (n1 + n2 - 2) ]
Correlation
Standard error of regression slope = sb1 = sqrt [ (yi - i)2 / (n - 2) ] / sqrt [ (xi - x)2 ]
Counting
Probability
Random Variables
In the following formulas, X and Y are random variables, and a and b are constants.
Chi-square statistic = 2 = [ ( n - 1 ) * s2 ] / 2
Variance of the difference between independent random variables = Var(X - Y) = Var(X) + Var(Y)
Sampling Distributions
Standard deviation of difference of sample means = d = sqrt[ (12 / n1) + (22 / n2) ]
Standard deviation of difference of sample proportions = d = sqrt{ [P1(1 - P1) / n1] + [P2(1 - P2) /
n2] }
Standard Error
Standard error of difference of sample means = SEd = sd = sqrt[ (s12 / n1) + (s22 / n2) ]
Standard error of difference of paired sample means = SEd = sd = { sqrt [ ((di - d)2 / (n - 1) ] } /
sqrt(n)
Pooled sample standard error = spooled = sqrt [ (n1 - 1) * s12 + (n2 - 1) * s22 ] / (n1 + n2 - 2) ]
Standard error of difference of sample proportions = sd = sqrt{ [p1(1 - p1) / n1] + [p2(1 - p2) / n2] }
Linear Transformations
For the following formulas, assume that Y is a linear transformation of the random variable X,
defined by the equation: Y = aX + b.
Standardized score = z = (x - x) / x.
t statistic = t = (x - x) / [ s/sqrt(n) ].
Estimation
Hypothesis Testing
Standardized test statistic = (Statistic - Parameter) / (Standard deviation of statistic)
Degrees of Freedom
The correct formula for degrees of freedom (DF) depends on the situation (the nature of the test
statistic, the number of samples, underlying assumptions, etc.).
One-sample t-test: DF = n - 1
Two-sample t-test: DF = (s12/n1 + s22/n2)2 / { [ (s12 / n1)2 / (n1 - 1) ] + [ (s22 / n2)2 / (n2 - 1) ] }
Sample Size
Below, the first two formulas find the smallest sample sizes required to achieve a fixed margin of
error, using simple random sampling. The third formula assigns sample to strata, based on a
proportionate design. The fourth formula, Neyman allocation, uses stratified sampling to minimize
variance, given a fixed sample size. And the last formula, optimum allocation, uses stratified
sampling to minimize variance, given a fixed budget.
Statistics Tutorial
Descriptive Statistics
Quantitative measures
Variables
Central tendency
Variability
Measures of position
Patterns in data
Dotplots
Histograms
Stemplots
Boxplots
Cumulative plots
Scatterplots
Comparing plots
Tabular displays
One-way tables
Two-way tables
Probability
Probability basics
Stat experiments
Probability laws
What is probability
Probability problems
Rules of probability
Bayes' rule
Random variables
Types of variables
Distributions
Independence
Combining
Transforming
Sampling theory
Random sampling
Central tendency
Variability
Sampling distribution
Distributions
Distribution basics
Probability dist
Discrete/continuous
Discrete
Binomial distribution
Negative binomial
Hypergeometric
Multinomial
Poisson
Continuous
Normal distribution
Standard normal
Student's t
Chi-square
F distribution
Estimation
Estimation theory
Estimation overview
Standard error
Margin of error
Confidence intervals
Proportions
Estimate proportion
Small samples
Mean scores
Estimate mean
Matched pairs
Hypothesis Testing
Foundations of testing
Hypothesis tests
How to test
Mean scores
Proportions
Small samples
Power
Region of acceptance
Power of a test
Chi-square tests
Goodness of fit
Homogeneity
Independence
Survey Sampling
Sampling methods
Data collection
Sampling methods
Survey sampling bias
Survey sampling
SRS analysis
Stratified samples
Stratified sampling
Stratified analysis
Cluster samples
Cluster sampling
CLS analysis
Sample planning
Linear regression
Measurement scales
Linear correlation
Linear regression
Regression example
Regression tests
Residual analysis
Transformations
Influential points
Slope estimate
Slope significance
Experiments
Experiment intro
Experimental design
Simulations
Appendices
Notation
Statistics Formulas