Baron & Kenny
Baron & Kenny
Baron & Kenny
Paths a and b are called direct effects. The mediational effect, in which X leads to Y through M, is
called the indirect effect. The indirect effect represents the portion of the relationship between X and Y
that is mediated by M.
Testing for mediation
Baron and Kenny (1986) proposed a four step approach in which several regression analyses are
conducted and significance of the coefficients is examined at each step. Take a look at the diagram
below to follow the description (note that c' could also be called a direct effect).
c
X
Step 1
Analysis
Conduct a simple regression analysis with X predicting Y to test
for path c alone, Y =B0 + B1 X + e
Visual Depiction
c
X
Step 2
Step 3
Step 4
Y
a
Y
c
The purpose of Steps 1-3 is to establish that zero-order relationships among the variables exist. If one
or more of these relationships are nonsignificant, researchers usually conclude that mediation is not
possible or likely (although this is not always true; see MacKinnon, Fairchild, & Fritz, 2007).
Assuming there are significant relationships from Steps 1 through 3, one proceeds to Step 4. In the
Step 4 model, some form of mediation is supported if the effect of M (path b) remains significant after
controlling for X. If X is no longer significant when M is controlled, the finding supports full mediation.
If X is still significant (i.e., both X and M both significantly predict Y), the finding supports partial
mediation.
Calculating the indirect effect
The above four-step approach is the general approach many researchers use. There are potential
problems with this approach, however. One problem is that we do not ever really test the significance
of the indirect pathwaythat X affects Y through the compound pathway of a and b. A second
problem is that the Barron and Kenny approach tends to miss some true mediation effects (Type II
errors; MacKinnon et al., 2007). An alternative, and preferable approach, is to calculate the indirect
effect and test it for significance. The regression coefficient for the indirect effect represents the
change in Y for every unit change in X that is mediated by M. There are two ways to estimate the
Newsom
USP 654 Data analysis II
Fall 2015
indirect coefficient. Judd and Kenny (1981) suggested computing the difference between two
regression coefficients. To do this, two regressions are required.
Analysis
Model 1
Y =B0 + B1 X + B2 M + e
c
X
Model 2
Y =B0 + BX + e
c
X
The approach involves subtracting the partial regression coefficient obtained in Model 1, B1 from the
simple regression coefficient obtained from Model 2, B. Note that both represent the effect of X on Y
but that B is the zero-order coefficient from the simple regression and B1 is the partial regression
coefficient from a multiple regression. The indirect effect is the difference between these two
coefficients:
Bindirect= B B1 .
An equivalent approach calculates the indirect effect by multiplying two regression coefficients (Sobel,
1982). The two coefficients are obtained from two regression models.
Sobel Product of Coefficients Approach
Analysis
Model 1
Visual Depiction
Y =B0 + B1 X + B2 M + e
c
X
Model 2
M =B0 + BX + e
Notice that Model 2 is a different model from the one used in the difference approach. In the Sobel
approach, Model 2 involves the relationship between X and M. A product is formed by multiplying two
coefficients together, the partial regression effect for M predicting Y, B2, and the simple coefficient for
X predicting M, B:
Bindirect = ( B2 )( B )
As it turns out, the Kenny and Judd difference of coefficients approach and the Sobel product of
coefficients approach yield identical values for the indirect effect (MacKinnon, Warsi, & Dwyer, 1995). 1
Statistical tests of the indirect effect
Once the regression coefficient for the indirect effect is calculated, it needs to be tested for
significance. There has been considerable controversy about the best way to estimate the standard
error used in the significance test, however. There are quite a few approaches to calculation of
standard errors.
There are two general approaches to testing significance of the indirect effect currentlybootstrap
methods (sometimes called "nonparametric resampling") and the Monte Carlo method (sometimes
called "parametric resampling"). For the bootstrap method, software for testing indirect effects
generally offer two options. One, referred to as "percentile" bootstrap, involves confidence intervals
1
Note: regardless of the approach you use (i.e., difference or product) be sure to use unstandardized coefficients if you do the
computations yourself.
Newsom
USP 654 Data analysis II
Fall 2015
using usual sampling distribution cutoffs without explicit bias corrections. The accelerated biascorrected bootstrap estimates correct for a bias in the average estimate and the standard deviation
across potential values of the indirect coefficient. The Monte Carlo approach involves computation of
the indirect effect and the standard error estimates for the separate coefficients for the full sample.
Resampling is then used to estimate the standard errors for the indirect effects using these values.
The bias-corrected bootstrap method may result in Type I error rates that are slightly higher than the
percentile bootstrap method (Biesanz, Falk, & Savalei, 2010; Fritz, Taylor, & MacKinnon, 2012).
Tofighi and MacKinnon (2015) find that both the percentile bootstrap confidence intervals and the
Monte Carlo method provide good tests with good Type I error rates and statistical power but that the
Monte Carlo approach had somewhat better power. Standardized coefficients can be computed,
though they may not be reported by the software program, and other methods such as the ratio of
indirect to total effect can also be computed (see Preacher & Kelley, 2011, for a review).
Structural equation modeling (SEM, also called covariance structure analysis) is designed, in part, to
test these more complicated models in a single analysis instead of testing separate regression
analyses. Most SEM software packages now offer indirect effect tests using one of the above
approaches for determining significance. In addition, the SEM analysis approach provides model fit
information that provides information about consistency of the hypothesized mediational model to the
data. Measurement error is a potential concern in mediation testing because of attenuation of
relationships and the SEM approach can address this problem by removing measurement error from
the estimation of the relationships among the variables. I will save more detail on this topic for another
course, however.
Online resources
David Kenny also has a webpage on mediation: http://davidakenny.net/cm/mediate.htm
Andrew Hayes' site Preacher and Hays bootstrap, Monte Carlo, and Sobel macros: http://www.afhayes.com/spss-sas-andmplus-macros-and-code.html
Mediation package for R: https://cran.r-project.org/web/packages/mediation/mediation.pdf
Kristopher Preacher's site with software approaches for the Monte Carlo method among other related material:
http://www.quantpsy.org/medmc/medmc.htm
Newsom
USP 654 Data analysis II
Fall 2015
Tofighi, D., & MacKinnon, D. P. (2015). Monte Carlo Confidence Intervals for Complex Functions of Indirect Effects. Structural
Equation Modeling: A Multidisciplinary Journal, 1-12.