Specification Error OR Misspecification in Statistical Models
Specification Error OR Misspecification in Statistical Models
Models?
When we think about model assumptions, we tend to focus on assumptions
like independence, normality, and constant variance. The other big
assumption, which is harder to see or test, is that there is no specification
error. The assumption of linearity is part of this, but it’s actually a bigger
assumption.
The basic idea is that when you choose a final model, you want to choose
There are a few common ways of specifying a linear model inaccurately.
Other ways to find it include residual graphs and, if they make theoretical
sense, adding transformations of X to the model and assessing model fit.
Including that interaction in the model will accurately represent the real
relationship between X and Y. Failing to include it means mis-specification
of X’s real effect.
(And there are certainly models whose job is not to represent all predictors
of an outcome. Rather it’s to test the relationship with specific predictors).
There are many ways to build a model. The goal of all of them is to find the
best model. The best model is one that includes all the important predictors
in the right form, but not any unimportant ones.
Conventional test of hypothesis Invalid and faulty inferences Valid though erroneous
and confidence region