8multiple Linear Regression
8multiple Linear Regression
LEARNING OUTCOMES
When you have completed this chapter, you will be able to:
Regression analysis is widely used for prediction and forecasting, where its use
has substantial overlap with the field of machine learning. Regression analysis is also
used to understand which among the independent variables are related to the
dependent variable, and to explore the forms of these relationships. In restricted
circumstances, regression analysis can be used to infer causal relationships between
the independent and dependent variables. However, this can lead to illusions or false
relationships, so caution is advisable; For example, correlation does not imply
causation.
Multiple Linear Regression
In SPSS
Multiple Linear Regression
A model fits the data well if the differences between the
observed values and the model’s predicted values are
small (check residual plots).
R-squared (between 0-100%) reveals how close the data
to the filled regression line. The higher the R-squared,
the better the model fits your data but it cannot
determine whether coefficient estimates and predictions
are biased and does not indicate whether a regression
model is adequate.
Partial correlations accounts the influence of the control
variables on both dependent and independent variables
Part correlations takes into account the influence of
control variables on the independent variable
Collinearity diagnostics implies that 2 predictor variables
are near perfect linear combinations of one another
12-22
In SPSS
Multiple Linear Regression
In SPSS
In SPSS
Adjusted R square takes into account the sample size. Durbin-Watson which
test the hypothesis that there might be a serial correlation in the data. As a
rule of thumb, if the value is not within 1.5-2.5, then there is no meaningful
serial correlation.
12-22
In SPSS
The most important here is row 5 (model 5) as it takes into account the 5 IV
with p<0.001, F(5,21)=611.59
12-22
In SPSS
A coefficient for areasizeper000families (model 1)means that for one unit increase in
areasizeper1000families, we would expect a 35.635 increase in
annualnetsalesper1000usd. Also, from standardized coefficients beta, a one sd increase
in areasizeper1000families leads to .954 sd increase in annualnetsalesper1000usd
12-22
In SPSS
In SPSS
In SPSS
In SPSS
12-22
In SPSS