0% found this document useful (0 votes)

3 views

Ch08 Part 2 - Multtiple Regression

biostatistics

Uploaded by

Omar Seeria

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Ch08 Part 2 - Multtiple Regression

biostatistics

Uploaded by

Omar Seeria

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 45

Multiple Regression

Prof. Andy Field

Aims
• Understand When To Use Multiple Regression.
• Understand the multiple regression equation and
what the betas represent.
• Understand Different Methods of Regression
– Hierarchical
– Stepwise
– Forced Entry
• Understand How to do a Multiple Regression on
IBM SPSS
• Understand how to Interpret multiple regression.
• Understand the Assumptions of Multiple
Regression and how to test them

Slide 2
What is Multiple Regression?
• Linear Regression is a model to predict
the value of one variable from another.
• Multiple Regression is a natural extension
of this model:
– We use it to predict values of an outcome
from several predictors.
– It is a hypothetical model of the relationship
between several variables.

Slide 3
Regression: An Example
• A record company boss was interested in
predicting album sales from advertising.
• Data
– 200 different album releases
• Outcome variable:
– Sales (CDs and Downloads) in the week after release
• Predictor variables
– The amount (in £s) spent promoting the album
before release (see last lecture)
– Number of plays on the radio (new variable)
Slide 5
Multiple Regression as an Equation

• With multiple regression the

relationship is described using
a variation of the equation of a
straight line.

y  b0  b1 X 1 b2 X 2    bn X n   i

Slide 6
b0

• b0 is the intercept.
• The intercept is the value of the Y
variable when all Xs = 0.
• This is the point at which the
regression plane crosses the Y-
axis (vertical).
Slide 7
Beta Values

• b1 is the regression coefficient for

variable 1.
• b2 is the regression coefficient for
variable 2.
• bn is the regression coefficient for nth
variable.
Slide 8
The Model with Two Predictors

Slide 9
Methods of Regression
• Hierarchical:
– Experimenter decides the order in which
variables are entered into the model.
• Forced Entry:
– All predictors are entered simultaneously.
• Stepwise:
– Predictors are selected using their semi-
partial correlation with the outcome.

Slide 10
Hierarchical Regression

• Known predictors (based on past

research) are entered into the
regression model first.
• New predictors are then entered in a
separate step/block.
• Experimenter makes the decisions.

Slide 12
Hierarchical Regression
• It is the best method:
– Based on theory testing.
– You can see the unique predictive
influence of a new variable on the
outcome because known predictors are
held constant in the model.
• Bad Point:
– Relies on the experimenter knowing
what they’re doing!

Slide 13
Forced Entry Regression
• All variables are entered into the
model simultaneously.
• The results obtained depend on the
variables entered into the model.
– It is important, therefore, to have good
theoretical reasons for including a
particular variable.

Slide 14
Stepwise Regression I

• Variables are entered into the model

based on mathematical criteria.
• Computer selects variables in steps.
• Step 1
– SPSS looks for the predictor that can explain
the most variance in the outcome variable.

Slide 15
Stepwise Regression II
• Step 2:
– Having selected the 1st predictor, a
second one is chosen from the
remaining predictors.
– The semi-partial correlation is
used as a criterion for selection.

Slide 16
Problems with Stepwise Methods

• Rely on a mathematical criterion.

– Variable selection may depend upon only
slight differences in the Semi-partial
correlation.
– These slight numerical differences can lead
to major theoretical differences.
• Should be used only for exploration

Slide 17
• The backward method is the opposite of the
forward method in that the computer begins by
placing all predictors in the model and then
calculating the contribution of each one by
looking at the significance value of the t-test. If a
predictor meets the removal criterion it is
removed from the model and the model is re-
estimated for the remaining predictors. The
contribution of the remaining predictors is then
reassessed.
• Less Type II error
Doing Multiple Regression

Slide 19
Doing Multiple Regression

Slide 20
Regression Statistics
Regression
Diagnostics
Output: Model Summary

Slide 23
R and R2
• R
– The correlation between the observed values
of the outcome, and the values predicted by
the model.
• R2
– Yhe proportion of variance accounted for by
the model.
• Adj. R2
– An estimate of R2 in the population
(shrinkage).

Slide 24
Output: ANOVA

Slide 25
Analysis of Variance: ANOVA
• The F-test
– looks at whether the variance
explained by the model (SSM) is
significantly greater than the error
within the model (SSR).
– It tells us whether using the regression
model is significantly better at
predicting values of the outcome than
using the mean.

Slide 26
Output: betas

Slide 27
How to Interpret Beta Values

• Beta values:
– the change in the outcome associated
with a unit change in the predictor.
• Standardised beta values:
– tell us the same but expressed as
standard deviations.

Slide 28
Beta Values
• b1= 0.087.
– So, as advertising increases by £1,
album sales increase by 0.087 units.
• b2= 3589.
– So, each time (per week) a song is
played on the radio its sales increase
by 3589 units.

Slide 29
Constructing a Model
y  b0  b1 X 1  b2 X 2
Sales  41124  0.087Adverts  3589plays

£1 Million Advertising, 15 plays

Sales  41124  0.087  1,000,000  3589 15
 41124  87000 53835
 181959

Slide 30
Standardised Beta Values
• 1= 0.523
– As advertising increases by 1 standard
deviation, album sales increase by 0.523 of a
standard deviation.
• 2= 0.546
– When the number of plays on the radio
increases by 1 SD its sales increase by 0.546
standard deviations.

Slide 31
Interpreting Standardised Betas

• As advertising increases by
£485,655, album sales increase by
0.523  80,699 = 42,206.
• If the number of plays on the radio
per week increases by 12, album
sales increase by 0.546  80,699 =
44,062.

Slide 32
Generalization
• When we run regression, we hope to be
able to generalize the sample model to the
entire population.
• To do this, several assumptions must be
met.
• Violating these assumptions stops us
generalizing conclusions to our target
population.

Slide 33
Straightforward Assumptions
• Variable Type:
– Outcome must be continuous
– Predictors can be continuous or dichotomous.
• Non-Zero Variance:
– Predictors must not have zero variance.
• Linearity:
– The relationship we model is, in reality, linear.
• Independence:
– All values of the outcome should come from a
different person.

Slide 34
The More Tricky Assumptions
• No Multicollinearity:
– Predictors must not be highly correlated.
• Homoscedasticity:
– For each value of the predictors the variance of the
error term should be constant.
• Independent Errors:
– For any pair of observations, the error terms should
be uncorrelated.
• Normally-distributed Errors

Slide 35
• In designs in which you test several
groups of participants this assumption
means that each of these samples comes
from populations with the same variance
• In correlational designs, this assumption
means that the variance of the outcome
variable should be stable at all levels of
the predictor variable
Plots of standardized residuals against predicted values
Checking Assumptions about Errors
• Homoscedacity/Independence of Errors:
– Plot ZRESID against ZPRED.
• Normality of Errors:
– Normal probability plot.
– Normally distributed errors : It is assumed that
the residuals in the model are random, normally
distributed variables with a mean of 0. This
assumption simply means that the differences
between the model and the observed data are
most frequently zero or very close to zero

Slide 39
Regression Plots
Homoscedasticity: ZRESID vs. ZPRED
Normality of Errors: Histograms
and P-P plots
Multicollinearity

• Multicollinearity exists if predictors are

highly correlated.
• This assumption can be checked with
collinearity diagnostics.

Slide 43
• Tolerance should be more than 0.2
(Menard, 1995)
• VIF should be less than 10 (Myers,
1990)
• Independent errors: For any two
observations the residual terms should be
uncorrelated (i.e., independent).
• This assumption can be tested with the
Durbin–Watson test,. The test statistic
can vary between 0 and 4, with a value of
2 meaning that the residuals are
uncorrelated

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Sensitivity Study - Number of Sublayers
No ratings yet
Sensitivity Study - Number of Sublayers
8 pages
NRF - R134a-R1234yf Airconditioning Filling Chart
67% (3)
NRF - R134a-R1234yf Airconditioning Filling Chart
1 page
Logistic Regression: Prof. Andy Field
No ratings yet
Logistic Regression: Prof. Andy Field
34 pages
Ch08 Part 2 - Multiple Regression
No ratings yet
Ch08 Part 2 - Multiple Regression
45 pages
Ch08 - Linear Regression
No ratings yet
Ch08 - Linear Regression
37 pages
Estimating Demand: Regression Analysis
No ratings yet
Estimating Demand: Regression Analysis
29 pages
3 Linear Regression 3
No ratings yet
3 Linear Regression 3
10 pages
FNCE20005 Corporate Financial Decision Making
No ratings yet
FNCE20005 Corporate Financial Decision Making
12 pages
Quntative Data Analysis SPSS: Correlation & Regression
No ratings yet
Quntative Data Analysis SPSS: Correlation & Regression
65 pages
Seu Ds610 Mod03
No ratings yet
Seu Ds610 Mod03
45 pages
Applied Quantitative Research Methodology: Lecture #2 Regression Analysis
No ratings yet
Applied Quantitative Research Methodology: Lecture #2 Regression Analysis
35 pages
GLM
No ratings yet
GLM
26 pages
ADM2304 Multiple Regression Dr. Suren Phansalker
No ratings yet
ADM2304 Multiple Regression Dr. Suren Phansalker
12 pages
Lecture - 33 Notes
No ratings yet
Lecture - 33 Notes
33 pages
03 - Review of Probabilistic Models
No ratings yet
03 - Review of Probabilistic Models
34 pages
Demand Forecasting CH 4
No ratings yet
Demand Forecasting CH 4
17 pages
Modeling
No ratings yet
Modeling
38 pages
Marketing Research: Data Analysis V: Regression Analysis (Part 1)
No ratings yet
Marketing Research: Data Analysis V: Regression Analysis (Part 1)
35 pages
Residual Analysis and test_02
No ratings yet
Residual Analysis and test_02
37 pages
Accuracy Assessment and Confusion Matrix
No ratings yet
Accuracy Assessment and Confusion Matrix
23 pages
Lecture3 Bias and Variance Analysis and Bagging
No ratings yet
Lecture3 Bias and Variance Analysis and Bagging
22 pages
Lec 3-5 (Function Approximation)
No ratings yet
Lec 3-5 (Function Approximation)
34 pages
5.REGRESSION-1
No ratings yet
5.REGRESSION-1
46 pages
Lesson 5 Model Selection
No ratings yet
Lesson 5 Model Selection
41 pages
Regression I
No ratings yet
Regression I
41 pages
Chap3 - Multiple Regression
No ratings yet
Chap3 - Multiple Regression
56 pages
Estimating Demand: Learn How To Interpret The Results of Regression Analysis Based On Demand Data
No ratings yet
Estimating Demand: Learn How To Interpret The Results of Regression Analysis Based On Demand Data
18 pages
CS550 Regression
No ratings yet
CS550 Regression
62 pages
4. lec (ex post)
No ratings yet
4. lec (ex post)
16 pages
Multiple Regression (Compatibility Mode)
No ratings yet
Multiple Regression (Compatibility Mode)
24 pages
L3 Demo - Building A Linear Regression
No ratings yet
L3 Demo - Building A Linear Regression
60 pages
Regression
No ratings yet
Regression
44 pages
BRM - L4,5 - Linear Regression
No ratings yet
BRM - L4,5 - Linear Regression
113 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
28 pages
Dummy Variables
No ratings yet
Dummy Variables
25 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Stat Review Continued
No ratings yet
Stat Review Continued
24 pages
Intermediate Analytics-Regression-Week 1
No ratings yet
Intermediate Analytics-Regression-Week 1
52 pages
Data Analysis
No ratings yet
Data Analysis
263 pages
Measures of Dispersion
0% (1)
Measures of Dispersion
52 pages
Module4 Market Research 2
No ratings yet
Module4 Market Research 2
34 pages
MLR- R and R2
No ratings yet
MLR- R and R2
17 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
29 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
Chap01-3 (Autosaved)
No ratings yet
Chap01-3 (Autosaved)
51 pages
5 CV Boot-Handout PDF
No ratings yet
5 CV Boot-Handout PDF
44 pages
Chapter 4: Economic Analysis
No ratings yet
Chapter 4: Economic Analysis
18 pages
0 Regularization PDF
No ratings yet
0 Regularization PDF
88 pages
23MTRN08I Lec 9 - Simulink Solvers
No ratings yet
23MTRN08I Lec 9 - Simulink Solvers
24 pages
Operations Management, Forecasting, MBA Lecture Notes
98% (64)
Operations Management, Forecasting, MBA Lecture Notes
8 pages
unit-3
No ratings yet
unit-3
30 pages
Demand Estimation
No ratings yet
Demand Estimation
30 pages
Module - 5 Risk & Decision Analysis - Simulation
No ratings yet
Module - 5 Risk & Decision Analysis - Simulation
48 pages
Regression Interpretation
No ratings yet
Regression Interpretation
96 pages
Chapter 1: Introduction To Operations Research: The Modelling Process
No ratings yet
Chapter 1: Introduction To Operations Research: The Modelling Process
19 pages
Linear Regression SPSS
No ratings yet
Linear Regression SPSS
19 pages
DECS431_Week9_Class1
No ratings yet
DECS431_Week9_Class1
29 pages
Yr 10 STATISTICS BOOKLET (Teacher Copy)
No ratings yet
Yr 10 STATISTICS BOOKLET (Teacher Copy)
47 pages
Gansp Awareness Quiz PDF
No ratings yet
Gansp Awareness Quiz PDF
13 pages
Precalculus: A Self-Teaching Guide
From Everand
Precalculus: A Self-Teaching Guide
Steve Slavin
4.5/5 (5)
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Waterloo Mathematics Brochure
No ratings yet
Waterloo Mathematics Brochure
24 pages
Visual Basic PDF
No ratings yet
Visual Basic PDF
2 pages
Guy Wire Hardware For Set # 1 at Elev 75.00 FT
No ratings yet
Guy Wire Hardware For Set # 1 at Elev 75.00 FT
1 page
Introduction of Pepsi
0% (1)
Introduction of Pepsi
61 pages
Salary Slip Format in Excel
No ratings yet
Salary Slip Format in Excel
12 pages
LV Home Ipid
No ratings yet
LV Home Ipid
2 pages
The System How Companies Get Theri Shit Together
No ratings yet
The System How Companies Get Theri Shit Together
103 pages
Indian Facilities Management Services Report-Final
50% (2)
Indian Facilities Management Services Report-Final
51 pages
SMG Wireless Gateway API: Synway Information Engineering Co., LTD
No ratings yet
SMG Wireless Gateway API: Synway Information Engineering Co., LTD
31 pages
Soil Stabilization: Topics
No ratings yet
Soil Stabilization: Topics
25 pages
Entreprenurial Mind
No ratings yet
Entreprenurial Mind
13 pages
UP8244
No ratings yet
UP8244
660 pages
Bagaimanakah Beban Kerja Dan Stres Kerja Mempengaruhi Kinerja Karyawan Dengan Burnout Sebagai Variabel Mediasi
No ratings yet
Bagaimanakah Beban Kerja Dan Stres Kerja Mempengaruhi Kinerja Karyawan Dengan Burnout Sebagai Variabel Mediasi
14 pages
Patients Safety - Key Issues and Challenges
No ratings yet
Patients Safety - Key Issues and Challenges
4 pages
3140_02_6RP_AFP_tcm143-700710
No ratings yet
3140_02_6RP_AFP_tcm143-700710
8 pages
Supplementary-Student-Application-Form-2020
No ratings yet
Supplementary-Student-Application-Form-2020
13 pages
PLC HMI: + in One Unit
No ratings yet
PLC HMI: + in One Unit
13 pages
DBGV (X429 - 3cli: Model
No ratings yet
DBGV (X429 - 3cli: Model
2 pages
Albina Omarova: Industrial Product Designer
No ratings yet
Albina Omarova: Industrial Product Designer
54 pages
CSSXHTML Webdev QRG
No ratings yet
CSSXHTML Webdev QRG
1 page
IMECS 2016 Full Papers
No ratings yet
IMECS 2016 Full Papers
779 pages
Lingua Inglesa Cespe
No ratings yet
Lingua Inglesa Cespe
128 pages
Suggested Answer May 2018
No ratings yet
Suggested Answer May 2018
23 pages
Deepak Singh Resume P
No ratings yet
Deepak Singh Resume P
3 pages
How To Get An Appointment With Anyone in 3 Simple Steps
No ratings yet
How To Get An Appointment With Anyone in 3 Simple Steps
9 pages
Project Title: Bachelor of Technology IN Mechanical Engineering
No ratings yet
Project Title: Bachelor of Technology IN Mechanical Engineering
11 pages
Offer Letter_Ajharuddin_Ambuja Cement Darlaghat Shutdown
No ratings yet
Offer Letter_Ajharuddin_Ambuja Cement Darlaghat Shutdown
5 pages
1506a E88tag3 - 250 Kva
No ratings yet
1506a E88tag3 - 250 Kva
12 pages