Psy 234 Investigating Relationships Week 11
Psy 234 Investigating Relationships Week 11
Psy 234 Investigating Relationships Week 11
relationships
Week 11
Two categorical
variables
Are boys more likely to prefer maths and science than girls?
Variables:
• Favourite subject (Nominal)
• Gender (Binary/ Nominal)
ØPresence of outliers
ØStatistic used:
r = correlation coefficient
Linear
Correlation Coefficient r
} Measures strength of a relationship between two
continuous variables -1 ≤ r ≤ 1
r = 0.9
r = 0.01
r = -0.9
Correlation Interpretation
An interpretation of the size of the coefficient has been
described by Cohen (1992) as:
r = 0.791
http://www.nejm.org/doi/full/10.1056/NEJMon1211064
Chocolate and serial killers
} What else is related to chocolate consumption?
r = 0.52
http://www.replicatedtypo.com/chocolate-consumption-traffic-
accidents-and-serial-killers/5718.html
www.statstutor.ac.uk
Hypothesis tests for r
Tests the null hypothesis that the population
correlation r = 0 NOT that there is a strong
relationship!
www.statstutor.ac.uk
Exercise - solution
Relationship Correlation Interpretation
Chocolate Number of
consumption Nobel winners
GDP (wealth)
Temperature
Dataset for today
• Factors affecting birth weight of babies
Mother smokes
=1
r = 0.706
Residuals
Baby heavier Baby lighter
than predicted than expected
Regression
line Baby the same
y=a+bx as predicted
y = a + bx
Dependent variable variable
Intercept Slope
www.statstutor.ac.uk
Exercise
• Investigate whether mothers pre-pregnancy weight and birth
weight are associated using a scatterplot, correlation and simple
regression.
Exercise - scatterplot
• Describe the relationship using the scatterplot and
correlation coefficient
r = 0.39
Regression question
www.statstutor.ac.uk
Correlation
• Pearson’s correlation = 0.39
• Interpretation:
• There is a significant relationship between a mothers’ pre-
pregnancy weight and the weight of her baby (p = 0.011). Pre-
pregnancy weight has a positive affect on a baby’s weight with
an increase of 0.03 lbs for each extra pound a mother weighs.
www.statstutor.ac.uk
Checking assumptions
• Linear relationship
• Histogram roughly peaks in the middle
• No patterns in residuals
Multiple regression
} Multiple regression has several binary or Scale
independent variables
y = a + b1 x1 + b 2 x2 + b 3 x3
www.statstutor.ac.uk
Exercise
• Which variables are most strongly related?
Exercise - Solution
• Which variables are most strongly related?
• Gestation and birth weight (0.709)