Chapter 6-Simple Linear Regression and Correlation

Uploaded by

Fatin

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Chapter 6-Simple Linear Regression and Correlation

Uploaded by

Fatin

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

MATH 2330

COMPUTATIONAL
METHOD AND
STATISTICS
CHAPTER 6- SIMPLE
LINEAR REGRESSION
AND CORRELATION

PREPARED BY :
DR MAZIATI AKMAL BT
MOHD HATTA
Simple Linear Regression and Correlation
• Scientist are frequently interested in studying the functional relationship between two variables
(y and x1) or more than two variables (y and x1, x2,…, xn). For example, in a chemical process, the
yield of the product is related to the process-operating temperature level, pressure,
concentration of reactants and some other factors/variable.
Simple Linear Regression and Correlation
• The most commonly used techniques for investigating the relationship between two
quantitative variables are correlation and linear regression. Correlation quantifies the strength
of the linear relationship between a pair of variables, whereas regression expresses the
relationship in the form of an equation. Let say, we are interested to investigate the
relationship between the yield of the product (y) and only with the process operating
temperature (x1), then the regression is called as simple regression model. The improved of the
simple regression model by adding one or more explanatory variables is known as multiple
regression model.
① Correlation
• The correlation coefficient computed from the sample data measures the strength and direction of a linear
relationship between two quantitative variables. The symbol for the sample correlation coefficient is r. The
symbol for the population correlation coefficient is ρ. The range of the correlation coefficient is from -1 to
+1.
1 , perfect positive correlation
𝑟=ቐ0 , no correlation (the values doen′ t seem to linked at all)
−1 , perfect negative correlation
① Correlation
The formula for correlation Pearson's correlation coefficient is

Guideline of Pearson’s correlation coefficient interpretation:

0 < |r| < 0 .3 weak correlation
0.3 < |r| < 0.7 moderate correlation
|r| > 0.7 strong correlation
① Correlation
Eg 1: The local ice cream shop keeps track of how much ice cream they sell versus the temperature on that day,
here are their figures for the last 4 days. Compute the value of the correlation coefficient for the data given
below.
① Correlation
Sol (Eg 1):
① Correlation
① Correlation
② Simple Linear Regression
• A simple regression includes only two variables:

i) one independent variable, x - use to explained the variation in the dependent variable (regressor

variable/ predictor variable)

ii) one dependent variable, y - is the one being explained (response variable).
② Simple Linear Regression
• As an illustration, consider the data in Table 11-1 below.

y: purity of oxygen produced in chemical distillation process.

x: percentage of hydrocarbons that are present in the main condenser of the distillation unit.
② Simple Linear Regression
• By looking at the scatter diagram, we can observe that there exists a strong linear relationship
between purity and hydrocarbon level. If a straight line is drawn through the points, the points will be
scattered closely around the line. Then, our simple linear regression model is written as
𝐸(𝑌ȁ𝑥) = 𝛽𝑜 + 𝛽1 𝑥
• where the y-intercept βo and the slope β1 are unknown regression coefficients. Now the data points
do not fall exactly on a straight line, so the above equation need to be modified to account for this. Let
the difference between the observed value of y and the straight line (𝛽𝑜 +𝛽1 𝑥) be an error ϵ. It is
convenient to think of ϵ as a statistical error; that is, it is a random variable that accounts for the failure
of the model to fit the data exactly. Thus, the complete regression model is written as 𝐸 𝑌ȁ𝑥 = 𝛽𝑜 +
𝛽1 𝑥 + 𝜖. The difference between the actual value of y and the predicted value of y for a given x value
is also known as residual.
② Simple Linear Regression
• The difference between the actual value of y and the predicted value of y for a given x value is also
known as residual.
• Assumptions:
i) mean zero and unknown variance σ2
ii) errors are uncorrelated
② Simple Linear Regression
• However, a large numbers of straight lines can be drawn through the scatter diagram. The question
now, which line will give the best fit to the data? Or what is the estimation of βo and β1 should result
in a line that is a “best fit” to the data?

• In regression analysis we try to find a line that best fits the points in the scatter diagram. Such a line
provides the best possible description of the relationship between the dependent and independent
variables. This can be done via the least squares regression.
Least square estimates
• The least squares estimates of the intercept and slope in the simple linear regression model are

𝑆𝑥𝑦
=
𝑆𝑥𝑥

𝒏 𝒏
𝟏 𝟏
ഥ=
𝐰𝐡𝐞𝐫𝐞 𝒚 ഥ=
෍ 𝒚𝒊 𝐚𝐧𝐝 𝒙 ෍ 𝒙𝒊 .
𝒏 𝒏
𝒊=𝟏 𝒊=𝟏
Estimating σ2
• Notice that, there is unknown parameter in our regression model, σ2 (the variance of the error term 𝜖).
The unbiased estimator of σ2 is given by
Eg 2: Consider

(a) calculate the least squares estimates of the slope and intercept. Estimates 𝜎̂2
(b) predict the oxygen purity 𝑦̂ when the hydrocarbon level is 𝑥̂ = 1.00%.
(c) suppose the hydrocarbon level is 0.99. Calculate the fitted value of 𝑦̂ and the corresponding residual.
(d) what change in mean purity is expected when the hydrocarbon level changes by 2%?
Sol (Eg 2):
By using minitab
By using minitab
Eg 3: As machines are used over long periods of time, the output product can get off target. Below is the average
value of how much off target a product is getting manufactured as a function of machine use.

a) Assuming that a simple linear regression model is appropriate,

i) fit the regression model relating the target (y) to the hours of machine use (x) [Ans: 𝒚̂ = 0.56050+0.019179]
ii) what is the estimate of 𝜎̂2? [Ans: 𝟎. 𝟎𝟎𝟎𝟔𝟏𝟒]
b) What is the estimate of expected hours when the average is 2 mm off target. [Ans: 75.056 hours]
c) Suppose the hours used is 39. Calculate the fitted value of 𝑦̂ and the corresponding residual.[Ans: −𝟎. 𝟎𝟎𝟖𝟒𝟖𝟏]
d) What change in mm off target is expected when the duration of machine used changes by 4hours?
[Ans: 𝟎. 𝟎𝟕𝟔𝟕𝐦𝐦]
By using minitab
Sol (Eg 3):
Coefficient of Determination
The quantity

2
𝑆𝑆𝐸 𝛽መ1 𝑆𝑥𝑦
𝑟 =1− =
𝑆𝑆𝑇 𝑆𝑆𝑇

is called the coefficient of determination and is often used to judge the adequacy of a regression model.
r2 is the square of the correlation coefficient between X and Y in which we often refer loosely to r2 as the
amount of variability in the data explained or accounted for by the regression model.

Eg 4: For the oxygen purity regression model in Eg 2, calculate the r2

Thank you and happy reading☺

The Practice of Health Program Evaluation. ISBN 9781483376370, 978-1483376370
100% (11)
The Practice of Health Program Evaluation. ISBN 9781483376370, 978-1483376370
23 pages
Regression Analysis
No ratings yet
Regression Analysis
12 pages
ICT583 Data Science Applications - Final Assignment - Individual - UPDATED!!! - Explanation
0% (1)
ICT583 Data Science Applications - Final Assignment - Individual - UPDATED!!! - Explanation
5 pages
Econometrics Chapter 3
No ratings yet
Econometrics Chapter 3
24 pages
Linear regression case study
No ratings yet
Linear regression case study
6 pages
Regression and Correlation
No ratings yet
Regression and Correlation
19 pages
Pearson's Correlation Coefficient
No ratings yet
Pearson's Correlation Coefficient
7 pages
Module 4
No ratings yet
Module 4
27 pages
Lecture+8+ +Linear+Regression
No ratings yet
Lecture+8+ +Linear+Regression
45 pages
Lecture 12 Simple Linear Regression Analysis
No ratings yet
Lecture 12 Simple Linear Regression Analysis
22 pages
CH 6
No ratings yet
CH 6
42 pages
ASS#1-FINALS Doromal
No ratings yet
ASS#1-FINALS Doromal
8 pages
ANALYTICAL TECHNIQUES LU4 Lecture Notes
No ratings yet
ANALYTICAL TECHNIQUES LU4 Lecture Notes
25 pages
Topic_13_Correlation_and_Simple_Linear_Regression
No ratings yet
Topic_13_Correlation_and_Simple_Linear_Regression
17 pages
Regression Analysis
No ratings yet
Regression Analysis
34 pages
Lecture 3.1.9 (REGRESSION)
No ratings yet
Lecture 3.1.9 (REGRESSION)
9 pages
Statistical Analysis (SM 901B) Unit 2 - Regression: Goonjan Jain Department of Applied Mathematics DTU
No ratings yet
Statistical Analysis (SM 901B) Unit 2 - Regression: Goonjan Jain Department of Applied Mathematics DTU
19 pages
Regression Primer
No ratings yet
Regression Primer
4 pages
Regression&Corr&Annova
No ratings yet
Regression&Corr&Annova
71 pages
Lecture 8-Association Between Variables
No ratings yet
Lecture 8-Association Between Variables
28 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
71 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
19 pages
Chapter 10 Regression Analysis
No ratings yet
Chapter 10 Regression Analysis
3 pages
Regression Analysis
100% (1)
Regression Analysis
43 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
Unit 21 StatsProbab
No ratings yet
Unit 21 StatsProbab
79 pages
Define Correlation & How It Is Converted Into Regression: Topic
No ratings yet
Define Correlation & How It Is Converted Into Regression: Topic
9 pages
da 4
No ratings yet
da 4
30 pages
Simple Linear Regression
100% (1)
Simple Linear Regression
50 pages
Class Note II_044242
No ratings yet
Class Note II_044242
19 pages
r23 p & s Unit 2 Material
No ratings yet
r23 p & s Unit 2 Material
14 pages
Module 6A Estimating Relationships
No ratings yet
Module 6A Estimating Relationships
104 pages
MachineLearning_Unit-II
No ratings yet
MachineLearning_Unit-II
45 pages
Ecotrix Ecotrix: B.A. Economics (Hons.) (University of Delhi) B.A. Economics (Hons.) (University of Delhi)
No ratings yet
Ecotrix Ecotrix: B.A. Economics (Hons.) (University of Delhi) B.A. Economics (Hons.) (University of Delhi)
18 pages
Regression Modelling With Actuarial and Financial Applications - Key Notes
No ratings yet
Regression Modelling With Actuarial and Financial Applications - Key Notes
3 pages
FM Project REPORT - Group3
No ratings yet
FM Project REPORT - Group3
24 pages
Normal Distribution and Regression Notes
No ratings yet
Normal Distribution and Regression Notes
71 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
L7 Correlation
No ratings yet
L7 Correlation
40 pages
Theme 3 Multivariante Regression Model
No ratings yet
Theme 3 Multivariante Regression Model
8 pages
Chapter 06-Regression Analysis
No ratings yet
Chapter 06-Regression Analysis
41 pages
Lesson 1-Correlation
100% (1)
Lesson 1-Correlation
12 pages
Bivariate Data Analysis
100% (1)
Bivariate Data Analysis
34 pages
06 Simple Linear Regression Part1
No ratings yet
06 Simple Linear Regression Part1
8 pages
CH 08
No ratings yet
CH 08
13 pages
Linear Regression Assignment Questions and Answer
No ratings yet
Linear Regression Assignment Questions and Answer
7 pages
Regression and Correlation
No ratings yet
Regression and Correlation
37 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
Topic 1: Investigating relationships between two numerical variables
No ratings yet
Topic 1: Investigating relationships between two numerical variables
8 pages
Regression Analysis
No ratings yet
Regression Analysis
52 pages
Econometrics for Finace Lecture II-Session Three
No ratings yet
Econometrics for Finace Lecture II-Session Three
32 pages
Chapter 3 - Multiple Linear Regression Models
No ratings yet
Chapter 3 - Multiple Linear Regression Models
29 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
13 pages
Business Stat 10 12 .PDF
No ratings yet
Business Stat 10 12 .PDF
144 pages
Part A Assignment - No - 4
No ratings yet
Part A Assignment - No - 4
14 pages
Correlation Analysis Notes-2
No ratings yet
Correlation Analysis Notes-2
5 pages
Data Management - Part 3
No ratings yet
Data Management - Part 3
39 pages
Regcorr 5
No ratings yet
Regcorr 5
20 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
Student Solutions Manual for Mathematics for Economics, fourth edition
From Everand
Student Solutions Manual for Mathematics for Economics, fourth edition
Michael Hoy
No ratings yet
Introduction to Logarithms and Exponentials
From Everand
Introduction to Logarithms and Exponentials
Simone Malacrida
No ratings yet
박사논문 천종기 PDF
100% (1)
박사논문 천종기 PDF
147 pages
Odemil Uyan BSIT Presentation ITC 101
No ratings yet
Odemil Uyan BSIT Presentation ITC 101
11 pages
Lindawati Fah
No ratings yet
Lindawati Fah
0 pages
Six Sigma: A New Practice For Reducing Water Consumption Within Coca Cola Industry
No ratings yet
Six Sigma: A New Practice For Reducing Water Consumption Within Coca Cola Industry
25 pages
MIS and Credit Monitoring Project
No ratings yet
MIS and Credit Monitoring Project
48 pages
Forecasting-Seasonal Models
No ratings yet
Forecasting-Seasonal Models
35 pages
Physics 103/105 Lab Manual: Princeton University Physics Department
No ratings yet
Physics 103/105 Lab Manual: Princeton University Physics Department
126 pages
Analysis of Impact of Celebrity Endorsements On Consumer Buying Behaviour
100% (1)
Analysis of Impact of Celebrity Endorsements On Consumer Buying Behaviour
20 pages
1 Econometrics and Economic Data
No ratings yet
1 Econometrics and Economic Data
20 pages
Sd108 - Quantitative Impacts of Project Change
67% (3)
Sd108 - Quantitative Impacts of Project Change
160 pages
Machine Learning Basics Infographic With Algorithm Examples PDF
No ratings yet
Machine Learning Basics Infographic With Algorithm Examples PDF
1 page
(Statistics - 101) : Chapter One
No ratings yet
(Statistics - 101) : Chapter One
55 pages
UNIT 1: Introduction To Business Intelligence
No ratings yet
UNIT 1: Introduction To Business Intelligence
37 pages
Customer Relationship Management Websites Analysis of The Top Ten Consumer Goods Companies
No ratings yet
Customer Relationship Management Websites Analysis of The Top Ten Consumer Goods Companies
20 pages
Baron 2003
No ratings yet
Baron 2003
20 pages
Dicky Pramana Agung
No ratings yet
Dicky Pramana Agung
31 pages
BA-III Sem Syllabus
No ratings yet
BA-III Sem Syllabus
15 pages
MSM MBA DMT Individual Assignment September 2021 A05600aede79bf
No ratings yet
MSM MBA DMT Individual Assignment September 2021 A05600aede79bf
46 pages
PavanMathiResume (1) 1 Compressed
No ratings yet
PavanMathiResume (1) 1 Compressed
2 pages
Research 1-WPS Office
No ratings yet
Research 1-WPS Office
6 pages
executive summary
No ratings yet
executive summary
9 pages
The Efficacy of Resistance Training in Addition To Usual Care For
No ratings yet
The Efficacy of Resistance Training in Addition To Usual Care For
34 pages
BI practical 1,2,3
No ratings yet
BI practical 1,2,3
10 pages
MBA Syllabus
No ratings yet
MBA Syllabus
152 pages
Gender Differences in The Academic Performance of Students in Senior Secondary School Mathematics
No ratings yet
Gender Differences in The Academic Performance of Students in Senior Secondary School Mathematics
7 pages
Telco Customer Churn Prediction Project Report
No ratings yet
Telco Customer Churn Prediction Project Report
40 pages
Indicators: Holistic Rubric For The Research Data-Collection Procedure
No ratings yet
Indicators: Holistic Rubric For The Research Data-Collection Procedure
2 pages
ML3 Some Supervised
No ratings yet
ML3 Some Supervised
17 pages