Simple Linear Regression

This document discusses key concepts in simple linear regression analysis including: 1) Residuals represent the distance between observed data points and the regression line, and are used to calculate sums of squares. 2) ANOVA tables are used to partition total variability into explained and unexplained components to assess model fit. Measures like the standard error of the estimate and R-squared indicate how well data fits the regression line. 3) Hypothesis testing and confidence intervals can be used to make inferences about the slope and intercept coefficients in the simple linear regression model.

Uploaded by

Lincoln

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

Simple Linear Regression

Uploaded by

Lincoln

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Distance between the best fitted line to the data point = residual

When correlation is not zero, we can estimate y value with x

Anova table → find out the total variability of the regression model
Mean between data and mean of data (yi-y mean)

SST (total sum of squares) → Σ (y-ȳ)^2

SSR regression sum of squares (explained) → Σ (ŷ-ȳ)^2
SSE Sum of squared errors (unexplained) → Σ(y-ŷ)^2

SST=RSS+SSE (total sum of squares = regression sum of squares + sum of squared errors)
The least square method helps us to find a line to minimise SSE
ANOVA table → how fitting is the regression model fitting to our observed data
Measure to fit of our model:
Approach 1: standard error of estimate (SEE, Standard Error of Estimate, SD of errors,s )
s=SEE= sqrt (MSE)= Sqrt (SSE/n-2 )

Approach 2: coefficient of determination (R^2)

R^2=1-SSE/Syy=(Syy-SSE)/Syy
= reduction in the sum of square error due to x / sum of square error using ŷ=ȳ

F stat = MSR/MSE

Indications
Observed data is close to the regression line, SSE low, SEE is also low
R^2 is high → if R^2 is high and SEE is low → a good indication where the model is good fit (high confidence of the
estimate)

Observed data is far to the regression line, SEE is high while R^2 is low, regression is a poor fit

When R^2 = 1, SSE must be equal to 0, i.e all the points fall on a straight line

Simple linear regression model and its properties

𝑆𝑦
→estimate of slope= 𝐵1= r 𝑆𝑥 =Sxy/Sxx r = Cov(x,y)/𝑠𝑥𝑠𝑦 Cov(x,y)=∑(xi-𝑥)(yi-𝑦)/ (n-1)

F stat = MSR/MSE
Test for population coefficient of correlation

Residuals

Stat model assumes that for SLR,

for each value of x, the value of y
is normally distributed with some
mean (that depends on x linearly),
And a SD that does not depend on x.
The sd
is constant and same for all values
Inference on the slope coefficient (hypothesis testing) s / 𝑆𝑥𝑥= 𝑆𝐸(𝑏i), (Estimated Standard error for βi, (given in
Summary output, SE Coef)

𝑠𝑒=s=standard error of estimate = 𝑀𝑆𝐸= 𝑆𝑆𝐸/(𝑛 − 2) (standard error of regression)

ii) Hypothesis test for the slope 𝑏1 and the intercept 𝑏0

- 2-sided test: 𝐻0: 𝑏𝑖= b* VS 𝐻𝑎: 𝑏𝑖 ≠ b*, for any hypothesized value b*
𝑏𝑖− 𝑏*
→ Observed test statistic (t-stat) : t= (~𝑇𝑛−2 under 𝐻0)
𝑆𝐸(𝑏𝑖)

→ Reject 𝐻0 if |t| > 𝑡α/2,𝑛−2 or p-value= 2P(𝑇𝑛−2≥|t|) < α

- 1-sided test:
→ Reject 𝐻0if t> 𝑡α,𝑛−2 or P (𝑇𝑛−2≥t ) < α for 𝐻α: 𝑏𝑖 > b* ; t< 𝑡α,𝑛−2 or P (𝑇𝑛−2≤ t ) < α for 𝐻α: 𝑏𝑖 < b*
Inferences in SLR: Reject the claim (i.e., 𝐻0 below) that the parameter (𝑏0 or 𝑏1) in SLR equals any value b* with 5% chance of committing
Type 1 error ;
𝐻0: 𝑏𝑖=b* vs 𝐻α:𝑏𝑖≠𝑏 *
If any of the following is correct: (reject null hypothesis which is the researcher claim)
1) The absolute value of the t-statistic is larger than 𝑡0.025,𝑛−2≈2;
2) The p-value computed from the t-statistic is less than 0.05; or
3) b* lies outside the 95% ci for the parameter 𝑏𝑖
Even if the errors are not normally distributed, we can apply hypothesis testing when n>30 (central limit theorem)
Interference with confidence interval
sx= sample SD of xo , s=se= 𝑀𝑆𝐸= 𝑆𝑆𝐸/(𝑛 − 2) (standard error of regression)

Coefficient of determination (R^2)

2 2 𝑆𝑦𝑦−𝑆𝑆𝐸
𝑅 =[𝑐𝑜𝑟𝑟(𝑋, 𝑌)] = SSR/ (SSR+SSE) ; = 𝑆𝑦𝑦
= 1-RSS/TSS

2
(𝑆𝑦𝑦=∑(𝑦𝑖 −𝑦) ) ** (Syy)=sy^2 (n-1)

95% C.I. for the parameter b in SLR: 𝑏1± 𝑡0.025,𝑛−2𝑆𝐸(𝑏𝑖) ; 𝑡0.025,𝑛−2

How to explain if prediction is reliable

1. R^2 is 30% , meaning that 70% of the variation in y remains
unexplained.
2. standard error of estimate (SEE, Standard Error of Estimate, SD
of errors,s ) , s=SEE= sqrt (MSE)= Sqrt (SSE/n-2

Prediction interval -> CI for new observation

CH 5 Confidence interval T distribution (mount shaped, symmetric about 0, fatter tail than standard
Normal, larger df → closer to standard
Normal )

A point estimator→ a single value that

Estimate an unknown population
parameter

Empirical rule 要符合percentages as well, not only bell shaped

PSYB07 Final Notes
No ratings yet
PSYB07 Final Notes
7 pages
North South Airlines Case Study
100% (2)
North South Airlines Case Study
13 pages
Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
Week 13
No ratings yet
Week 13
25 pages
Residual Analysis and Test - 02
No ratings yet
Residual Analysis and Test - 02
22 pages
Topic 2 Simple Regression Model
No ratings yet
Topic 2 Simple Regression Model
39 pages
Regression Equation
No ratings yet
Regression Equation
56 pages
Simple Linear
No ratings yet
Simple Linear
10 pages
Reading 11: Correlation and Simple Regression: Calculate and Interpret The Following
No ratings yet
Reading 11: Correlation and Simple Regression: Calculate and Interpret The Following
15 pages
Chapter 12 11
No ratings yet
Chapter 12 11
15 pages
Regression Equation For SI
No ratings yet
Regression Equation For SI
12 pages
Regression
No ratings yet
Regression
11 pages
Lecture 10
No ratings yet
Lecture 10
38 pages
简单线性回归分析Simple Linear Regression PDF
No ratings yet
简单线性回归分析Simple Linear Regression PDF
8 pages
Advanced Marketing Research
No ratings yet
Advanced Marketing Research
32 pages
File4-Session3-Introduction To Regression
No ratings yet
File4-Session3-Introduction To Regression
50 pages
Simple Linear Regressionclassroom
No ratings yet
Simple Linear Regressionclassroom
37 pages
Regression Analysis and Modeling-Xii
No ratings yet
Regression Analysis and Modeling-Xii
9 pages
Chapter 16
No ratings yet
Chapter 16
6 pages
325unit 1 Simple Regression Analysis
No ratings yet
325unit 1 Simple Regression Analysis
10 pages
STATS135 REVIEWER
No ratings yet
STATS135 REVIEWER
5 pages
Type Equation Here. Type Equation Here.: N SST K N SSE R y
No ratings yet
Type Equation Here. Type Equation Here.: N SST K N SSE R y
2 pages
Formulas
No ratings yet
Formulas
3 pages
Regression Formulas Quick Notes Simplified
No ratings yet
Regression Formulas Quick Notes Simplified
1 page
regression4
No ratings yet
regression4
19 pages
CH 14
No ratings yet
CH 14
31 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
7 pages
Simple Lin Regress Inference
No ratings yet
Simple Lin Regress Inference
51 pages
Regression
No ratings yet
Regression
12 pages
Simple Regression and Correlation
No ratings yet
Simple Regression and Correlation
30 pages
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
No ratings yet
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
67 pages
Regression Analysis and Multiple Regression: Session 7
No ratings yet
Regression Analysis and Multiple Regression: Session 7
100 pages
What Is Multiple Linear Regression
No ratings yet
What Is Multiple Linear Regression
23 pages
Chapter13 MAS202
No ratings yet
Chapter13 MAS202
32 pages
Econometrics_3
No ratings yet
Econometrics_3
7 pages
Notes 516 Summer 09 Part 2
No ratings yet
Notes 516 Summer 09 Part 2
15 pages
Statistic Formulas
No ratings yet
Statistic Formulas
3 pages
Descriptive Statistics: Hypothesis Testing - Means
No ratings yet
Descriptive Statistics: Hypothesis Testing - Means
3 pages
Chapter 14 Simple Linear Regression
No ratings yet
Chapter 14 Simple Linear Regression
45 pages
Metode Kuadrat Terkecil (Least Square Method) : Budi Waluyo
No ratings yet
Metode Kuadrat Terkecil (Least Square Method) : Budi Waluyo
45 pages
Decision Sciences Formulae Sheet
No ratings yet
Decision Sciences Formulae Sheet
3 pages
MAP 716 Lecture 4 Simple Linear Regression
No ratings yet
MAP 716 Lecture 4 Simple Linear Regression
23 pages
Chapter 2 Simple Linear Regression - Jan2023
No ratings yet
Chapter 2 Simple Linear Regression - Jan2023
66 pages
Chapter14
No ratings yet
Chapter14
65 pages
Lecture8 4
No ratings yet
Lecture8 4
29 pages
L1 QM07 High Yield Notes
No ratings yet
L1 QM07 High Yield Notes
4 pages
Simple Regression
No ratings yet
Simple Regression
35 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
36 pages
ExamFinal Topics
No ratings yet
ExamFinal Topics
9 pages
Formula Sheet For Statistics
No ratings yet
Formula Sheet For Statistics
43 pages
SimpleLineaReg Example 5b027145e190ed456c0d0c9a3b3dd24f
No ratings yet
SimpleLineaReg Example 5b027145e190ed456c0d0c9a3b3dd24f
3 pages
Lecture 12
No ratings yet
Lecture 12
47 pages
CHAPTER 2 Simple Linear Regression
No ratings yet
CHAPTER 2 Simple Linear Regression
76 pages
328formulas03 (2019 - 04 - 03 15 - 13 - 21 UTC)
No ratings yet
328formulas03 (2019 - 04 - 03 15 - 13 - 21 UTC)
12 pages
Ch10 - Curve Fitting
No ratings yet
Ch10 - Curve Fitting
157 pages
Lecture 6 Simple Linear Regression
No ratings yet
Lecture 6 Simple Linear Regression
36 pages
Estimation of Causal Relationships I: Illustration 1
No ratings yet
Estimation of Causal Relationships I: Illustration 1
11 pages
Linear Regression
No ratings yet
Linear Regression
64 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Scicent QRN U11 Final e
No ratings yet
Scicent QRN U11 Final e
12 pages
QB U9 MC Eng
No ratings yet
QB U9 MC Eng
71 pages
Scicent QRN U7 Final e
No ratings yet
Scicent QRN U7 Final e
12 pages
Scicent QRN U9 Final e
No ratings yet
Scicent QRN U9 Final e
12 pages
Study - Id79183 - e Commerce Industry in Thailand
No ratings yet
Study - Id79183 - e Commerce Industry in Thailand
44 pages
Form Adv: Uniform Application For Investment Adviser Registration and Report by Exempt Reporting Advisers
No ratings yet
Form Adv: Uniform Application For Investment Adviser Registration and Report by Exempt Reporting Advisers
129 pages
Study - Id125136 - Gen Z Generation Z in Thailand
No ratings yet
Study - Id125136 - Gen Z Generation Z in Thailand
26 pages
HKUST Financial Calculator Workshop 2024-02
No ratings yet
HKUST Financial Calculator Workshop 2024-02
34 pages
Study Id56476 Online-Marketplaces
No ratings yet
Study Id56476 Online-Marketplaces
41 pages
Good Hope School - 16 21 2A Ch.7 More About Statistical Charts CQ
No ratings yet
Good Hope School - 16 21 2A Ch.7 More About Statistical Charts CQ
9 pages
Content
No ratings yet
Content
6 pages
Courts, The Law, and LGBT Rights in Asia
No ratings yet
Courts, The Law, and LGBT Rights in Asia
22 pages
2019-20-S2 Maths
0% (1)
2019-20-S2 Maths
15 pages
Lecture 01-After Lecture-L06
No ratings yet
Lecture 01-After Lecture-L06
10 pages
Acrefore 9780190228637 e 1168
No ratings yet
Acrefore 9780190228637 e 1168
18 pages
Good Hope School - 16 21 2B Ch.10 Angles Related To Triangles and Polygons CQ
No ratings yet
Good Hope School - 16 21 2B Ch.10 Angles Related To Triangles and Polygons CQ
3 pages
Data Analytics - Object Segmentation UNIT-IV
100% (1)
Data Analytics - Object Segmentation UNIT-IV
33 pages
360digiTMG - Certificate Course On Data Science - Curriculum
No ratings yet
360digiTMG - Certificate Course On Data Science - Curriculum
12 pages
Word Acquisition in Neural Language Models
No ratings yet
Word Acquisition in Neural Language Models
15 pages
CLL113 Quiz 2 Solutions
No ratings yet
CLL113 Quiz 2 Solutions
22 pages
Unit 3 MCQ
No ratings yet
Unit 3 MCQ
20 pages
Prac Final+sols98
No ratings yet
Prac Final+sols98
6 pages
Choo Etal 2013 Exploring Characteristics of Airport Access Mode Choice A Case Study of Korea
No ratings yet
Choo Etal 2013 Exploring Characteristics of Airport Access Mode Choice A Case Study of Korea
18 pages
217 - Chapter 4 REGRESSION AND CORRELATION
No ratings yet
217 - Chapter 4 REGRESSION AND CORRELATION
69 pages
Stock Market Price Prediction Analysis - Removed
100% (4)
Stock Market Price Prediction Analysis - Removed
54 pages
Marketing Analytics
No ratings yet
Marketing Analytics
79 pages
Artikel Bahasa Inggris
No ratings yet
Artikel Bahasa Inggris
7 pages
Exercises
No ratings yet
Exercises
38 pages
BZAN_6310-project_instructions
No ratings yet
BZAN_6310-project_instructions
4 pages
Industrial Report
No ratings yet
Industrial Report
56 pages
Ex01 Linear Regression
No ratings yet
Ex01 Linear Regression
2 pages
Macc Reviewer
No ratings yet
Macc Reviewer
13 pages
Assessing The Strength of Reinforced Concrete Structures Through Ultrasonic Pulse Velocity and Schmidt Rebound Hammer Tests-Libre
No ratings yet
Assessing The Strength of Reinforced Concrete Structures Through Ultrasonic Pulse Velocity and Schmidt Rebound Hammer Tests-Libre
8 pages
Practice Questions 3
No ratings yet
Practice Questions 3
2 pages
Association Between Cannabis Use and Opioid.8
No ratings yet
Association Between Cannabis Use and Opioid.8
10 pages
Microplate Reader: M.R.C LTD
No ratings yet
Microplate Reader: M.R.C LTD
36 pages
Thesis Statement On Population Control
100% (3)
Thesis Statement On Population Control
5 pages
PSYCH 248 Hunter Summer 2019
No ratings yet
PSYCH 248 Hunter Summer 2019
4 pages
7 0 0 4 MBA (Sem I) Theory Examination 2017-18 Business Statistics
No ratings yet
7 0 0 4 MBA (Sem I) Theory Examination 2017-18 Business Statistics
3 pages
Cost Accounting and Control
No ratings yet
Cost Accounting and Control
4 pages
Cost Estimation
No ratings yet
Cost Estimation
69 pages
Access Fundamental Statistics for the Behavioral Sciences 8th Edition Howell Test Bank All Chapters Immediate PDF Download
100% (10)
Access Fundamental Statistics for the Behavioral Sciences 8th Edition Howell Test Bank All Chapters Immediate PDF Download
47 pages
An Adjusted Boxplot For Skewed Distributions: M. Hubert, E. Vandervieren
No ratings yet
An Adjusted Boxplot For Skewed Distributions: M. Hubert, E. Vandervieren
16 pages
The Effect of Firm Size On Earnings Management: S. Ghon Rhee
No ratings yet
The Effect of Firm Size On Earnings Management: S. Ghon Rhee
32 pages