0% found this document useful (0 votes)

51 views

Simple Regression Model: Conference Paper

The document discusses simple linear regression analysis. It provides examples of scatter plots that show different relationships between two variables: [1] a strong direct relationship, [2] a weaker direct relationship, [3] no relationship, and [4] a strong inverse relationship. It then explains how to calculate the Pearson correlation coefficient to quantify the relationship. Finally, it describes how to calculate the slope and y-intercept of the regression line to model the relationship between the variables.

Uploaded by

Vigneshwari Mahamuni

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views

Simple Regression Model: Conference Paper

Uploaded by

Vigneshwari Mahamuni

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/329611627

Simple regression model

Conference Paper · May 2014

CITATIONS READS

0 576

2 authors:

Mercedes Orús-Lacort Christophe Jouis

Independent researcher. Université de la Sorbonne Nouvelle Paris 3 & EHESS & CAMS-CNRS
228 PUBLICATIONS 6 CITATIONS 226 PUBLICATIONS 185 CITATIONS

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

The story of my preprint (future article) tittled “Fermat Last Theorem Revisited” View project

Multidisciplinary researches and articles View project

All content following this page was uploaded by Mercedes Orús-Lacort on 13 December 2018.

The user has requested enhancement of the downloaded file.

Simple linear regression

1. What is the purpose of the simple linear regression?

Occasionally, we have two quantitative variables that may be related, and what we
intend to study is: can we predict the value of one of them from the known values of the
other?.

To study it, the steps that we follow are:

 Draw a graph where appear each variable data, this graph is called "Scatter
plot".

 Calculate the correlation coefficient of Pearson.

 Calculate a formula which will allow us to predict the value of one of these
variables from the another, this formula "Regression line" is called.

 We studied if we can consider the regression line as valid. For do it, we resolve
hypothesis test, and we calculate a ratio called "Adjustment coefficient of
goodness" (or also called R-squared, or coefficient of determination).

Let's see then what are the scatterplots.

Suppose we want to provide the Benefits of a company from Spending on Advertising.

We will call Y to the variable Benefits (which I expected) and X to the variable
Advertising.
The variable Y is called dependent variable and the variable X is called independent
variable.

The values of the two variables that we are studying are represented in this diagram.
And we may find with situations like that you will see below:

First situation:

In this case you may observe that:

- The points are close together: This means that there is a strong relationship between
the two variables.
- Also you may observe they are right-oriented: This means that both variables
are related directly proportional, i.e. when it increases spending on Advertising,
also increase the Benefits.

Second situation:

In this case you may observe that:

- The points are not very close together: This means that there is not a strong relation
between the two variables, but if we calculate the regression line, this will not adjust
very well.

- Also you may observe the right-oriented: This means that both variables are related
directly proportional, i.e. when it increases spending on Advertising, also increase the
Benefits.
Third situation:

In this case you may observe that:

- The points are very dispersed: This means that there is no relation between the two
variables, and that it wouldn't make any sense calculate a regression model.

Fourth situation:

In this case you may observe that:

- The points are close together: This means that there is a strong relationship between
the two variables.

- Also you may observe they are left-oriented: This means that both variables are related
inversely proportional, i.e. when it increases spending on Advertising, then decrease the
Benefits.

2. Calculation of the correlation Pearson coefficient

If we have data from two random variables that we think that they may be related, the
mode to confirm if that relationship exists or not, is to calculate the correlation
coefficient of Pearson rxy. The value of this coefficient is always between - 1 and 1.

To calculate it, we use the following formula:

1
S n1
 (xi  x)(yi  y)
rxy  XY  
SX S Y 1 1
n1
 (xi  x) n  1  (yi  y)
2 2

1
n1
 (xi  x)(yi  y)
 
1 1
n1
 (xi  x) n  1  (yi  y)
2 2

1
n1
 (xi  x)(yi  y)  (xi  x)(yi  y) 
 
1
n1
 (x i  x)2  (yi  y)2  (x i  x)2  (yi  y)2


 xiyi  y xi  x  yi nxy
 
 xi2  2x  xi  nx  yi2  2y yi  ny
2 2


If rxy is close to 1  X and Y correlated directly proportional.

If rxy is close to - 1  X and Y correlated inversely proportional.

If rxy is close to 0  X and Y not correlated.

Important: The sign (positive or negative) of this coefficient, depends on how it

came out focused our scatter diagram: If it came out to the right-oriented, then
the sign of the coefficient is positive, while if it came out the left-oriented then
the sign of the coefficient is negative, and if the diagram was dispersed, this
coefficient will have a value close to 0. That is to say:

First situation:

In this case, rxy will have positive sign, and its value would be close to 1, e.g. rxy = 0976.
Second situation:

In this case, rxy will have positive sign, and its value would be not more close to 1, e.g.
rxy = 0,676.

Third situation:

In this case, rxy will have positive or negative sign and its value would be more close to
0 than 1, e.g. rxy = 0.215 or rxy = - 0.215.

Fourth situation:

In this case, rxy will have a negative sign, and its value would be close to - 1, for
example rxy = - 0,915.

3. Calculation of the simple linear regression model

It makes sense compute it when the correlation coefficient is close to 1 or – 1.

Using the regression line we can predict the value of one of the variables from the
other.
To the variable which we are going to predict its value (say it is Y), is called dependent
variable, and the other variable (say it is X) is called independent variable.

We intend, therefore, to find a formula of the type Y = a + b·X that will allow us to
predict the value of Y from the value of the X, so that, it fits the maximum
possible cloud dispersion plot points.

For example, and according to the 4 situations we have seen above, we could
have:

First situation:

Second situation:

Third situation:

Fourth situation:
Calculation of the values of "a" and "b"

"b" is called a slope of a line, and its formula to calculate it is:

1
SXY n  1  (xi  x)(yi  y)  (x  x)(y  y) 
i i
b 2  
SX 1
 i(x  x) 2  (x  x)
i
2

n1


 x y  y x  x y nxy
i i i i

 x  2x x  nx 
i
2
i
2

And if we know the rxy value, we can calculate it as follows:

SY
b  rxy
SX

Once calculated the "b", "a" called y-intercept, it’s calculated as follows:

a  y  bx

4.- Hypothesis tests for the slope

To know if we can give valid regression model, we must resolve the following
hypotheses test:

Ho: β = 0
Ha: β ≠ 0

Where β represents the slop of the regression line.

To resolve this test, we calculate the statistic test which is a Student's t with
n - 2 degrees of freedom, by the following formula:
b b
t 
Sb 1 n

 (y  a  bxi )2
n  2 i1 i
n

 (x
i 1
i  x)2

where :

 b is the slope of regression line.

 Sb is the standard error estándar of the slope.

Let us note, that if give us the total values of the sums, and I do not know the values of
each value of the variable X and the Y, then, we will calculate the standard error as
shown below:

1 n
 (y  a  bxi )2
n  2 i1 i
Sb  
n

 (x
i 1
i  x) 2

1  n 2 n n n n

 
n  2  i1
y i  n·a 2
 b 2
 x i
2
 2a  y i  2b  x y
i i  2ab  xi 
i 1 i 1 i 1 i 1 

 x i
2
 2x  x i  nx
2


Then we take a decision:

 Through areas of acceptance and rejection of the null hypothesis:

We seek in the table statistics critics tn-2, α/2 and - tn-2, α/2, being α level of
significance.

 Calculate P Value:
P Value = 2·P (tn-2 > |t test|)

Therefore:

P Value > α  Accept Ho

P Value < α  Reject Ho and accept alternative
 Calculating the confidence interval for the slope of the regression line:

  tn2, /2·Standard Error of the slope

So, if 0 falls within the interval, the null hypothesis is accepted.

5.- Calculation coefficient R2

Another way to see if the model "fit well or not", is by calculating the coefficient R
square, or also called coefficient of determination or coefficient of goodness of fit. To
calculate it, we use the following formula:

R2 = rxy2

This ratio takes values between 0 and 1, so that:

If R2 is close to 0  the model doesn’t fit well

If R2 is close to 1  the model fits well

View publication stats

Omer G G10 Summative Assessment - Criterion B Linear Equation
100% (1)
Omer G G10 Summative Assessment - Criterion B Linear Equation
6 pages
Regression
No ratings yet
Regression
60 pages
Module-4 (Correlation & Regression)
No ratings yet
Module-4 (Correlation & Regression)
30 pages
Regression Analysis NEW-1
No ratings yet
Regression Analysis NEW-1
60 pages
Regression Analysis MCQ
No ratings yet
Regression Analysis MCQ
15 pages
J. K.Shah Classes Regression Analysis
No ratings yet
J. K.Shah Classes Regression Analysis
15 pages
J. K.Shah Classes Regression Analysis
No ratings yet
J. K.Shah Classes Regression Analysis
21 pages
Regression Analysis MCQ
No ratings yet
Regression Analysis MCQ
15 pages
6
No ratings yet
6
108 pages
Correlation and Regression Bi-Variate Data: Let (X
No ratings yet
Correlation and Regression Bi-Variate Data: Let (X
11 pages
Correlation & Regression
No ratings yet
Correlation & Regression
24 pages
Regression: Regression. But Quite Often The Values of A Particular Phenomenon May Be Affected by Multiplicity of
No ratings yet
Regression: Regression. But Quite Often The Values of A Particular Phenomenon May Be Affected by Multiplicity of
8 pages
Regression - Definition, Formula, Derivation, Application - Embibe
No ratings yet
Regression - Definition, Formula, Derivation, Application - Embibe
10 pages
Correlation and Regression-1
No ratings yet
Correlation and Regression-1
32 pages
Correlation and Regression: by Tushar Bhatt
100% (1)
Correlation and Regression: by Tushar Bhatt
66 pages
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
No ratings yet
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
6 pages
Correlation Ansd Simple Regression
No ratings yet
Correlation Ansd Simple Regression
27 pages
Chapter 4 (Regression part)
No ratings yet
Chapter 4 (Regression part)
13 pages
Regression Analysis 1 2020
No ratings yet
Regression Analysis 1 2020
40 pages
Correlation: We Take Two Measurements, of Two Different Physical Properties Are They Related?
No ratings yet
Correlation: We Take Two Measurements, of Two Different Physical Properties Are They Related?
27 pages
Week9 PDF
No ratings yet
Week9 PDF
34 pages
07 - Correlation and Regression Analysis-1
No ratings yet
07 - Correlation and Regression Analysis-1
13 pages
Unit 3 Simple Correlation and Regression Analysis1
No ratings yet
Unit 3 Simple Correlation and Regression Analysis1
16 pages
Scatter Plot/Diagram Simple Linear Regression Model
No ratings yet
Scatter Plot/Diagram Simple Linear Regression Model
43 pages
Chapter 8 - MULTIPLE REGRESSION MODEL
No ratings yet
Chapter 8 - MULTIPLE REGRESSION MODEL
7 pages
Module III (Part II)(Regression and Time Series)
No ratings yet
Module III (Part II)(Regression and Time Series)
118 pages
Regression 1.2 Regression Analysis 1.2.1 Introduction To Regression Analysis
No ratings yet
Regression 1.2 Regression Analysis 1.2.1 Introduction To Regression Analysis
9 pages
Chapter 4 Regression
No ratings yet
Chapter 4 Regression
38 pages
Simple Regression
No ratings yet
Simple Regression
8 pages
Statistical model for Agriculture(Cost and Yield Pridiction)
No ratings yet
Statistical model for Agriculture(Cost and Yield Pridiction)
14 pages
Corr and Regress
No ratings yet
Corr and Regress
61 pages
Correlation
No ratings yet
Correlation
19 pages
AP X Maths-Holiday Worksheet-Level 1 20240506 075923
No ratings yet
AP X Maths-Holiday Worksheet-Level 1 20240506 075923
50 pages
Course Notes For Unit 6 of The Udacity Course ST101 Introduction To Statistics PDF
No ratings yet
Course Notes For Unit 6 of The Udacity Course ST101 Introduction To Statistics PDF
23 pages
Multiple Regression Model - Matrix Form
No ratings yet
Multiple Regression Model - Matrix Form
22 pages
Linear Regression Analysis: Module - Ii
No ratings yet
Linear Regression Analysis: Module - Ii
11 pages
17 Regression Analysis
No ratings yet
17 Regression Analysis
10 pages
Statistics
No ratings yet
Statistics
17 pages
Week 9
No ratings yet
Week 9
23 pages
LinearRegression_FoundationalMathofAI_S24
No ratings yet
LinearRegression_FoundationalMathofAI_S24
4 pages
Regression Analysis
No ratings yet
Regression Analysis
52 pages
Correlation and Regression Analyses
No ratings yet
Correlation and Regression Analyses
8 pages
WEEK 7 Modular
No ratings yet
WEEK 7 Modular
10 pages
Syilfi, Dwi Ispriyanti, Diah Safitri: Analisis Regresi Linier Piecewise Dua Segmen
No ratings yet
Syilfi, Dwi Ispriyanti, Diah Safitri: Analisis Regresi Linier Piecewise Dua Segmen
11 pages
Simple LR Lecture
No ratings yet
Simple LR Lecture
60 pages
Linear Regression
No ratings yet
Linear Regression
34 pages
Simple Linear Regression and Correlation Analysis: Chapter Five
No ratings yet
Simple Linear Regression and Correlation Analysis: Chapter Five
5 pages
CE304-Unit 5-Lect2-Jumah 2018
No ratings yet
CE304-Unit 5-Lect2-Jumah 2018
14 pages
Stat 473-573 Notes
No ratings yet
Stat 473-573 Notes
139 pages
Chapter2 Econometrics MultipleLinearRegressionModel 1 1
No ratings yet
Chapter2 Econometrics MultipleLinearRegressionModel 1 1
34 pages
Case Study
No ratings yet
Case Study
87 pages
Derivation of The Ordinary Least Squares Estimator Simple Linear Regression Case
No ratings yet
Derivation of The Ordinary Least Squares Estimator Simple Linear Regression Case
17 pages
Linear Models
No ratings yet
Linear Models
92 pages
Lyapunov Stability Theory:: Problem of Motion Stability, Includes Two Methods For Stability Analysis (The So
No ratings yet
Lyapunov Stability Theory:: Problem of Motion Stability, Includes Two Methods For Stability Analysis (The So
25 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
29 pages
Statistics and Probability: Quarter 4 - (Week 6)
No ratings yet
Statistics and Probability: Quarter 4 - (Week 6)
8 pages
Oversikt ECN402
No ratings yet
Oversikt ECN402
40 pages
Nonlinear Regression: Major: Civil Engineering
No ratings yet
Nonlinear Regression: Major: Civil Engineering
41 pages
ALGEBRA SIMPLIFIED EQUATIONS WORKBOOK WITH ANSWERS: Linear Equations, Quadratic Equations, Systems of Equations
From Everand
ALGEBRA SIMPLIFIED EQUATIONS WORKBOOK WITH ANSWERS: Linear Equations, Quadratic Equations, Systems of Equations
Luke Aneke
No ratings yet
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
Simple Linear Regression and Correlation: Abrasion Loss vs. Hardness
No ratings yet
Simple Linear Regression and Correlation: Abrasion Loss vs. Hardness
23 pages
1 s2.0 S2772397621000101 Main
No ratings yet
1 s2.0 S2772397621000101 Main
12 pages
Create Graphs With Excel
No ratings yet
Create Graphs With Excel
11 pages
Experimental and Theoretical Study of Earth-Moist Concrete: G. Hüsken H.J.H. Brouwers
No ratings yet
Experimental and Theoretical Study of Earth-Moist Concrete: G. Hüsken H.J.H. Brouwers
10 pages
Hussein, Luaay
No ratings yet
Hussein, Luaay
242 pages
Agricultural Solid Waste Management: Basic Strategies: National Dairy Research Institute
No ratings yet
Agricultural Solid Waste Management: Basic Strategies: National Dairy Research Institute
14 pages
Particle Packing Theory - Fennis
No ratings yet
Particle Packing Theory - Fennis
3 pages
Corrosion Limit For Cuso4 Corrosion Limit For Agcl Corrosion Limit For Calomel
No ratings yet
Corrosion Limit For Cuso4 Corrosion Limit For Agcl Corrosion Limit For Calomel
1 page
First Steps in Understanding Engineering Students' Growth of Conceptual and Procedural Knowledge in An Interactive Learning Context
No ratings yet
First Steps in Understanding Engineering Students' Growth of Conceptual and Procedural Knowledge in An Interactive Learning Context
12 pages
Application of Transformative Learning Theory in Engineering Education
No ratings yet
Application of Transformative Learning Theory in Engineering Education
6 pages
Connections and Tension Member Design
No ratings yet
Connections and Tension Member Design
9 pages
Members in Tension - IV
No ratings yet
Members in Tension - IV
53 pages
10 On Partial Differential Equations
No ratings yet
10 On Partial Differential Equations
4 pages
Convergence of Fourier Series
No ratings yet
Convergence of Fourier Series
5 pages
FuncEq Intro
No ratings yet
FuncEq Intro
11 pages
Basic Concepts of FMEA and FMECA
No ratings yet
Basic Concepts of FMEA and FMECA
5 pages
Introduction To Numerical Analysis II: Finite Element Method
No ratings yet
Introduction To Numerical Analysis II: Finite Element Method
16 pages
g7m3l10 - Properties of Inequalities
No ratings yet
g7m3l10 - Properties of Inequalities
5 pages
SAP2000 Nonlinear Dynamic Analysis
No ratings yet
SAP2000 Nonlinear Dynamic Analysis
12 pages
Titration and PH Measurement Mullen Jennings Roy
No ratings yet
Titration and PH Measurement Mullen Jennings Roy
5 pages
Mathematical Methods For Physcists Webber/Arfken Selected Solutions Ch. 6 & 7
100% (1)
Mathematical Methods For Physcists Webber/Arfken Selected Solutions Ch. 6 & 7
6 pages
Statistics
No ratings yet
Statistics
48 pages
MCV4U Homework Guide
No ratings yet
MCV4U Homework Guide
8 pages
The Generalized Uncertainty Principle: Jun-Li Li and Cong-Feng Qiao
No ratings yet
The Generalized Uncertainty Principle: Jun-Li Li and Cong-Feng Qiao
18 pages
Advanced Educational Statistics - Edu 901C
No ratings yet
Advanced Educational Statistics - Edu 901C
12 pages
The Qualitative Descriptive Approach
No ratings yet
The Qualitative Descriptive Approach
4 pages
Teoria Ergodica: 1 N N 1 2 N 1/n
No ratings yet
Teoria Ergodica: 1 N N 1 2 N 1/n
2 pages
Experiments With A Single Factor (1) : Design of Experiment
No ratings yet
Experiments With A Single Factor (1) : Design of Experiment
28 pages
Cambridge Books Online
No ratings yet
Cambridge Books Online
20 pages
Solution Manual for Quantitative Analysis for Management, 12/E Barry Render, Ralph M. Stair, Michael E. Hanna, Trevor S. Hale 2024 scribd download full chapters
100% (9)
Solution Manual for Quantitative Analysis for Management, 12/E Barry Render, Ralph M. Stair, Michael E. Hanna, Trevor S. Hale 2024 scribd download full chapters
43 pages
Full Chapter An Introduction To Performance Analysis of Sport 2Nd Edition Adam Cullinane PDF
100% (6)
Full Chapter An Introduction To Performance Analysis of Sport 2Nd Edition Adam Cullinane PDF
53 pages
Support Vector Regression
No ratings yet
Support Vector Regression
15 pages
Anova
0% (1)
Anova
5 pages
2.2 Concavity PDF
No ratings yet
2.2 Concavity PDF
12 pages
All Calculus SL
100% (1)
All Calculus SL
123 pages
T Test For Dependent Samples
100% (2)
T Test For Dependent Samples
11 pages
Mathematic M Sem 1 Coursework (Introduction)
No ratings yet
Mathematic M Sem 1 Coursework (Introduction)
2 pages
18.445 Introduction To Stochastic Processes
No ratings yet
18.445 Introduction To Stochastic Processes
10 pages
Aod q-1
No ratings yet
Aod q-1
7 pages
Spatial Correlation New
No ratings yet
Spatial Correlation New
14 pages
Least-Upper-Bound Property: Completeness Properties
No ratings yet
Least-Upper-Bound Property: Completeness Properties
3 pages