0% found this document useful (0 votes)

20 views

Unit 2 Regression Analysis

Uploaded by

sbsbxbxahsbxx

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

Unit 2 Regression Analysis

Uploaded by

sbsbxbxahsbxx

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

1.

Calculate the regression coefficient and obtain the lines of regression for the following data

Solution:

Regression coefficient of X on Y

(i) Regression equation of X on Y

(ii) Regression coefficient of Y on X

(iii) Regression equation of Y on X

Y = 0.929X–3.716+11
= 0.929X+7.284
The regression equation of Y on X is Y= 0.929X + 7.284

alculate the two regression equations of X on Y and Y on X from the data given below, taking
deviations from a actual means of X and Y.

Estimate the likely demand when the price is Rs.20.

Solution:
Calculation of Regression equation

(i) Regression equation of X on Y

(ii) Regression Equation of Y on X

When X is 20, Y will be

= –0.25 (20)+44.25

= –5+44.25

= 39.25 (when the price is Rs. 20, the likely demand is 39.25)

Obtain regression equation of Y on X and estimate Y when X=55 from the following

Solution:

(i) Regression coefficients of Y on X

(ii) Regression equation of Y on X

Y–51.57 = 0.942(X–48.29 )

Y = 0.942X–45.49+51.57=0.942 #–45.49+51.57

Y = 0.942X+6.08

The regression equation of Y on X is Y= 0.942X+6.08 Estimation of Y when X= 55

Y= 0.942(55)+6.08=57.89

Example 9.12

Find the means of X and Y variables and the coefficient of correlation between them from the
following two regression equations:

2Y–X–50 = 0

3Y–2X–10 = 0.

Solution:

We are given

2Y–X–50 = 0 ... (1)

3Y–2X–10 = 0 ... (2)

Solving equation (1) and (2)

We get Y = 90
Putting the value of Y in equation (1)

We get X = 130

Calculating correlation coefficient

Let us assume equation (1) be the regression equation of Y on X

2Y = X+50

Linear Regression

Linear regression algorithm shows a linear relationship between a dependent (y) and one or more
independent (y) variables, hence called as linear regression. Since linear regression shows the linear
relationship, which means it finds how the value of the dependent variable is changing according to
the value of the independent variable.

The linear regression model provides a sloped straight line representing the relationship between the
variables. Consider the below image:
Mathematically, we can represent a linear regression as:

y= a0+a1x+ ε

Here,

Y= Dependent Variable (Target Variable)

X= Independent Variable (predictor Variable)
a0= intercept of the line (Gives an additional degree of freedom)
a1 = Linear regression coefficient (scale factor to each input value).
ε = random error

The values for x and y variables are training datasets for Linear Regression model representation.

Types of Linear Regression

Linear regression can be further divided into two types of the algorithm:

o Simple Linear Regression:

If a single independent variable is used to predict the value of a numerical dependent
variable, then such a Linear Regression algorithm is called Simple Linear Regression.

o Multiple Linear regression:

If more than one independent variable is used to predict the value of a numerical dependent
variable, then such a Linear Regression algorithm is called Multiple Linear Regression.

Linear Regression Line

A linear line showing the relationship between the dependent and independent variables is called
a regression line. A regression line can show two types of relationship:

o Positive Linear Relationship:

If the dependent variable increases on the Y-axis and independent variable increases on X-
axis, then such a relationship is termed as a Positive linear relationship.

o Negative Linear Relationship:

If the dependent variable decreases on the Y-axis and independent variable increases on the
X-axis, then such a relationship is called a negative linear relationship.

Implementation of Simple Linear Regression Algorithm using Python

Step-1: Data Pre-processing

1. import numpy as nm

2. import matplotlib.pyplot as mtp

3. import pandas as pd

o Next, we will load the dataset into our code:

1. data_set= pd.read_csv('Salary_Data.csv')
o After that, we need to extract the dependent and independent variables from the given
dataset. The independent variable is years of experience, and the dependent variable is
salary. Below is code for it:

1. x= data_set.iloc[:, :-1].values

2. y= data_set.iloc[:, 1].values
In the above output image, we can see the X (independent) variable and Y (dependent) variable has
been extracted from the given dataset.

o Next, we will split both variables into the test set and training set. We have 30 observations,
so we will take 20 observations for the training set and 10 observations for the test set. We
are splitting our dataset so that we can train our model using a training dataset and then test
the model using a test dataset. The code for this is given below:

1. # Splitting the dataset into training and test set.

2. from sklearn.model_selection import train_test_split

3. x_train, x_test, y_train, y_test= train_test_split(x, y, test_size= 1/3, random_state=0)

By executing the above code, we will get x-test, x-train and y-test, y-train dataset. Consider the below
images:

Test-dataset:
Training Dataset:
Step-2: Fitting the Simple Linear Regression to the Training Set:

Now the second step is to fit our model to the training dataset. To do so, we will import
the LinearRegression class of the linear_model library from the scikit learn. After importing the class,
we are going to create an object of the class named as a regressor. The code for this is given below:

1. #Fitting the Simple Linear Regression model to the training dataset

2. from sklearn.linear_model import LinearRegression

3. regressor= LinearRegression()

4. regressor.fit(x_train, y_train)

5. Output:

6. Out[7]: LinearRegression(copy_X=True, fit_intercept=True, n_jobs=None, normalize=False)

Step: 3. Prediction of test set result:

dependent (salary) and an independent variable (Experience). So, now, our model is ready to predict
the output for the new observations. In this step, we will provide the test dataset (new observations)
to the model to check whether it can predict the correct output or not.

We will create a prediction vector y_pred, and x_pred, which will contain predictions of test dataset,
and prediction of training set respectively.

1. #Prediction of Test and Training set result

2. y_pred= regressor.predict(x_test)

3. x_pred= regressor.predict(x_train)

Output:

You can check the variable by clicking on the variable explorer option in the IDE, and also compare
the result by comparing values from y_pred and y_test. By comparing these values, we can check
how good our model is performing.

Step: 4. visualizing the Training set results:

1. mtp.scatter(x_train, y_train, color="green")

2. mtp.plot(x_train, x_pred, color="red")

3. mtp.title("Salary vs Experience (Training Dataset)")

4. mtp.xlabel("Years of Experience")
5. mtp.ylabel("Salary(In Rupees)")

6. mtp.show()

Output:

By executing the above lines of code, we will get the below graph plot as an output.

Step: 5. visualizing the Test set results:

In the previous step, we have visualized the performance of our model on the training set. Now, we
will do the same for the Test set. The complete code will remain the same as the above code, except
in this, we will use x_test, and y_test instead of x_train and y_train.

Here we are also changing the color of observations and regression line to differentiate between the
two plots, but it is optional.

1. #visualizing the Test set results

2. mtp.scatter(x_test, y_test, color="blue")

3. mtp.plot(x_train, x_pred, color="red")

4. mtp.title("Salary vs Experience (Test Dataset)")

5. mtp.xlabel("Years of Experience")

6. mtp.ylabel("Salary(In Rupees)")

7. mtp.show()

Output:
By executing the above line of code, we will get the output as:

Multiple Linear Regression

Example: Multiple Linear Regression by Hand

Suppose we have the following dataset with one response variable y and two predictor variables
X1 and X2:

Use the following steps to fit a multiple linear regression model to this dataset.

Step 1: Calculate X12, X22, X1y, X2y and X1X2.

Step 2: Calculate Regression Sums.

Next, make the following regression sum calculations:

 Σx12 = ΣX12 – (ΣX1)2 / n = 38,767 – (555)2 / 8 = 263.875

 Σx22 = ΣX22 – (ΣX2)2 / n = 2,823 – (145)2 / 8 = 194.875

 Σx1y = ΣX1y – (ΣX1Σy) / n = 101,895 – (555*1,452) / 8 = 1,162.5

 Σx2y = ΣX2y – (ΣX2Σy) / n = 25,364 – (145*1,452) / 8 = -953.5

 Σx1x2 = ΣX1X2 – (ΣX1ΣX2) / n = 9,859 – (555*145) / 8 = -200.375

Step 3: Calculate b0, b1, and b2.

The formula to calculate b1 is: [(Σx22)(Σx1y) – (Σx1x2)(Σx2y)] / [(Σx12) (Σx22) – (Σx1x2)2]

Thus, b1 = [(194.875)(1162.5) – (-200.375)(-953.5)] / [(263.875) (194.875) – (-200.375)2] = 3.148

The formula to calculate b2 is: [(Σx12)(Σx2y) – (Σx1x2)(Σx1y)] / [(Σx12) (Σx22) – (Σx1x2)2]

Thus, b2 = [(263.875)(-953.5) – (-200.375)(1152.5)] / [(263.875) (194.875) – (-200.375)2] = -1.656

The formula to calculate b0 is: y – b1X1 – b2X2

Thus, b0 = 181.5 – 3.148(69.375) – (-1.656)(18.125) = -6.867

Step 5: Place b0, b1, and b2 in the estimated linear regression equation.

The estimated linear regression equation is: ŷ = b0 + b1x1 + b2x2

In our example, it is ŷ = -6.867 + 3.148x1 – 1.656x2

How to Interpret a Multiple Linear Regression Equation

Here is how to interpret this estimated linear regression equation: ŷ = -6.867 + 3.148x1 – 1.656x2

b0 = -6.867. When both predictor variables are equal to zero, the mean value for y is -6.867.

b1 = 3.148. A one unit increase in x1 is associated with a 3.148 unit increase in y, on average, assuming
x2 is held constant.

b2 = -1.656. A one unit increase in x2 is associated with a 1.656 unit decrease in y, on average,
assuming x1 is held constant.

Homework 05 Answers
No ratings yet
Homework 05 Answers
3 pages
DDM HW1 Akash Srivastava
100% (2)
DDM HW1 Akash Srivastava
14 pages
Solved MCQs of Social Work
100% (4)
Solved MCQs of Social Work
17 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
SVM Implementation
No ratings yet
SVM Implementation
8 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Lab#10 Ai
No ratings yet
Lab#10 Ai
3 pages
Esai Seshan FMS Practical Final Submission
No ratings yet
Esai Seshan FMS Practical Final Submission
25 pages
FMS Final Submission
No ratings yet
FMS Final Submission
25 pages
Forecasting Techniques: Quantitative Techniques in Management
No ratings yet
Forecasting Techniques: Quantitative Techniques in Management
25 pages
Lab 11,12 - Copy
No ratings yet
Lab 11,12 - Copy
7 pages
5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
Logistic Regression
100% (1)
Logistic Regression
10 pages
Statistical Analysis
No ratings yet
Statistical Analysis
26 pages
Linear Regression2
No ratings yet
Linear Regression2
9 pages
MLR Example 2predictors
No ratings yet
MLR Example 2predictors
5 pages
PHD Econ, Applied Econometrics 2021/22 - Takehome University of Innsbruck
No ratings yet
PHD Econ, Applied Econometrics 2021/22 - Takehome University of Innsbruck
20 pages
UNIT 3 AAM
No ratings yet
UNIT 3 AAM
30 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
10 pages
05 Logistic - Regression
No ratings yet
05 Logistic - Regression
7 pages
Regression Model
No ratings yet
Regression Model
6 pages
Linear Regression
No ratings yet
Linear Regression
4 pages
EE-232: Signals and Systems Lab 2: Plotting and Array Processing in MATLAB
No ratings yet
EE-232: Signals and Systems Lab 2: Plotting and Array Processing in MATLAB
16 pages
Final Submission of Fundamental of Mathematics & Statictis
No ratings yet
Final Submission of Fundamental of Mathematics & Statictis
37 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Experiment1 Explanation
No ratings yet
Experiment1 Explanation
6 pages
BB A 3 Econometric Sand Excel
No ratings yet
BB A 3 Econometric Sand Excel
28 pages
Assignment of Econometrics
No ratings yet
Assignment of Econometrics
10 pages
Guide Function Fit
No ratings yet
Guide Function Fit
11 pages
Linear Regression
No ratings yet
Linear Regression
13 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
Unit 5 (CORRELATION AND REGRESSION)
No ratings yet
Unit 5 (CORRELATION AND REGRESSION)
23 pages
Linear Regression With Pytroch
No ratings yet
Linear Regression With Pytroch
13 pages
Calculations of Regression Equation
No ratings yet
Calculations of Regression Equation
3 pages
Correlation Coefficient
No ratings yet
Correlation Coefficient
12 pages
6.01 Midterm 1 Spring 2011: Name: Section
No ratings yet
6.01 Midterm 1 Spring 2011: Name: Section
21 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Mini Tests
No ratings yet
Mini Tests
11 pages
EXP-4 DMusingPYTHON
No ratings yet
EXP-4 DMusingPYTHON
7 pages
Topic 3 Multiple Regression Analysis Estimation
No ratings yet
Topic 3 Multiple Regression Analysis Estimation
31 pages
UNIT-1 Polynomial Regression
No ratings yet
UNIT-1 Polynomial Regression
7 pages
Regression and Correlation
No ratings yet
Regression and Correlation
15 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
Linear Regression
No ratings yet
Linear Regression
62 pages
SC&RP - Unit 5
No ratings yet
SC&RP - Unit 5
36 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
95 pages
Intro to Linear and Logistic Reg
No ratings yet
Intro to Linear and Logistic Reg
5 pages
Python Implementation of Random Forest Algorithm
No ratings yet
Python Implementation of Random Forest Algorithm
10 pages
Unit - 2 ML
No ratings yet
Unit - 2 ML
32 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
Unit3
No ratings yet
Unit3
12 pages
Multi LR Expl
No ratings yet
Multi LR Expl
4 pages
C2W3_Lab_01_Model_Evaluation_and_Selection
No ratings yet
C2W3_Lab_01_Model_Evaluation_and_Selection
21 pages
DMJAP-LinearRegression-3
No ratings yet
DMJAP-LinearRegression-3
28 pages
Tugas Statel Individu
No ratings yet
Tugas Statel Individu
17 pages
Data Analysis Using R - 5
No ratings yet
Data Analysis Using R - 5
9 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
s&Ml Unit 5- q & A
No ratings yet
s&Ml Unit 5- q & A
15 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Regression
No ratings yet
Regression
3 pages
1 Correlation
No ratings yet
1 Correlation
1 page
Workshop Calculos Bioindicadores PDF
No ratings yet
Workshop Calculos Bioindicadores PDF
66 pages
Use Cases For Project
No ratings yet
Use Cases For Project
4 pages
Volmar 2014
No ratings yet
Volmar 2014
12 pages
Damodaran Valuation
100% (1)
Damodaran Valuation
229 pages
NeurIPS 2022 Using Embeddings For Causal Estimation of Peer Influence in Social Networks Paper Conference
No ratings yet
NeurIPS 2022 Using Embeddings For Causal Estimation of Peer Influence in Social Networks Paper Conference
13 pages
Log VP ML Lomba
No ratings yet
Log VP ML Lomba
15 pages
Tamirat report
No ratings yet
Tamirat report
42 pages
Detailed Syllabus ST Xaviers Kolkata Eco Hons
No ratings yet
Detailed Syllabus ST Xaviers Kolkata Eco Hons
20 pages
Batangas State University
No ratings yet
Batangas State University
4 pages
Caseload Management
No ratings yet
Caseload Management
21 pages
A Practical Introduction To Nordpred - Cancerview - Ca
No ratings yet
A Practical Introduction To Nordpred - Cancerview - Ca
46 pages
B.Sc. (H) Probability and Statistics 2011-2012
No ratings yet
B.Sc. (H) Probability and Statistics 2011-2012
2 pages
Chakrabarti Et Al 2023 JFDR
No ratings yet
Chakrabarti Et Al 2023 JFDR
29 pages
Syllabus MAS202 Sp23
No ratings yet
Syllabus MAS202 Sp23
23 pages
Econometric S Lecture 45
No ratings yet
Econometric S Lecture 45
31 pages
Board Composition, Board Size and Market Value of Listed Industrial Goods Companies in Nigeria
No ratings yet
Board Composition, Board Size and Market Value of Listed Industrial Goods Companies in Nigeria
8 pages
Instant download Encyclopedia of Epidemiology volume 1 2 1st Edition Sarah E. Boslaugh pdf all chapter
100% (8)
Instant download Encyclopedia of Epidemiology volume 1 2 1st Edition Sarah E. Boslaugh pdf all chapter
60 pages
Lesson 8 - Classification
No ratings yet
Lesson 8 - Classification
74 pages
Di Tella & Schargrodsky (2004) - Do Police Reduce Crime Estimates Using The Allocation of Police Forces After A Terrorist Attack PDF
No ratings yet
Di Tella & Schargrodsky (2004) - Do Police Reduce Crime Estimates Using The Allocation of Police Forces After A Terrorist Attack PDF
35 pages
Practical Multivariate Analysis 6th Edition Afifi All Chapters Instant Download
100% (3)
Practical Multivariate Analysis 6th Edition Afifi All Chapters Instant Download
62 pages
Ch8 - 1533638090646993.pdf 2
No ratings yet
Ch8 - 1533638090646993.pdf 2
42 pages
(eBook PDF) Biostatistics with R An Introduction to Statistics Through Biological Data 2024 scribd download
100% (13)
(eBook PDF) Biostatistics with R An Introduction to Statistics Through Biological Data 2024 scribd download
55 pages
Costing
No ratings yet
Costing
23 pages
Matrix Plot of Law. SCH Gpa Vs Under Grad G, Lmat Perctl, Qlty Rating & Gre
No ratings yet
Matrix Plot of Law. SCH Gpa Vs Under Grad G, Lmat Perctl, Qlty Rating & Gre
4 pages
Choo Etal 2013 Exploring Characteristics of Airport Access Mode Choice A Case Study of Korea
No ratings yet
Choo Etal 2013 Exploring Characteristics of Airport Access Mode Choice A Case Study of Korea
18 pages