0% found this document useful (0 votes)

14 views

Simple Linear Regression

The document demonstrates using linear regression to predict home prices and Canadian per capita income based on area and year respectively. It shows loading and exploring data, plotting scatter plots, fitting linear regression models, making predictions, and adding prediction columns.

Uploaded by

jaymehta1444

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Simple Linear Regression

Uploaded by

jaymehta1444

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

3/8/24, 4:48 PM MlYt1.

ipynb - Colaboratory

keyboard_arrow_down Question in the video

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from sklearn import linear_model

df = pd.read_csv("homeprices.csv")
df

area price

0 2600 550000

1 3000 565000

2 3200 610000

3 3600 680000

4 4000 725000

Next steps: Generate code with df

toggle_off View recommended plots

Plot a scatterplot for the data available.

%matplotlib inline
plt.xlabel("area (sqr ft)") # adds labels on the x-axis
plt.ylabel("price (US $)")
plt.scatter(df.area, df.price, color="red", marker="+")

<matplotlib.collections.PathCollection at 0x7bbdf44ec1c0>

We fit the available data into a Linear Regression model.

The various Features are added as a 2D array ( df[ [ ] ] ).

The Target variable is added as df.___

reg = linear_model.LinearRegression()
reg.fit(df[['area']], df.price)
print(reg.coef_) # gives the coeffecients of the linear equation.
print(reg.intercept_) # gives the y-intercept of the equation

[135.78767123]
180616.43835616432

%matplotlib inline
plt.xlabel("area (sqr ft)")
plt.ylabel("price (US $)")
plt.scatter(df.area, df.price, color="red", marker="+")
plt.plot(df.area, reg.predict(df[['area']]), color='blue') # plots the Linear Regression Line.

https://colab.research.google.com/drive/19T8cNCsKWIzrDmgmVNTbrb_LOBxMtsp8#scrollTo=XJelj1L0xusa&printMode=true 1/4
3/8/24, 4:48 PM MlYt1.ipynb - Colaboratory

[<matplotlib.lines.Line2D at 0x7bbdf4506aa0>]

print(reg.predict([[3300]])) # predicts the value of the given input.

print(reg.predict([[5000]]))

[628715.75342466]
[859554.79452055]
/usr/local/lib/python3.10/dist-packages/sklearn/base.py:439: UserWarning: X does not have valid feature names, but LinearRegression
warnings.warn(
/usr/local/lib/python3.10/dist-packages/sklearn/base.py:439: UserWarning: X does not have valid feature names, but LinearRegression
warnings.warn(

 

d = pd.read_csv("areas.csv")
d.head(3)

area

0 1000

1 1500

2 2300

Next steps: Generate code with d

toggle_off View recommended plots

p = reg.predict(d) # predicting values for an array input

array([ 316404.10958904, 384297.94520548, 492928.08219178,

661304.79452055, 740061.64383562, 799808.21917808,
926090.75342466, 650441.78082192, 825607.87671233,
492928.08219178, 1402705.47945205, 1348390.4109589 ,
1144708.90410959])

d['prices'] = p # adding a column in the sheet for the predicted values

https://colab.research.google.com/drive/19T8cNCsKWIzrDmgmVNTbrb_LOBxMtsp8#scrollTo=XJelj1L0xusa&printMode=true 2/4
3/8/24, 4:48 PM MlYt1.ipynb - Colaboratory

area prices

0 1000 3.164041e+05

1 1500 3.842979e+05

2 2300 4.929281e+05

3 3540 6.613048e+05

4 4120 7.400616e+05

5 4560 7.998082e+05

6 5490 9.260908e+05

7 3460 6.504418e+05

8 4750 8.256079e+05

9 2300 4.929281e+05

10 9000 1.402705e+06

11 8600 1.348390e+06

12 7100 1.144709e+06

Next steps: Generate code with d

toggle_off View recommended plots

d.to_csv("prediction.csv", index = False) # exporting the csv file with no index column

keyboard_arrow_down Exercise
df1=pd.read_csv("/content/canada_per_capita_income.csv")
df1.head()

year pci

0 1970 3399.299037

1 1971 3768.297935

2 1972 4251.175484

3 1973 4804.463248

4 1974 5576.514583

Next steps: Generate code with df1

toggle_off View recommended plots

plt.xlabel("Year")
plt.ylabel("Per Capita Income")
plt.scatter(df1.year, df1.pci, color='red', marker='+')

<matplotlib.collections.PathCollection at 0x7bbdf4705960>

https://colab.research.google.com/drive/19T8cNCsKWIzrDmgmVNTbrb_LOBxMtsp8#scrollTo=XJelj1L0xusa&printMode=true 3/4
3/8/24, 4:48 PM MlYt1.ipynb - Colaboratory
reg1 = linear_model.LinearRegression()
reg1.fit(df1[['year']], df1.pci)
print(reg1.coef_)
print(reg1.intercept_)

[828.46507522]
-1632210.7578554575

plt.xlabel("Year")
plt.ylabel("Per Capita Income")
plt.scatter(df1.year, df1.pci, color='red', marker='+')
plt.plot(df1.year, reg1.predict(df1[['year']]), color='blue')

output [<matplotlib.lines.Line2D at 0x7bbdf4798ca0>]

reg1.predict([[2020]])

/usr/local/lib/python3.10/dist-packages/sklearn/base.py:439: UserWarning: X does not have valid feature names, but LinearRegression
warnings.warn(
array([41288.69409442])

 

https://colab.research.google.com/drive/19T8cNCsKWIzrDmgmVNTbrb_LOBxMtsp8#scrollTo=XJelj1L0xusa&printMode=true 4/4

Lesson Plan Audio Video Conferencing
100% (8)
Lesson Plan Audio Video Conferencing
3 pages
PP 180828 Shrink Wrapping Manual
100% (2)
PP 180828 Shrink Wrapping Manual
56 pages
Induction of General Anesthesia: Overview - UpToDate
100% (1)
Induction of General Anesthesia: Overview - UpToDate
41 pages
Measuring Value in The Public Sector
No ratings yet
Measuring Value in The Public Sector
6 pages
Correlation and Regression (TP)
No ratings yet
Correlation and Regression (TP)
4 pages
27 Jupyter Notebook
No ratings yet
27 Jupyter Notebook
42 pages
Fds Mannual
No ratings yet
Fds Mannual
39 pages
Practical 5
No ratings yet
Practical 5
11 pages
IML project
No ratings yet
IML project
6 pages
Deltapdf
No ratings yet
Deltapdf
3 pages
Statistical Data Analysis - Ipynb - Colaboratory
No ratings yet
Statistical Data Analysis - Ipynb - Colaboratory
6 pages
Lab5.ipynb - Colaboratory
No ratings yet
Lab5.ipynb - Colaboratory
8 pages
DMV - 6 - Jupyter Notebook
No ratings yet
DMV - 6 - Jupyter Notebook
6 pages
Data Science Record_05
No ratings yet
Data Science Record_05
20 pages
Matplotlib - Pyplot PLT Numpy NP Scipy Seaborn Sns Scipy Random
No ratings yet
Matplotlib - Pyplot PLT Numpy NP Scipy Seaborn Sns Scipy Random
4 pages
1 - Linear - Regression - Ipynb - Colaboratory
No ratings yet
1 - Linear - Regression - Ipynb - Colaboratory
7 pages
Lab 4
No ratings yet
Lab 4
3 pages
Linear Regression Python Sklearn Numpy P PDF
No ratings yet
Linear Regression Python Sklearn Numpy P PDF
2 pages
PPP Models - ARIMA & NARNN - Ipynb - Colaboratory
No ratings yet
PPP Models - ARIMA & NARNN - Ipynb - Colaboratory
8 pages
Linear Regression - Ipynb - Colab
No ratings yet
Linear Regression - Ipynb - Colab
3 pages
plt_tutorial
No ratings yet
plt_tutorial
1 page
Guia para La Importación de Series Financieras de Yahoo F
No ratings yet
Guia para La Importación de Series Financieras de Yahoo F
8 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
6 pages
Regression: Pyspark - SQL
No ratings yet
Regression: Pyspark - SQL
5 pages
linear-regression
No ratings yet
linear-regression
8 pages
AD3301 - Visualization - Ipynb - Colaboratory
No ratings yet
AD3301 - Visualization - Ipynb - Colaboratory
15 pages
ml exp-5,6 (1)[1] (1)
No ratings yet
ml exp-5,6 (1)[1] (1)
6 pages
Python Note 3
No ratings yet
Python Note 3
11 pages
ML LAB Mannual - Index
No ratings yet
ML LAB Mannual - Index
29 pages
cs3362 Foundations of Data Science Lab Manual
No ratings yet
cs3362 Foundations of Data Science Lab Manual
53 pages
Machine Failure Prediction
No ratings yet
Machine Failure Prediction
11 pages
Tutorial 2 - Clustering
100% (2)
Tutorial 2 - Clustering
6 pages
Load Dataset: Import As
No ratings yet
Load Dataset: Import As
8 pages
Linear Regression
No ratings yet
Linear Regression
3 pages
BDA Lab 4: Python Data Visualization: Your Name: Mohamad Salehuddin Bin Zulkefli Matric No: 17005054
No ratings yet
BDA Lab 4: Python Data Visualization: Your Name: Mohamad Salehuddin Bin Zulkefli Matric No: 17005054
10 pages
Scenario 1:: Acknowlegement
No ratings yet
Scenario 1:: Acknowlegement
17 pages
STOCK - MARKET - PROJECT - Jupyter Notebook
No ratings yet
STOCK - MARKET - PROJECT - Jupyter Notebook
24 pages
CS-3361-Data-science-lab Manual
No ratings yet
CS-3361-Data-science-lab Manual
36 pages
adi_dsbda4_final
No ratings yet
adi_dsbda4_final
2 pages
Clustering Documentation Python Code
No ratings yet
Clustering Documentation Python Code
8 pages
FDSA Lab Manual
No ratings yet
FDSA Lab Manual
27 pages
Experiment1111
No ratings yet
Experiment1111
25 pages
Rajeek8 12
No ratings yet
Rajeek8 12
21 pages
exp_3_ml
No ratings yet
exp_3_ml
3 pages
poojitha 4
No ratings yet
poojitha 4
6 pages
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
No ratings yet
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
1 page
ZFNet For CIFAR-10 Classification
No ratings yet
ZFNet For CIFAR-10 Classification
33 pages
IP PRACTICAL FILE 2024-25
No ratings yet
IP PRACTICAL FILE 2024-25
26 pages
01_MichaelHarris_WinningPatterns.ipynb - Colab
No ratings yet
01_MichaelHarris_WinningPatterns.ipynb - Colab
12 pages
Final Practical File 2022-23
No ratings yet
Final Practical File 2022-23
87 pages
Stock_class_py - Jupyter Notebook
No ratings yet
Stock_class_py - Jupyter Notebook
5 pages
Data Sci
No ratings yet
Data Sci
29 pages
AAI3
No ratings yet
AAI3
7 pages
DV 9
No ratings yet
DV 9
11 pages
Numpy - Pandas - Lab - Jupyter Notebook
No ratings yet
Numpy - Pandas - Lab - Jupyter Notebook
29 pages
ML File
No ratings yet
ML File
17 pages
Uber - Analysis - Jupyter - Notebook
100% (1)
Uber - Analysis - Jupyter - Notebook
10 pages
Machine Learning Lab Manual (1)
No ratings yet
Machine Learning Lab Manual (1)
42 pages
Análisis Exploratorio de Datos (EDA) - NVIDIA 2021-2023
No ratings yet
Análisis Exploratorio de Datos (EDA) - NVIDIA 2021-2023
9 pages
AIML 7 To 11
No ratings yet
AIML 7 To 11
7 pages
Python Assignment 03- Anil Kumar KN -91241460081
No ratings yet
Python Assignment 03- Anil Kumar KN -91241460081
9 pages
06K_means_clustering
No ratings yet
06K_means_clustering
4 pages
ML - Lab-6.ipynb - Colab
No ratings yet
ML - Lab-6.ipynb - Colab
4 pages
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Business Ethics and Corporate Social Responsibility: A Holistic Approach
100% (1)
Business Ethics and Corporate Social Responsibility: A Holistic Approach
6 pages
Gre Student Presentation
No ratings yet
Gre Student Presentation
68 pages
Welcome To Saurashtra University Naimish Sem 4 Bcom
No ratings yet
Welcome To Saurashtra University Naimish Sem 4 Bcom
1 page
Lab 1: Pulse Width Modulation (PWM) : ENGR 1000: Electrical and Computer Engineering Module
No ratings yet
Lab 1: Pulse Width Modulation (PWM) : ENGR 1000: Electrical and Computer Engineering Module
4 pages
Sand Filter Next Gen
100% (1)
Sand Filter Next Gen
18 pages
VOS3000 Details Pricing
No ratings yet
VOS3000 Details Pricing
13 pages
Minimum Wages Act Labour Law Project
50% (2)
Minimum Wages Act Labour Law Project
15 pages
Europe Vol4 PDF
No ratings yet
Europe Vol4 PDF
36 pages
IEPE (Integrated Electronics Piezo-Electric) - Kistler
No ratings yet
IEPE (Integrated Electronics Piezo-Electric) - Kistler
2 pages
Windows Vista Beta
No ratings yet
Windows Vista Beta
13 pages
Ecw567-Wastewater Sedimentation
No ratings yet
Ecw567-Wastewater Sedimentation
34 pages
Uson vs. Del Rosario, G.R. No. L-4963 January 29, 1953 Facts
No ratings yet
Uson vs. Del Rosario, G.R. No. L-4963 January 29, 1953 Facts
1 page
ACTION PLAN - Phil-IRI 2022-2023
No ratings yet
ACTION PLAN - Phil-IRI 2022-2023
3 pages
Dba 302 Accounting Rate of Return, NPV and Irr Presentations
No ratings yet
Dba 302 Accounting Rate of Return, NPV and Irr Presentations
12 pages
Principles of Finance
No ratings yet
Principles of Finance
2 pages
Tos Educ154-Template
No ratings yet
Tos Educ154-Template
2 pages
Introduction To Systems Analysis and Design
No ratings yet
Introduction To Systems Analysis and Design
47 pages
Module-2 CN (21CS52) - Datalink Layer
No ratings yet
Module-2 CN (21CS52) - Datalink Layer
110 pages
RIDGID A-Frame Fault Locator
No ratings yet
RIDGID A-Frame Fault Locator
2 pages
Travel Agency Franchise
No ratings yet
Travel Agency Franchise
3 pages
Untitled2.ipynb - Colab-Exp2
No ratings yet
Untitled2.ipynb - Colab-Exp2
2 pages
Lr2170sa 4ah Specification Sheet Translated
No ratings yet
Lr2170sa 4ah Specification Sheet Translated
15 pages
Proposal of Methylamine 99
No ratings yet
Proposal of Methylamine 99
20 pages
Isolation Precautions and Use of Personal Protective Equipments1
No ratings yet
Isolation Precautions and Use of Personal Protective Equipments1
65 pages
Product Recommendation Eaton Fuller Heavy-Duty Transmissions 13 - 18 Speed RT-6613
No ratings yet
Product Recommendation Eaton Fuller Heavy-Duty Transmissions 13 - 18 Speed RT-6613
2 pages
Mystery School Code Review
No ratings yet
Mystery School Code Review
4 pages