Assignment AI-ML

- The document describes building a linear regression model to predict median home values (MEDV) using the Boston Housing dataset. - It is determined that the percentage of lower status population (LSTAT) has the highest correlation with MEDV. - A linear regression model is fit using LSTAT to predict MEDV, which achieves high accuracy based on evaluation metrics. - The model provides a method to predict MEDV values using a single linear regression model on the Boston Housing dataset.

Uploaded by

Shelly Sharma

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views

Assignment AI-ML

Uploaded by

Shelly Sharma

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Assignment AI/ML

NAME: SHELLY SHARMA

CLASS: BTECH IT B
BATCH: B1
ROLL NO.: 2016820
ID: BTBTI20249

Submitted to: Dr Urvashi Prakash Shukla

The Boston Housing Dataset
Question: The MEDV value of each person in the diabetic dataset needs a linear model to be designed for its
prediction.

The Boston Housing Dataset is a derived from information collected by the U.S. Census Service concerning

housing in the area of Boston MA. The following describes the dataset columns:

● CRIM - per capita crime rate by town

● ZN - proportion of residential land zoned for lots over 25,000 sq.ft.
● INDUS - proportion of non-retail business acres per town.
● CHAS - Charles River dummy variable (1 if tract bounds river; 0 otherwise)
● NOX - nitric oxides concentration (parts per 10 million)
● RM - average number of rooms per dwelling
● AGE - proportion of owner-occupied units built prior to 1940
● DIS - weighted distances to five Boston employment centres
● RAD - index of accessibility to radial highways
● TAX - full-value property-tax rate per $10,000
● PTRATIO - pupil-teacher ratio by town
● LSTAT - % lower status of the population
● MEDV - Median value of owner-occupied homes in $1000's
ANSWER

LOGIC EXPLANATION :

Step-1:we have imported libraries such as pandas and numpy to import their functionalities in our
assignment. Here we have imported linear regression that uses the relationship between the data points
to draw straight line through all them. This line can be used to predict future values.
Matplotlib is a cross-platform, data visualization and graphical plotting library for Python and its
numerical extension NumPy. As such, it offers a viable open source alternative to MATLAB

Step-2: We have defined column names as per dataset provided.

Then we have loaded the dataset using the variable df.
We have also taken a variable x to store the column MEDV.
Step-3: The isnull() method returns a DataFrame object where all the values are replaced with a Boolean
value True for NULL values, and otherwise False.
Here the isna() method returns a DataFrame object where all the values are replaced with a Boolean
value True for NA (not-a -number) values, and otherwise False.

OUTPUT:
LOGIC EXPLANATION:

Step-4: We have normalized our dataset.

Normalize is a function present in sklearn. preprocessing package. Normalization is used for
scaling input data set on a scale of 0 to 1 to have unit norm. At last we have printed our
normalized data.

OUTPUT:
LOGIC EXPLANATION:
Step-5: We have calculated the correlation of column [‘MEDV’] with all other columns.
The corr() method calculates the relationship between each column in your data set. The Result
of the corr() method is a table with a lot of numbers that represents how well the relationship is
between two columns. The number varies from -1 to 1.

OUTPUT:
After comparing the correlation of each column with MEDV , we have concluded that LSTAT provides
the best correlation with MEDV.That’s why we will further work on column LSTAT and MEDV to
predict the best values for our dataset.
LOGIC EXPLANATION:

Step-6: We have defined a variable y which stored the column values of LSTAT.
Further we plotted a graph between MEDV and LSTAT where X-axis denotes MEDV values and Y-axis
denotes the LSTAT values.

OUTPUT:
LOGIC EXPLANATION :

Step-7: We have taken x_mean as MEDV values and y_mean as LSTAT values .Using these two
parameters we have calculated our mean values for both.

Now we have calculated the slope as b1 and intercept as b0 and then printed the result. We have
stored the value of the resulting linear equation in variable y_pred. After this we have use plt.scatter()
function to plot a scatter plot and uses the values of y_pred to predict our result.
From the graph we can infer that the scattered points are near to the plotted line, proving that it has
high accuracy.
OUTPUT:
LOGIC EXPLANATION :

Step-8: We have imported mean_absolute_error from sklearn.metrics to calculate the Mean Absolute
Error. Mean Absolute Error calculates the average difference between the calculated values and actual
values.

Step-9: We have imported mean_squared_error from sklearn.metrics to calculate the Mean Square
Error. The Mean Squared Error of an estimator measures the average of error squares i.e. the average
squared difference between the estimated values and true value.

Step-10: We calculated Root Mean Squared Error.

RMSE is a square root of value gathered from the mean square error function. It helps us plot a
difference between the estimate and actual value of a parameter of the model.

Step-11: We have imported r2_score from sklearn.metrics to calculate R_Square value for our data. It is
used to evaluate the performance of a linear regression model.

Step-12: We have calculated the Adjusted R-squared of our data.

The Adjusted R-squared takes into account the number of independent variables used for predicting the
target variable.

OUTPUT:
LOGIC EXPLANATION:

Step-13: We have used reshape() function allows us to reshape an array in Python. Reshaping basically
means, changing the shape of an array. And the shape of an array is determined by the number of
elements in each dimension. Reshaping allows us to add or remove dimensions in an array.

LinearRegression fits a linear model with coefficients to minimize the residual sum of squares between
the observed targets in the dataset, and the targets predicted by the linear approximation. Regression
analysis is a form of predictive modelling technique which investigates the relationship between a
dependent (target) and independent variable (predictor). This technique is used for forecasting, time
series modelling and finding the causal effect relationship between the variables.

OUTPUT:
CONCLUSION:

After processing the whole data we came to the conclusion that LSTAT has the best accuracy in
correlation with MEDV. The plotted graph gathers the same information about the accuracy of the data
taken.
Also at last we have shown m_pred and m_predicted values that came out to be same and hence we
have designed a linear single model of regression to predict the MEDV value.

English: Quarter 4 - Module 1: Judge The Validity of The Evidence From The Text
95% (21)
English: Quarter 4 - Module 1: Judge The Validity of The Evidence From The Text
28 pages
Aluminum Household Utensil Making Plant
100% (2)
Aluminum Household Utensil Making Plant
26 pages
Comp-XM Examination Guide
No ratings yet
Comp-XM Examination Guide
15 pages
Of Mice Men Character Quotes
No ratings yet
Of Mice Men Character Quotes
1 page
Stastical Physics: The Energy Distribution
No ratings yet
Stastical Physics: The Energy Distribution
17 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
DSBDAL_Assignment no 4
No ratings yet
DSBDAL_Assignment no 4
15 pages
Data Science and Big Data Analysis Mcqs
No ratings yet
Data Science and Big Data Analysis Mcqs
53 pages
ML Lab Manual Prgm 2&3
No ratings yet
ML Lab Manual Prgm 2&3
6 pages
Predicting Sonar Rocks Against Mines With ML
No ratings yet
Predicting Sonar Rocks Against Mines With ML
7 pages
UNIT 2 Notes
No ratings yet
UNIT 2 Notes
8 pages
Making predictions
No ratings yet
Making predictions
13 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Linear Regression Simple Technique For I
No ratings yet
Linear Regression Simple Technique For I
3 pages
Regression Using Excel
No ratings yet
Regression Using Excel
2 pages
k
No ratings yet
k
11 pages
Ex 2 Solution
No ratings yet
Ex 2 Solution
13 pages
Prediction
100% (1)
Prediction
10 pages
R Viva Questions
100% (1)
R Viva Questions
4 pages
Python Exploratory Data Analysis
No ratings yet
Python Exploratory Data Analysis
24 pages
Continuous Assessment
No ratings yet
Continuous Assessment
4 pages
ML Regression Documentation
No ratings yet
ML Regression Documentation
7 pages
Sensitivity Analyses: A Brief Tutorial With R Package Pse, Version 0.1.2
No ratings yet
Sensitivity Analyses: A Brief Tutorial With R Package Pse, Version 0.1.2
14 pages
Streaming Algorithms For Data in Motion
No ratings yet
Streaming Algorithms For Data in Motion
11 pages
Curve Fitting With Matlab
100% (1)
Curve Fitting With Matlab
38 pages
Linear Regression: Machine Learning
No ratings yet
Linear Regression: Machine Learning
9 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
20 pages
Aer0 Pro RM Sir
No ratings yet
Aer0 Pro RM Sir
10 pages
CGR 22318 SUPER 20
No ratings yet
CGR 22318 SUPER 20
27 pages
Lab Manual DAR
No ratings yet
Lab Manual DAR
81 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
Arnav MLlab02
No ratings yet
Arnav MLlab02
6 pages
Face Recognition With Eigenface
No ratings yet
Face Recognition With Eigenface
4 pages
Adaptable Value-Set Analysis For Low-Level Code
No ratings yet
Adaptable Value-Set Analysis For Low-Level Code
13 pages
Sales and Advertising
No ratings yet
Sales and Advertising
14 pages
Assignment
No ratings yet
Assignment
3 pages
Linear Regression: What Is Regression Analysis?
100% (1)
Linear Regression: What Is Regression Analysis?
21 pages
R-Unit 5
No ratings yet
R-Unit 5
76 pages
Coding 2
No ratings yet
Coding 2
3 pages
ML Lab - Sukanya Raja
No ratings yet
ML Lab - Sukanya Raja
23 pages
Recipe-5-Identifying-a-linear-relationship - Ipynb - Colab
No ratings yet
Recipe-5-Identifying-a-linear-relationship - Ipynb - Colab
6 pages
3 - Feature Extraction
No ratings yet
3 - Feature Extraction
22 pages
An Introduction To Locally Linear Embedding
No ratings yet
An Introduction To Locally Linear Embedding
13 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
Lecture Notes - Linear Regression
No ratings yet
Lecture Notes - Linear Regression
26 pages
A New Method D of Multi Dimensiona Al Scaling
No ratings yet
A New Method D of Multi Dimensiona Al Scaling
6 pages
2018 A Comparative Study of Different Curve Fitting Algorithms in ANN
No ratings yet
2018 A Comparative Study of Different Curve Fitting Algorithms in ANN
5 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
Network
No ratings yet
Network
14 pages
Experiment No 7
No ratings yet
Experiment No 7
7 pages
R Lab 1
No ratings yet
R Lab 1
5 pages
Math 551 Lab 3
No ratings yet
Math 551 Lab 3
5 pages
Data Set Property Based K' in VDBSCAN Clustering Algorithm
No ratings yet
Data Set Property Based K' in VDBSCAN Clustering Algorithm
5 pages
Unit3__R
No ratings yet
Unit3__R
19 pages
Chapter 20 Modul DSS
No ratings yet
Chapter 20 Modul DSS
20 pages
Graph Plotting in R Programming
No ratings yet
Graph Plotting in R Programming
12 pages
HW2+Solution
No ratings yet
HW2+Solution
11 pages
Linear Regression
No ratings yet
Linear Regression
46 pages
Quiz Questions
No ratings yet
Quiz Questions
6 pages
INDUSTRY 2 Akshat
No ratings yet
INDUSTRY 2 Akshat
12 pages
DSR 2879
No ratings yet
DSR 2879
25 pages
Unit 4 Part 2
No ratings yet
Unit 4 Part 2
17 pages
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
From Everand
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
Fouad Sabry
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Bresenham Line Algorithm: Efficient Pixel-Perfect Line Rendering for Computer Vision
From Everand
Bresenham Line Algorithm: Efficient Pixel-Perfect Line Rendering for Computer Vision
Fouad Sabry
No ratings yet
Summary of Ancient Mystery Cults
100% (3)
Summary of Ancient Mystery Cults
25 pages
Epse 432 - Reflection Paper
No ratings yet
Epse 432 - Reflection Paper
5 pages
Line Item 18 To 24 # Datasheet - 8F, 12F, 24F, 48F, 96F, 144F, 288F D MTA SS PE BK
No ratings yet
Line Item 18 To 24 # Datasheet - 8F, 12F, 24F, 48F, 96F, 144F, 288F D MTA SS PE BK
3 pages
SDN PDF
No ratings yet
SDN PDF
2 pages
The Clay Research Group The Clay Research Group
No ratings yet
The Clay Research Group The Clay Research Group
9 pages
School Grant 2024 Letter (1)
No ratings yet
School Grant 2024 Letter (1)
474 pages
Proof: Euler S Theorem
No ratings yet
Proof: Euler S Theorem
20 pages
CED Assignment 2&3
No ratings yet
CED Assignment 2&3
4 pages
Q1 2023 MAIN Investor Presentation 28th April 2023 Final Version
No ratings yet
Q1 2023 MAIN Investor Presentation 28th April 2023 Final Version
35 pages
1.3 Patients Assessment Policy
0% (1)
1.3 Patients Assessment Policy
6 pages
NANDA Page 658
100% (1)
NANDA Page 658
5 pages
Kuznetsov & Kuznetsova 2008
No ratings yet
Kuznetsov & Kuznetsova 2008
17 pages
Campo in Japan
No ratings yet
Campo in Japan
3 pages
W 1 SKR 3200
No ratings yet
W 1 SKR 3200
47 pages
Saarthi Education Jija Mata Colony, Near Paithan Gate A Bad. Cont: 8694947070 / 5050
No ratings yet
Saarthi Education Jija Mata Colony, Near Paithan Gate A Bad. Cont: 8694947070 / 5050
8 pages
SOP of Milkshake revised 2-2
No ratings yet
SOP of Milkshake revised 2-2
8 pages
Syllabus: Cambridge IGCSE (9-1) First Language English 0990
No ratings yet
Syllabus: Cambridge IGCSE (9-1) First Language English 0990
35 pages
Group 5-IM
No ratings yet
Group 5-IM
13 pages
Unit 18 Lab
No ratings yet
Unit 18 Lab
6 pages
IPE Syllabus 2024 (Final_Update) (1)
No ratings yet
IPE Syllabus 2024 (Final_Update) (1)
40 pages
FFF Solutions
No ratings yet
FFF Solutions
152 pages
Ambience Tiverton Automated Brochure
No ratings yet
Ambience Tiverton Automated Brochure
4 pages
C1 - Writing help - Informal letters (1)
No ratings yet
C1 - Writing help - Informal letters (1)
20 pages
DENAIR Oil-Free Air Compressor
No ratings yet
DENAIR Oil-Free Air Compressor
10 pages
Mime Summative Scene
No ratings yet
Mime Summative Scene
2 pages