0% found this document useful (0 votes)

11 views

Assignment B 1 LinearRegression

Uploaded by

Mahesh Kadam

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Assignment B 1 LinearRegression

Uploaded by

Mahesh Kadam

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

B.E.

(COMP) Sinhgad Institute of Technology, Lonavala LP_III

Name of the Student: ________________________________ Roll No: _

CLASS: - B. E. [COMP] Division: A, B, C Course: LP-III
Machine Learning
Assignment No. 01
UBER RIDE FARE PREDICTION
Marks: /10

Date of Performance: / /2023

2024 Sign with Date:

Title : Uber ride fare prediction using regression algorithms

Objectives:
• To analyse Uber ride dataset to predict the fare of a ride.
• To compare performance of different regressors.

Outcomes:
• Predict the sales of a store.

PEOs, POs, PSOs and COs satisfied

PEOs: I, III POs: 1, 2, 3, 4, 5 PSOs: 1, 2 COs: 1

Problem Statement:
Predict the price of the Uber ride from a given pickup point to the agreed drop-off location.
Perform following tasks:
1. Pre-process the dataset.
2. Identify outliers.
3. Check the correlation.
4. Implement linear regression and random forest regression models.
5. Evaluate the models and compare their respective scores like R2, RMSE, etc.

Dataset link: https://www.kaggle.com/datasets/yasserh/uber-fares-dataset

Theory:

Linear Regression
In statistics, linear regression is a linear approach to modeling the relationship between a
scalar response (or dependent variable) and one or more explanatory variables (or independent
variables). The case of one explanatory variable is called simple linear regression. For more
than one explanatory variable, the process is called multiple linear regression.

Linear Regression Equation:

Linear regression is a way to model the relationship between two variables. You might also
recognize the equation as the slope formula. The equation has the form Y= a + bX, where Y

1 | Department of Computer Engineering, SIT, Lonavala

B.E. (COMP) Sinhgad Institute of Technology, Lonavala LP_III

is the dependent variable (that’s the variable that goes on the Y axis), X is the independent
variable (i.e. it is plotted on the X axis), b is the slope of the line and a is the y-intercept.

Simple Linear Regression

Simple or single-variate linear regression is the simplest case of linear regression with a single
independent variable, 𝐱 = 𝑥.
The following figure illustrates simple linear regression:

Simple Linear Regression With scikit-learn

1. Import the packages and classes you need.
2. Provide data to work with and eventually do appropriate transformations.
3. Create a regression model and fit it with existing data.
4. Check the results of model fitting to know whether the model is satisfactory.
5. Apply the model for predictions.
These steps are more or less general for most of the regression approaches and
implementations.

Linear Regression – Implementation using scikit learn

from sklearn.linear_model import LinearRegression

from sklearn.metrics import mean_squared_error

# Cannot use Rank 1 matrix in scikit learn

X = X.reshape((m, 1))
# Creating Model
reg = LinearRegression()
# Fitting training data
reg = reg.fit(X, Y)
# Y Prediction
Y_pred = reg.predict(X)

# Calculating R2 Score
r2_score = reg.score(X, Y)
print(r2_score)

2 | Department of Computer Engineering, SIT, Lonavala

B.E. (COMP) Sinhgad Institute of Technology, Lonavala LP_III

Decision Tree
Decision tree builds regression or classification models in the form of a tree structure. It breaks
down a dataset into smaller and smaller subsets while at the same time an associated decision
tree is incrementally developed. The final result is a tree with decision nodes and leaf nodes.
A decision node (e.g., Outlook) has two or more branches (e.g., Sunny, Overcast and Rainy),
each representing values for the attribute tested. Leaf node (e.g., Hours Played) represents a
decision on the numerical target. The topmost decision node in a tree which corresponds to the
best predictor called root node. Decision trees can handle both categorical and numerical data.

Decision Tree Regression – Implementation using scikit learn

# import the regressor

from sklearn.tree import DecisionTreeRegressor

# create a regressor object

regressor = DecisionTreeRegressor(random_state = 0)

# fit the regressor with X and Y data

regressor.fit(X, y)

# predicting a new value

# test the output by changing values, like 3750

y_pred = regressor.predict(3750)

# print the predicted price

print("Predicted price: % d\n"% y_pred)

Random Forest Regression

3 | Department of Computer Engineering, SIT, Lonavala

B.E. (COMP) Sinhgad Institute of Technology, Lonavala LP_III

Random Forest Regression is a supervised learning algorithm that uses ensemble

learning method for regression. Ensemble learning method is a technique that combines
predictions from multiple machine learning algorithms to make a more accurate prediction than
a single model.

The diagram above shows the structure of a Random Forest. You can notice that the trees run
in parallel with no interaction amongst them. A Random Forest operates by constructing several
decision trees during training time and outputting the mean of the classes as the prediction of all
the trees.

Random Forest Regression – Implementation using scikit learn

# import the regressor

from sklearn.ensemble import RandomForestRegressor

# create regressor object

regressor = RandomForestRegressor(n_estimators = 100,
random_state = 0)

# fit the regressor with x and y data

regressor.fit(x, y)

# predicting a new value

Y_pred = regressor.predict(np.array([6.5]).reshape(1, 1))
# test the output by changing values

# print the predicted price

Y_pred

4 | Department of Computer Engineering, SIT, Lonavala

B.E. (COMP) Sinhgad Institute of Technology, Lonavala LP_III

Conclusion:
Thus we implemented and compared different regressors using PYTHON scikit-learn
library.

A. Write short answer of following questions :

1. What is linear regression?
2. What is pruning in Decision Tree?
3. What is Ensemble Learning?
4. What is Entropy and Information gain in Decision tree algorithm?
5. What is Random Forest? How does it work?

5 | Department of Computer Engineering, SIT, Lonavala

Software Startup - Business Plan
100% (2)
Software Startup - Business Plan
32 pages
GoM Report On Government Communication
67% (9)
GoM Report On Government Communication
97 pages
Labconco Manual Liofilizadora
No ratings yet
Labconco Manual Liofilizadora
71 pages
LECTURE Regression
No ratings yet
LECTURE Regression
12 pages
Unit 5 II Decision Tree, Regression
No ratings yet
Unit 5 II Decision Tree, Regression
10 pages
Types of Regression
No ratings yet
Types of Regression
8 pages
R Unit 4th and 5th
No ratings yet
R Unit 4th and 5th
17 pages
ML-U2-Regression
No ratings yet
ML-U2-Regression
20 pages
AI lab7
No ratings yet
AI lab7
13 pages
Regression
No ratings yet
Regression
31 pages
Machine Learning With Python Algorithms
No ratings yet
Machine Learning With Python Algorithms
28 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
LR-LogReg
No ratings yet
LR-LogReg
53 pages
LP III Lab Manual
100% (1)
LP III Lab Manual
8 pages
Linear Regression in Machine learning - GeeksforGeeks
No ratings yet
Linear Regression in Machine learning - GeeksforGeeks
25 pages
DAV-EXP
No ratings yet
DAV-EXP
11 pages
Unit2 ML Notes
No ratings yet
Unit2 ML Notes
19 pages
LinearRegression PDF
No ratings yet
LinearRegression PDF
4 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Supervised Machine Learning - Regression
No ratings yet
Supervised Machine Learning - Regression
34 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
7. Machine Learning - Develop machine learning model - Regression
No ratings yet
7. Machine Learning - Develop machine learning model - Regression
36 pages
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
Unit 2
No ratings yet
Unit 2
11 pages
ML points
No ratings yet
ML points
13 pages
Machine learning
No ratings yet
Machine learning
62 pages
30 GM ASAP Linear Regression
No ratings yet
30 GM ASAP Linear Regression
10 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Week 9 - PROG 8510 Week 9
No ratings yet
Week 9 - PROG 8510 Week 9
27 pages
AIH_LAB1
No ratings yet
AIH_LAB1
10 pages
Lab#10 Ai
No ratings yet
Lab#10 Ai
3 pages
ML UNIT II
No ratings yet
ML UNIT II
30 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
DSUP_Exp4[1]
No ratings yet
DSUP_Exp4[1]
6 pages
ml2020 Pythonlab02
No ratings yet
ml2020 Pythonlab02
3 pages
ML Exp1 C36
No ratings yet
ML Exp1 C36
13 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
lecture 9-10
No ratings yet
lecture 9-10
28 pages
unit5_R
No ratings yet
unit5_R
5 pages
Module 4
No ratings yet
Module 4
41 pages
Data Analytics Lab
No ratings yet
Data Analytics Lab
46 pages
AI algorithm
No ratings yet
AI algorithm
40 pages
Linear Regression Code
No ratings yet
Linear Regression Code
5 pages
22UCS303 DS-Unit IV-LINEAR REGRESSION
No ratings yet
22UCS303 DS-Unit IV-LINEAR REGRESSION
19 pages
Lecture Notes - Linear Regression
No ratings yet
Lecture Notes - Linear Regression
26 pages
5_AML Lecture 5_Linear regression
No ratings yet
5_AML Lecture 5_Linear regression
56 pages
Machine Learning Lab Notes
No ratings yet
Machine Learning Lab Notes
3 pages
IJRPR22505
No ratings yet
IJRPR22505
3 pages
Lecture-3---Linear-Regression-imran-20022025-092939am
No ratings yet
Lecture-3---Linear-Regression-imran-20022025-092939am
46 pages
Linear Regression
No ratings yet
Linear Regression
6 pages
ML manoj
No ratings yet
ML manoj
51 pages
ML Lecture - 3
No ratings yet
ML Lecture - 3
47 pages
Exp No 03
No ratings yet
Exp No 03
15 pages
ML Unit
No ratings yet
ML Unit
23 pages
Unit 5
No ratings yet
Unit 5
171 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
98 pages
ML Experiment No 1 Linear Regression Analysis
No ratings yet
ML Experiment No 1 Linear Regression Analysis
3 pages
6_Classification and Regression Tasks
No ratings yet
6_Classification and Regression Tasks
115 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Module 1 Notes
100% (1)
Module 1 Notes
73 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
What Is Cardiac Axis ECG Interpretation Geeky Medics
No ratings yet
What Is Cardiac Axis ECG Interpretation Geeky Medics
1 page
Cds Deploymnet Steps
No ratings yet
Cds Deploymnet Steps
18 pages
Warsash New Training Requirements Under Stcw10
No ratings yet
Warsash New Training Requirements Under Stcw10
6 pages
上海外语口译证书培训与考试系列丛书·英语中级口译证书考试中级口译教程 (第四版) (梅德明) (Z-Library) -1
No ratings yet
上海外语口译证书培训与考试系列丛书·英语中级口译证书考试中级口译教程 (第四版) (梅德明) (Z-Library) -1
697 pages
82_Module_2.3_QOS_March2023
No ratings yet
82_Module_2.3_QOS_March2023
33 pages
Rmo 12 2013 List of Unused Expired Orssiscis Annex D Docxdocx PDF Free
100% (2)
Rmo 12 2013 List of Unused Expired Orssiscis Annex D Docxdocx PDF Free
2 pages
Q2 DLL-MAPEH 8 - Health
No ratings yet
Q2 DLL-MAPEH 8 - Health
8 pages
Ball Valve Data Sheet: Item Requirement Notes
No ratings yet
Ball Valve Data Sheet: Item Requirement Notes
1 page
Top 25 investors in India
No ratings yet
Top 25 investors in India
5 pages
Almeida Theatre Production of Homecoming by Pinter
No ratings yet
Almeida Theatre Production of Homecoming by Pinter
34 pages
Inf Sta3
No ratings yet
Inf Sta3
15 pages
CreditCard Companies-Case Study PDF
No ratings yet
CreditCard Companies-Case Study PDF
3 pages
Magnetic Bearing
No ratings yet
Magnetic Bearing
5 pages
Finals MS Powerpoint SGN
No ratings yet
Finals MS Powerpoint SGN
18 pages
Coron Span: Wrecks OF
No ratings yet
Coron Span: Wrecks OF
78 pages
Aman Futures
No ratings yet
Aman Futures
2 pages
(Extended) Roll Me A Deity - by Assassin NPC
No ratings yet
(Extended) Roll Me A Deity - by Assassin NPC
35 pages
1
No ratings yet
1
6 pages
English Top 10 Endangered Mexico
No ratings yet
English Top 10 Endangered Mexico
14 pages
Automatic Transmission / Trans: - Automatic Transaxle Assy (U250E)
No ratings yet
Automatic Transmission / Trans: - Automatic Transaxle Assy (U250E)
2 pages
Pololu 3pi Robot User's Guide © 2001-2011 Pololu Corporation
No ratings yet
Pololu 3pi Robot User's Guide © 2001-2011 Pololu Corporation
68 pages
1 Handbook 2018-2019
No ratings yet
1 Handbook 2018-2019
179 pages
Instant ebooks textbook Foundations of Dynamic Economic Analysis Optimal Control Theory and Applications 1st Edition Michael R. Caputo download all chapters
100% (22)
Instant ebooks textbook Foundations of Dynamic Economic Analysis Optimal Control Theory and Applications 1st Edition Michael R. Caputo download all chapters
85 pages
Specialist Gynaecologist Hamilton Waikato NZ 3204
No ratings yet
Specialist Gynaecologist Hamilton Waikato NZ 3204
3 pages
Math 1
No ratings yet
Math 1
8 pages
DG211B/212B: Vishay Siliconix
No ratings yet
DG211B/212B: Vishay Siliconix
9 pages