0% found this document useful (0 votes)

27 views

ML Assignment1 Linear Regression

The document shows analysis of a dataset containing employees' years of experience and salary. It loads and inspects the data, plots a scatter plot of experience vs. salary, fits a linear regression model to predict salary from experience, and plots the training and test results. Key steps include splitting the data into train and test sets, fitting a linear regression model to the training set, and using the model to make predictions on both training and test sets.

Uploaded by

Dishant kumar yadav mhakhariya

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

ML Assignment1 Linear Regression

Uploaded by

Dishant kumar yadav mhakhariya

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

keyboard_arrow_down DISHANT KUMAR YADAV 2021BCS0136

#DISHANT KUMAR YADAV

import numpy as np
import pandas as pd

df = pd.read_csv('/content/sample_data/Salary_Data.csv')
df
#DISHANT KUMAR YADAV

YearsExperience Salary

0 1.1 39343.0

1 1.3 46205.0

2 1.5 37731.0

3 2.0 43525.0

4 2.2 39891.0

5 2.9 56642.0

6 3.0 60150.0

7 3.2 54445.0

8 3.2 64445.0

9 3.7 57189.0

10 3.9 63218.0

11 4.0 55794.0

12 4.0 56957.0

13 4.1 57081.0

14 4.5 61111.0

15 4.9 67938.0

16 5.1 66029.0

17 5.3 83088.0

18 5.9 81363.0

19 6.0 93940.0

20 6.8 91738.0

21 7.1 98273.0

22 7.9 101302.0

23 8.2 113812.0

24 8.7 109431.0

25 9.0 105582.0

26 9.5 116969.0

27 9.6 112635.0

28 10.3 122391.0

29 10.5 121872.0

#DISHANT KUMAR YADAV

import matplotlib.pyplot as plt

exp = df['YearsExperience']
sal = df['Salary']

plt.scatter(exp,sal)
plt.xlabel('Experience')
plt.ylabel('Salary')
#DISHANT KUMAR YADAV
Text(0, 0.5, 'Salary')

#DISHANT KUMAR YADAV

exp_np = exp.to_numpy()
sal_np = sal.to_numpy()

exp_np.shape, sal_np.shape
#DISHANT KUMAR YADAV

((30,), (30,))

#DISHANT KUMAR YADAV

from sklearn.linear_model import LinearRegression

sklearn_model = LinearRegression().fit(exp_np.reshape((30,1)), sal_np)

sklearn_sal_predictions = sklearn_model.predict(exp_np.reshape((30,1)))
sklearn_sal_predictions.shape
#DISHANT KUMAR YADAV

(30,)

#DISHANT KUMAR YADAV

exp = df['YearsExperience']
sal = df['Salary']

plt.scatter(exp,sal)
plt.xlabel('Experience')
plt.ylabel('Salary')

plt.scatter(exp,sklearn_sal_predictions )
#DISHANT KUMAR YADAV

output <matplotlib.collections.PathCollection at 0x7c7e2822d360>

#DISHANT KUMAR YADAV
predictions_df = pd.DataFrame({'YearsExperience': exp, 'Salary':sal, 'Sklearn salary prediction':sklearn_sal_predictions})

predictions_df
#DISHANT KUMAR YADAV

YearsExperience Salary Sklearn salary prediction

0 1.1 39343.0 36187.158752

1 1.3 46205.0 38077.151217

2 1.5 37731.0 39967.143681

3 2.0 43525.0 44692.124842

4 2.2 39891.0 46582.117306

5 2.9 56642.0 53197.090931

6 3.0 60150.0 54142.087163

7 3.2 54445.0 56032.079627

8 3.2 64445.0 56032.079627

9 3.7 57189.0 60757.060788

10 3.9 63218.0 62647.053252

11 4.0 55794.0 63592.049484

12 4.0 56957.0 63592.049484

13 4.1 57081.0 64537.045717

14 4.5 61111.0 68317.030645

15 4.9 67938.0 72097.015574

16 5.1 66029.0 73987.008038

17 5.3 83088.0 75877.000502

18 5.9 81363.0 81546.977895

19 6.0 93940.0 82491.974127

20 6.8 91738.0 90051.943985

21 7.1 98273.0 92886.932681

22 7.9 101302.0 100446.902538

23 8.2 113812.0 103281.891235

24 8.7 109431.0 108006.872395

25 9.0 105582.0 110841.861092

26 9.5 116969.0 115566.842252

27 9.6 112635.0 116511.838485

28 10.3 122391.0 123126.812110

29 10.5 121872.0 125016.804574

keyboard_arrow_down DISHANT KUMAR YADAV 2021BCS0136
# Step 1: Import the required python packages
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression

# Step 2: Load the dataset

df = pd.read_csv('/content/sample_data/Salary_Data.csv')

# Step 3: Data analysis - distribution plot shows the variation in the data distribution.
exp = df['YearsExperience']
sal = df['Salary']

plt.scatter(exp, sal)
plt.xlabel('Experience')
plt.ylabel('Salary')
plt.title('Distribution of Experience vs. Salary')
plt.show()

output

# Step 4: Split the dataset into dependent/independent variables

X = df[['YearsExperience']]
y = df['Salary']

# Step 5: Split data into Train/Test sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Step 6: Train the regression model

regression_model = LinearRegression()
regression_model.fit(X_train, y_train)

▾ LinearRegression
LinearRegression()
# Step 7: Plot the training results
plt.scatter(X_train, y_train, color='blue')
plt.plot(X_train, regression_model.predict(X_train), color='red')
plt.xlabel('Experience')
plt.ylabel('Salary')
plt.title('Training Results: Experience vs. Salary')
plt.show()

# Step 7: Plot the test results

plt.scatter(X_test, y_test, color='blue')
plt.plot(X_train, regression_model.predict(X_train), color='red') # Same line as training for comparison
plt.xlabel('Experience')
plt.ylabel('Salary')
plt.title('Test Results: Experience vs. Salary')
plt.show()

Salary Prediction LinearRegression
100% (1)
Salary Prediction LinearRegression
7 pages
Linear Regression 2
No ratings yet
Linear Regression 2
3 pages
Data Preprocessing & Visualization1
No ratings yet
Data Preprocessing & Visualization1
2 pages
Regression Demo
No ratings yet
Regression Demo
8 pages
Linear Regression 1
No ratings yet
Linear Regression 1
2 pages
Linear - Regression - Ipynb - Colaboratory
No ratings yet
Linear - Regression - Ipynb - Colaboratory
4 pages
C: Users Dell Downloads Salary - Data - CSV
No ratings yet
C: Users Dell Downloads Salary - Data - CSV
2 pages
DSBDA3 - Jupyter Notebook
No ratings yet
DSBDA3 - Jupyter Notebook
12 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
4 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
4 pages
EXP 2 ML
No ratings yet
EXP 2 ML
4 pages
Kunj Project 1
No ratings yet
Kunj Project 1
34 pages
Salary Prediction Project
No ratings yet
Salary Prediction Project
6 pages
EXP-4 DMusingPYTHON
No ratings yet
EXP-4 DMusingPYTHON
7 pages
Pps Ui22cs57lab 10
No ratings yet
Pps Ui22cs57lab 10
17 pages
Kunj Project 1
No ratings yet
Kunj Project 1
34 pages
Parth IP Employee Management Project (1)
No ratings yet
Parth IP Employee Management Project (1)
32 pages
Kunj 3
No ratings yet
Kunj 3
34 pages
Data Scientist Salaries 1686594662
No ratings yet
Data Scientist Salaries 1686594662
29 pages
python 1
No ratings yet
python 1
3 pages
Employee Management System
No ratings yet
Employee Management System
33 pages
Appendix B: Source Code
No ratings yet
Appendix B: Source Code
5 pages
Viksit Ip Project File
No ratings yet
Viksit Ip Project File
33 pages
Linear Regression2
No ratings yet
Linear Regression2
9 pages
employee management-Ghanim,Rudra
No ratings yet
employee management-Ghanim,Rudra
25 pages
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
11 pages
Salary Prediction
No ratings yet
Salary Prediction
32 pages
Ali Bhai's IP Project
No ratings yet
Ali Bhai's IP Project
31 pages
Employee Info
No ratings yet
Employee Info
2 pages
Ip Project File
No ratings yet
Ip Project File
46 pages
Social Network Analysis: Cheruvu Nvss Suhas 21BCE8374
No ratings yet
Social Network Analysis: Cheruvu Nvss Suhas 21BCE8374
10 pages
Aastha IP Employee Project
No ratings yet
Aastha IP Employee Project
32 pages
Assignment 03
No ratings yet
Assignment 03
6 pages
Solution To Task 1
No ratings yet
Solution To Task 1
2 pages
Emp Project
No ratings yet
Emp Project
40 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
7 pages
etl_and_stats_code
No ratings yet
etl_and_stats_code
2 pages
New Final Ip Project
No ratings yet
New Final Ip Project
33 pages
Unit5 - Linear Regression
No ratings yet
Unit5 - Linear Regression
4 pages
2 Linear Regression
No ratings yet
2 Linear Regression
5 pages
AIDS - DM Using Python - Lab Programs
No ratings yet
AIDS - DM Using Python - Lab Programs
19 pages
Easy Sudoku Puzzle Book (Printable Version)
From Everand
Easy Sudoku Puzzle Book (Printable Version)
Sheba Blake
No ratings yet
Maxbox Starter139 Top5 Data Diagram Types
No ratings yet
Maxbox Starter139 Top5 Data Diagram Types
4 pages
Group 24 Miniproject
No ratings yet
Group 24 Miniproject
33 pages
Medium Sudoku Puzzle Book (Printable Version)
From Everand
Medium Sudoku Puzzle Book (Printable Version)
Sheba Blake
No ratings yet
Project Advanced Statistics UMESHHASIJA SEP2021 Jupyter File
100% (1)
Project Advanced Statistics UMESHHASIJA SEP2021 Jupyter File
25 pages
Rajdeep 2023PGDM1197
No ratings yet
Rajdeep 2023PGDM1197
5 pages
Ml Projects
No ratings yet
Ml Projects
22 pages
Pattern Recognition
No ratings yet
Pattern Recognition
26 pages
Task1
No ratings yet
Task1
5 pages
Python Module 5
No ratings yet
Python Module 5
19 pages
Source Code55
No ratings yet
Source Code55
18 pages
Panda Merged
No ratings yet
Panda Merged
19 pages
Student Notebook HR Analysis
No ratings yet
Student Notebook HR Analysis
11 pages
Logistic Binary Classification
No ratings yet
Logistic Binary Classification
3 pages
Student - Linear Regression Example - Colaboratory
No ratings yet
Student - Linear Regression Example - Colaboratory
6 pages
Salaries for San Francisco Employee _ ML _ FA _ DA projects
No ratings yet
Salaries for San Francisco Employee _ ML _ FA _ DA projects
33 pages
SVM Practical4 ML4
No ratings yet
SVM Practical4 ML4
3 pages
Practical 3
No ratings yet
Practical 3
8 pages
Data (3)
No ratings yet
Data (3)
17 pages
Federated Learning With Non-IID Data: Yue Zhao Meng Li Liangzhen Lai
No ratings yet
Federated Learning With Non-IID Data: Yue Zhao Meng Li Liangzhen Lai
12 pages
Class Assignment
No ratings yet
Class Assignment
8 pages
Lecture 2 - Barriers To Communication
No ratings yet
Lecture 2 - Barriers To Communication
11 pages
Decision Tree
No ratings yet
Decision Tree
4 pages
Lecture 13
No ratings yet
Lecture 13
17 pages
Kalasalingam Academy of Research and Education (Deemed To Be University) Anand Nagar, Krishnankoil - 626126
No ratings yet
Kalasalingam Academy of Research and Education (Deemed To Be University) Anand Nagar, Krishnankoil - 626126
60 pages