0% found this document useful (0 votes)

2 views

Assignment no 2 _ML_output

Uploaded by

Akkimsd

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Assignment no 2 _ML_output

Uploaded by

Akkimsd

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Assignment no 2 _ML

October 8, 2024

[36]: #ASSIGNMENT NO 2
#Use K-Nearest Neighbors and Support Vector Machine for classification. Analyze␣
↪their performance.

#Dataset link: The emails.csv dataset on the Kaggle https://www.kaggle.com/

↪datasets/balaka18/email-spam-classification-dataset-csv

[37]: import pandas as pd

import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt

[38]: df=pd.read_csv('/home/pc13/Documents/Email/emails.csv')

[51]: df.head() #it returns the first five rows of the DataFrame df

[51]: Email No. the to ect and for of a you hou … connevey jay \
0 Email 1 0 0 1 0 0 0 2 0 0 … 0 0
1 Email 2 8 13 24 6 6 2 102 1 27 … 0 0
2 Email 3 0 0 1 0 0 0 8 0 0 … 0 0
3 Email 4 0 5 22 0 5 1 51 2 10 … 0 0
4 Email 5 7 6 17 1 5 2 57 0 9 … 0 0

valued lay infrastructure military allowing ff dry Prediction

0 0 0 0 0 0 0 0 0
1 0 0 0 0 0 1 0 0
2 0 0 0 0 0 0 0 0
3 0 0 0 0 0 0 0 0
4 0 0 0 0 0 1 0 0

[5 rows x 3002 columns]

[52]: df.info() #df.info() function in pandas provides a concise summary of a␣

↪DataFrame

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 5172 entries, 0 to 5171
Columns: 3002 entries, Email No. to Prediction

1
dtypes: int64(3001), object(1)
memory usage: 118.5+ MB

[53]: df.isnull().sum() #The df.isnull().sum() function in pandas is used to check␣

↪for missing (null) values in a DataFrame.

[53]: Email No. 0

the 0
to 0
ect 0
and 0
..
military 0
allowing 0
ff 0
dry 0
Prediction 0
Length: 3002, dtype: int64

[54]: X = df.iloc[:, 1:-1].values

y = df.iloc[:, -1].values
#X and y are being created from a pandas DataFrame df using the iloc method,␣
↪which is used for integer-location based indexing

#X typically represents the feature set (input data) used for training a␣
↪machine learning model.

#y usually represents the target variable (output data) that the model aims to␣
↪predict.

[55]: from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.30,␣
↪random_state=101)

#the train_test_split function from the sklearn.model_selection module is used␣

↪to split the dataset into training and testing sets

[56]: from sklearn.preprocessing import StandardScaler

sc_X = StandardScaler()
X_train = sc_X.fit_transform(X_train)
X_test = sc_X.transform(X_test)
#The StandardScaler from the sklearn.preprocessing module is used to␣
↪standardize the feature set

[57]: from sklearn.neighbors import KNeighborsClassifier

classifier = KNeighborsClassifier(n_neighbors=5)
classifier.fit(X_train, y_train)
#using the KNeighborsClassifier from the sklearn.neighbors module to create and␣
↪train a K-Nearest Neighbors (KNN) classifier.

2
[57]: KNeighborsClassifier()

[58]: #KNeighborsClassifier() is a class in the sklearn.neighbors module of the␣

↪scikit-learn library, which implements the K-Nearest Neighbors (KNN)␣

↪algorithm for classification tasks

[59]: y_pred = classifier.predict(X_test)

#In this line of code, y_pred = classifier.predict(X_test), you're using the␣
↪trained K-Nearest Neighbors classifier to make predictions on the test set

[60]: from sklearn.metrics import confusion_matrix, accuracy_score

cm = confusion_matrix(y_test, y_pred)
#using functions from sklearn.metrics to evaluate the performance of your␣
↪K-Nearest Neighbors classifier by generating a confusion matrix.

[61]: cm
#The variable cm contains the confusion matrix generated by the␣
↪confusion_matrix function.

[61]: array([[866, 248],

[ 16, 422]])

[49]: from sklearn.metrics import classification_report

cl_report=classification_report(y_test,y_pred)
print(cl_report)
#Generating a classification report using the classification_report function␣
↪from the sklearn.metrics module

precision recall f1-score support

0 0.98 0.78 0.87 1114

1 0.63 0.96 0.76 438

accuracy 0.83 1552

macro avg 0.81 0.87 0.81 1552
weighted avg 0.88 0.83 0.84 1552

[50]: print("Accuracy Score for KNN : ", accuracy_score(y_pred,y_test))

Accuracy Score for KNN : 0.8298969072164949

[62]: from sklearn.svm import SVC

from sklearn.metrics import accuracy_score
#importing the Support Vector Classifier (SVC) from the sklearn.svm module and␣
↪the accuracy_score function from sklearn.metrics

3
[69]: svc = SVC(C=1.0,kernel='rbf',gamma='auto')
svc.fit(X_train,y_train)
y_pred2 = svc.predict(X_test)
#you're creating and training a Support Vector Classifier (SVC) using the␣
↪Radial Basis Function (RBF) kernel

[64]: from sklearn.metrics import confusion_matrix, accuracy_score

#generating a confusion matrix for the predictions made by the Support Vector␣
↪Classifier (SVC

cm = confusion_matrix(y_test, y_pred2)
#Creating the Confusion Matrix

[70]: cm
#The variable cm contains the confusion matrix generated from your SVC model's␣
↪predictions

[70]: array([[1106, 8],

[ 95, 343]])

[67]: print("Accuracy Score for SVC : ", accuracy_score(y_pred2,y_test))

Accuracy Score for SVC : 0.9336340206185567

[71]: from sklearn.metrics import classification_report

cl_report=classification_report(y_test,y_pred2)
print(cl_report)
#generating a classification report for the predictions made by your Support␣
↪Vector Classifier (SVC)

precision recall f1-score support

0 0.92 0.99 0.96 1114

1 0.98 0.78 0.87 438

accuracy 0.93 1552

macro avg 0.95 0.89 0.91 1552
weighted avg 0.94 0.93 0.93 1552

[ ]:

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
UNIT1
No ratings yet
UNIT1
12 pages
02 - Email - Spam - Ipynb - Colab
No ratings yet
02 - Email - Spam - Ipynb - Colab
11 pages
ML 2 16
No ratings yet
ML 2 16
6 pages
ML 2
No ratings yet
ML 2
1 page
ML2 (1)
No ratings yet
ML2 (1)
2 pages
ML 5
No ratings yet
ML 5
3 pages
Case Study - Classifier
No ratings yet
Case Study - Classifier
5 pages
ML Practical 2
No ratings yet
ML Practical 2
6 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
33 pages
P2) Code Email Spam Detection
No ratings yet
P2) Code Email Spam Detection
3 pages
ML Lab2 pgm
No ratings yet
ML Lab2 pgm
3 pages
Email spam detection
No ratings yet
Email spam detection
3 pages
Machine Learning Assignment 3
No ratings yet
Machine Learning Assignment 3
7 pages
9,12,19,68 - ML Assignment-2
No ratings yet
9,12,19,68 - ML Assignment-2
5 pages
Name: Mussab Bin Shahid Sap-Id: 2024 Assignment: Machine-Learning
No ratings yet
Name: Mussab Bin Shahid Sap-Id: 2024 Assignment: Machine-Learning
5 pages
AIML_ECE304_Assign-2_kartikeya_Kandpal_Ajitesh_S.ipynb - Colab
No ratings yet
AIML_ECE304_Assign-2_kartikeya_Kandpal_Ajitesh_S.ipynb - Colab
4 pages
Risss ML Record 6
No ratings yet
Risss ML Record 6
6 pages
Act8
No ratings yet
Act8
20 pages
Project-4 (KNN CLASSIFICATION) (2) PRANAB
No ratings yet
Project-4 (KNN CLASSIFICATION) (2) PRANAB
2 pages
Machine Learning With Python - Machine Learning Algorithms - KNN
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - KNN
15 pages
ML Practical 2D
No ratings yet
ML Practical 2D
6 pages
Assignment B 2 EmailClassification
No ratings yet
Assignment B 2 EmailClassification
6 pages
KNN SVM
No ratings yet
KNN SVM
2 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
AI PROJECT FILE
No ratings yet
AI PROJECT FILE
11 pages
Practical - 5 - 52
No ratings yet
Practical - 5 - 52
4 pages
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
Ml-Exp-2 - Jupyter Notebook
No ratings yet
Ml-Exp-2 - Jupyter Notebook
2 pages
Assignment 02
No ratings yet
Assignment 02
5 pages
Practical 7
No ratings yet
Practical 7
6 pages
ML Lab Programs (1-13)
No ratings yet
ML Lab Programs (1-13)
44 pages
Machine Learning Assignment (1)
No ratings yet
Machine Learning Assignment (1)
8 pages
KnnClassifier - Jupyter Notebook
No ratings yet
KnnClassifier - Jupyter Notebook
2 pages
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
No ratings yet
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
2 pages
ML practical Kiranjot 6-10
No ratings yet
ML practical Kiranjot 6-10
10 pages
Lab 8
No ratings yet
Lab 8
7 pages
Total Listing Machine Learning
100% (1)
Total Listing Machine Learning
114 pages
FAQ's - Supervised Learning
No ratings yet
FAQ's - Supervised Learning
4 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
ML Week10.1
No ratings yet
ML Week10.1
5 pages
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
ML Lab6
No ratings yet
ML Lab6
4 pages
Naive bayes gaussian table tennis - Jupyter Notebook
No ratings yet
Naive bayes gaussian table tennis - Jupyter Notebook
6 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
5 pages
Scikit-Learn Cheat Sheet Python For Data Science: Preprocessing The Data Evaluate Your Model's Performance
100% (1)
Scikit-Learn Cheat Sheet Python For Data Science: Preprocessing The Data Evaluate Your Model's Performance
1 page
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
pratham ML
No ratings yet
pratham ML
14 pages
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
100% (1)
Scikit-Learn: Scikit-Learn Is An Open Source Python Library That
1 page
Act10
No ratings yet
Act10
4 pages
Solution 1
No ratings yet
Solution 1
6 pages
ML Assignment 02
No ratings yet
ML Assignment 02
8 pages
SPPUML5
No ratings yet
SPPUML5
4 pages
ML practical Lovepreet 6-10
No ratings yet
ML practical Lovepreet 6-10
10 pages
mnbnmnbnnmbbhhuyrgh
No ratings yet
mnbnmnbnnmbbhhuyrgh
3 pages
Scikit Learn Cheat Sheet Python
No ratings yet
Scikit Learn Cheat Sheet Python
1 page
Microsoft Visual Basic Interview Questions: Microsoft VB Certification Review
From Everand
Microsoft Visual Basic Interview Questions: Microsoft VB Certification Review
Equity Press
No ratings yet
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
From Everand
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
Vladimir Kiselev
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Bahan Ajar Asking and Giving Opinion
No ratings yet
Bahan Ajar Asking and Giving Opinion
3 pages
Instructional Planning Models
100% (1)
Instructional Planning Models
31 pages
Postmdernism Theory: Abdulazim Ali N.Elaati 25-5-2016
No ratings yet
Postmdernism Theory: Abdulazim Ali N.Elaati 25-5-2016
7 pages
Sainik Awasiya Mahavidyalaya Chitwan: Syllabus For Grade XI Entrance Examination (Science) - 2080
No ratings yet
Sainik Awasiya Mahavidyalaya Chitwan: Syllabus For Grade XI Entrance Examination (Science) - 2080
2 pages
MSMU Lesson Plan Format Grade/Class/Subject:: Ccss - Ela-Literacy.L.2.1
No ratings yet
MSMU Lesson Plan Format Grade/Class/Subject:: Ccss - Ela-Literacy.L.2.1
10 pages
Zos Admin Laterals New
No ratings yet
Zos Admin Laterals New
6 pages
Organizing Ideas by Time and Space
No ratings yet
Organizing Ideas by Time and Space
8 pages
Resource Pack
No ratings yet
Resource Pack
23 pages
Pemodelan Sistem 1 Rev
100% (1)
Pemodelan Sistem 1 Rev
185 pages
Step by Step Guide To Create A Simple BPM Process - SAP Blogs
No ratings yet
Step by Step Guide To Create A Simple BPM Process - SAP Blogs
44 pages
Memory Management Notes - Operating System
No ratings yet
Memory Management Notes - Operating System
4 pages
Mastery Teaching and Mastery Learning
No ratings yet
Mastery Teaching and Mastery Learning
14 pages
Python Basics Notes
No ratings yet
Python Basics Notes
32 pages
MIS All Units 1-8
No ratings yet
MIS All Units 1-8
89 pages
Enhancing Spelling Proficiency in English Among Grade Seven Learners Through the Implementation of the Cover-Copy-Compare (CCC) Strategy
No ratings yet
Enhancing Spelling Proficiency in English Among Grade Seven Learners Through the Implementation of the Cover-Copy-Compare (CCC) Strategy
11 pages
Calculo de Presion de Diseno
No ratings yet
Calculo de Presion de Diseno
12 pages
C - Session 14
No ratings yet
C - Session 14
19 pages
Wisdom of Veda
No ratings yet
Wisdom of Veda
104 pages
Delete Create TRX BTS Nokia
No ratings yet
Delete Create TRX BTS Nokia
17 pages
How To Choose A Dissertation Title
100% (2)
How To Choose A Dissertation Title
6 pages
OpenSAP s4h35 Week 2 All Slides
No ratings yet
OpenSAP s4h35 Week 2 All Slides
112 pages
6º Primaria (Online Exercises)
No ratings yet
6º Primaria (Online Exercises)
2 pages
Log GDC
No ratings yet
Log GDC
4 pages
Viernes Santos
No ratings yet
Viernes Santos
6 pages
Conventional Encryption
No ratings yet
Conventional Encryption
14 pages
Importance of Technical Writing
No ratings yet
Importance of Technical Writing
2 pages
2.3.9 Practice - Written Assignment - The Wonderful World of Food (Practice)
100% (1)
2.3.9 Practice - Written Assignment - The Wonderful World of Food (Practice)
3 pages
DLL Hijacking Basics
No ratings yet
DLL Hijacking Basics
34 pages
SA English Ws
No ratings yet
SA English Ws
5 pages