Wart Treatment Using Machine Learning Support Vector Algorithm

This document discusses using machine learning algorithms like support vector machines, decision trees, and random forests to predict the effectiveness of wart treatment using immunotherapy. It compares the performance of these algorithms on a dataset containing 90 instances with 8 attributes related to patient information and treatment results. The random forest algorithm achieved the highest accuracy of 86.6% according to 10-fold cross-validation experiments.

Uploaded by

Lij Bereket Tena

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views

Wart Treatment Using Machine Learning Support Vector Algorithm

Uploaded by

Lij Bereket Tena

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Title wart treatment using machine learning support vector algorithm

Abstract support vector machine (SVM) in data optimization has becomes powerful tools of problem
solving in machine learning. SVM algorithm can be used for Face detection, image classification, text
categorization, etc. In this paper, we present wart treatment of patients using immunotherapy by using
support vector machine algorithm model. Immunotherapy is a new class of cancer treatment that works
to harness the innate powers of our own immune system to fight cancer. We run various kinds of
algorithms and compared the performance of each algorithm with the other in terms of the wart
treatment result of patients using immunotherapy. The different types of algorithms are incorporated
step by step. The treatment of patient’s wart using immunotherapy has been considered as a
classification problem and it is evaluated using various types of machine learning algorithms. The
evaluations have been performed on diverse feature sets and the different classification methods. The
comparison of the results is also presented and the evaluation show that for the wart treatment using
immunotherapy. The immunotherapy data set has 90 instances and 8 attributes of type integer and real
type. For these, data set the data mining tool used was sklearn.

1. Introduction

SVM is a supervised machine learning algorithm which can be used for classification or regression
problems. It uses a technique called the kernel trick to transform your data and then based on these
transformations it finds an optimal boundary between the possible outputs. Simply put, it does some
extremely complex data transformations, then figures out how to separate our data based on the labels
or outputs we have defined. The goal of the SVM algorithm is to create the best line or decision
boundary that can segregate n-dimensional space into classes so that we can easily put the new data
point in the correct category in the future. This best decision boundary is called a hyperplane. SVM
chooses the extreme points/vectors that help in creating the hyperplane. These extreme cases are called
as support vectors, and hence algorithm is termed as Support Vector Machine. Consider the below
diagram in which there are two different categories that are classified using a decision boundary or hype

Fig 1.1 svm model

Hyperplane: There can be multiple lines/decision boundaries to segregate the classes in n-
dimensional space, but we need to find out the best decision boundary that helps to classify the
data points. This best boundary is known as the hyperplane of SVM. The dimensions of the
hyperplane depend on the features present in the dataset, which means if there are 2 features (as
shown in fig 1.1), then hyperplane will be a straight line. And if there are 3 features, then
hyperplane will be a 2-dimension plane. We always create a hyperplane that has a maximum
margin, which means the maximum distance between the data points.
Support vectors: The data points or vectors that are the closest to the hyperplane and which affect
the position of the hyperplane are termed as Support Vector. Since these vectors support the
hyperplane, hence called a Support vector.
1. Methods
SVM can be understood with the example that we have used in the KNN classifier. Suppose we
see a wart treated with immunotherapy with 1 and not treated result 0, so if we want a model that
can accurately identify whether wart is treated or not , so such a model can be created by using
the SVM algorithm. We will first train our model with lots of data with their features,is that
treated or not. so that it can learn about different features of treated and not t, and then we test it
with new test data. So as support vector creates a decision boundary between these two data
(treated or not ) and choose extreme cases (support vectors), it will see the extreme case. On the
basis of the support vectors, it will classify it as a treated or not.
2. Model development
Machine learning model is the process for making your models available in production environments,
where they can provide predictions to other data. It is only once models are deployed to production that
they start adding value, making deployment simple. All of the available data is split into two categories.
In the training phase, we use 90% of the data in training the model. The remaining 10% of the data is
used in the testing phase to validate the accuracy of the model built. In the prediction phase, the model
is deployed in production and we use actual live data in predicting the outcome. I am going to use a 10-
fold cross validation to split the data into training and testing on scikitlearn and sample data to develop
our model using the Classification algorithms. These algorithms are Support vector machine, Naive
Bayes and Decision Tree.
Fig 3.1 supervised learning model
3. Related Works

One of the most common and leading cause of cancer death in human beings is lung cancer. The
advanced observation of cancer takes the main role to inflate a patient’s probability for survival of the
disease. Other researchers show the accomplishment of support vector machine (SVM) and logistic
regression (LR) algorithms in predicting the survival rate of lung cancer patients and compares the
effectiveness of these two algorithms through accuracy, precision, recall, F1 score and confusion matrix.
These techniques have been applied to detect the survival possibilities of lung cancer victims and help
the physicians to take decisions on the forecast of the disease.

Support vector machine

Support vector machine is a representation of the training data as points in space separated into
categories by a clear gap that is as wide as possible. New examples are then mapped into that same
space and predicted to belong to a category based on which side of the gap they fall. SVM it capable of
doing both classification and regression. In this post I'll focus on using SVM for classification. In particular
I'll be focusing on non-linear SVM, or SVM using a non-linear kernel. Non-linear SVM means that the
boundary that the algorithm calculates doesn't have to be a straight line. The benefit is that you can
capture much more complex relationships between your datapoints without having to perform difficult
transformations on your own. The downside is that the training time is much longer as it's much more
computationally intensive.

3.1. Decision Tree

A decision tree is a tree-like graph with nodes representing the place where we pick an attribute and ask
a question; edges represent the answers to the question; and the leaves represent the actual output or
class label. They are used in non-linear decision making with simple linear decision surface.

Fig 4.2.1 random forest model

3.2. Random Forest

Random forest is solid choice for nearly any prediction problem (even non-linear ones). It's a
relatively new machine learning strategy (it came out of Bell Labs in the 90s) and it can be used
for just about anything. It belongs to a larger class of machine learning algorithms called
ensemble methods. the algorithm to induce a random forest will create a bunch of random
decision trees automatically. Since the trees are generated at random, most won't be all that
meaningful to learning our classification/regression problem (maybe 99.9% of trees).

Fig 4.3.2 random forest model

4. Dataset
Wart treatment using immunotherapy dataset have 8 features and 90 instances. I collect the data from
UCI dataset website. Attribute Information: <class 'pandas.core.frame.DataFrame'>

RangeIndex: 90 entries, 0 to 89

Data columns (total 8 columns):

# Column Non-Null Count Dtype

--- ------ -------------- -----

0 sex 90 non-null int64

1 age 90 non-null int64

2 Time 90 non-null float64

3 Number_of_Warts 90 non-null int64

4 Type 90 non-null int64

5 Area 90 non-null int64

6 induration_diameter 90 non-null int64

7 Result_of_Treatment 90 non-null int64

dtypes: float64(1), int64(7)

memory usage: 5.8 KB Experimental result

5. Experimental Results

In this paper I am trying to compare three different classification algorithms on the same dataset to see
the performance or accuracy of each model and provided a comparison results in terms of accuracy,
confusion matrix and classification report using the experiment result.

Result 1: Accuracy, confusion matrix and classification report for Decision Tree

Result 2 Accuracy, confusion matrix and classification report for SVM

Result 3 Accuracy, confusion matrix and classification report for random forest
So as shown in the below bar chart I have conducted comparison on different classification technique
and provided a basis among them in terms of accuracy, confusion matrix and classification report by
applying 10- fold cross validation, and random forest can achieve around 86.6 % of acuuracy.

6. Conclusion

In this paper, I am trying to compare Support vector machine, random forest and decision for
immunotherapy dataset. I use 10-fold cross validation (90% for training and 10% for testing) for all
algorithms. Based on these classifications the accuracy decision tree 83.33%, accuracy of random forest
is 86.66% and accuracy of SVM 78.88%. From these accuracy result, we can say that with the same
dataset the accuracy of different algorithms becomes different. So, we must be selecting the efficient
one by comparing them.

7. Reference
8.
https://www.researchgate.net/publication/319870836_Predicting_Lung_Cancer_Survivability_using
_SVM_and_Logistic_Regression_Algorithms
https://archive.ics.uci.edu/ml/datasets/Immunotherapy+Dataset
https://www.analyticsvidhya.com/blog/2017/09/understaing-support-vector-machine-example-
code/

ETC1010 S12015 Solution Part2
No ratings yet
ETC1010 S12015 Solution Part2
7 pages
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
No ratings yet
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
11 pages
UNIT 3 AAM
No ratings yet
UNIT 3 AAM
30 pages
Machine Learning Algorithms For Breast Cancer Prediction
No ratings yet
Machine Learning Algorithms For Breast Cancer Prediction
8 pages
13 PracticalMachineLearning
100% (1)
13 PracticalMachineLearning
84 pages
SVM, Neural Network and Random Forest in R
No ratings yet
SVM, Neural Network and Random Forest in R
45 pages
ML Unit 3 V1
No ratings yet
ML Unit 3 V1
25 pages
Machine learning algorithms laiki
No ratings yet
Machine learning algorithms laiki
123 pages
Experiment 2.3 SVM Classifier
No ratings yet
Experiment 2.3 SVM Classifier
3 pages
3.unit 3 ML Part-1 Q&A
No ratings yet
3.unit 3 ML Part-1 Q&A
39 pages
Breast Cancer Classifier Using Machine Learning
No ratings yet
Breast Cancer Classifier Using Machine Learning
7 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Report On Prediction Model
No ratings yet
Report On Prediction Model
17 pages
SVM Unit 2
No ratings yet
SVM Unit 2
12 pages
AI Chapter 3 Part 3
No ratings yet
AI Chapter 3 Part 3
49 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
DL PPR3
No ratings yet
DL PPR3
57 pages
Ai ML Research Paper-219311275
No ratings yet
Ai ML Research Paper-219311275
6 pages
Algorithm of Neural Network M4
No ratings yet
Algorithm of Neural Network M4
25 pages
DSUP_Exp6[1]
No ratings yet
DSUP_Exp6[1]
5 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
5-(9-12) SVM & DT Classifiers
No ratings yet
5-(9-12) SVM & DT Classifiers
41 pages
ML Unit-3
No ratings yet
ML Unit-3
16 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
9 pages
SVM&Decision Tree
No ratings yet
SVM&Decision Tree
10 pages
Testsdfhakjfhadks
No ratings yet
Testsdfhakjfhadks
6 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Multi-Disease Prediction With Machine Learning
No ratings yet
Multi-Disease Prediction With Machine Learning
7 pages
Project Title and Abstract
No ratings yet
Project Title and Abstract
17 pages
Machine Learning Section4 Ebook v03
No ratings yet
Machine Learning Section4 Ebook v03
20 pages
Machine Learning 1707965934
No ratings yet
Machine Learning 1707965934
15 pages
support_vector_machines
No ratings yet
support_vector_machines
12 pages
SVM Unit3
No ratings yet
SVM Unit3
23 pages
A Systematic Review of Supervised Learning Algorithms in Disease Diagnosis
No ratings yet
A Systematic Review of Supervised Learning Algorithms in Disease Diagnosis
10 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
Basic of SVM Algorithm
No ratings yet
Basic of SVM Algorithm
10 pages
Lab 6 Dsa
No ratings yet
Lab 6 Dsa
15 pages
ML Unit-3
No ratings yet
ML Unit-3
28 pages
PDSeasonableSchool ML4PD
No ratings yet
PDSeasonableSchool ML4PD
135 pages
U21amg05 Aif and ML Unit 04 Notes
No ratings yet
U21amg05 Aif and ML Unit 04 Notes
42 pages
CSL0777 L23
No ratings yet
CSL0777 L23
39 pages
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
No ratings yet
11 Most Common Machine Learning Algorithms Explained in A Nutshell by Soner Yıldırım Towards Data Science
16 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
Lecture 9_Classification_Part 2_ec0c64efddca717f99b726e6fd37c459
No ratings yet
Lecture 9_Classification_Part 2_ec0c64efddca717f99b726e6fd37c459
26 pages
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
Breast Cancer Classification
100% (2)
Breast Cancer Classification
16 pages
Module 3
No ratings yet
Module 3
79 pages
Report of Comparing 5 Classification Algorithms of Machine Learning PDF
No ratings yet
Report of Comparing 5 Classification Algorithms of Machine Learning PDF
4 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
27 pages
Analysis of Common Supervised Learning Algorithms Through Application
No ratings yet
Analysis of Common Supervised Learning Algorithms Through Application
20 pages
A Comprehensive Survey On Support Vector Machine Classification Applications, Challenges and Trends - 2019
No ratings yet
A Comprehensive Survey On Support Vector Machine Classification Applications, Challenges and Trends - 2019
8 pages
Prediction On Iris
No ratings yet
Prediction On Iris
14 pages
Machine Learning Algorithms 1728923216
No ratings yet
Machine Learning Algorithms 1728923216
12 pages
ML Unit 3 Part B Material
No ratings yet
ML Unit 3 Part B Material
15 pages
A Comprehensive Survey On Support Vector Machine Classification - Applications, Challenges and Trends
No ratings yet
A Comprehensive Survey On Support Vector Machine Classification - Applications, Challenges and Trends
27 pages
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
No ratings yet
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
5 pages
5 markd
No ratings yet
5 markd
24 pages
Machine Learning Algorithms - A Review - ART20203995
No ratings yet
Machine Learning Algorithms - A Review - ART20203995
6 pages
Ijet V7i2 8 10557
No ratings yet
Ijet V7i2 8 10557
4 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Classification and Prediction
No ratings yet
Classification and Prediction
41 pages
Predicting Heart Disease at Early Stages Using Machine Learning: A Survey
No ratings yet
Predicting Heart Disease at Early Stages Using Machine Learning: A Survey
4 pages
HW1 Final
No ratings yet
HW1 Final
4 pages
Hronsky - Session 4 - Mineral Exploration Tactics
No ratings yet
Hronsky - Session 4 - Mineral Exploration Tactics
44 pages
Time Series Analysis of Cross-Listed Stocks
No ratings yet
Time Series Analysis of Cross-Listed Stocks
9 pages
UMBC CMSC 471 Final Exam,: 1. True/False (20 Points)
No ratings yet
UMBC CMSC 471 Final Exam,: 1. True/False (20 Points)
6 pages
T10-R73-P2-Varian-802-902-v5.1 - Practice Questions
No ratings yet
T10-R73-P2-Varian-802-902-v5.1 - Practice Questions
11 pages
Classification Algorithm in Data Mining: An
No ratings yet
Classification Algorithm in Data Mining: An
6 pages
Rathore 2024 Machine Learning Applications in Human Resource Ma
No ratings yet
Rathore 2024 Machine Learning Applications in Human Resource Ma
12 pages
Automated Root Cause Analysis of No
No ratings yet
Automated Root Cause Analysis of No
13 pages
Solomatine 2004
No ratings yet
Solomatine 2004
11 pages
Ch.3 Data Preprocessing
No ratings yet
Ch.3 Data Preprocessing
16 pages
Crime Data Mediante Machine Learning
No ratings yet
Crime Data Mediante Machine Learning
6 pages
Reducing Power Consumption of Digital Predistortion For RF Power Amplifiers Using Real-Time Model Switching
No ratings yet
Reducing Power Consumption of Digital Predistortion For RF Power Amplifiers Using Real-Time Model Switching
9 pages
Lab-Practice-I(ML)-Lab Manual-Vaishali
No ratings yet
Lab-Practice-I(ML)-Lab Manual-Vaishali
57 pages
Instant ebooks textbook Qlik Sense Advanced Data Visualization for Your Organization 1 edition Edition Ferran Pagans download all chapters
100% (4)
Instant ebooks textbook Qlik Sense Advanced Data Visualization for Your Organization 1 edition Edition Ferran Pagans download all chapters
55 pages
DDT: Distributed Decision Tree
No ratings yet
DDT: Distributed Decision Tree
54 pages
Unit 3 DVA
No ratings yet
Unit 3 DVA
50 pages
Data Analysis and Price Prediction of Black Friday Sales Using Machine Learning Techniques IJERTV10IS070271
No ratings yet
Data Analysis and Price Prediction of Black Friday Sales Using Machine Learning Techniques IJERTV10IS070271
8 pages
Machine Learning Functionalities
No ratings yet
Machine Learning Functionalities
58 pages
Smai A1 PDF
No ratings yet
Smai A1 PDF
3 pages
ML Labs
No ratings yet
ML Labs
46 pages
Fundamentals of Machine Learning For Predictive Data Analytics
No ratings yet
Fundamentals of Machine Learning For Predictive Data Analytics
52 pages
IR Question Bank
100% (2)
IR Question Bank
29 pages
3806 Disease Prediction by Using Machine Learning PDF
No ratings yet
3806 Disease Prediction by Using Machine Learning PDF
6 pages
8 Decision Trees Option
No ratings yet
8 Decision Trees Option
59 pages
Data Mining For Business in Python Deck
No ratings yet
Data Mining For Business in Python Deck
93 pages
Parallel 29 - Laura Bradier
No ratings yet
Parallel 29 - Laura Bradier
33 pages
Visualizing Trees and Forests
No ratings yet
Visualizing Trees and Forests
24 pages