Comparative Analysis of Breast Cancer Detection Using Cutting-Edge Machine Learning Algorithms (MLAs)
Comparative Analysis of Breast Cancer Detection Using Cutting-Edge Machine Learning Algorithms (MLAs)
Article QR
Author (s): Tanzeel Sultan Rana1*, Imran Saleem2, Rabia Naseer Rao1, Maryam Shabbir2,
Laiba Wahid Chaudary1
Affiliation (s): 1
University of Management and Technology Lahore, Pakistan
2
Bahria University, Lahore, Pakistan
3
Octans Digital Pvt Ltd. 92-cc,Ex-Park View, Lahore ,Pakistan
DOI: https://doi.org/10.32350.icr.31.01
Received: April 12, 2023, Revised: April 29, 2023, Accepted: May 29, 2023, Published:
History: June 24, 2023
Citation: T. Sultan, R. I. Saleem, R. N. Rao, M. Shabbir, and L. W. Chaudary,
“Comparative analysis of breast cancer detection using cutting-edge machine
learning algorithms (MLAs),” Innov. Comput. Rev., vol. 3, no. 1, pp. 01–15,
2023, doi: https://doi.org/10.32350.icr.31.01
A publication of
School of Systems and Technology
University of Management and Technology, Lahore, Pakistan
Comparative Analysis of Breast Cancer Detection using Cutting-edge
Machine Learning Algorithms (MLAs)
Tanzeel Sultan Rana1 ∗, Imran Saleem2, Rabia Naseer Rao1, Maryam Shabbir2,
0F
ABSTRACT Recently, machine learning techniques have gained popularity for the
medical diagnosis. Medical professionals use this approach to learn and detect the
abnormalities of life-threatening chronic diseases. The increasing use of ML approaches
may be due in part to better disease diagnosis enabled through improved symptom
detection. The current study deployed different machine learning algorithms, including
Decision Trees (DT), K-Nearest Neighbors (KNN), classifiers Multilayer Perceptron (MP),
Support Vector Machines (SVM), and Random Forest (RF) for early predictions and
symptoms of the disease. These models were capable of differentiating between benign
and harmful cancer cells Benign tumours, which were non-cancerous and in most cases,
non-lethal were mostly confined to the area from where they originated, however, it was
observed that malignant cancer can start with abnormal cell growth in the human body.
This abnormal cell growth can quickly spread to nearby tissues, which can cause
infiltration of adjacent cells, resulting in a potentially fatal condition. Thereby, it was
observed that Multilayer Perceptron (MLP) model provided the highest accuracy
percentage of 86% when compared with all the other techniques in association with the
accuracy rate of the models
INDEX TERMS Decision Tree (DT), K-Nearest Neighbor (KNN), Logistic Regression
(LR), Support Vector Machine (SVM), Machine Learning (ML), Multilayer Perceptron
(MP), Random Forest (RF)
I. INTRODUCTION tissue. The current study focused on
breast cancer tumours along with
Cancer is a complex disease, which is
Machine Learning Algorithms (MLA).
characterized by uncontrolled growth of
Primarily, the breast composed of two
abnormal tissue in the entire body.
main types of tissue: glandular tissue and
Normally, old or damaged cells are
connective tissue. Glandular tissues are
replaced by new and healthy cells to
responsible for producing milk, whereas
maintain a healthy functioning of body .
connective tissues provide structural
Contrastingly, some damaged tissue
support and shape the breast. Glandular
incessantly grows and become a mass of
tissues may convert into malignant
tissues known as a tumour in the human
tumours with the passage of time. Most
body. There are two types of tumours,
breast cancers emanate in the cells of the
for instance, malignant and benign
lobules, the anatomical structures that
∗
Corresponding Author: tanzeelsultanrana1@gmail.com
Innovative Computing Review
2
Volume 3 Issue 1, Spring 2023
Rana et al.
TABLE I
SUMMARY OF LITERATURE REVIEW
Reference Model Method
Support Vector Machine
(SVM), Decision Tree (C4.5), SVM improves breast cancer
[8]
Naive Bayes (NB), and k diagnosis and treatment strategies.
Nearest Neighbors (k-NN)
Haifeng's ensemble model
demonstrated improved diagnosis
Support vector machine
[9] accuracy, but its long training time
ensemble Algorithm
and computational expense raise
concerns.
Haifeng's ensemble model
demonstrated improved diagnosis
Deep learning and machine
[10] accuracy, but its long training time
learning algorithms
and computational expense raise
concerns.
Neural networks can identify
cancerous cells but require
Artificial Neural Network
[11] observation over time and significant
(ANN)
processing power processing of image
processing.
Support Vector Machine
(SVM), Random Forest, SVM is the best-supervised machine
[12] Logistic Regression, Decision learning algorithm for breast cancer
tree (C4.5), and K-Nearest classification.
Neighbours (KNN)
Random Forest, Naïve Bayes,
Support Vector Machines SVM is the best classifier with 97.9%
[13]
SVM, and K-Nearest accuracy.
Neighbors K-NN
Support Vector Machine Clustering and SVM combined to
[14]
(SVM) predict WBCD with 99.10% accuracy.
Pulse-Coupled Neural A new approach uses LDA to reduce
Networks (PCNN) and Deep feature dimensionality and applies
[15]
Convolutional Neural five ML algorithms to the BCWD
Networks (CNN) dataset.
Support Vector Machine
(SVM), Random Forest,
Support vector machines had the
[16] Logistic Regression, Decision
highest accuracy (97.2%).
tree (C4.5), and K-Nearest
Neighbours (KNN)
For technique, the confusion matrix was testing, whereas 25% of the data was used.
calculated. For the dataset with 831 The confusion matrix of these machine
instances, 75% of the data instances were learning algorithms was shown, which
employed for training models and for provided the results for SVM, DT RF, K-
Innovative Computing Review
12
Volume 3 Issue 1, Spring 2023
Rana et al.
NN, MLP, and LR, respectively. The on some machine learning techniques
confusion matrix was seen in its in breast cancer classification,” in 2020
combination in the table above. IEEE-EMBS Conference on
Biomedical Engineering and Sciences
However, It can forecast the greatest
(IECBES), IEEE, 2021, pp. 499–504,
number of positives when any of the six
doi:
strategies is noticed to be true. Logistics
https://doi.org/10.1109/IECBES48179
Regression Models (LRM) may forecast
.2021.9398837
the least amount of positives when they are
false positives in addition to predicting the [5] S. Saeed, N. Z. Jhanjhi, M. Naqvi, M.
greatest number of true positives. Logistic Humyun, M. Ahmad, and L. Gaur,
Regression (LR) predicted the lowest false- “Optimized breast cancer premature
positive rate, whereas the false-positives detection method with computational
highest value was achieved by LR and segmentation: A systematic review
MLP. mapping,” Approaches Appl. Deep
Learn. Virtual Med. Care, pp. 24–51,
The decision Tree (DT) provided the
2022, doi: https://doi.org/10.4018/978-
highest rate where false negatives were
1-7998-8929-8.ch002
concerned, with the lowest rate being
achieved by MLP. The F1 score for all the [6] W. Yue, Z. Wang, H. Chen, A. Payne,
techniques is almost 97%, which was and X. Liu, “Machine learning with
significantly better. LR predicted the applications in breast cancer diagnosis
highest value of True-Negative, whereas and prognosis,” Designs, vol. 2, no. 2,
MLP provided the lowest value. Art. no. 13, May 2018, doi:
https://doi.org/10.3390/designs202001
REFERENCES
3
[1] D. Lazaro-Pacheco, A. M. Shaaban, S.
[7] L. Galluzzi et al., “Autophagy in
Rehman, and I. Rehman, “Raman
malignant transformation and cancer
spectroscopy of breast cancer,” Appl.
progression,” EMBO J., vol. 34, no. 7,
Spectrosc. Rev., vol. 55, no. 6, pp. 439–
pp. 856–880, 2015, doi:
475, 2020, doi:
https://doi.org/10.15252/embj.201490
https://doi.org/10.1080/05704928.201
784
9.1601105
[8] H. Asri, H. Mousannif, H. Al
[2] M. Arnold et al., “Current and future
Moatassime, and T. Noel, “Using
burden of breast cancer: Global
machine learning algorithms for breast
statistics for 2020 and 2040,” The
cancer risk prediction and diagnosis,”
Breast, vol. 66, pp. 15–23, 2022, doi:
Procedia Comput. Sci., vol. 83, pp.
https://doi.org/10.1016/j.breast.2022.0
1064–1069, 2016, doi:
8.010
https://doi.org/10.1016/j.procs.2016.0
[3] F. Gorunescu, “Fighting breast cancer 4.224
with the aid of artificial intelligence: A
[9] H. Wang, B. Zheng, S. W. Yoon, and
big challenge,” J. Cancer Clin. Res.,
H. S. Ko, “A support vector machine-
vol. 3, no. 1, Art. no. 1069.
based ensemble algorithm for breast
[4] N. A. Mashudi, S. A. Rossli, N. cancer diagnosis,” Eur. J. Oper. Res.,
Ahmad, and N. M. Noor, “Comparison vol. 267, no. 2, pp. 687–699, 2018, doi: