PUBLICATION
PUBLICATION
PUBLICATION
PHASE-I
ABSTRACT: The need for precise and effective medical diagnosis has grown in importance in the healthcare industry in
today's fast-paced world.. The core concept driving clinical diagnosis is to minimize human error in medical settings, a
principle that extends beyond healthcare to other domains such as earth observation via satellites and comprehending
activities in outer space. The primary motivation behind the development of our project lies in providing doctors with a
reliable tool to predict and address potential health issues through a user-friendly web platform. Additionally, we aim to
enhance the overall user experience by implementing features that manage user history and preferences. The technologies at
the heart of our solution involve cutting-edge convolutional neural networks (CNN) and the powerful EfficientNet B3 for
image processing, combined with the versatility of React.js for crafting an interactive web front-end. Our project is firmly
grounded in image data, addressing common challenges in image processing, including overfitting, hyperparameter
sensitivity, and time consumption. By tackling these issues head-on, we aim to empower medical professionals with rapid and
accurate diagnostic results, aligning with their need for swift decision-making and optimal patient care.
Technologies used in this project are CNN and In A Novel Attention-Based CNN-RNN Hybrid
EfficientNet. The next column explains the technologies we Architecture for sEMG-Based Gesture Recognition
use. (2018)[3], the proposed architecture combines
convolutional neural networks (CNNs) and recurrent neural
Convolutional Neural Network (CNN) greatly helps with networks (RNNs) with an attentional mechanism that
image processing and pattern recognition. It uses recognizes gestures based on surface electromyography
convolutional layers to automatically learn hierarchical signals (sEMG). CNNs extract features from images, while
features of data. CNNs perform the most accurate and RNNs, especially Long Short-Term Memory (LSTM) units,
adaptive computer vision tasks. model temporal dependencies in data. An attention
mechanism is introduced to highlight relevant information
EfficientNet is a family of convolutional neural networks in sEMG signals. The architecture uses a combination of
(CNN) optimized for computer vision. It uses a loss functions, including attention loss and target
combination of various techniques to balance depth, width replication loss.
and resolution, making the model effective with accuracy.
EfficientNet is well-known in the deep learning community Challenges in this approach include the vanishing and
for its ability to deliver high performance while using fewer exploding gradient problem, which can hinder training and
computational resources. make it difficult to effectively capture long-range
dependencies in sequences. RNNs also face limitations
This article aims to provide an overview of the system by related to their short-term memory, excelling at tasks with
touching on its architecture, functionality and benefits. By relatively short sequences but struggling to remember
exploring its potential and practical applications, we aim to information from distant past events.
demonstrate the transformative power of this technology,
which integrates the highest level of intelligence with
humans to edit medical images. Through this research, we Wei Fang et.al[4] proposed a Deep Convolutional
aim to demonstrate the potential of this system to transform Generative Adversarial Network (DCGAN) including a
customer loyalty and error-free operation in healthcare, discriminative network and a generative network, both
ushering in a new era of efficiency and consumer analysis. consisting of four layers. The generator transforms noise
vectors into realistic samples, while the discriminator
distinguishes between real and generated data. The training
2. RELATED WORKS process involves a non-convex game, and an adaptive
learning rate reduction strategy is used to increase the
Tsung-Yu Lin et.al[1], discussed the architecture bilinear training efficiency. However, this approach encounters
pooling to combine the outer product of two features several problems, including training instability, regime
through global averaging, resulting in a covariance matrix collapse (where the generator produces limited diversity),
that captures pairwise interactions. When these two features hyperparameter sensitivity, and the need for significant
computational resources due to its intensive nature.
Abien et.al[5] explores an experimental approach that In Improved Classification of MRI Images for Cervical
combines SVM and CNN for image classification. This Cancer Using CONVOLUTIONAL NEURAL
hybrid architecture uses SVM for binary classification, NETWORKS [9] the paper presents an approach for the
which determines the optimal hyperplane for separating automatic identification of hepatocellular carcinoma. It
two classes. The study considers both L1-SVM and uses an ensemble learning strategy with seven machine
L2-SVM, using the hinge loss as a standard loss function learning algorithms and uses collaborative representation
for SVM. However, this paper highlights problems classification with a boosting technique to categorize
including a lack of comprehensive understanding, potential hyperspectral images. However, this method faces
bias and fairness issues, absence of moral or ethical challenges, including a lack of domain knowledge and
considerations, and long-term memory-related limitations potential limitations in portability to different domains.
in the classification process.
Yiwen Liu, et.al[10] presented the paper that outlines the
An Image Classification Algorithm using AlexNet [6], the methodology for identifying crop diseases and insect pests.
authors present a method for image classification using an It uses transfer learning with VGG16 and
improved version of the AlexNet convolutional neural Inception-ResNet-v2 (CNN) convolutional neural
network (CNN). This approach solves the problem of networks. However, this approach faces challenges,
traditional CNNs with a high number of parameters, including resource intensiveness, the need for model
especially in fully connected layers, which can lead to updates, and complexity associated with fine-tuning.
problems such as overfitting and increased computational
complexity. The proposed enhancement introduces Sota Kato, et.al[11] proposed a study involving 50 cell
deconvolution layers into the AlexNet architecture, images with ground truth provided by Kyoto University.
effectively reducing the number of parameters, especially These images are obtained using a fluorescent marker on
in the fully connected layer. However, it is important to the cell membrane and nucleus of mouse liver and are
note that this improvement comes with increased 256×256 pixels in size. Among these images, 40 were used
computational complexity, the risk of overfitting due to for training, five for validation, and the remaining five for
model complexity, and the requirement for a significant testing. However, it should be noted that the proposed
amount of labeled training data. Achieving a balance method did not significantly improve the cell membrane
between these factors is essential for successful segmentation accuracy.
implementation of the improved AlexNet-based image
classification algorithm. Image Processing for Malaria Screening Using a Deep
Learning Approach [12] by Fetulhak Abdurahm, et.al. The
Yanan Sun, et.al, presented a method that uses a genetic study aims to improve the detection of P. falciparum in
algorithm (GA) to automatically optimize the architecture thick-smear microscopic images using an advanced deep
of convolutional neural networks (CNN). The process starts learning object . detection models. The aim is to address the
by initializing a population of CNN architectures, encoding challenges that were previously identified in this context.
them using skip and pooling layers and omitting fully To achieve this, the paper proposes a tile-based image
connected layers. Although this approach appears processing approach that enables better detection of small
promising, it faces challenges, including reproducibility objects in high-resolution images that exceed the input
issues, domain knowledge requirement, cross-domain resolution limits of the network. However, it should be
transferability difficulties, generalizability limitations, and noted that this method may not achieve a very high level of
data and time resource requirements. detection performance, especially when resizing the
high-resolution input image for detection.
Shengyi Zhao, et.al[8] presented an attention module to
improve model performance with minimal computational Md. Ashiq Mahmood and Tamal Joyti Roy[13], conducted
cost. This module uses the Squeeze-and-Excitation an experiment involving image analysis to establish an
Networks (SENet) method, which automatically assigns assessment process for identifying COVID-19-related lung
weight coefficients to feature channels based on their infection. This approach first involves analyzing the images
importance. This recalibration process reinforces relevant without converting them into binary segments.
feature channels while suppressing irrelevant ones. Subsequently, the authors convert normal images into
However, this approach comes with problems, including binary and collect the relevant values. The experiment also
computational complexity, reduced interpretability, and the uses parameter-based neural networks. However, one of the
need for careful hyperparameter tuning. challenges facing this study is the availability of
insufficient datasets for comprehensive analysis. Image
division is controlled by local segmentation.
3.3.1 EfficientNet B3
Augmenting dataset during training by applying After the image is submitted, the image is handed over to
transformations like rotation, scaling, and flipping we can the deep learning model, EfficientNetB3, for the actual
get a wider variety of data for training which will help in diagnosis. The model analyzes the image to generate a
generalizing the model and prevent overfitting. Data prediction for the medical condition and produces a
augmentation helps the model become robust to variations confidence score to assess the certainty of the diagnosis.
in input data. This step is the core of the application, where the machine
learning model leverages its capabilities to provide
valuable medical insights.
3.3.5 Regularization:
3.4.5. RESULT PRESENTATION:
To prevent overfitting, we have added regularization
techniques such as dropout and weight decay to the model.
EfficientNet B3 has a significant number of parameters, The diagnosis results are presented to the user in a
and regularization can help improve generalization. user-friendly and understandable format. This presentation
includes the predicted medical condition, the associated
confidence score.
3.4. SYSTEM WORKFLOW
3.4.6. DATA SAFETY AND PRIVACY:
3.4.1 USER LOGIN:
Throughout the workflow, robust data safety and privacy
Upon accessing the application, users are presented with measures are upheld. This includes secure data
the login screen. Here, users input their credentials, transmission protocols, temporary data storage during the
typically consisting of an email address and password, user's session, and user authentication. The application
which are securely verified for authentication. Once prioritizes data protection to safeguard sensitive patient
successfully logged in, users gain access to the application's information and medical images, following industry
features. standards and regulations to ensure user data remains
confidential and secure.
3.4.2 HOMEPAGE AND IMAGE UPLOAD :
3.4.7 LOGOUT AND USER MANAGEMENT:
Upon successful login, users are directed to the
application's homepage, which serves as the central hub for Users have the option to log out of their accounts when
initiating the diagnosis process. The homepage is designed they have completed the diagnosis. Additionally, the
to be intuitive and user-friendly, featuring a conspicuous application offers user management features that empower
Upload Image button or area where users can easily select users to maintain their accounts, including the ability to
and submit their medical images for analysis. This step is reset passwords and configure account settings. This
crucial for initiating the diagnosis process and streamlining provides a seamless and user-centric experience while
the user experience. allowing users to have control over their accounts.
4. CONCLUSION [7] Yanan Sun, Bing Xue, Mengjie Zhang, Gary and
Jiancheng Lv, “Automatically Designing CNN
In the culmination of this project, we have successfully Architectures Using Genetic Algorithms for Image
developed a user-centric medical image diagnosis web Classification”. arXiv:1808.03818v3, [cs.NE], 27 march
application. By removing the individual user registration 2020
process we have made sure that there is no unauthorized
access to the tool. This streamlined workflow, coupled with [8] Shengyi Zhao, Yun Peng, Jizhan Liu * and Shuo Wu.
efficient image processing and clear result presentation, “Tomato Leaf Disease Diagnosis Based on Improved
facilitates prompt and comprehensive medical image Convolution Neural Network by Attention Module”.
diagnoses. The integration of the EfficientNetB3 deep Agriculture 2021, 11, 651. 11 July 2021
learning model enhances the accuracy and reliability of the
results, contributing to informed healthcare decisions. As [9] S. Gowri, Judith Justin and R. Vanithamani. S. Gowri,
we move forward, ongoing monitoring and maintenance Judith Justin and R. Vanithamani. “An Improved
will be essential to uphold the application's continued Classification of Mr Images for Cervical Cancer Using
reliability and accuracy. This project exemplifies the Convolutional Neural Networks”. ICTACT journal on
delicate balance between user-friendliness and data image and video processing, volume: 12, issue: 02,
security, underlining its potential as a valuable tool in the november 2021
realm of healthcare, benefiting both medical professionals
and patients.
[10] Yiwen Liu , Xian Zhang, Yanxia Gao,Taiguo Qu, and
Yuanquan Shi. “Improved CNN Method for Crop Pest
Identification Based on Transfer Learning”. Hindawi
Computational Intelligence and Neuroscience Volume
REFERENCE: 2022, Article ID 9709648, Published 16 March 2022
[1] Tsung-Yu LIN, Subhransu Maji, Improved Bilinear [11] Sota Kato Kazuhiro Hottal, “Cell Segmentation by
Pooling with CNNs, arXiv:1707.06772v1 [cs.CV], 21 july Image-to-Image Translation using Multiple Different
2017 Discriminators''.DOI: 10.5220/0009170103300335, Volume
4: BIOSIGNALS, published in 2022
[4] Wei Fang, Feihong Zhang1, Victor S. Sheng3 and [14] Sota Kato Kazuhiro Hotta. Cell Segmentation by
Yewen Ding1. “A Method for Improving CNN-Based Image-to-Image Translation using Multiple Different
Image Recognition Using DCGAN”. CMC, vol.57, no.1, Discriminators. DOI: 10.5220/0009170103300335, Volume
pp.167-178, 2018 4: BIOSIGNALS, published in 2022
[5] Abien Fred M. Agarap. “An Architecture Combining [15] Luyuan Wang Yankui Sun Image classification using
Convolutional Neural Network(CNN) and Support Vector convolutional neural network with wavelet domain inputs.
Machine (SVM) for Image Classification.” IET Image Process, DOI: 10.1049/ipr2.12466, 21 February
arXiv:1712.03541v2, 7 Feb 2019 2022
[6] Shaojuan Li, Lizhi Wang, Jia Li and Yuan Yao. “Image [16] Ahmad Yahya Dawod Aniwat Phaphuangwittayakul
Classification Algorithm Based on Improved AlexNet”. Salita Angkurawaranon. A hybrid method for traumatic
Shaojuan Li et al 2021 J. Phys.: Conf. Ser. 1813 012051 brain injury lesion segmentation. International Journal of
Electrical and Computer Engineering (IJECE) Vol.12,
No.2, April 2022, pp.1437~1448
PLAGIARISM REPORT FOR PHASE 1 PAPER
Medical Image Diagnosis In Healthcare
Utilizing A Web Application
Abstract:
In today's fast-paced world, the need for accurate and effective diagnoses in the healthcare industry has become increasingly
important. The main idea behind diagnostics is to reduce human error in medicine; This is a principle that extends medicine
to other fields, such as satellite Earth observation and understanding operations in space. The main purpose in developing
our work is to provide doctors with reliable tools to predict and solve customers' health problems on the online platform. We
also aim to improve the overall user experience by implementing functions to manage user history and preferences. The core
technologies of our solution include the powerful EfficientNet B3 for neural networks (CNN) and image processing, as well
as extensive use of React.js to create an interactive web front end. Our work focuses on image data and solves problems such
as overhead, hyperparameter sensitivity, and time consumption in image processing. By addressing these issues, our goal is to
provide physicians with fast, accurate results that meet their rapid decision-making and care needs, good patients.
Related Work overuse, limited use for certain situations, and problems
Tsung-Yu Lin et al. [1] discussed the design of bilinear with the use of defined roles.
pooling, which combines the products of two faces with
global competition to form a different matrix leading to the Sing CNN for Cancer Detection (2018) [2] Ishani Dabral et
interaction effect. Although these two features are similar, al. proposed a neural network (CNN) architecture for
they lead to the parallelism of B-CNN, reminiscent of cancer diagnosis targeting the Invasive Ductal Cancer
second-order (O2P), a technique used in semantic (IDC) dataset. This innovation combines convolution,
classification. However, this approach suffers from several output and deconvolution techniques with custom data
limitations, such as sensitivity to hyperparameters, risk of priors. However, it faces problems such as the need for
large data sets, complex models, excessive risk, and A method using the image classification algorithm AlexNet
difficulty in implementing the transformation process. [6], the author proposed a method using the development of
AlexNet convolutional neural network (CNN) for image
A tracking system that recognizes gestures based on gesture classification. This method solves many inconsistency
recognition and surface electromyographic signals (sEMG) problems of traditional CNN, especially in the fully
based on the CNN-RNN hybrid architecture (2018). While connected process, which will cause excessive interference
CNNs extract features from images, RNNs, especially and increase the complexity of the process. This
short-term (LSTM) units, model the body of the object. development introduced the deconvolution process into the
Tracking techniques have been introduced to highlight AlexNet architecture, specifically reducing the number of
important information in sEMG signals. The architecture parameters in all layers. However, it is worth noting that
uses a combination of redundancy, including shock and this development also brings with it an increase in
target redundancy. computer complexity, the risk of overfitting due to model
complexity, and the need for large amounts of training data.
Challenges with this approach include fading and breakage Achieving a balance of these factors is critical for the
gradient issues, which can disrupt training and make successful development of the AlexNet-based image
long-term progress difficult to make. RNNs also face classification algorithm.
limitations regarding short time intervals; They are good at
working with short segments but have difficulty Yanan Sun et al. proposed a method to optimize the
remembering information about events in the distant past. performance of convolutional neural networks (CNN) using
genetic algorithms (GA). The process starts by initializing a
set of CNN architectures, coding them layer by layer and
Wei Fang et al. The generator transforms noise vectors into removing entire layers. Although this approach seems
real patterns, while the operator distinguishes between real promising, it faces challenges such as reproducibility
and generated data. Training methods include non-convex issues, information requirements, flexible adaptations,
games and adaptive learning reduction strategies are used overall limits, material and time requirements.
to improve learning performance. However, this approach
suffers from several problems such as training instability, Shengyi Zhu et al. This model uses the Compression and
poor performance (the generator is diverse), excessive Excitation Network (SENet) method to assign weight
hyperparameter sensitivity, and requires significant coefficients to channels based on their importance. This
computer services due to its severe nature. recalibration process improves the performance of the film
without any particular problems. However, this approach
Abien et al[5] discussed an experimental method still suffers from many problems, such as computational
combining SVM and CNN for image classification. This complexity, reduced interpretability, and the need for
hybrid architecture uses SVM for binary classification to careful tuning of hyperparameters.
determine the best hyperplane separating two classes. This
work considers both L1-SVM and L2-SVM by using hinge In improving the classification of Cervical Cancer MRI
loss as the loss model for SVM. However, this article Images Using Convolutional Neural Networks [9] A report
highlights issues such as lack of understanding of the on a method for identifying Hepatocellular carcinoma. It
distribution process, issues of injustice and justice, ethics or uses a learning concept combined with seven machine
lack thereof, and limitations with long-term memory. learning algorithms and uses collaborative representation
classification and optimization techniques to classify
hyperspectral images. However, this approach faces
challenges such as lack of information and the ability to Kumar, et.al[14] proposed a facial recognition system using
limit movement between different locations. Local Binary Pattern algorithm; it focuses on local patterns,
comparing each pixel to its neighbors and encoding the
Yiwen Liu et al. It uses VGG16 and Inception-ResNet-v2 results into a binary format. This method might be sensitive
(CNN) convolutional neural networks for transfer learning. to change in the lighting and the change in poses. While
However, this approach faces challenges such as resource efficientNetB3 being a deep learning model can learn the
consumption, need for updated models, and difficulties hierarchical features in an efficient way, and can handle the
with optimization. change in pose, expressions and lighting.
Medical imaging often requires manual guidance that By improving the data during training using
includes identifying regions of interest (ROIs) and transformations such as rotation, scaling, and translation,
abnormalities. This work is usually done by an electrician we can obtain more training data, which will help make the
or trained scribe to ensure the quality and accuracy of the model larger and avoid collisions. Data augmentation helps
recorded data. Establishing a ground truth with accurate the model make changes to the input data.
descriptions and diagnoses for each image is important
because this forms the basis for training the AI model and
3.3.5 Regularization:
allows it to make future diagnoses earlier.
Using advanced learning models such as EfficientNet B3 is The image loading and processing phase is the main work
one method in deep learning. Pre-trained weight models to ensure that the system can analyze the medical image
can be used as a starting point for many types of image well. Tell users to click the "Upload Image" button and
analysis on large data sets. This can increase training speed select a medical image from their device. Once uploaded,
and potentially improve performance, especially if you the system continues to process images for further analysis.
have limited clinical experience. This preprocessing may include transforming and
normalizing to ensure that the image is in the appropriate
3.3.3 Fine-Tuning: format and size required by the machine learning model.
3.4.4 DIAGNOSIS PROCESS:
Diagnostic results are presented to users in a user-friendly After that, the user will have to select the diseases they are
and easy-to-understand format. This display includes willing to diagnose. There will be a list available form
predicted treatments and confidence scores. which the user can select.
[6] Shaojuan Li, Lizhi Wang, Jia Li and Yuan Yao. “Image
Classification Algorithm Based on Improved AlexNet”.
Shaojuan Li et al 2021 J. Phys.: Conf. Ser. 1813 012051
5. CONCLUSION
[8] Shengyi Zhao, Yun Peng, Jizhan Liu * and Shuo Wu.
“Tomato Leaf Disease Diagnosis Based on Improved
With the completion of this project, we completed the Convolution Neural Network by Attention Module”.
development of the client's web analytics application. By Agriculture 2021, 11, 651. 11 July 2021
removing the user registration process, we ensure that there
is no unauthorized access to the device. This simple
[9] S. Gowri, Judith Justin and R. Vanithamani. S. Gowri,
process, combined with image processing and presentation
Judith Justin and R. Vanithamani. “An Improved
of clear results, facilitates timely and successful diagnoses.
Classification of Mr Images for Cervical Cancer Using
Integration of the EfficientNetB3 deep learning model
Convolutional Neural Networks”. ICTACT journal on
helps make clinical decisions by increasing the accuracy
image and video processing, volume: 12, issue: 02,
and reliability of results. As we move forward, ongoing
november 2021
monitoring and maintenance is critical to maintain the
reliability and accuracy of the application. By
demonstrating the balance between user-friendliness and [10] Yiwen Liu , Xian Zhang, Yanxia Gao,Taiguo Qu, and
information security, the project demonstrates its potential Yuanquan Shi. “Improved CNN Method for Crop Pest
to be a useful, important tool in medicine for doctors and Identification Based on Transfer Learning”. Hindawi
patients. Computational Intelligence and Neuroscience Volume
2022, Article ID 9709648, Published 16 March 2022
REFERENCE:
[11] Sota Kato Kazuhiro Hottal, “Cell Segmentation by 2023,pp.1-6, doi: 10.1109/ViTECoN58111.2023.10157843.
Image-to-Image Translation using Multiple Different
Discriminators''.DOI: 10.5220/0009170103300335, Volume [15] Luyuan Wang Yankui Sun Image classification using
4: BIOSIGNALS, published in 2022 convolutional neural network with wavelet domain inputs.
IET Image Process, DOI: 10.1049/ipr2.12466, 21 February
[12] Fetulhak Abdurahm Kinde Anlay Fante. “Tile-based 2022
microscopic image processing for malaria screening using
deep learning approach”. April 19th, 2022 [16] Ahmad Yahya Dawod Aniwat Phaphuangwittayakul
Salita Angkurawaranon. A hybrid method for traumatic
[13] Md. Ashiq Mahmood Tamal Joyti Roy. “A Hybrid brain injury lesion segmentation. International Journal of
Approach to Find COVID-19 Related Lung Infection Electrical and Computer Engineering (IJECE) Vol.12,
Utilizing 2 Bit Image Processing, 19 February 2022. No.2, April 2022, pp.1437~1448
3. Students will attain a hands-on level in changing a small novel idea / method
right into an operating model / prototype related to multidisciplinary abilities and /
or understanding and operating in a team.
4. Students will be able to interpret the outcome of their project. Students will take
on the challenges of teamwork, prepare a presentation in a professional manner,
and document all aspects of design work.
CO1: On completion the students are capable of executing the proposed plan and
become aware of and overcome the bottlenecks throughout every stage.
CO2: On completion of the project work students could be in a role to take on any
difficult sensible issues and locate answers through formulating the right
methodology.
CO3: Students will attain a hands-on level in changing a small novel idea / method
right into an operating model / prototype related to multi- disciplinary abilities and
/ or understanding and operating in a team.
CO4: Students will be able to interpret the outcome of their project. Students will
take on the challenges of teamwork, prepare a presentation in a professional
manner, and document all aspects of design work.