SSRN Id3909350
SSRN Id3909350
SSRN Id3909350
com/abstract=3909350
FACE MASK DETECTION USING DEEP LEARNING
Mohammed Rizwan s
(Jamia Hamdard University)
INTRODUCTION
The pattern of wearing face mask increase because of the COVID-19. Before
Covid-19, People used to wear faces mask to protect themselves from air pollution. While
some people wear a face mask to protect their face skin. COVID badly affected lifestyles of
the people all over the world. According to researchers, a Face mask can help us to protect
ourselves from the virus. Face mask blocks the bacteria’s outside during respiration and
deals with blocking the transmission. It has been noted that Mask had played a major
role before COVID to stop the spreading of other diseases, such as Tuberculosis, etc.
COVID badly affected the economy of many countries.
The government of all the countries is taking necessary steps to stop the spreading.
AI and Machine Learning can assist with battling Covid-19 in many aspects. Here, we
present the solution for the identification of the Cover face mask. This project model is
prepared using the technology of Computer Vision, Deep Learning, and Machine
Learning. The model is trained to identify whether the individual is covering his/her face
with a mask or not. The model can be used with cameras placed in public places like
markets, malls, parks, metro stations, etc. The model is incorporation between DEEP
learning and traditional AI strategies with OpenCV, tensor stream, and Keras. This
project can be used for real-time detection in cameras.
1. SOFTWARES DESCRIPTION
1.1 SOFTWARE AND TEXT EDITOR
Jupyter notebook is open-source IDE that is used to create jupyter document that can
be created and shared with live. It is an open-source web application that allow data scientist
to create and share documents that integrate live code,equation, computational
output,visualization and multimedia resource, along with explanatory text in a single
document.The Jupyter notebook can support various languages that are popular in data
science such as Python,Julia ,Scala,R, etc.
i. Make prediction
The benefits of machine learning are that machines can discover and learn more effectively by
humans by using complex computational methods.
Supervised learning: In Supervised Learning, we have the labeled data that we know to
know the target before our objective. Algorithm of dependent variable which is to be
predicted to a given set of independent variables. Ex: Regression ,Classifications
Unsupervised learning: Algorithm in which we don’t have any target to predict. Here
we use un-labeled data. Ex: Clustering.
Applications:
1. Robotics :
2. Security Systems
3. Medical Surgery
Deep Learning is a subfield of machine learning which is used for a large and complex
dataset. Here we use more advance and efficient algorithms to tackle the program effectively.
Deep Learning makes the process of interpreting a large amount of data much faster and easier.
1.2.4 OpenCV
Its applications include detecting a specific object, analyzing the video and performing feature
detection, Street view image, and analysis, Robot car navigation.
1.2.5 TENSORFLOW
Applications
1. Digit Classification
2. Image Recognization
3. Text Summarization
1.2.6 KERAS
Features:
2.2.7. MobileNetV2
3. HARDWARE DESCRIPTION
Webcam:-
Fig.4. Webcam
4. Methodology
4.1 Dataset
The dataset which we have used comprises 3550 total pictures out of which 1805 are of
the picture are with facemask and 1786 pictures without a facemask. All the pictures in this
dataset are real pictures. The pictures are taken from different angles so that our model should
identity from them clearly. The dataset for this project is taken from Kaggle one of the best
platforms for data science and machine learning. From every one of the three sources, the
extent of the pictures is equivalent.
We need to divide our dataset into three different sections: preparing dataset, test
dataset, and approval dataset. Preparing dataset is used to train our model and a test dataset is
used to test our model. The training model uses the different images and training itself
accordingly. Training. After the completion of the Training model, we use a test dataset to
check our model whether it's working well or not. Datasets are divided into different ratios.
Here, In this project, the dataset is divided into 4:1, which means 80% of the data is used as
preparing dataset or training dataset and the remaining 20% is used as a test dataset. The reason
to use 80% data as a training dataset is “More the training dataset more accurate will be our
model”. The split proportion of the dataset is 0.8:0.2.
At the planning time, for each pixel, we consider the default bounding boxes having
different sizes and point of view extents with ground truth boxes finally use the Intersection
over Union (IoU) technique to pick the best organizing with box. IoU evaluates the number of
bits of our expected box that matches with the ground reality. The characteristics range from 0
to 1 and growing potential gains of IoU choose the exactness in the figure; the best worth being
the most vital worth of IoU. The condition and pictorial depiction of IoU are given as follow:
We took a stab at utilizing three distinctive base models for identifying 'cover' or 'no
veil'. The activity was done to track down the best fit model in our situation. The assessment
interaction comprises of first taking a gander at the grouping report which gives us knowledge
towards exactness, review, and F1 score. The conditions of these three measurements are as per
the following:
Positives + Negatives
Using these three metrics, we can conclude which model is performing most efficiently. The
I implemented this model on different images containing one and more faces. I also
implemented it on live streams video using my laptop camera in different situations
1. Wearing a mask
2. Without wearing the mask
The results in both situations are taken as Screenshots.
6. Limitation
Though technology is bringing revolution in the world .There are many benefits like
automation of cars, Robots etc. On the other hand it has many drawbacks. The first drawback of
this project is its security. This project is working like detective , capturing the data of people.
If anyone get access to this data he/she can use it in wrong way.Secondly, the resolution power
is less. In public places like Malls, Market and the other crowded area this project might fail. It
require storage like hard disk to store the data which make it costly. And the cost of
maintenance is high.
Custom alerts can be sent to the person with or without a face mask or the one whose
face is unrecognizable in the admin system. No need to install any hardware as the system can
be connected with your existing surveillance system only. The can implemented easily with
any camera or hardware like surveillance cameras. The system restricts access for those not
wearing the masks and notifies the authorities. You can customize the face mask detection
system based on your business requirements. You can check the analytics based on the system
generated reports.Easy to access and control the movements from any device through face
mask detection applications. Partially occluded faces either with mask or hair or hand, can be
easily detected.
8. Conclusion