YoloMask: An Enhanced YOLO Model for Detection of Face Mask Wearing Normality, Irregularity and Spoofing

Cao, Zhicheng; Li, Wenlong; Zhao, Heng; Pang, Liaojun

doi:10.1007/978-3-031-20233-9_21

Zhicheng Cao¹⁵,
Wenlong Li¹⁵,
Heng Zhao¹⁵ &
…
Liaojun Pang¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13628))

Included in the following conference series:

Chinese Conference on Biometric Recognition

1199 Accesses
1 Citations

Abstract

Wearing of surgical face masks has become the new norm of our daily life in the context of the COVID-19 pandemic. Under many conditions at various public places, it is necessary to check or monitor whether the face mask is worn properly. Manual judgement of mask wearing not only wastes manpower but also fails to monitor it in a way of all-time and real-time, posing the urge of an automatic mask wearing detection technology. Earlier automatic mask wearing methods uses a successive means in which the face is detected first and then the mask is determined and judged followingly. More recent methods take the end-to-end paradigm by utilizing successful and well-known CNN models from the field of object detection. However, these methods fail to consider the diversity of face mask wearing, such as different kinds of irregularity and spoofing. Thus, we in this study introduce a comprehensive mask wearing detection dataset (named as Diverse Masked Faces) by distinguishing a total of five different classes of mask wearing. We then adapt the YOLOX model for our specific task and further improve it using a new composite loss which merges the CIoU and the alpha-IoU losses and inherits both their advantages. The improved model is referred as YoloMask. Our proposed method was tested on the new dataset and has been proved to significantly outperform other SOTA methods in the literature that are either successive or end-to-end.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Two-Stage Facial Mask Detection Model for Indoor Environments

A Novel Framework for Detection of Face Mask Using R-CNN During This COVID-19

Face Mask Detection Using YOLOv3

References

BalAzy, A., Toivola, M., Adhikari, A., Sivasubramani, S.K., Reponen, T., Grinshpun, T.: Do n95 respirators provide 95 viruses, and how adequate are surgical masks? Am. J. Infect. Control 34(2), 51–57 (2006)
Article Google Scholar
MacIntyre, C.R., Cauchemez, S., Dwyer, D.E., Seale, H., Cheung, P., Ferguson, N.M.: Face mask use and control of respiratory virus transmission in households. Emerg. Infect. Dis. 15(2), 233–241 (2009)
Article Google Scholar
WHO: The WHO coronavirus (COVID-19) dashboard [EB/OL] (2022). https://covid19.who.int/
Feng, S., Shen, C., Xia, N., Song, W., Fan, M., Cowling, B.J.: Rational use of face masks in the covid-19 pandemic. Lancet Resp. Med. 8(5), 434–436 (2020)
Article Google Scholar
Abboah-Offei, M., Salifu, Y., Adewale, B., Bayuo, J., Ofosu-Poku, R., Opare-Lokko, E.B.A.: A rapid review of the use of face mask in preventing the spread of covid-19. Int. J. Nurs. Stud. Adv. 3, 100013 (2021)
Article Google Scholar
Spitzer, M.: Masked education? the benefits and burdens of wearing face masks in schools during the current corona pandemic. Trends Neurosci. Educ. 20, 100138–100138 (2020)
Article Google Scholar
Sabetian, G., et al.: Covid-19 infection among healthcare workers: a cross-sectional study in Southwest Iran. Virol. J. 18(1), 58 (2021)
Article Google Scholar
Wang, B., Zheng, J., Chen, C.L.P.: A survey on masked facial detection methods and datasets for fighting against covid-19. IEEE Trans. Artif. Intell. 3(3), 323–343 (2022)
Article Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: 2001 CVPR, vol. 1, pp. 511–518 (2001)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 CVPR, vol. 1, pp. 886–893 (2005)
Google Scholar
Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: 2008 CVPR, pp. 1–8 (2008)
Google Scholar
Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)
Article Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 CVPR, pp. 580–587 (2014)
Google Scholar
Girshick, R.: Fast r-cnn. In: 2015 ICCV, pp. 1440–1448 (2015)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks. IEEE TPAMI 39(6), 1137–1149 (2017)
Article Google Scholar
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask r-cnn. In: 2017 ICCV, pp. 2980–2988 (2017)
Google Scholar
Pramanik, A., Pal, S.K., Maiti, J., Mitra, P.: Granulated rcnn and multi-class deep sort for multi-object detection and tracking. IEEE Trans. Emerg. Topics Comput. Intell. 6(1), 171–181 (2022)
Article Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: CVPR, pp. 779–788 (2016)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 318–327 (2020)
Article Google Scholar
Ge, Z., Liu, S., Wang, F., Li, Z., Sun, J.: YOLOX: exceeding yolo series in 2021 (2021)
Google Scholar
Yang, Q., Lan, Z.: Mask wearing specification detection based on cascaded convolutional neural network. In: 7th International Conference on Systems and Informatics, pp. 1–6 (2021)
Google Scholar
Zhao, Y., Geng, S.: Object detection of face mask recognition based on improved faster RCNN. In: bin Ahmad, B.H., Cen, F. (eds.) 2nd International Conference on Computer Vision, Image, and Deep Learning, vol. 11911, pp. 145–152. International Society for Optics and Photonics, SPIE (2021)
Google Scholar
Nithin, A., Jaisharma, K.: A deep learning based novel approach for detection of face mask wearing using enhanced single shot detector (ssd) over convolutional neural network (cnn) with improved accuracy. In: 2022 International Conference on Business Analytics for Technology and Security (ICBATS), pp. 1–5 (2022)
Google Scholar
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv e-prints (2018)
Google Scholar
Wang, C.Y., Liao, H.Y.M., Wu, Y.H., Chen, P.Y., Hsieh, J.W., Yeh, I.H.: CSPNet: a new backbone that can enhance learning capability of cnn. In: 2020 CVPR Workshops, pp. 1571–1580 (2020)
Google Scholar
Elfwing, S., Uchibe, E., Doya, K.: Sigmoid-weighted linear units for neural network function approximation in reinforcement learning. Neural Netw. 107, 3–11 (2018)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE TPAMI 37(9), 1904–16 (2014)
Article Google Scholar
Liu, S., Qi, L., Qin, H., Shi, J., Jia, J.: Path aggregation network for instance segmentation. In: 2018 CVPR, pp. 8759–8768 (2018)
Google Scholar
Tian, Z., Shen, C., Chen, H., He, T.: Fcos: fully convolutional one-stage object detection. In: 2019 ICCV, pp. 9626–9635 (2019)
Google Scholar
Ge, Z., Liu, S., Li, Z., Yoshie, O., Sun, J.: Ota: optimal transport assignment for object detection. In: 2021 CVPR, pp. 303–312 (2021)
Google Scholar
Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., Ren, D.: Distance-iou loss: faster and better learning for bounding box regression. In: AAAI Conference on Artificial Intelligence, pp. 12993–13000 (2020)
Google Scholar
HE, J., Erfani, S., Ma, X., Bailey, J., Chi, Y., Hua, X.S.: ${\backslash }$alpha-iou: a family of power intersection over union losses for bounding box regression. In Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P., Vaughan, J.W. (eds.) NIPS, vol. 34, pp. 20230–20242. Curran Associates, Inc. (2021)
Google Scholar
Ge, S., Li, J., Ye, Q., Luo, Z.: Detecting masked faces in the wild with lle-cnns. In: 2017 CVPR, pp. 426–434 (2017)
Google Scholar
Yang, S., Luo, P., Loy, C.C., Tang, X.: Wider face: a face detection benchmark. In: 2016 CVPR, pp. 5525–5533 (2016)
Google Scholar
Tzutalin: Labelimg (2015). https://github.com/tzutalin/labelImg Git code
Singh, S., Ahuja, U., Kumar, M., Kumar, K., Sachdeva, M.: Face mask detection using YOLOv3 and faster R-CNN models: COVID-19 environment. Multimedia Tools Appl. 80, 1–16 (2021)
Article Google Scholar

Download references

Acknowledgments

We greatly acknowledge the financial supports from the Natural Science Foundation of China (NSFC No. 61906149), the Natural Science Basic Research Program of Shaanxi (Program No. 2021JM-136), the Natural Science Foundation of Chongqing (cstc2021jcyj-msxmX1068), the Xi’an Science and Technology Program (No. 21RGSF0011) and the Fundamental Research Funds for the Central Universities (No. QTZX22072).

Author information

Authors and Affiliations

School of Life Science and Technology, Engineering Research Center of Molecular and Neuro Imaging, Ministry of Education, Xidian University, Xi’an, 710126, Shaanxi, China
Zhicheng Cao, Wenlong Li, Heng Zhao & Liaojun Pang

Authors

Zhicheng Cao
View author publications
You can also search for this author in PubMed Google Scholar
Wenlong Li
View author publications
You can also search for this author in PubMed Google Scholar
Heng Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Liaojun Pang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Liaojun Pang .

Editor information

Editors and Affiliations

Beijing University of Posts and Telecommunications, Beijing, China
Weihong Deng
Tsinghua University, Beijing, China
Jianjiang Feng
Beihang University, Beijing, China
Di Huang
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Meina Kan
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Zhenan Sun
Tsinghua University, Beijing, China
Fang Zheng
China Electronics Standardization Institute, Beijing, China
Wenfeng Wang
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Zhaofeng He

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cao, Z., Li, W., Zhao, H., Pang, L. (2022). YoloMask: An Enhanced YOLO Model for Detection of Face Mask Wearing Normality, Irregularity and Spoofing. In: Deng, W., et al. Biometric Recognition. CCBR 2022. Lecture Notes in Computer Science, vol 13628. Springer, Cham. https://doi.org/10.1007/978-3-031-20233-9_21

Download citation

DOI: https://doi.org/10.1007/978-3-031-20233-9_21
Published: 03 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20232-2
Online ISBN: 978-3-031-20233-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

YoloMask: An Enhanced YOLO Model for Detection of Face Mask Wearing Normality, Irregularity and Spoofing

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Two-Stage Facial Mask Detection Model for Indoor Environments

A Novel Framework for Detection of Face Mask Using R-CNN During This COVID-19

Face Mask Detection Using YOLOv3

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

YoloMask: An Enhanced YOLO Model for Detection of Face Mask Wearing Normality, Irregularity and Spoofing

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Two-Stage Facial Mask Detection Model for Indoor Environments

A Novel Framework for Detection of Face Mask Using R-CNN During This COVID-19

Face Mask Detection Using YOLOv3

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation