Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
review-article

Face Mask-Wearing Detection Model Based on Loss Function and Attention Mechanism

Published: 01 January 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Face mask-wearing detection is of great significance for safety protection during the epidemic. Aiming at the problem of low detection accuracy due to the problems of occlusion, complex illumination, and density in mask-wearing detection, this paper proposes a neural network model based on the loss function and attention mechanism for mask-wearing detection in complex environments. Based on YOLOv5s, we first introduce an attention mechanism in the feature fusion process to improve feature utilization, study the effect of different attention mechanisms (CBAM, SE, and CA) on improving deep network models, and then explore the influence of different bounding box loss functions (GIoU, CIoU, and DIoU) on mask-wearing recognition. CIoU is used as the frame regression loss function to improve the positioning accuracy. By collecting 7,958 mask-wearing images and a large number of images of people without masks as a dataset and using YOLOv5s as the benchmark model, the mAP of the model proposed in the paper reached 90.96% on the validation set, which is significantly better than the traditional deep learning method. Mask-wearing detection is carried out in a real environment, and the experimental results of the proposed method can meet the daily detection requirements.

    References

    [1]
    C. W. Yang, T. H. Phung, H. H. Shuai, and W. H. Cheng, “Mask or non-mask? robust face mask detector via triplet-consistency representation learning,” ACM Transactions on Multimedia Computing, Communications, and Applications, vol. 18, no. 1s, pp. 1–20, 2022.
    [2]
    B. Wang, Y. Zhao, and C. L. P. Chen, “Hybrid transfer learning and broad learning system for wearing mask detection in the covid-19 era,” IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1–12, 2021.
    [3]
    H. Goyal, K. Sidana, C. Singh, A. Jain, and S. Jindal, “A real time face mask detection system using convolutional neural network,” Multimedia Tools and Applications, vol. 81, pp. 1–17, 2022.
    [4]
    J. Zhang, F. Han, and Y. Chun, “A novel detection framework about conditions of wearing face mask for helping control the spread of covid-19,” IEEE Access, vol. 9, pp. 42975–42984, 2021.
    [5]
    A. Kumar, A. Kalia, A. Sharma, and M. Kaushal, “A hybrid tiny YOLO v4-SPP module based improved face mask detection vision system,” Journal of Ambient Intelligence and Humanized Computing, pp. 1–14, 2021.
    [6]
    Z. Zhao, K. Hao, X. Ma, X. Liu, T. Zheng, J. Xu, and S. Cui, “SAI-YOLO: a lightweight network for real-time detection of driver mask-wearing specification on resource-constrained devices,” Computational Intelligence and Neuroscience, vol. 2021, 15 pages, 2021.
    [7]
    R. Girshick, J. Donahue, and T. Darrell, “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587, Columbus, OH, USA, June 2014.
    [8]
    R. Girshick, “Fast R-CNN,” in Proceedings of the IEEE international conference on computer vision, pp. 1440–1448, Santiago, CI, USA, December 2015.
    [9]
    S. Ren, K. He, and R. Girshick, “FasterR. Cnn: towards real-time object detection with region proposal networks,” in Proceedings of the Advances in Neural Information Processing Systems, pp. 91–99, Grenada, Spain, June 2015.
    [10]
    J. Dai, Y. Li, and K. He, “R-fcn: object detection via region-based fully convolutional networks,” in Proceedings of theAdvances in Neural Information Processing Systems, New York, NY, USA, December 2016.
    [11]
    Z. Cai, N. Vasconcelos, and R.-C. N. N. Cascade, “Delving into high quality object detection,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6154–6162, Salt Lake City, UT, USA, June 2018.
    [12]
    L. Xie, Y. Liu, L. Jin, and Z. Xie, “DeRPN: taking a further step toward more general object detection,” in Proceedings of the AAAI Conference on Artificial Intelligence, pp. 9046–9053, Honolulu, HI, USA, February 2019.
    [13]
    W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Y. Fu, and A. C. Berg, “Ssd: Single Shot Multibox detector,” in Proceedings of the European Conference on Computer Vision, pp. 21–37, Munich, Germany, October 2016.
    [14]
    J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You only look once: unified, real-time object detection,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788, Las Vegas, NV, USA, June 2016.
    [15]
    J. Redmon and A. Farhadi, “YOLO9000 better faster stronger,” in Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 7263–7271, Vegas, NV, USA, June 2016.
    [16]
    T. Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, “Focal loss for dense object detection,” in Proceedings of the IEEE International Conference On Computer Vision, vol. 42, pp. 2980–2988, Chongqing, China, July 2020.
    [17]
    S. Zhang, L. Wen, and X. Bian, “Single-shot refinement neural network for object detection,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4203–4212, Lake City, UT, USA, June 2018.
    [18]
    X. Fan and M. Jiang, “A single stage face mask detector for assisting control of the COVID-19 pandemic,” in Proceedings of the 2021 IEEE international conference on systems, man, and cybernetics (SMC), IEEE, pp. 832–837, Melbourne, Australia, October 2021.
    [19]
    G. Jignesh Chowdary, N. S. Punn, S. K. Sonbhadra, and S. Agarwal, “Face mask detection using transfer learning of inceptionv3,” in Proceedings of the International Conference on Big Data Analytics, pp. 81–90, Ahmedabad, India, December 2020.
    [20]
    M. M. Rahman, M. M. H. Manik, M. M. Islam, S. Mahmud, and J. H. Kim, “An Automated System to Limit COVID-19 Using Facial Mask Detection in Smart City network,” in Proceedings of the 2020 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS), IEEE, Vancouver, Canada, September 2020.
    [21]
    M. Loey, G. Manogaran, M. H. N. Taha, and N. E. M. Khalifa, “A hybrid deep transfer learning model with machine learning methods for face mask detection in the era of the COVID-19 pandemic,” Measurement, vol. 167, 2021.
    [22]
    X. Ren and X. Liu, “Mask wearing detection based on YOLOv3[C]//Journal of physics: conference series,” IOP Publishing, vol. 1678, no. 1, 2020.
    [23]
    M. H. Guo, T. X. Xu, J. J. Liu, Z. N. Liu, P. T. Jiang, T. J. Mu, S. H. Zhang, R. R. Martin, M. M. Cheng, and S. M. Hu, “Attention mechanisms in computer vision: a survey,” Computational Visual Media, vol. 8, no. 3, pp. 331–368, 2022.
    [24]
    V. Mnih, N. Heess, and A. Graves, “Recurrent models of visual attention,” in Proceedings of the Advances in Neural Information Processing Systems, pp. 2204–2212, New York, NY, USA, December 2014.
    [25]
    M. Jaderberg, K. Simonyan, and A. Zisserman, “Spatial transformer networks,” in Proceedings of the Advances in Neural Information Processing Systems, pp. 2017–2025, New York, NY, USA, December 2015.
    [26]
    X. Zhu, H. Hu, S. Lin, and J. Dai, “Deformable convnets v2: more deformable, better results,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Beach, CA, USA, June 2019.
    [27]
    J. Hu, L. Shen, S. Albanie et al., “Squeeze-and-Excitation networks,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 8, pp. 2011–2023.
    [28]
    S. Woo, J. Park, J. Y. Lee, and I. S Kweon, “Cbam: Convolutional Block Attention module,” European Conference on Computer Vision (ECCV), pp. 3–19, 2018.
    [29]
    Q. Hou, D. Zhou, and J. Feng, “Coordinate attention for efficient mobile network design,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13713–13722, Nashville, TN, USA, June 2021.
    [30]
    X. Wang, R. Girshick, A. Gupta, and K. He, “Non-local Neural networks,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7794–7803, Lake City, UT, USA, June 2018.
    [31]
    A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, and M. Dehghani, “An image is worth 16x16 words: transformers for image recognition at scale,” in Proceedings of the International conference on learning representations, pp. 1–21, Vienna, Austria, May 2021.
    [32]
    H. Rezatofighi, N. Tsoi, and J. Y. Gwak, “Generalized Intersection over union: A Metric and a Loss for Bounding Box regression,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Beach, CA, USA, June 2019.
    [33]
    Z. Zheng, P. Wang, and W. Liu, “Distance-IoU loss: faster and better learning for bounding box regression,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 7, pp. 12993–13000, April 2020.
    [34]
    J. He, S. Erfani, X. Ma, J. Bailey, Y. Chi, and X. S. Hua, “Alpha-IoU: a family of power intersection over union losses for bounding box regression,” Advances in Neural Information Processing Systems, pp. 1–13, 2021.
    [35]
    H. Zhang, Y. Wang, F. Dayoub, and N. Sunderhauf, “Varifocalnet: an iou-aware dense object detector,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Ashville, TN, USA, June 2021.
    [36]
    H. A. H. Mahmoud, N. S. Alghamdi, and A. H. Alharbi, “Real time feature extraction deep-CNN for mask detection,” Intelligent Automation and Soft Computing, vol. 31, pp. 1423–1434, 2022.
    [37]
    S. Xu, “An improved lightweight yolov5 model based on attention mechanism for face mask detection,” 2022, https://arxiv.org/abs/2203.16506.
    [38]
    F. Mercaldo and A. Santone, “Transfer learning for mobile real-time face mask detection and localization,” Journal of the American Medical Informatics Association, vol. 28, no. 7, pp. 1548–1554, 2021.
    [39]
    X. Su, M. Gao, J. Ren, Y. Li, M. Dong, and X. Liu, “Face mask detection and classification via deep transfer learning,” Multimedia Tools and Applications, vol. 81, no. 3, pp. 4475–4494, 2022.
    [40]
    M. F. S. Sabir, I. Mehmood, W. A. Alsaggaf, E. F. Khairullah, S. Alhuraiji, A. S. Alghamdi, and A. El-Latif, “An automated real-time face mask detection system using transfer learning with faster-rcnn in the era of the covid-19 pandemic,” Computers, Materials & Continua, vol. 71, pp. 4151–4166, 2022.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Computational Intelligence and Neuroscience
    Computational Intelligence and Neuroscience  Volume 2022, Issue
    2022
    32229 pages
    ISSN:1687-5265
    EISSN:1687-5273
    Issue’s Table of Contents
    This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

    Publisher

    Hindawi Limited

    London, United Kingdom

    Publication History

    Published: 01 January 2022

    Qualifiers

    • Review-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 0
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media