Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3532213.3532298acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccaiConference Proceedingsconference-collections
research-article

Prohibited items detection on X-ray images with multi-task learning

Published: 13 July 2022 Publication History

Abstract

Inspection by X-ray has a wide range of applicability in airports, subways, railway stations, bus stations, and shipping ports. However, using X-ray images during security inspections is challenging because the X-ray image containing prohibited items (positive samples) in the real-world security inspection scenario is much smaller than those that do not contain prohibited items (negative samples). This results in low detection efficiency of the auto security inspection and many false detections. Considering this problem, this paper proposes detecting and recognizing prohibited items on X-ray images in real-world security inspections based on multi-task learning (MTL). This method uses an image classification network to filter out most X-ray images that do not contain prohibited items, thus reducing false detections and improving detection efficiency. MTL simultaneously trains the image classification network and the detection network of prohibited items. In addition, the backbone of the MTL network adopts a model sharing method that can reduce model parameters and improve the convergence speed of the model. Based on a dataset of security inspection X-ray images where the ratio of negative samples to positive samples is 100 (SIXray100), we performed data augmentation, estimated the target anchor of the training set, and used the concept of multi-scale training to complete the model training. Compared with the class-balanced hierarchical refinement method (CHR) that proposed a benchmark in detection, our method shows an improvement of 2% to 5%. Moreover, as the number of test images increases, our approach will obtain results that significantly exceed those from previous research. At the same time, this paper’s strategy for detecting prohibited items and its shared neural network framework have significant application value in engineering.

References

[1]
Trapit Bansal, David Belanger, and Andrew McCallum. 2016. Ask the gru: Multi-task learning for deep text recommendations. In proceedings of the 10th ACM Conference on Recommender Systems. 107–114.
[2]
Muhammet Baştan, Mohammad Reza Yousefi, and Thomas M Breuel. 2011. Visual words on baggage X-ray images. In International Conference on Computer Analysis of Images and Patterns. Springer, 360–368.
[3]
Zhaowei Cai and Nuno Vasconcelos. 2018. Cascade r-cnn: Delving into high quality object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 6154–6162.
[4]
Junkun Chen, Xipeng Qiu, Pengfei Liu, and Xuanjing Huang. 2018. Meta multi-task learning for sequence modeling. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.
[5]
Daxiang Dong, Hua Wu, Wei He, Dianhai Yu, and Haifeng Wang. 2015. Multi-task learning for multiple language translation. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 1723–1732.
[6]
Mark Everingham, Luc Van Gool, Christopher KI Williams, John Winn, and Andrew Zisserman. 2010. The pascal visual object classes (voc) challenge. International journal of computer vision 88, 2 (2010), 303–338.
[7]
Yona Falinie A Gaus, Neelanjan Bhowmik, Samet Akcay, and Toby Breckon. 2019. Evaluating the transferability and adversarial discrimination of convolutional neural networks for threat object detection and classification within x-ray security imagery. In 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA). IEEE, 420–425.
[8]
Lei Han and Yu Zhang. 2015. Learning multi-level task groups in multi-task learning. In Twenty-Ninth AAAI Conference on Artificial Intelligence.
[9]
Taimur Hassan, Salman H Khan, Samet Akcay, Mohammed Bennamoun, and Naoufel Werghi. 2019. Deep CMST Framework for the Autonomous Recognition of Heavily Occluded and Cluttered Baggage Items from Multivendor Security Radiographs. CoRR, dec 14(2019), 17.
[10]
Taimur Hassan, Muhammad Shafay, Samet Akçay, Salman Khan, Mohammed Bennamoun, Ernesto Damiani, and Naoufel Werghi. 2020. Meta-Transfer Learning Driven Tensor-Shot Detector for the Autonomous Localization and Recognition of Concealed Baggage Threats. Sensors 20, 22 (2020), 6450.
[11]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
[12]
Geremy Heitz and Gal Chechik. 2010. Object separation in X-ray image sets. In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, 2093–2100.
[13]
Benyi Hu, Chi Zhang, Le Wang, Qilin Zhang, and Yuehu Liu. 2020. Multi-label X-ray Imagery Classification via Bottom-up Attention and Meta Fusion. In Proceedings of the Asian Conference on Computer Vision.
[14]
Gao Huang, Zhuang Liu, Laurens Van Der Maaten, and Kilian Q Weinberger. 2017. Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4700–4708.
[15]
Rafflesia Khan, Tarannum Fariha Raisa, and Rameswar Debnath. 2018. An efficient contour based fine-grained algorithm for multi category object detection. Journal of Image and Graphics 6, 2 (2018), 127–136.
[16]
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision. 2980–2988.
[17]
Shikun Liu, Edward Johns, and Andrew J Davison. 2019. End-to-end multi-task learning with attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1871–1880.
[18]
Ying Liu, Luyao Geng, Weidong Zhang, Yanchao Gong, and Zhijie Xu. 2021. Survey of Video Based Small Target Detection. Journal of Image and Graphics 9, 4 (2021).
[19]
Domingo Mery and Carlos Arteta. 2017. Automatic defect recognition in x-ray testing using computer vision. In 2017 IEEE winter conference on applications of computer vision (WACV). IEEE, 1026–1035.
[20]
Domingo Mery and Aggelos K Katsaggelos. 2017. A logarithmic x-ray imaging model for baggage inspection: Simulation and object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 57–65.
[21]
Domingo Mery, Daniel Saavedra, and Mukesh Prasad. 2020. X-Ray Baggage Inspection With Computer Vision: A Survey. IEEE Access 8(2020), 145620–145633.
[22]
Domingo Mery, Erick Svec, and Marco Arias. 2016. Object recognition in X-ray testing using adaptive sparse representations. Journal of Nondestructive Evaluation 35, 3 (2016), 1–9.
[23]
Domingo Mery, Erick Svec, Marco Arias, Vladimir Riffo, Jose M Saavedra, and Sandipan Banerjee. 2016. Modern computer vision techniques for x-ray testing in baggage inspection. IEEE Transactions on Systems, Man, and Cybernetics: Systems 47, 4(2016), 682–692.
[24]
Caijing Miao, Lingxi Xie, Fang Wan, chi Su, Hongye Liu, jianbin Jiao, and Qixiang Ye. 2019. SIXray: A Large-scale Security Inspection X-ray Benchmark for Prohibited Item Discovery in Overlapping Images. In CVPR.
[25]
Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, and Martial Hebert. 2016. Cross-stitch networks for multi-task learning. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3994–4003.
[26]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems 28 (2015), 91–99.
[27]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems 28 (2015), 91–99.
[28]
Vladimir Riffo, Sebastian Flores, and Domingo Mery. 2017. Threat objects detection in X-ray images using an active vision approach. Journal of Nondestructive Evaluation 36, 3 (2017), 1–13.
[29]
Anders Søgaard and Yoav Goldberg. 2016. Deep multi-task learning with low level tasks supervised at lower layers. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 231–235.
[30]
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. 2016. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2818–2826.
[31]
Zhenhui Tang, Engang Tian, Yongxiong Wang, Licheng Wang, and Taicheng Yang. 2020. Nondestructive Defect Detection in Castings by Using Spatial Attention Bilinear Convolutional Neural Network. IEEE Transactions on Industrial Informatics 17, 1 (2020), 82–89.
[32]
Simon Vandenhende, Stamatios Georgoulis, Wouter Van Gansbeke, Marc Proesmans, Dengxin Dai, and Luc Van Gool. 2021. Multi-task learning for dense prediction tasks: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).
[33]
Qian Wang, Neelanjan Bhowmik, and Toby P Breckon. 2020. On the evaluation of prohibited item classification and detection in volumetric 3d computed tomography baggage security screening imagery. In 2020 International Joint Conference on Neural Networks (IJCNN). IEEE, 1–8.
[34]
Qian Wang, Khalid N Ismail, and Toby P Breckon. 2020. An approach for adaptive automatic threat recognition within 3D computed tomography images for baggage security screening. Journal of X-ray Science and Technology 28, 1 (2020), 35–58.
[35]
Jian Zhang, Li Zhang, Ziran Zhao, Yaohong Liu, Jianping Gu, Qiang Li, and Duokun Zhang. 2014. Joint shape and texture based x-ray cargo image classification. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops. 266–273.
[36]
Yue Zhu, Yutao Zhang, Haigang Zhang, Jinfeng Yang, and Zihao Zhao. 2020. Data augmentation of X-ray images in baggage inspection based on generative adversarial networks. IEEE Access 8(2020), 86536–86544.
  1. Prohibited items detection on X-ray images with multi-task learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCAI '22: Proceedings of the 8th International Conference on Computing and Artificial Intelligence
    March 2022
    809 pages
    ISBN:9781450396110
    DOI:10.1145/3532213
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 July 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Multi-task learning
    2. Prohibited items detection
    3. Real-world security inspection scenario
    4. X-ray images

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICCAI '22

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 56
      Total Downloads
    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 21 Sep 2024

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media