Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3512527.3531383acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Automatic Visual Recognition of Unexploded Ordnances Using Supervised Deep Learning

Published: 27 June 2022 Publication History

Abstract

Unexploded Ordnance (UXO) classification is a challenging task which is currently tackled using electromagnetic induction devices that are expensive and may require physical presence in potentially hazardous environments. The limited availability of open UXO data has, until now, impeded the progress of image-based UXO classification, which may offer a safe alternative at a reduced cost. In addition, the existing sporadic efforts focus mainly on small scale experiments using only a subset of common UXO categories. Our work aims to stimulate research interest in image-based UXO classification, with the curation of a novel dataset that consists of over 10000 annotated images from eight major UXO categories. Through extensive experimentation with supervised deep learning we uncover key insights into the challenging aspects of this task. Finally, we set the baseline on our novel benchmark by training state-of-the-art Convolutional Neural Networks and a Vision Transformer that are able to discriminate between highly overlapping UXO categories with 84.33% accuracy.

References

[1]
Roger Achkar, Michel Owayjan, and Carlo Mrad. 2011. Landmine Detection and Classification Using MLP. In 2011 Third International Conference on Computational Intelligence, Modelling & Simulation. IEEE, Langkawi, Malaysia, 1--6. https://doi.org/10.1109/CIMSim.2011.10
[2]
Saed Amer, Amir Shirkhodaie, and Haroun Rababaah. 2008. UXO detection, characterization, and remediation using intelligent robotic systems. In Detection and Sensing of Mines, Explosive Objects, and Obscured Targets XIII, Russell S. Harmon, John H. Holloway Jr., and J. Thomas Broach (Eds.), Vol. 6953. International Society for Optics and Photonics, SPIE, Orlando, FL, 191 -- 202. https://doi.org/10.1117/12.777778
[3]
Steve Branson, Grant Van Horn, Serge Belongie, and Pietro Perona. 2014. Bird species categorization using pose normalized deep convolutional nets. arXiv preprint arXiv:1406.2952 (2014).
[4]
CAT-UXO. 2021. Collective Awareness to Unexploded Ordnance. https://cat-uxo.com/ Retrieved March 15, 2021 from
[5]
Ekin D. Cubuk, Barret Zoph, Jonathon Shlens, and Quoc V. Le. 2019. RandAugment: Practical data augmentation with no separate search. CoRR, Vol. abs/1909.13719 (2019). http://arxiv.org/abs/1909.13719
[6]
Yin Cui, Yang Song, Chen Sun, Andrew Howard, and Serge J. Belongie. 2018. Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CoRR, Vol. abs/1806.06193 (2018). http://arxiv.org/abs/1806.06193
[7]
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2020. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. CoRR, Vol. abs/2010.11929 (2020). https://arxiv.org/abs/2010.11929
[8]
Andre Esteva, Katherine Chou, Serena Yeung, Nikhil Naik, Ali Madani, Ali Mottaghi, Yun Liu, Eric Topol, Jeff Dean, and Richard Socher. 2021. Deep learning-enabled medical computer vision. NPJ digital medicine, Vol. 4, 1 (2021), 1--9.
[9]
Ju He, Jieneng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, and Alan L. Yuille. 2021. TransFG: A Transformer Architecture for Fine-grained Recognition. CoRR, Vol. abs/2103.07976 (2021). https://arxiv.org/abs/2103.07976
[10]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Deep Residual Learning for Image Recognition. CoRR, Vol. abs/1512.03385 (2015). http://arxiv.org/abs/1512.03385
[11]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Identity Mappings in Deep Residual Networks. In Computer Vision -- ECCV 2016, Bastian Leibe, Jiri Matas, Nicu Sebe, and Max Welling (Eds.). Springer International Publishing, Cham, 630--645.
[12]
Joel Janai, Fatma Güney, Aseem Behl, Andreas Geiger, et al. 2020. Computer vision for autonomous vehicles: Problems, datasets and state of the art. Foundations and Trends in Computer Graphics and Vision, Vol. 12, 1--3 (2020), 1--308.
[13]
Parneet Kaur, Karan Sikka, and Ajay Divakaran. 2017. Combining weakly and webly supervised learning for classifying food images. arXiv preprint arXiv:1712.08730 (2017).
[14]
Aditya Khosla, Nityananda Jayadevaprakash, Bangpeng Yao, and Li Fei-Fei. 2011. Novel Dataset for Fine-Grained Image Categorization. In First Workshop on Fine-Grained Visual Categorization, IEEE Conference on Computer Vision and Pattern Recognition. Colorado Springs, CO.
[15]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. CoRR, Vol. abs/1412.6980 (2015).
[16]
Piotr Koniusz, Yusuf Tas, Hongguang Zhang, Mehrtash Harandi, Fatih Porikli, and Rui Zhang. 2018. Museum exhibit identification challenge for the supervised domain adaptation and beyond. In Proceedings of the European conference on computer vision (ECCV). 788--804.
[17]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, Vol. 25 (2012), 1097--1105.
[18]
Anderson Lebbad, Garrett Clayton, and C. Nataraj. 2017. Classification of UXO Using Convolutional Networks Trained on a Limited Dataset. In 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA). IEEE, Cancun, Mexico, 1098--1101. https://doi.org/10.1109/ICMLA.2017.000--1
[19]
Yann LeCun, Yoshua Bengio, et al. 1995. Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, Vol. 3361, 10 (1995), 1995.
[20]
Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proc. IEEE, Vol. 86, 11 (1998), 2278--2324.
[21]
Ilya Loshchilov and Frank Hutter. 2016. SGDR: Stochastic Gradient Descent with Restarts. CoRR, Vol. abs/1608.03983 (2016). http://arxiv.org/abs/1608.03983
[22]
Azadeh Nazemi, Niloofar Tavakolian, Donal Fitzpatrick, Ching Y Suen, et al. 2019. Offline handwritten mathematical symbol recognition utilising deep learning. arXiv preprint arXiv:1910.07395 (2019).
[23]
Maria-Elena Nilsback and Andrew Zisserman. 2008. Automated flower classification over a large number of classes. In 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing. IEEE, 722--729.
[24]
Xavier Nú nez-Nieto, Mercedes Solla, Paula Gómez-Pérez, and Henrique Lorenzo. 2014. GPR signal characterization for automated landmine and UXO detection based on machine learning techniques. Remote sensing, Vol. 6, 10 (2014), 9729--9748.
[25]
Francesco Pinto, Philip Torr, and Puneet K Dokania. 2021. Are Vision Transformers Always More Robust Than Convolutional Neural Networks?. In NeurIPS 2021 Workshop on Distribution Shifts: Connecting Methods and Applications.
[26]
Ning Qian. 1999. On the momentum term in gradient descent learning algorithms. Neural networks, Vol. 12, 1 (1999), 145--151.
[27]
Maithra Raghu, Thomas Unterthiner, Simon Kornblith, Chiyuan Zhang, and Alexey Dosovitskiy. 2021. Do vision transformers see like convolutional neural networks? Advances in Neural Information Processing Systems, Vol. 34 (2021).
[28]
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2015. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision (IJCV), Vol. 115, 3 (2015), 211--252. https://doi.org/10.1007/s11263-015-0816-y
[29]
Amir Shirkhodaie and Haroun Rababaah. 2007. Visual detection, recognition, and classification of surface-buried UXO based on soft-computing decision fusion. In Detection and Remediation Technologies for Mines and Minelike Targets XII, Russell S. Harmon, J. Thomas Broach, and John H. Holloway Jr. (Eds.), Vol. 6553. International Society for Optics and Photonics, SPIE, 650 -- 661. https://doi.org/10.1117/12.719776
[30]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR, Vol. abs/1409.1556 (2015).
[31]
Andreas Steiner, Alexander Kolesnikov, Xiaohua Zhai, Ross Wightman, Jakob Uszkoreit, and Lucas Beyer. 2021. How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers. CoRR, Vol. abs/2106.10270 (2021). https://arxiv.org/abs/2106.10270
[32]
Mingxing Tan, Bo Chen, Ruoming Pang, Vijay Vasudevan, and Quoc V. Le. 2018. MnasNet: Platform-Aware Neural Architecture Search for Mobile. CoRR, Vol. abs/1807.11626 (2018). http://arxiv.org/abs/1807.11626
[33]
Mingxing Tan and Quoc V. Le. 2019. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. CoRR, Vol. Abs/1905.11946 (2019). http://arxiv.org/abs/1905.11946
[34]
Hongkun Tian, Tianhai Wang, Yadong Liu, Xi Qiao, and Yanzhou Li. 2020. Computer vision technology in agricultural automation-A review. Information Processing in Agriculture, Vol. 7, 1 (2020), 1--19.
[35]
Grant Van Horn, Steve Branson, Ryan Farrell, Scott Haber, Jessie Barry, Panos Ipeirotis, Pietro Perona, and Serge Belongie. 2015. Building a bird recognition app and large scale dataset with citizen scientists: The fine print in fine-grained dataset collection. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 595--604. https://doi.org/10.1109/CVPR.2015.7298658
[36]
"VECTOR" project. 2020--2022. Virtual Evidence Capture Tool for Ordnance Recovery. https://projectvector.net/.
[37]
C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie. 2011. The Caltech-UCSD Birds-200--2011 Dataset. Technical Report CNS-TR-2011-001. California Institute of Technology.
[38]
Nicolas E Walsh and Wendy S Walsh. 2003. Rehabilitation of landmine victims: the ultimate challenge. Bulletin of the World Health Organization, Vol. 81 (2003), 665--670.
[39]
David P Williams. 2019. Acoustic-Color-Based Convolutional Neural Networks for UXO Classification with Low-Frequency Sonar. In John S. Papadakis (Hg.): UACE2019-Conference Proceedings. 5th Underwater Acoustics Conference and Exhibition. Hersonissos, Vol. 30. Crete, Greece, 421--428.
[40]
Zhilu Zhang and Mert R. Sabuncu. 2018. Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels. CoRR, Vol. abs/1805.07836 (2018). http://arxiv.org/abs/1805.07836

Cited By

View all
  • (2024)Detection and Identification of Unexploded Ordnance Using a Two-Step Deep Learning Methodology2024 32nd Mediterranean Conference on Control and Automation (MED)10.1109/MED61351.2024.10566207(257-262)Online publication date: 11-Jun-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMR '22: Proceedings of the 2022 International Conference on Multimedia Retrieval
June 2022
714 pages
ISBN:9781450392389
DOI:10.1145/3512527
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 June 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. UXORD-10K dataset
  2. convolutional neural networks
  3. object recognition
  4. unexploded ordnance
  5. vision transformers

Qualifiers

  • Research-article

Funding Sources

Conference

ICMR '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)34
  • Downloads (Last 6 weeks)1
Reflects downloads up to 09 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Detection and Identification of Unexploded Ordnance Using a Two-Step Deep Learning Methodology2024 32nd Mediterranean Conference on Control and Automation (MED)10.1109/MED61351.2024.10566207(257-262)Online publication date: 11-Jun-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media