research-article

Automatic Visual Recognition of Unexploded Ordnances Using Supervised Deep Learning

Authors:

Georgios Begkas,

Panagiotis Giannakeris,

Konstantinos Ioannidis,

Georgios Kalpakis,

Theodora Tsikrika,

Stefanos Vrochidis,

Ioannis KompatsiarisAuthors Info & Claims

ICMR '22: Proceedings of the 2022 International Conference on Multimedia Retrieval

Pages 286 - 294

https://doi.org/10.1145/3512527.3531383

Published: 27 June 2022 Publication History

Abstract

Unexploded Ordnance (UXO) classification is a challenging task which is currently tackled using electromagnetic induction devices that are expensive and may require physical presence in potentially hazardous environments. The limited availability of open UXO data has, until now, impeded the progress of image-based UXO classification, which may offer a safe alternative at a reduced cost. In addition, the existing sporadic efforts focus mainly on small scale experiments using only a subset of common UXO categories. Our work aims to stimulate research interest in image-based UXO classification, with the curation of a novel dataset that consists of over 10000 annotated images from eight major UXO categories. Through extensive experimentation with supervised deep learning we uncover key insights into the challenging aspects of this task. Finally, we set the baseline on our novel benchmark by training state-of-the-art Convolutional Neural Networks and a Vision Transformer that are able to discriminate between highly overlapping UXO categories with 84.33% accuracy.

References

[1]

Roger Achkar, Michel Owayjan, and Carlo Mrad. 2011. Landmine Detection and Classification Using MLP. In 2011 Third International Conference on Computational Intelligence, Modelling & Simulation. IEEE, Langkawi, Malaysia, 1--6. https://doi.org/10.1109/CIMSim.2011.10

Digital Library

[2]

Saed Amer, Amir Shirkhodaie, and Haroun Rababaah. 2008. UXO detection, characterization, and remediation using intelligent robotic systems. In Detection and Sensing of Mines, Explosive Objects, and Obscured Targets XIII, Russell S. Harmon, John H. Holloway Jr., and J. Thomas Broach (Eds.), Vol. 6953. International Society for Optics and Photonics, SPIE, Orlando, FL, 191 -- 202. https://doi.org/10.1117/12.777778

[3]

Steve Branson, Grant Van Horn, Serge Belongie, and Pietro Perona. 2014. Bird species categorization using pose normalized deep convolutional nets. arXiv preprint arXiv:1406.2952 (2014).

[4]

CAT-UXO. 2021. Collective Awareness to Unexploded Ordnance. https://cat-uxo.com/ Retrieved March 15, 2021 from

[5]

Ekin D. Cubuk, Barret Zoph, Jonathon Shlens, and Quoc V. Le. 2019. RandAugment: Practical data augmentation with no separate search. CoRR, Vol. abs/1909.13719 (2019). http://arxiv.org/abs/1909.13719

[6]

Yin Cui, Yang Song, Chen Sun, Andrew Howard, and Serge J. Belongie. 2018. Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CoRR, Vol. abs/1806.06193 (2018). http://arxiv.org/abs/1806.06193

[7]

Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2020. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. CoRR, Vol. abs/2010.11929 (2020). https://arxiv.org/abs/2010.11929

[8]

Andre Esteva, Katherine Chou, Serena Yeung, Nikhil Naik, Ali Madani, Ali Mottaghi, Yun Liu, Eric Topol, Jeff Dean, and Richard Socher. 2021. Deep learning-enabled medical computer vision. NPJ digital medicine, Vol. 4, 1 (2021), 1--9.

[9]

Ju He, Jieneng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, and Alan L. Yuille. 2021. TransFG: A Transformer Architecture for Fine-grained Recognition. CoRR, Vol. abs/2103.07976 (2021). https://arxiv.org/abs/2103.07976

[10]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Deep Residual Learning for Image Recognition. CoRR, Vol. abs/1512.03385 (2015). http://arxiv.org/abs/1512.03385

[11]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Identity Mappings in Deep Residual Networks. In Computer Vision -- ECCV 2016, Bastian Leibe, Jiri Matas, Nicu Sebe, and Max Welling (Eds.). Springer International Publishing, Cham, 630--645.

[12]

Joel Janai, Fatma Güney, Aseem Behl, Andreas Geiger, et al. 2020. Computer vision for autonomous vehicles: Problems, datasets and state of the art. Foundations and Trends in Computer Graphics and Vision, Vol. 12, 1--3 (2020), 1--308.

Digital Library

[13]

Parneet Kaur, Karan Sikka, and Ajay Divakaran. 2017. Combining weakly and webly supervised learning for classifying food images. arXiv preprint arXiv:1712.08730 (2017).

[14]

Aditya Khosla, Nityananda Jayadevaprakash, Bangpeng Yao, and Li Fei-Fei. 2011. Novel Dataset for Fine-Grained Image Categorization. In First Workshop on Fine-Grained Visual Categorization, IEEE Conference on Computer Vision and Pattern Recognition. Colorado Springs, CO.

[15]

Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. CoRR, Vol. abs/1412.6980 (2015).

[16]

Piotr Koniusz, Yusuf Tas, Hongguang Zhang, Mehrtash Harandi, Fatih Porikli, and Rui Zhang. 2018. Museum exhibit identification challenge for the supervised domain adaptation and beyond. In Proceedings of the European conference on computer vision (ECCV). 788--804.

Digital Library

[17]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems, Vol. 25 (2012), 1097--1105.

Digital Library

[18]

Anderson Lebbad, Garrett Clayton, and C. Nataraj. 2017. Classification of UXO Using Convolutional Networks Trained on a Limited Dataset. In 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA). IEEE, Cancun, Mexico, 1098--1101. https://doi.org/10.1109/ICMLA.2017.000--1

[19]

Yann LeCun, Yoshua Bengio, et al. 1995. Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, Vol. 3361, 10 (1995), 1995.

[20]

Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proc. IEEE, Vol. 86, 11 (1998), 2278--2324.

[21]

Ilya Loshchilov and Frank Hutter. 2016. SGDR: Stochastic Gradient Descent with Restarts. CoRR, Vol. abs/1608.03983 (2016). http://arxiv.org/abs/1608.03983

[22]

Azadeh Nazemi, Niloofar Tavakolian, Donal Fitzpatrick, Ching Y Suen, et al. 2019. Offline handwritten mathematical symbol recognition utilising deep learning. arXiv preprint arXiv:1910.07395 (2019).

[23]

Maria-Elena Nilsback and Andrew Zisserman. 2008. Automated flower classification over a large number of classes. In 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing. IEEE, 722--729.

Digital Library

[24]

Xavier Nú nez-Nieto, Mercedes Solla, Paula Gómez-Pérez, and Henrique Lorenzo. 2014. GPR signal characterization for automated landmine and UXO detection based on machine learning techniques. Remote sensing, Vol. 6, 10 (2014), 9729--9748.

[25]

Francesco Pinto, Philip Torr, and Puneet K Dokania. 2021. Are Vision Transformers Always More Robust Than Convolutional Neural Networks?. In NeurIPS 2021 Workshop on Distribution Shifts: Connecting Methods and Applications.

[26]

Ning Qian. 1999. On the momentum term in gradient descent learning algorithms. Neural networks, Vol. 12, 1 (1999), 145--151.

[27]

Maithra Raghu, Thomas Unterthiner, Simon Kornblith, Chiyuan Zhang, and Alexey Dosovitskiy. 2021. Do vision transformers see like convolutional neural networks? Advances in Neural Information Processing Systems, Vol. 34 (2021).

[28]

Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2015. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision (IJCV), Vol. 115, 3 (2015), 211--252. https://doi.org/10.1007/s11263-015-0816-y

Digital Library

[29]

Amir Shirkhodaie and Haroun Rababaah. 2007. Visual detection, recognition, and classification of surface-buried UXO based on soft-computing decision fusion. In Detection and Remediation Technologies for Mines and Minelike Targets XII, Russell S. Harmon, J. Thomas Broach, and John H. Holloway Jr. (Eds.), Vol. 6553. International Society for Optics and Photonics, SPIE, 650 -- 661. https://doi.org/10.1117/12.719776

[30]

Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR, Vol. abs/1409.1556 (2015).

[31]

Andreas Steiner, Alexander Kolesnikov, Xiaohua Zhai, Ross Wightman, Jakob Uszkoreit, and Lucas Beyer. 2021. How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers. CoRR, Vol. abs/2106.10270 (2021). https://arxiv.org/abs/2106.10270

[32]

Mingxing Tan, Bo Chen, Ruoming Pang, Vijay Vasudevan, and Quoc V. Le. 2018. MnasNet: Platform-Aware Neural Architecture Search for Mobile. CoRR, Vol. abs/1807.11626 (2018). http://arxiv.org/abs/1807.11626

[33]

Mingxing Tan and Quoc V. Le. 2019. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. CoRR, Vol. Abs/1905.11946 (2019). http://arxiv.org/abs/1905.11946

[34]

Hongkun Tian, Tianhai Wang, Yadong Liu, Xi Qiao, and Yanzhou Li. 2020. Computer vision technology in agricultural automation-A review. Information Processing in Agriculture, Vol. 7, 1 (2020), 1--19.

[35]

Grant Van Horn, Steve Branson, Ryan Farrell, Scott Haber, Jessie Barry, Panos Ipeirotis, Pietro Perona, and Serge Belongie. 2015. Building a bird recognition app and large scale dataset with citizen scientists: The fine print in fine-grained dataset collection. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 595--604. https://doi.org/10.1109/CVPR.2015.7298658

[36]

"VECTOR" project. 2020--2022. Virtual Evidence Capture Tool for Ordnance Recovery. https://projectvector.net/.

[37]

C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie. 2011. The Caltech-UCSD Birds-200--2011 Dataset. Technical Report CNS-TR-2011-001. California Institute of Technology.

[38]

Nicolas E Walsh and Wendy S Walsh. 2003. Rehabilitation of landmine victims: the ultimate challenge. Bulletin of the World Health Organization, Vol. 81 (2003), 665--670.

[39]

David P Williams. 2019. Acoustic-Color-Based Convolutional Neural Networks for UXO Classification with Low-Frequency Sonar. In John S. Papadakis (Hg.): UACE2019-Conference Proceedings. 5th Underwater Acoustics Conference and Exhibition. Hersonissos, Vol. 30. Crete, Greece, 421--428.

[40]

Zhilu Zhang and Mert R. Sabuncu. 2018. Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels. CoRR, Vol. abs/1805.07836 (2018). http://arxiv.org/abs/1805.07836

Cited By

Craioveanu MStamatescu G(2024)Detection and Identification of Unexploded Ordnance Using a Two-Step Deep Learning Methodology2024 32nd Mediterranean Conference on Control and Automation (MED)10.1109/MED61351.2024.10566207(257-262)Online publication date: 11-Jun-2024
https://doi.org/10.1109/MED61351.2024.10566207

Index Terms

Automatic Visual Recognition of Unexploded Ordnances Using Supervised Deep Learning
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object recognition
      2. Computer vision tasks
        Visual content-based indexing and retrieval

Recommendations

Deep face recognition using imperfect facial data
Abstract
Today, computer based face recognition is a mature and reliable mechanism which is being practically utilised for many access control scenarios. As such, face recognition or authentication is predominantly performed using ‘perfect’ ...
Highlights
- We show the performance of machine learning for face recognition using partial faces and other manipulations of the face such as rotation and zooming which ...
A survey on deep learning based face recognition
Abstract
Deep learning, in particular the deep convolutional neural networks, has received increasing interests in face recognition recently, and a number of deep learning methods have been proposed. This paper summarizes about 330 ...
Graphical abstract

Display Omitted
Highlights
- Presents a comprehensive survey of deep learning based face recognition methods.
Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition

Deep-learning neural networks such as convolutional neural network (CNN) have shown great potential as a solution for difficult vision problems, such as object recognition. Spiking neural networks (SNN)-based architectures have shown great potential as ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ICMR '22: Proceedings of the 2022 International Conference on Multimedia Retrieval

June 2022

714 pages

ISBN:9781450392389

DOI:10.1145/3512527

General Chairs:
Vincent Oria
New Jersey Institute of Technology, USA
,
Maria Luisa Sapino
Università degli Studi di Torino, Italy
,
Shin'ichi Satoh
National Institute of Informatics, Japan
,
Brigitte Kerhervé
Université du Québec à Montréal, Canada
,
Program Chairs:
Wen-Huang Cheng
National Yang Ming Chao Tung University, Taiwan
,
Ichiro Ide
Nagoya University, Japan
,
Vivek Singh
Rutgers University, USA

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 June 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

North Atlantic Treaty Organization

Conference

ICMR '22

Sponsor:

SIGMM

ICMR '22: International Conference on Multimedia Retrieval

June 27 - 30, 2022

NJ, Newark, USA

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
190
Total Downloads

Downloads (Last 12 months)38
Downloads (Last 6 weeks)7

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Craioveanu MStamatescu G(2024)Detection and Identification of Unexploded Ordnance Using a Two-Step Deep Learning Methodology2024 32nd Mediterranean Conference on Control and Automation (MED)10.1109/MED61351.2024.10566207(257-262)Online publication date: 11-Jun-2024
https://doi.org/10.1109/MED61351.2024.10566207

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten