YOLO-Underwater: A Real-Time Object Detection Framework for Enhanced Underwater Robotics Operations

Xie, Weifang; Chen, Cang; Cai, Zhiqi; Zhuang, Mengting; Yu, Jingying; Ge, Huilin; Lu, Yu

doi:10.1007/978-981-97-5675-9_5

Weifang Xie¹⁰,
Cang Chen¹¹,
Zhiqi Cai¹⁰,
Mengting Zhuang¹⁰,
Jingying Yu¹⁰,
Huilin Ge¹² &
…
Yu Lu¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14879))

Included in the following conference series:

International Conference on Intelligent Computing

508 Accesses

Abstract

We propose a YOLOv7-underwater model for real-time underwater object detection, specifically designed to meet the requirements of underwater robotics. The model integrates a new ConvNeXt convolutional layer structure and a wide receptive field module, incorporating techniques such as inverted bottleneck layers, GELU activation functions, and layer normalization. Additionally, it introduces a parameter-free attention module (SimAM) to enhance network performance, addressing challenges posed by varying water conditions and image blurriness. Experimental results demonstrate that the proposed model significantly improves the efficiency and accuracy of underwater object detection and recognition compared to other algorithms, making it suitable for real-time applications in diverse underwater environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Real-time underwater target detection based on improved YOLOv7

Article 11 January 2025

A real-time object detection method for underwater complex environments based on FasterNet-YOLOv7

Article 12 December 2023

Real-time detection of small underwater organisms with a novel lightweight SFESI-YOLOv8n model

Article 18 December 2024

Disclosure of Interests

The authors have no competing interests to declare that are relevant to the content of this article.

References

Zhang, Z.R., Xu, F.B., Li, P.J.: Design of automatic operated modular underwater vehicle system for marine ranch breeding (2021)
Google Scholar
Wu, Y., Duan, Y., Wei, Y.: Application of intelligent and unmanned equipment in aquaculture: a review. Comput. Electron. Agric. 199, 107201 (2022)
Google Scholar
Ge, H., Dai, Y., Zhu, Z.: A deep learning model applied to optical image target detection and recognition for the identification of underwater biostructures. Machines 10(9), 809 (2022)
Google Scholar
Zhang, H., Zhang, S., Wang, Y.: Subsea pipeline leak inspection by autonomous underwater vehicle. Appl. Ocean Res. 107, 102321 (2021)
Article Google Scholar
Gašparović, B., Lerga, J., Mauša, G.: deep learning approach for objects detection in underwater pipeline images. Appl. Artif. Intell. 36(1), 2146853 (2022)
Google Scholar
Rumson, A.G.: The application of fully unmanned robotic systems for inspection of subsea pipelines. Ocean Eng. 235, 109214 (2021)
Article Google Scholar
Tang Y., Wang L., Jin S.: AUV-based side-scan sonar real-time method for underwater-target detection. J. Marine Sci. Eng. 11(4), 690 (2023)
Google Scholar
Mogstad, A.A., Ødegård, Ø., Nornes, S.M.: Mapping the historical shipwreck figaro in the high arctic using underwater sensor-carrying robots. Remote Sens. 12(6), 997(2020)
Google Scholar
Yulin, T., Jin, S., Bian, G.: Shipwreck target recognition in side-scan sonar images by improved YOLOv3 model based on transfer learning. IEEE Access 8, 173450–173460 (2020)
Article Google Scholar
Girshick, R., Donahue, J., Darrell, T.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580–587 (2014)
Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 1440–1448(2015)
Google Scholar
Redmon, J., Divvala, S., Girshick, R.: You Only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779–788 (2016)
Google Scholar
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7263–7271 (2017)
Google Scholar
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: YOLOv4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Ge, Z., Liu, S., Wang, F.: YOLOX: exceeding YOLO series in 2021. arXiv preprint arXiv:2107.08430 (2021)
Li, C., Li, L., Jiang, H.: YOLOv6: a single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976 (2022)
Wang, C.Y., Bochkovskiy, A., Liao, H.Y.M.: YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7464–7475 (2023)
Google Scholar

Download references

Acknowledgment

This research is supported by the Research Promotion Project of Key Construction Discipline in Guangdong Province (2022ZDJS112).

Author information

Authors and Affiliations

College of Big Data and Internet, Shenzhen Technology University, Shenzhen, China
Weifang Xie, Zhiqi Cai, Mengting Zhuang, Jingying Yu & Yu Lu
Faculty of Science and Engineering, Waseda University, Tokyo, Japan
Cang Chen
Ocean College, Jiangsu University of Science and Technology, Zhenjiang, China
Huilin Ge

Authors

Weifang Xie
View author publications
You can also search for this author in PubMed Google Scholar
Cang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zhiqi Cai
View author publications
You can also search for this author in PubMed Google Scholar
Mengting Zhuang
View author publications
You can also search for this author in PubMed Google Scholar
Jingying Yu
View author publications
You can also search for this author in PubMed Google Scholar
Huilin Ge
View author publications
You can also search for this author in PubMed Google Scholar
Yu Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yu Lu .

Editor information

Editors and Affiliations

Eastern Institute of Technology, Ningbo, China
De-Shuang Huang
Tianjin University of Science and Technology, Tianjin, China
Xiankun Zhang
Tianjin University of Science and Technology, Tianjin, China
Chuanlei Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xie, W. et al. (2024). YOLO-Underwater: A Real-Time Object Detection Framework for Enhanced Underwater Robotics Operations. In: Huang, DS., Zhang, X., Zhang, C. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science(), vol 14879. Springer, Singapore. https://doi.org/10.1007/978-981-97-5675-9_5

Download citation

DOI: https://doi.org/10.1007/978-981-97-5675-9_5
Published: 01 August 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5674-2
Online ISBN: 978-981-97-5675-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics