Abstract
We propose a YOLOv7-underwater model for real-time underwater object detection, specifically designed to meet the requirements of underwater robotics. The model integrates a new ConvNeXt convolutional layer structure and a wide receptive field module, incorporating techniques such as inverted bottleneck layers, GELU activation functions, and layer normalization. Additionally, it introduces a parameter-free attention module (SimAM) to enhance network performance, addressing challenges posed by varying water conditions and image blurriness. Experimental results demonstrate that the proposed model significantly improves the efficiency and accuracy of underwater object detection and recognition compared to other algorithms, making it suitable for real-time applications in diverse underwater environments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Disclosure of Interests
The authors have no competing interests to declare that are relevant to the content of this article.
References
Zhang, Z.R., Xu, F.B., Li, P.J.: Design of automatic operated modular underwater vehicle system for marine ranch breeding (2021)
Wu, Y., Duan, Y., Wei, Y.: Application of intelligent and unmanned equipment in aquaculture: a review. Comput. Electron. Agric. 199, 107201 (2022)
Ge, H., Dai, Y., Zhu, Z.: A deep learning model applied to optical image target detection and recognition for the identification of underwater biostructures. Machines 10(9), 809 (2022)
Zhang, H., Zhang, S., Wang, Y.: Subsea pipeline leak inspection by autonomous underwater vehicle. Appl. Ocean Res. 107, 102321 (2021)
Gašparović, B., Lerga, J., Mauša, G.: deep learning approach for objects detection in underwater pipeline images. Appl. Artif. Intell. 36(1), 2146853 (2022)
Rumson, A.G.: The application of fully unmanned robotic systems for inspection of subsea pipelines. Ocean Eng. 235, 109214 (2021)
Tang Y., Wang L., Jin S.: AUV-based side-scan sonar real-time method for underwater-target detection. J. Marine Sci. Eng. 11(4), 690 (2023)
Mogstad, A.A., Ødegård, Ø., Nornes, S.M.: Mapping the historical shipwreck figaro in the high arctic using underwater sensor-carrying robots. Remote Sens. 12(6), 997(2020)
Yulin, T., Jin, S., Bian, G.: Shipwreck target recognition in side-scan sonar images by improved YOLOv3 model based on transfer learning. IEEE Access 8, 173450–173460 (2020)
Girshick, R., Donahue, J., Darrell, T.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580–587 (2014)
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 1440–1448(2015)
Redmon, J., Divvala, S., Girshick, R.: You Only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779–788 (2016)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7263–7271 (2017)
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: YOLOv4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Ge, Z., Liu, S., Wang, F.: YOLOX: exceeding YOLO series in 2021. arXiv preprint arXiv:2107.08430 (2021)
Li, C., Li, L., Jiang, H.: YOLOv6: a single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976 (2022)
Wang, C.Y., Bochkovskiy, A., Liao, H.Y.M.: YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7464–7475 (2023)
Acknowledgment
This research is supported by the Research Promotion Project of Key Construction Discipline in Guangdong Province (2022ZDJS112).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Xie, W. et al. (2024). YOLO-Underwater: A Real-Time Object Detection Framework for Enhanced Underwater Robotics Operations. In: Huang, DS., Zhang, X., Zhang, C. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science(), vol 14879. Springer, Singapore. https://doi.org/10.1007/978-981-97-5675-9_5
Download citation
DOI: https://doi.org/10.1007/978-981-97-5675-9_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5674-2
Online ISBN: 978-981-97-5675-9
eBook Packages: Computer ScienceComputer Science (R0)