Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

YOLO-Underwater: A Real-Time Object Detection Framework for Enhanced Underwater Robotics Operations

  • Conference paper
  • First Online:
Advanced Intelligent Computing Technology and Applications (ICIC 2024)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14879))

Included in the following conference series:

  • 434 Accesses

Abstract

We propose a YOLOv7-underwater model for real-time underwater object detection, specifically designed to meet the requirements of underwater robotics. The model integrates a new ConvNeXt convolutional layer structure and a wide receptive field module, incorporating techniques such as inverted bottleneck layers, GELU activation functions, and layer normalization. Additionally, it introduces a parameter-free attention module (SimAM) to enhance network performance, addressing challenges posed by varying water conditions and image blurriness. Experimental results demonstrate that the proposed model significantly improves the efficiency and accuracy of underwater object detection and recognition compared to other algorithms, making it suitable for real-time applications in diverse underwater environments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Disclosure of Interests

The authors have no competing interests to declare that are relevant to the content of this article.

References

  1. Zhang, Z.R., Xu, F.B., Li, P.J.: Design of automatic operated modular underwater vehicle system for marine ranch breeding (2021)

    Google Scholar 

  2. Wu, Y., Duan, Y., Wei, Y.: Application of intelligent and unmanned equipment in aquaculture: a review. Comput. Electron. Agric. 199, 107201 (2022)

    Google Scholar 

  3. Ge, H., Dai, Y., Zhu, Z.: A deep learning model applied to optical image target detection and recognition for the identification of underwater biostructures. Machines 10(9), 809 (2022)

    Google Scholar 

  4. Zhang, H., Zhang, S., Wang, Y.: Subsea pipeline leak inspection by autonomous underwater vehicle. Appl. Ocean Res. 107, 102321 (2021)

    Article  Google Scholar 

  5. Gašparović, B., Lerga, J., Mauša, G.: deep learning approach for objects detection in underwater pipeline images. Appl. Artif. Intell. 36(1), 2146853 (2022)

    Google Scholar 

  6. Rumson, A.G.: The application of fully unmanned robotic systems for inspection of subsea pipelines. Ocean Eng. 235, 109214 (2021)

    Article  Google Scholar 

  7. Tang Y., Wang L., Jin S.: AUV-based side-scan sonar real-time method for underwater-target detection. J. Marine Sci. Eng. 11(4), 690 (2023)

    Google Scholar 

  8. Mogstad, A.A., Ødegård, Ø., Nornes, S.M.: Mapping the historical shipwreck figaro in the high arctic using underwater sensor-carrying robots. Remote Sens. 12(6), 997(2020)

    Google Scholar 

  9. Yulin, T., Jin, S., Bian, G.: Shipwreck target recognition in side-scan sonar images by improved YOLOv3 model based on transfer learning. IEEE Access 8, 173450–173460 (2020)

    Article  Google Scholar 

  10. Girshick, R., Donahue, J., Darrell, T.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580–587 (2014)

    Google Scholar 

  11. Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 1440–1448(2015)

    Google Scholar 

  12. Redmon, J., Divvala, S., Girshick, R.: You Only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779–788 (2016)

    Google Scholar 

  13. Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7263–7271 (2017)

    Google Scholar 

  14. Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)

  15. Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: YOLOv4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)

  16. Ge, Z., Liu, S., Wang, F.: YOLOX: exceeding YOLO series in 2021. arXiv preprint arXiv:2107.08430 (2021)

  17. Li, C., Li, L., Jiang, H.: YOLOv6: a single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976 (2022)

  18. Wang, C.Y., Bochkovskiy, A., Liao, H.Y.M.: YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7464–7475 (2023)

    Google Scholar 

Download references

Acknowledgment

This research is supported by the Research Promotion Project of Key Construction Discipline in Guangdong Province (2022ZDJS112).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yu Lu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xie, W. et al. (2024). YOLO-Underwater: A Real-Time Object Detection Framework for Enhanced Underwater Robotics Operations. In: Huang, DS., Zhang, X., Zhang, C. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science(), vol 14879. Springer, Singapore. https://doi.org/10.1007/978-981-97-5675-9_5

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-5675-9_5

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-5674-2

  • Online ISBN: 978-981-97-5675-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics