A Real-Time Detection Framework for On-Tree Mango Based on SSD Network

Liang, Qiaokang; Zhu, Wei; Long, Jianyong; Wang, Yaonan; Sun, Wei; Wu, Wanneng

doi:10.1007/978-3-319-97589-4_36

Qiaokang Liang^17,18,19,
Wei Zhu^17,18,19,
Jianyong Long^17,18,19,
Yaonan Wang^17,18,19,
Wei Sun^17,18,19 &
…
Wanneng Wu^17,18,19

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10985))

Included in the following conference series:

International Conference on Intelligent Robotics and Applications

3258 Accesses
25 Citations

Abstract

On-tree fruit detection in orchards is important for yield estimation, mapping and automatic harvesting in modern agriculture. This paper proposes a real-time detection framework for on-tree mango based on SSD (Single shot Multi Box Detector) network, a state-of-the-art object detection algorithms based on deep learning. The mango image dataset used in this paper was gathered from outdoor mango orchards. Firstly, the dataset was annotated and converted to a trainable dataset for SSD network. Secondly, the author designed new sampling strategies and image distortions at the image pre-processing stage to optimize data augmentation techniques. Moreover, the default box proposal methods of SSD network were improved by redesigning the shapes of default boxes on multiple feature maps according to our own dataset. Finally, to explore which classification network is most suitable for mango detection, an experiment was presented to compare the detection performance of SSD network with the VGG16 and ZFNet as base network respectively. Almond dataset was also used to verify our proposed method. Experimental results demonstrated that, with optimization of data augmentation techniques and default box proposals, our improved VGG16-based SSD network can achieve higher performance than Faster R-CNN in on-tree mango detection, with F1 score of 0.911 at 35 FPS for 400 × 400 input image, which is a real-time detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

On-tree fruit detection system using Darknet-19 based SSD network

Article 28 June 2024

MangoYOLO5: A Fast and Compact YOLOv5 Model for Mango Detection

A detection algorithm based on improved YOLOv5 for coarse-fine variety fruits

Article 07 December 2023

References

Syal, A., Garg, D., Sharma, S.: Apple fruit detection and counting using computer vision techniques. In: IEEE International Conference on Computational Intelligence & Computing Research, pp. 1–6 (2015)
Google Scholar
Kapach, K., Barnea, E., Mairon, R., Edan, Y., Ben-Shahar, O.: Computer vision for fruit harvesting robots – state of the art and challenges ahead. Int. J. Comput. Vis. Robot. 3(1–2), 4–34 (2012)
Article Google Scholar
Wang, Q., Nuske, S., Bergerman, M., Singh, S.: Automated crop yield estimation for apple orchards. In: Desai, J., Dudek, G., Khatib, O., Kumar, V. (eds.) Experimental Robotics. Springer Tracts in Advanced Robotics, vol. 88, pp. 745–758. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-319-00065-7_50
Chapter Google Scholar
Kadmiry, B., Wong, C.K.: Perception scheme for fruits detection in trees for autonomous agricultural robot applications. In: International Conference on Image & Vision Computing, New Zealand, pp. 1–6 (2016)
Google Scholar
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)
Article Google Scholar
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. arXiv:1612.08242 [cs.CV]
Liu, W., et al.: SSD: Single Shot MultiBox Detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Suchet, B., James, U.: Deep fruit detection in orchards. arXiv:1610.03677 [cs.RO]
Girshick, R., Donahue, J., Darrell, T., et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8691, pp. 346–361. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10578-9_23
Chapter Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Google Scholar
Li, Y., He, K., Sun, J.: R-FCN: object detection via region-based fully convolutional networks. In: Advances in Neural Information Processing Systems, pp. 379–387 (2016)
Google Scholar
Redmon, J., Divvala, S., Girshick, R., et al.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Google Scholar
Stein, M., Bargoti, S., Underwood, J.: Image based mango fruit detection, localisation and yield estimation using multiple view geometry. Sensors 16(11), 1915 (2016). https://doi.org/10.3390/s16111915
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: NIPS (2015)
Google Scholar
Bargoti, S.: Pychet Labeller - an object annotation toolbox (2016). https://github.com/acfr/pychetlabeller
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_53
Chapter Google Scholar

Download references

Acknowledgement

This work was supported in part by the National Nature Science Foundation of China (NSFC 61673163), Hunan Provincial Natural Science Foundation of China (2016JJ3045), and Hunan Key Laboratory of Intelligent Robot Technology in Electronic Manufacturing (No. 2018002).

Author information

Authors and Affiliations

College of Electrical and Information Engineering, Hunan University, Changsha, 410082, China
Qiaokang Liang, Wei Zhu, Jianyong Long, Yaonan Wang, Wei Sun & Wanneng Wu
Hunan Key Laboratory of Intelligent Robot Technology in Electronic Manufacturing, Hunan University, Changsha, 410082, China
Qiaokang Liang, Wei Zhu, Jianyong Long, Yaonan Wang, Wei Sun & Wanneng Wu
National Engineering Laboratory for Robot Vision Perception and Control, Changsha, 410082, China
Qiaokang Liang, Wei Zhu, Jianyong Long, Yaonan Wang, Wei Sun & Wanneng Wu

Authors

Qiaokang Liang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Jianyong Long
View author publications
You can also search for this author in PubMed Google Scholar
Yaonan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Sun
View author publications
You can also search for this author in PubMed Google Scholar
Wanneng Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jianyong Long .

Editor information

Editors and Affiliations

University of Newcastle, Callaghan, New South Wales, Australia
Zhiyong Chen
University of Newcastle, Callaghan, New South Wales, Australia
Alexandre Mendes
University of Newcastle, Callaghan, New South Wales, Australia
Yamin Yan
Shenzhen Institutes of Advanced Technology, Shenzhen, China
Shifeng Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liang, Q., Zhu, W., Long, J., Wang, Y., Sun, W., Wu, W. (2018). A Real-Time Detection Framework for On-Tree Mango Based on SSD Network. In: Chen, Z., Mendes, A., Yan, Y., Chen, S. (eds) Intelligent Robotics and Applications. ICIRA 2018. Lecture Notes in Computer Science(), vol 10985. Springer, Cham. https://doi.org/10.1007/978-3-319-97589-4_36

Download citation

DOI: https://doi.org/10.1007/978-3-319-97589-4_36
Published: 04 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97588-7
Online ISBN: 978-3-319-97589-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Real-Time Detection Framework for On-Tree Mango Based on SSD Network

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

On-tree fruit detection system using Darknet-19 based SSD network

MangoYOLO5: A Fast and Compact YOLOv5 Model for Mango Detection

A detection algorithm based on improved YOLOv5 for coarse-fine variety fruits

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Real-Time Detection Framework for On-Tree Mango Based on SSD Network

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

On-tree fruit detection system using Darknet-19 based SSD network

MangoYOLO5: A Fast and Compact YOLOv5 Model for Mango Detection

A detection algorithm based on improved YOLOv5 for coarse-fine variety fruits

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation