Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3441250.3441262acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicaipConference Proceedingsconference-collections
research-article

Object Detection with Auto-Learning Anchor Algorithm

Published: 29 May 2021 Publication History

Abstract

As an effective auxiliary means for object detection task, region anchors are widely adopted in most of state-of-the art detectors. However, anchor's location and shape in those works are normally determined by experience or some preprocessing methods, i.e., clustering, which leads to time consumption and limits the flexibility of anchors. In this paper, we explore the possibility that networks can predict bounding boxes and simultaneously learn anchor's location and shape end to end. Specifically, we propose a new anchoring scheme, named Automatic Anchor Learning, which can be integrated into any object detectors and enable detectors to learn the location and size of anchors while training, without sampling anchors over any predefined set of scales and aspect ratios. The proposed method first predicts where the centers of objects of interest might exist and then predict the shape of anchor that should be placed in this location. By applying the proposed Automatic Anchoring Learning method to Yolov3 model, we achieve around 3.3% and 1.6% higher recall and mAP on MS COCO with 80% less anchors, and 10% more FPS than the original Yolov3. Additionally, we also integrate our method into other object algorithms, i.e., Fast R-CNN and RetinaNet, we respectively improve their detection mAP by 2.5% and 1.1%.

References

[1]
Ke Wei and Zhang Tianliang. Multiple Anchor Learning for Visual Object Detection. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp.10206-10215.
[2]
Redmon, Joseph and Farhadi. YOLOv3: An Incremental Improvement, 2018, arXiv:1804.02767.
[3]
Lu Ruiqi and Ma Huimin. Occluded Pedestrian Detection with Visible IoU and Box Sign Predictor. In: International Conference on Image Processing (ICIP), 2019, pp.1640-1644.
[4]
Ping-Yang Chen and Jun-Wei Hsieh. Residual Bi-Fusion Feature Pyramid Network for Accurate Single-shot Object Detection, 2019, arXiv: 1911.12051.
[5]
Xu Yongchao and Fu Mingtao, Gliding vertex on the horizontal bounding box for multi-oriented object detection. In: Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020, pp.1-1.
[6]
Sheng Yi and Xi Li. WSOD with PSNet and Box Regression, 2019, arXiv:1911.11512.
[7]
Deng J and Dong W. ImageNet: A Large-Scale Hierarchical Image Database. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2009, pp.248-255.
[8]
Mark Everingham, SM Ali Eslami and Luc Van Gool. The pascal visual object classes challenge: Aretrospective. International Journal of Computer Vision, 2015, vol.111, no.1, pp.98-136.
[9]
Andreas Geiger and Philip Lenz.Are we ready for autonomous driving the kitti vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2012, pp.3354-3361.
[10]
Tsung-Yi Lin, Michael Maire and Serge Belongie. Microsoft coco: Common objects in context. In: European Conference on Computer Vision, 2014, pp.740-755.
[11]
J Wang and K Chen. Region Proposal by Guided Anchoring. In: Conference on Computer Vision and Pattern Recognition (CVPR),2019, pp.2960-2969.
[12]
Lin, Tsung-Yi and Goyal. Focal Loss for Dense Object Detection. In: International Conference on Computer Vision (ICCV), 2017, pp.2999-3007.
[13]
Lin, Tsung-Yi and Dollár. Feature Pyramid Networks for Object Detection, 2016, arXiv:1612.03144.
[14]
Zhang, Hanwang and Kyaw. PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN. In: International Conference on Computer Vision (ICCV), 2017, pp.4243-4251.
[15]
Ren Shaoqing and He Kaiming. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Transactions on Pattern Analysis and Machine Intelligence, 2015, vol.39, no.6, pp.1137-1149.
[16]
Li Yanghao and Chen Yuntao. Scale-Aware Trident Networks for Object Detection. In: International Conference on Computer Vision, 2019, pp.6053-6062.
[17]
Peng Junran and Sun Ming. POD: Practical Object Detection with Scale-Sensitive Network, 2019, rXiv:1909.02225.
[18]
Liu Wei and Anguelov Dragomir, SSD: Single Shot MultiBox Detector. In: European Conference on Computer Vision, 2015, pp.21-37.
[19]
G. Ghiasi and T. Lin. NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp.7029-7038.
[20]
Fu Cheng-Yang and Liu Wei. DSSD : Deconvolutional Single Shot Detector, 2017, arXiv:1701.06659.
[21]
Zhang Xiaosong and Wan Fang. FreeAnchor: Learning to Match Anchors for Visual Object Detection, 2019, arXiv:1909.02466.
[22]
Yang Tong and Zhang Xiangyu. MetaAnchor: Learning to Detect Objects with Customized Anchors, 2018, arXiv:1807.00980.
[23]
Jiang Borui and Luo Ruixuan. Acquisition of Localization Confidence for Accurate Object Detectionm, 2018, arXiv:1807.11590.
[24]
Law H and Deng J. CornerNet: Detecting Objects as Paired Keypoints. Journal of Computer Vision, 2020, vol.128, pp.642-656.
[25]
Kaiwen Duan, Song Bai and Lingxi Xie. Centernet: Object detection with keypoint triplets. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2019, arXiv:1904.08189.
[26]
He Kaiming and Zhang Xiangyu. Deep Residual Learning for Image Recognition. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp.770-778.
[27]
Lin Tsung-Yi and Dollár Piotr. Feature Pyramid Networks for Object Detection, 2016, arXiv:1612.03144
[28]
Huang Jonathan and Rathod Vivek. Speed Accuracy Trade-Offs for Modern Convolutional Object Detectors. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp 3296-3297.
[29]
Shrivastava Abhinav and Sukthankar Rahul. Beyond Skip Connections: Top-Down Modulation for Object Detection, 2016, arXiv:1612.06851.

Cited By

View all
  • (2023)Classification of the most common conditionally pathogenic microorganisms on SEM images with YOLO model2023 IX International Conference on Information Technology and Nanotechnology (ITNT)10.1109/ITNT57377.2023.10139188(1-5)Online publication date: 17-Apr-2023
  • (2022)Semi-automatic one-class image labeling using a neural network object detection model2022 VIII International Conference on Information Technology and Nanotechnology (ITNT)10.1109/ITNT55410.2022.9848575(1-5)Online publication date: 23-May-2022

Index Terms

  1. Object Detection with Auto-Learning Anchor Algorithm
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      ICAIP '20: Proceedings of the 4th International Conference on Advances in Image Processing
      November 2020
      191 pages
      ISBN:9781450388368
      DOI:10.1145/3441250
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 29 May 2021

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Anchor Learning
      2. Object Detection
      3. Yolov3

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Funding Sources

      • National Key Research and Development Program

      Conference

      ICAIP 2020

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)8
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 03 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Classification of the most common conditionally pathogenic microorganisms on SEM images with YOLO model2023 IX International Conference on Information Technology and Nanotechnology (ITNT)10.1109/ITNT57377.2023.10139188(1-5)Online publication date: 17-Apr-2023
      • (2022)Semi-automatic one-class image labeling using a neural network object detection model2022 VIII International Conference on Information Technology and Nanotechnology (ITNT)10.1109/ITNT55410.2022.9848575(1-5)Online publication date: 23-May-2022

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media