research-article

Object Detection with Auto-Learning Anchor Algorithm

Authors:

Yuanlun XieAuthors Info & Claims

ICAIP '20: Proceedings of the 4th International Conference on Advances in Image Processing

Pages 27 - 34

https://doi.org/10.1145/3441250.3441262

Published: 29 May 2021 Publication History

Abstract

As an effective auxiliary means for object detection task, region anchors are widely adopted in most of state-of-the art detectors. However, anchor's location and shape in those works are normally determined by experience or some preprocessing methods, i.e., clustering, which leads to time consumption and limits the flexibility of anchors. In this paper, we explore the possibility that networks can predict bounding boxes and simultaneously learn anchor's location and shape end to end. Specifically, we propose a new anchoring scheme, named Automatic Anchor Learning, which can be integrated into any object detectors and enable detectors to learn the location and size of anchors while training, without sampling anchors over any predefined set of scales and aspect ratios. The proposed method first predicts where the centers of objects of interest might exist and then predict the shape of anchor that should be placed in this location. By applying the proposed Automatic Anchoring Learning method to Yolov3 model, we achieve around 3.3% and 1.6% higher recall and mAP on MS COCO with 80% less anchors, and 10% more FPS than the original Yolov3. Additionally, we also integrate our method into other object algorithms, i.e., Fast R-CNN and RetinaNet, we respectively improve their detection mAP by 2.5% and 1.1%.

References

[1]

Ke Wei and Zhang Tianliang. Multiple Anchor Learning for Visual Object Detection. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp.10206-10215.

[2]

Redmon, Joseph and Farhadi. YOLOv3: An Incremental Improvement, 2018, arXiv:1804.02767.

[3]

Lu Ruiqi and Ma Huimin. Occluded Pedestrian Detection with Visible IoU and Box Sign Predictor. In: International Conference on Image Processing (ICIP), 2019, pp.1640-1644.

[4]

Ping-Yang Chen and Jun-Wei Hsieh. Residual Bi-Fusion Feature Pyramid Network for Accurate Single-shot Object Detection, 2019, arXiv: 1911.12051.

[5]

Xu Yongchao and Fu Mingtao, Gliding vertex on the horizontal bounding box for multi-oriented object detection. In: Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020, pp.1-1.

[6]

Sheng Yi and Xi Li. WSOD with PSNet and Box Regression, 2019, arXiv:1911.11512.

[7]

Deng J and Dong W. ImageNet: A Large-Scale Hierarchical Image Database. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2009, pp.248-255.

[8]

Mark Everingham, SM Ali Eslami and Luc Van Gool. The pascal visual object classes challenge: Aretrospective. International Journal of Computer Vision, 2015, vol.111, no.1, pp.98-136.

Digital Library

[9]

Andreas Geiger and Philip Lenz.Are we ready for autonomous driving the kitti vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2012, pp.3354-3361.

[10]

Tsung-Yi Lin, Michael Maire and Serge Belongie. Microsoft coco: Common objects in context. In: European Conference on Computer Vision, 2014, pp.740-755.

[11]

J Wang and K Chen. Region Proposal by Guided Anchoring. In: Conference on Computer Vision and Pattern Recognition (CVPR),2019, pp.2960-2969.

[12]

Lin, Tsung-Yi and Goyal. Focal Loss for Dense Object Detection. In: International Conference on Computer Vision (ICCV), 2017, pp.2999-3007.

[13]

Lin, Tsung-Yi and Dollár. Feature Pyramid Networks for Object Detection, 2016, arXiv:1612.03144.

[14]

Zhang, Hanwang and Kyaw. PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN. In: International Conference on Computer Vision (ICCV), 2017, pp.4243-4251.

[15]

Ren Shaoqing and He Kaiming. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Transactions on Pattern Analysis and Machine Intelligence, 2015, vol.39, no.6, pp.1137-1149.

[16]

Li Yanghao and Chen Yuntao. Scale-Aware Trident Networks for Object Detection. In: International Conference on Computer Vision, 2019, pp.6053-6062.

[17]

Peng Junran and Sun Ming. POD: Practical Object Detection with Scale-Sensitive Network, 2019, rXiv:1909.02225.

[18]

Liu Wei and Anguelov Dragomir, SSD: Single Shot MultiBox Detector. In: European Conference on Computer Vision, 2015, pp.21-37.

[19]

G. Ghiasi and T. Lin. NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp.7029-7038.

[20]

Fu Cheng-Yang and Liu Wei. DSSD : Deconvolutional Single Shot Detector, 2017, arXiv:1701.06659.

[21]

Zhang Xiaosong and Wan Fang. FreeAnchor: Learning to Match Anchors for Visual Object Detection, 2019, arXiv:1909.02466.

[22]

Yang Tong and Zhang Xiangyu. MetaAnchor: Learning to Detect Objects with Customized Anchors, 2018, arXiv:1807.00980.

[23]

Jiang Borui and Luo Ruixuan. Acquisition of Localization Confidence for Accurate Object Detectionm, 2018, arXiv:1807.11590.

[24]

Law H and Deng J. CornerNet: Detecting Objects as Paired Keypoints. Journal of Computer Vision, 2020, vol.128, pp.642-656.

[25]

Kaiwen Duan, Song Bai and Lingxi Xie. Centernet: Object detection with keypoint triplets. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2019, arXiv:1904.08189.

[26]

He Kaiming and Zhang Xiangyu. Deep Residual Learning for Image Recognition. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp.770-778.

[27]

Lin Tsung-Yi and Dollár Piotr. Feature Pyramid Networks for Object Detection, 2016, arXiv:1612.03144

[28]

Huang Jonathan and Rathod Vivek. Speed Accuracy Trade-Offs for Modern Convolutional Object Detectors. In: Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp 3296-3297.

[29]

Shrivastava Abhinav and Sukthankar Rahul. Beyond Skip Connections: Top-Down Modulation for Object Detection, 2016, arXiv:1612.06851.

Cited By

Gridin VNovikov ISalem BSolodovnikov V(2023)Classification of the most common conditionally pathogenic microorganisms on SEM images with YOLO model2023 IX International Conference on Information Technology and Nanotechnology (ITNT)10.1109/ITNT57377.2023.10139188(1-5)Online publication date: 17-Apr-2023
https://doi.org/10.1109/ITNT57377.2023.10139188
Salem BSolodovnikov VNovikov IGridin V(2022)Semi-automatic one-class image labeling using a neural network object detection model2022 VIII International Conference on Information Technology and Nanotechnology (ITNT)10.1109/ITNT55410.2022.9848575(1-5)Online publication date: 23-May-2022
https://doi.org/10.1109/ITNT55410.2022.9848575

Index Terms

Object Detection with Auto-Learning Anchor Algorithm
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object recognition
  2. Machine learning
    1. Learning paradigms

Index terms have been assigned to the content through auto-classification.

Recommendations

Soft Anchor-Point Object Detection
Computer Vision – ECCV 2020
Abstract
Recently, anchor-free detection methods have been through great progress. The major two families, anchor-point detection and key-point detection, are at opposite edges of the speed-accuracy trade-off, with anchor-point detectors having the speed ...
Anchor pruning for object detection
Abstract
This paper proposes anchor pruning for object detection in one-stage anchor-based detectors. While pruning techniques are widely used to reduce the computational cost of convolutional neural networks, they tend to focus on optimizing ...
Highlights
- Novel pruning method for object detection models: anchor pruning.
- Remove ...
Performance releaser with smart anchor learning for arbitrary‐oriented object detection
Abstract
Arbitrary‐oriented object detection is widely used in aerial image applications because of its efficient object representation. However, the use of oriented bounding box aggravates the imbalance between positive and negative samples when using one‐...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICAIP '20: Proceedings of the 4th International Conference on Advances in Image Processing

November 2020

191 pages

ISBN:9781450388368

DOI:10.1145/3441250

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 May 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

National Key Research and Development Program

Conference

ICAIP 2020

ICAIP 2020: 2020 4th International Conference on Advances in Image Processing

November 13 - 15, 2020

Chengdu, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
67
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)0

Reflects downloads up to 03 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Gridin VNovikov ISalem BSolodovnikov V(2023)Classification of the most common conditionally pathogenic microorganisms on SEM images with YOLO model2023 IX International Conference on Information Technology and Nanotechnology (ITNT)10.1109/ITNT57377.2023.10139188(1-5)Online publication date: 17-Apr-2023
https://doi.org/10.1109/ITNT57377.2023.10139188
Salem BSolodovnikov VNovikov IGridin V(2022)Semi-automatic one-class image labeling using a neural network object detection model2022 VIII International Conference on Information Technology and Nanotechnology (ITNT)10.1109/ITNT55410.2022.9848575(1-5)Online publication date: 23-May-2022
https://doi.org/10.1109/ITNT55410.2022.9848575

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten