research-article

Objectness Consistent Representation for Weakly Supervised Object Detection

Authors:

Ke Yang,

Yong DouAuthors Info & Claims

MM '20: Proceedings of the 28th ACM International Conference on Multimedia

Pages 1688 - 1696

https://doi.org/10.1145/3394171.3413835

Published: 12 October 2020 Publication History

Get Access

Editorial Notes

The authors have requested minor, non-substantive changes to the VoR and, in accordance with ACM policies, a Corrected VoR was published on March 4, 2021. For reference purposes the VoR may still be accessed via the Supplemental Material section on this page.

Abstract

Weakly supervised object detection aims at learning object detectors with only image-level category labels. Most existing methods tend to solve this problem by using a multiple instance learning detector which is usually trapped to discriminate object parts. In order to select high-quality proposals, recent works leverage objectness scores derived from weakly-supervised segmentation maps to rank the object proposals. Base on our observation, this kind of segmentation guided method always fails due to neglect of the fact that the objectness of all proposals inside the ground-truth box should be consistent. In this paper, we propose a novel object representation named Objectness Consistent Representation (OCRepr) to meet the consistency criterion of objectness. Specifically, we project the segmentation confidence scores into two orthogonal directions, namely vertical and horizontal, to get the OCRepr. With the novel object representation, more high-quality proposals can be mined for learning a much stronger object detector. We obtain 54.6% and 51.1% mAP scores on VOC 2007 and 2012 datasets, significantly outperforming the state-of-the-art and demonstrating the superiority of OCRepr for weakly supervised object detection.

Supplementary Material

MP4 File (3394171.3413835.mp4)

Weakly supervised object detection aims at learning object detectors with only image-level category labels.\r\nMost existing methods tend to solve this problem by using a multiple instance learning detector which is usually trapped to discriminate object parts. In order to select high-quality proposals, recent works leverage objectness scores derived from weakly-supervised segmentation to rank the object proposals. Base on our observation, this kind of segmentation guided method always fails due to neglect of the fact that the objectness of all proposals inside the ground-truth box should be consistent. We propose a novel object representation named Objectness Consistent Representation (OCRepr) to meet the consistency criterion of objectness. Specifically, we project the segmentation confidence scores into two orthogonal directions, namely vertical and horizontal, to get the OCRepr. With the novel object representation, more high-quality proposals can be mined for learning a much stronger object detector.

Download
6.34 MB

References

[1]

Jiwoon Ahn, Sunghyun Cho, and Suha Kwak. [n.d.]. Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations. In CVPR 2019.

Editorial Notes

Abstract

Supplementary Material

References

Cited By

Recommendations

Adaptive Generation of Weakly Supervised Semantic Segmentation for Object Detection

Weakly Supervised Object Detection Based on Active Learning

Proposal-Refined Weakly Supervised Object Detection in Underwater Images

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations