Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3394171.3413835acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Objectness Consistent Representation for Weakly Supervised Object Detection

Published: 12 October 2020 Publication History

Editorial Notes

The authors have requested minor, non-substantive changes to the VoR and, in accordance with ACM policies, a Corrected VoR was published on March 4, 2021. For reference purposes the VoR may still be accessed via the Supplemental Material section on this page.

Abstract

Weakly supervised object detection aims at learning object detectors with only image-level category labels. Most existing methods tend to solve this problem by using a multiple instance learning detector which is usually trapped to discriminate object parts. In order to select high-quality proposals, recent works leverage objectness scores derived from weakly-supervised segmentation maps to rank the object proposals. Base on our observation, this kind of segmentation guided method always fails due to neglect of the fact that the objectness of all proposals inside the ground-truth box should be consistent. In this paper, we propose a novel object representation named Objectness Consistent Representation (OCRepr) to meet the consistency criterion of objectness. Specifically, we project the segmentation confidence scores into two orthogonal directions, namely vertical and horizontal, to get the OCRepr. With the novel object representation, more high-quality proposals can be mined for learning a much stronger object detector. We obtain 54.6% and 51.1% mAP scores on VOC 2007 and 2012 datasets, significantly outperforming the state-of-the-art and demonstrating the superiority of OCRepr for weakly supervised object detection.

Supplementary Material

MP4 File (3394171.3413835.mp4)
Weakly supervised object detection aims at learning object detectors with only image-level category labels.\r\nMost existing methods tend to solve this problem by using a multiple instance learning detector which is usually trapped to discriminate object parts. In order to select high-quality proposals, recent works leverage objectness scores derived from weakly-supervised segmentation to rank the object proposals. Base on our observation, this kind of segmentation guided method always fails due to neglect of the fact that the objectness of all proposals inside the ground-truth box should be consistent. We propose a novel object representation named Objectness Consistent Representation (OCRepr) to meet the consistency criterion of objectness. Specifically, we project the segmentation confidence scores into two orthogonal directions, namely vertical and horizontal, to get the OCRepr. With the novel object representation, more high-quality proposals can be mined for learning a much stronger object detector.

References

[1]
Jiwoon Ahn, Sunghyun Cho, and Suha Kwak. [n.d.]. Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations. In CVPR 2019.
[2]
Jiwoon Ahn and Suha Kwak. [n.d.]. Learning pixel-level semantic affinity with image-level supervision for weakly supervised semantic segmentation. In CVPR 2018.
[3]
Hakan Bilen and Andrea Vedaldi. [n.d.]. Weakly supervised deep detection networks. In CVPR 2016.
[4]
Ramazan Gokberk Cinbis, Jakob Verbeek, and Cordelia Schmid. [n.d.]. Weakly supervised object localization with multi-fold multiple instance learning. IEEE TPAMI 2017 ( [n.,d.]).
[5]
Ali Diba, Vivek Sharma, Ali Mohammad Pazandeh, Hamed Pirsiavash, and Luc Van Gool. [n.d.]. Weakly Supervised Cascaded Convolutional Networks. In CVPR 2017.
[6]
Mark Everingham, Luc Van Gool, Christopher KI Williams, John Winn, and Andrew Zisserman. [n.d.]. The PASCAL Visual Object Classes (VOC) Challenge. IJCV 2010 ( [n.,d.]).
[7]
Weifeng Ge, Sibei Yang, and Yizhou Yu. [n.d.]. Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning. In CVPR 2018.
[8]
Ross Girshick. [n.d.]. Fast R-CNN. In ICCV 2015.
[9]
Judy Hoffman, Deepak Pathak, Trevor Darrell, and Kate Saenko. [n.d.]. Detector discovery in the wild: Joint multiple instance and representation learning. In CVPR 2015.
[10]
Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. [n.d.]. Caffe: Convolutional architecture for fast feature embedding. In ACM MM 2014.
[11]
Zequn Jie, Yunchao Wei, Xiaojie Jin, Jiashi Feng, and Wei Liu. [n.d.]. Deep self-taught learning for weakly supervised object localization. In CVPR 2017.
[12]
Vadim Kantorov, Maxime Oquab, Minsu Cho, and Ivan Laptev. [n.d.]. Contextlocnet: Context-aware deep network models for weakly supervised localization. In ECCV 2016.
[13]
Alexander Kolesnikov and Christoph H Lampert. [n.d.]. Seed, expand and constrain: Three principles for weakly-supervised image segmentation. In ECCV 2016.
[14]
Satoshi Kosugi, Toshihiko Yamasaki, and Kiyoharu Aizawa. 2019. Object-aware instance labeling for weakly supervised object detection. In Proceedings of the IEEE International Conference on Computer Vision. 6064--6072.
[15]
Dong Li, Jia-Bin Huang, Yali Li, Shengjin Wang, and Ming-Hsuan Yang. [n.d.]. Weakly supervised object localization with progressive domain adaptation. In CVPR 2016.
[16]
Xiaoyan Li, Meina Kan, Shiguang Shan, and Xilin Chen. 2019. Weakly Supervised Object Detection With Segmentation Collaboration. In Proceedings of the IEEE International Conference on Computer Vision. 9735--9744.
[17]
Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Yong Jae Lee, Alexander G Schwing, and Jan Kautz. 2020. Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10598--10607.
[18]
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, et al. [n.d.]. Imagenet large scale visual recognition challenge. IJCV 2015 ( [n.,d.]).
[19]
Yunhang Shen, Rongrong Ji, Yan Wang, Yongjian Wu, and Liujuan Cao. [n.d.] a. Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation. In CVPR 2019.
[20]
Yunhan Shen, Rongrong Ji, Shengchuan Zhang, Wangmeng Zuo, Yan Wang, and F Huang. [n.d.] b. Generative adversarial learning towards fast weakly supervised detection. In CVPR 2018.
[21]
Karen Simonyan and Andrew Zisserman. [n.d.]. Very deep convolutional networks for large-scale image recognition. In ICLR 2015.
[22]
Peng Tang, Xinggang Wang, Song Bai, Wei Shen, Xiang Bai, Wenyu Liu, and Alan Yuille. [n.d.] b. PCL: Proposal Cluster Learning for Weakly Supervised Object Detection. IEEE TPAMI 2018 ( [n.,d.]).
[23]
Peng Tang, Xinggang Wang, Xiang Bai, and Wenyu Liu. [n.d.] a. Multiple Instance Detection Network with Online Instance Classifier Refinement. In CVPR 2017.
[24]
Peng Tang, Xinggang Wang, Angtian Wang, Yongluan Yan, Wenyu Liu, Junzhou Huang, and Alan Yuille. [n.d.] c. Weakly Supervised Region Proposal Network and Object Detection. In ECCV 2018.
[25]
Jasper RR Uijlings, Koen EA Van De Sande, Theo Gevers, and Arnold WM Smeulders. 2013. Selective search for object recognition. IJCV (2013).
[26]
Fang Wan, Chang Liu, Wei Ke, Xiangyang Ji, Jianbin Jiao, and Qixiang Ye. 2019. C-MIL: Continuation multiple instance learning for weakly supervised object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2199--2208.
[27]
Yunchao Wei, Jiashi Feng, Xiaodan Liang, Ming-Ming Cheng, Yao Zhao, and Shuicheng Yan. [n.d.] a. Object region mining with adversarial erasing: A simple classification to semantic segmentation approach. In CVPR 2017.
[28]
Yunchao Wei, Zhiqiang Shen, Bowen Cheng, Honghui Shi, Jinjun Xiong, Jiashi Feng, and Thomas Huang. [n.d.] b. TS2C: tight box mining with surrounding segmentation context for weakly supervised object detection. In ECCV 2018.
[29]
Ke Yang, Dongsheng Li, and Yong Dou. 2019. Towards Precise End-to-End Weakly Supervised Object Detection Network. In The IEEE International Conference on Computer Vision (ICCV).
[30]
Xiaopeng Zhang, Jiashi Feng, Hongkai Xiong, and Qi Tian. [n.d.] b. Zigzag Learning for Weakly Supervised Object Detection. In CVPR 2018.
[31]
Xiaopeng Zhang, Yang Yang, and Jiashi Feng. [n.d.] c. ML-LocNet: Improving Object Localization with Multi-view Learning Network. In ECCV 2018.
[32]
Yongqiang Zhang, Yancheng Bai, Mingli Ding, Yongqiang Li, and Bernard Ghanem. [n.d.] a. W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection. In CVPR 2018.
[33]
Xiangyun Zhao, Shuang Liang, and Yichen Wei. 2018. Pseudo mask augmented object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4061--4070.

Cited By

View all
  • (2024)Enhanced Spatial Feature Learning for Weakly Supervised Object DetectionIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.317818035:1(961-972)Online publication date: Jan-2024
  • (2024)PanorAMS: Automatic Annotation for Detecting Objects in Urban ContextIEEE Transactions on Multimedia10.1109/TMM.2023.327969626(1281-1294)Online publication date: 1-Jan-2024
  • (2024)Misclassification in Weakly Supervised Object DetectionIEEE Transactions on Image Processing10.1109/TIP.2024.340298133(3413-3427)Online publication date: 2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '20: Proceedings of the 28th ACM International Conference on Multimedia
October 2020
4889 pages
ISBN:9781450379885
DOI:10.1145/3394171
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 October 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. image retrieval
  2. object detection
  3. weakly supervised

Qualifiers

  • Research-article

Funding Sources

  • National Natural Science Foundation of China
  • National Key Research and Development Program of China

Conference

MM '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Enhanced Spatial Feature Learning for Weakly Supervised Object DetectionIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.317818035:1(961-972)Online publication date: Jan-2024
  • (2024)PanorAMS: Automatic Annotation for Detecting Objects in Urban ContextIEEE Transactions on Multimedia10.1109/TMM.2023.327969626(1281-1294)Online publication date: 1-Jan-2024
  • (2024)Misclassification in Weakly Supervised Object DetectionIEEE Transactions on Image Processing10.1109/TIP.2024.340298133(3413-3427)Online publication date: 2024
  • (2024)DAR-ILL: Double-Attention Refining and Iterative Labeling Learning for Weakly Supervised Object DetectionIEEE Transactions on Cognitive and Developmental Systems10.1109/TCDS.2023.331267916:3(899-911)Online publication date: Jun-2024
  • (2023)ALFPN: Adaptive Learning Feature Pyramid Network for Small Object DetectionInternational Journal of Intelligent Systems10.1155/2023/62662092023(1-14)Online publication date: 21-Apr-2023
  • (2023)FI-WSOD: Foreground Information Guided Weakly Supervised Object DetectionIEEE Transactions on Multimedia10.1109/TMM.2022.319801825(1890-1902)Online publication date: 1-Jan-2023
  • (2023)Selecting High-Quality Proposals for Weakly Supervised Object Detection With Bottom-Up Aggregated Attention and Phase-Aware LossIEEE Transactions on Image Processing10.1109/TIP.2022.323174432(682-693)Online publication date: 2023
  • (2023)Hybrid Contrastive Prototypical Network for Few-Shot Scene Classification2023 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP49359.2023.10223151(3588-3592)Online publication date: 8-Oct-2023
  • (2023)Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.00645(6985-6995)Online publication date: 1-Oct-2023
  • (2023)VPPT: Visual Pre-Trained Prompt Tuning Framework for Few-Shot Image ClassificationICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP49357.2023.10095154(1-5)Online publication date: 4-Jun-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media