Context-Transformer: Tackling Object Confusion for Few-Shot Detection

Yang, Ze; Wang, Yali; Chen, Xianyu; Liu, Jianzhuang; Qiao, Yu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2003.07304 (cs)

[Submitted on 16 Mar 2020]

Title:Context-Transformer: Tackling Object Confusion for Few-Shot Detection

Authors:Ze Yang (1), Yali Wang (1), Xianyu Chen (1), Jianzhuang Liu (2), Yu Qiao (1 and 3) ((1) ShenZhen Key Lab of Computer Vision and Pattern Recognition, SIAT-SenseTime Joint Lab, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, (2) Huawei Noah's Ark Lab, (3) SIAT Branch, Shenzhen Institute of Artificial Intelligence and Robotics for Society)

View PDF

Abstract:Few-shot object detection is a challenging but realistic scenario, where only a few annotated training images are available for training detectors. A popular approach to handle this problem is transfer learning, i.e., fine-tuning a detector pretrained on a source-domain benchmark. However, such transferred detector often fails to recognize new objects in the target domain, due to low data diversity of training samples. To tackle this problem, we propose a novel Context-Transformer within a concise deep transfer framework. Specifically, Context-Transformer can effectively leverage source-domain object knowledge as guidance, and automatically exploit contexts from only a few training images in the target domain. Subsequently, it can adaptively integrate these relational clues to enhance the discriminative power of detector, in order to reduce object confusion in few-shot scenarios. Moreover, Context-Transformer is flexibly embedded in the popular SSD-style detectors, which makes it a plug-and-play module for end-to-end few-shot learning. Finally, we evaluate Context-Transformer on the challenging settings of few-shot detection and incremental few-shot detection. The experimental results show that, our framework outperforms the recent state-of-the-art approaches.

Comments:	Accepted by AAAI-2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2003.07304 [cs.CV]
	(or arXiv:2003.07304v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2003.07304

Submission history

From: Ze Yang [view email]
[v1] Mon, 16 Mar 2020 16:17:11 UTC (4,795 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computer Vision and Pattern Recognition

Title:Context-Transformer: Tackling Object Confusion for Few-Shot Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computer Vision and Pattern Recognition

Title:Context-Transformer: Tackling Object Confusion for Few-Shot Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators