Deformable Convolutional Networks

Dai, Jifeng; Qi, Haozhi; Xiong, Yuwen; Li, Yi; Zhang, Guodong; Hu, Han; Wei, Yichen

Computer Science > Computer Vision and Pattern Recognition

arXiv:1703.06211v2 (cs)

[Submitted on 17 Mar 2017 (v1), revised 22 Mar 2017 (this version, v2), latest version 5 Jun 2017 (v3)]

Title:Deformable Convolutional Networks

Authors:Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei

View PDF

Abstract:Convolutional neural networks (CNNs) are inherently limited to model geometric transformations due to the fixed geometric structures in its building modules. In this work, we introduce two new modules to enhance the transformation modeling capacity of CNNs, namely, deformable convolution and deformable RoI pooling. Both are based on the idea of augmenting the spatial sampling locations in the modules with additional offsets and learning the offsets from target tasks, without additional supervision. The new modules can readily replace their plain counterparts in existing CNNs and can be easily trained end-to-end by standard back-propagation, giving rise to deformable convolutional networks. Extensive experiments validate the effectiveness of our approach on sophisticated vision tasks of object detection and semantic segmentation. The code would be released.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1703.06211 [cs.CV]
	(or arXiv:1703.06211v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1703.06211

Submission history

From: Jifeng Dai [view email]
[v1] Fri, 17 Mar 2017 21:58:20 UTC (6,904 KB)
[v2] Wed, 22 Mar 2017 12:39:32 UTC (6,906 KB)
[v3] Mon, 5 Jun 2017 10:08:50 UTC (6,587 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2017-03

Change to browse by:

References & Citations

3 blog links

(what is this?)

DBLP - CS Bibliography

listing | bibtex

Jifeng Dai
Haozhi Qi
Yuwen Xiong
Yi Li
Guodong Zhang

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Deformable Convolutional Networks

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deformable Convolutional Networks

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators