End-to-End Instance Segmentation with Recurrent Attention

Ren, Mengye; Zemel, Richard S.

Computer Science > Machine Learning

arXiv:1605.09410 (cs)

[Submitted on 30 May 2016 (v1), last revised 13 Jul 2017 (this version, v5)]

Title:End-to-End Instance Segmentation with Recurrent Attention

Authors:Mengye Ren, Richard S. Zemel

View PDF

Abstract:While convolutional neural networks have gained impressive success recently in solving structured prediction problems such as semantic segmentation, it remains a challenge to differentiate individual object instances in the scene. Instance segmentation is very important in a variety of applications, such as autonomous driving, image captioning, and visual question answering. Techniques that combine large graphical models with low-level vision have been proposed to address this problem; however, we propose an end-to-end recurrent neural network (RNN) architecture with an attention mechanism to model a human-like counting process, and produce detailed instance segmentations. The network is jointly trained to sequentially produce regions of interest as well as a dominant object segmentation within each region. The proposed model achieves competitive results on the CVPPP, KITTI, and Cityscapes datasets.

Comments:	CVPR 2017
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1605.09410 [cs.LG]
	(or arXiv:1605.09410v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1605.09410

Submission history

From: Mengye Ren [view email]
[v1] Mon, 30 May 2016 20:40:20 UTC (8,852 KB)
[v2] Tue, 6 Sep 2016 15:09:06 UTC (8,891 KB)
[v3] Sun, 27 Nov 2016 17:41:57 UTC (8,221 KB)
[v4] Mon, 16 Jan 2017 23:08:35 UTC (8,221 KB)
[v5] Thu, 13 Jul 2017 00:53:33 UTC (5,962 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2016-05

Change to browse by:

cs
cs.CV

References & Citations

1 blog link

(what is this?)

DBLP - CS Bibliography

listing | bibtex

Mengye Ren
Richard S. Zemel

export BibTeX citation

Computer Science > Machine Learning

Title:End-to-End Instance Segmentation with Recurrent Attention

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:End-to-End Instance Segmentation with Recurrent Attention

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators