Efficient Regional Memory Network for Video Object Segmentation

Xie, Haozhe; Yao, Hongxun; Zhou, Shangchen; Zhang, Shengping; Sun, Wenxiu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2103.12934 (cs)

[Submitted on 24 Mar 2021 (v1), last revised 27 Apr 2021 (this version, v2)]

Title:Efficient Regional Memory Network for Video Object Segmentation

Authors:Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Wenxiu Sun

View PDF

Abstract:Recently, several Space-Time Memory based networks have shown that the object cues (e.g. video frames as well as the segmented object masks) from the past frames are useful for segmenting objects in the current frame. However, these methods exploit the information from the memory by global-to-global matching between the current and past frames, which lead to mismatching to similar objects and high computational complexity. To address these problems, we propose a novel local-to-local matching solution for semi-supervised VOS, namely Regional Memory Network (RMNet). In RMNet, the precise regional memory is constructed by memorizing local regions where the target objects appear in the past frames. For the current query frame, the query regions are tracked and predicted based on the optical flow estimated from the previous frame. The proposed local-to-local matching effectively alleviates the ambiguity of similar objects in both memory and query frames, which allows the information to be passed from the regional memory to the query region efficiently and effectively. Experimental results indicate that the proposed RMNet performs favorably against state-of-the-art methods on the DAVIS and YouTube-VOS datasets.

Comments:	CVPR 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2103.12934 [cs.CV]
	(or arXiv:2103.12934v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2103.12934

Submission history

From: Haozhe Xie [view email]
[v1] Wed, 24 Mar 2021 02:08:46 UTC (4,983 KB)
[v2] Tue, 27 Apr 2021 23:02:25 UTC (4,984 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Haozhe Xie
Hongxun Yao
Shangchen Zhou
Shengping Zhang
Wenxiu Sun

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Efficient Regional Memory Network for Video Object Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Efficient Regional Memory Network for Video Object Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators