Full-Duplex Strategy for Video Object Segmentation

Ji, Ge-Peng; Fan, Deng-Ping; Fu, Keren; Wu, Zhe; Shen, Jianbing; Shao, Ling

doi:10.1007/s41095-021-0262-4

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.03151 (cs)

[Submitted on 6 Aug 2021 (v1), last revised 3 Sep 2021 (this version, v3)]

Title:Full-Duplex Strategy for Video Object Segmentation

Authors:Ge-Peng Ji, Deng-Ping Fan, Keren Fu, Zhe Wu, Jianbing Shen, Ling Shao

View PDF

Abstract:Previous video object segmentation approaches mainly focus on using simplex solutions between appearance and motion, limiting feature collaboration efficiency among and across these two cues. In this work, we study a novel and efficient full-duplex strategy network (FSNet) to address this issue, by considering a better mutual restraint scheme between motion and appearance in exploiting the cross-modal features from the fusion and decoding stage. Specifically, we introduce the relational cross-attention module (RCAM) to achieve bidirectional message propagation across embedding sub-spaces. To improve the model's robustness and update the inconsistent features from the spatial-temporal embeddings, we adopt the bidirectional purification module (BPM) after the RCAM. Extensive experiments on five popular benchmarks show that our FSNet is robust to various challenging scenarios (e.g., motion blur, occlusion) and achieves favourable performance against existing cutting-edges both in the video object segmentation and video salient object detection tasks. The project is publicly available at: this https URL.

Comments:	Accepted at ICCV-2021 (Journal Submission). Project Page: this http URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.03151 [cs.CV]
	(or arXiv:2108.03151v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.03151
Journal reference:	Comp. Visual Media 9, 155-175 (2023)
Related DOI:	https://doi.org/10.1007/s41095-021-0262-4

Submission history

From: Ge-Peng Ji [view email]
[v1] Fri, 6 Aug 2021 14:50:50 UTC (10,124 KB)
[v2] Thu, 19 Aug 2021 01:11:49 UTC (10,125 KB)
[v3] Fri, 3 Sep 2021 09:24:39 UTC (2,586 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Full-Duplex Strategy for Video Object Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Full-Duplex Strategy for Video Object Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators