K-Net: Towards Unified Image Segmentation

Zhang, Wenwei; Pang, Jiangmiao; Chen, Kai; Loy, Chen Change

Computer Science > Computer Vision and Pattern Recognition

arXiv:2106.14855 (cs)

[Submitted on 28 Jun 2021 (v1), last revised 1 Nov 2021 (this version, v2)]

Title:K-Net: Towards Unified Image Segmentation

Authors:Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy

View PDF

Abstract:Semantic, instance, and panoptic segmentations have been addressed using different and specialized frameworks despite their underlying connections. This paper presents a unified, simple, and effective framework for these essentially similar tasks. The framework, named K-Net, segments both instances and semantic categories consistently by a group of learnable kernels, where each kernel is responsible for generating a mask for either a potential instance or a stuff class. To remedy the difficulties of distinguishing various instances, we propose a kernel update strategy that enables each kernel dynamic and conditional on its meaningful group in the input image. K-Net can be trained in an end-to-end manner with bipartite matching, and its training and inference are naturally NMS-free and box-free. Without bells and whistles, K-Net surpasses all previous published state-of-the-art single-model results of panoptic segmentation on MS COCO test-dev split and semantic segmentation on ADE20K val split with 55.2% PQ and 54.3% mIoU, respectively. Its instance segmentation performance is also on par with Cascade Mask R-CNN on MS COCO with 60%-90% faster inference speeds. Code and models will be released at this https URL.

Comments:	Camera ready for NeurIPS2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2106.14855 [cs.CV]
	(or arXiv:2106.14855v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2106.14855

Submission history

From: Wenwei Zhang [view email]
[v1] Mon, 28 Jun 2021 17:18:21 UTC (24,492 KB)
[v2] Mon, 1 Nov 2021 17:40:49 UTC (13,206 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:K-Net: Towards Unified Image Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:K-Net: Towards Unified Image Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators