Seesaw Loss for Long-Tailed Instance Segmentation

Wang, Jiaqi; Zhang, Wenwei; Zang, Yuhang; Cao, Yuhang; Pang, Jiangmiao; Gong, Tao; Chen, Kai; Liu, Ziwei; Loy, Chen Change; Lin, Dahua

Computer Science > Computer Vision and Pattern Recognition

arXiv:2008.10032v3 (cs)

[Submitted on 23 Aug 2020 (v1), revised 24 Dec 2020 (this version, v3), latest version 17 Jun 2021 (v4)]

Title:Seesaw Loss for Long-Tailed Instance Segmentation

Authors:Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin

View PDF

Abstract:Instance segmentation has witnessed a remarkable progress on class-balanced benchmarks. However, they fail to perform as accurately in real-world scenarios, where the category distribution of objects naturally comes with a long tail. Instances of head classes dominate a long-tailed dataset and they serve as negative samples of tail categories. The overwhelming gradients of negative samples on tail classes lead to a biased learning process for classifiers. Consequently, objects of tail categories are more likely to be misclassified as backgrounds or head categories. To tackle this problem, we propose Seesaw Loss to dynamically re-balance gradients of positive and negative samples for each category, with two complementary factors, i.e., mitigation factor and compensation factor. The mitigation factor reduces punishments to tail categories w.r.t. the ratio of cumulative training instances between different categories. Meanwhile, the compensation factor increases the penalty of misclassified instances to avoid false positives of tail categories. We conduct extensive experiments on Seesaw Loss with mainstream frameworks and different data sampling strategies. With a simple end-to-end training pipeline, Seesaw Loss obtains significant gains over Cross-Entropy Loss, and achieves state-of-the-art performance on LVIS dataset without bells and whistles.

Comments:	Technical Report
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2008.10032 [cs.CV]
	(or arXiv:2008.10032v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2008.10032

Submission history

From: Jiaqi Wang [view email]
[v1] Sun, 23 Aug 2020 12:44:45 UTC (44 KB)
[v2] Mon, 7 Dec 2020 12:38:15 UTC (908 KB)
[v3] Thu, 24 Dec 2020 13:16:00 UTC (916 KB)
[v4] Thu, 17 Jun 2021 15:13:10 UTC (1,098 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Seesaw Loss for Long-Tailed Instance Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Seesaw Loss for Long-Tailed Instance Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators