TETRIS: Towards Exploring the Robustness of Interactive Segmentation

Moskalenko, Andrey; Shakhuro, Vlad; Vorontsova, Anna; Konushin, Anton; Antonov, Anton; Krapukhin, Alexander; Shepelev, Denis; Soshin, Konstantin

doi:10.1609/aaai.v38i5.28225

Computer Science > Computer Vision and Pattern Recognition

arXiv:2402.06132 (cs)

[Submitted on 9 Feb 2024]

Title:TETRIS: Towards Exploring the Robustness of Interactive Segmentation

Authors:Andrey Moskalenko, Vlad Shakhuro, Anna Vorontsova, Anton Konushin, Anton Antonov, Alexander Krapukhin, Denis Shepelev, Konstantin Soshin

View PDF HTML (experimental)

Abstract:Interactive segmentation methods rely on user inputs to iteratively update the selection mask. A click specifying the object of interest is arguably the most simple and intuitive interaction type, and thereby the most common choice for interactive segmentation. However, user clicking patterns in the interactive segmentation context remain unexplored. Accordingly, interactive segmentation evaluation strategies rely more on intuition and common sense rather than empirical studies (e.g., assuming that users tend to click in the center of the area with the largest error). In this work, we conduct a real user study to investigate real user clicking patterns. This study reveals that the intuitive assumption made in the common evaluation strategy may not hold. As a result, interactive segmentation models may show high scores in the standard benchmarks, but it does not imply that they would perform well in a real world scenario. To assess the applicability of interactive segmentation methods, we propose a novel evaluation strategy providing a more comprehensive analysis of a model's performance. To this end, we propose a methodology for finding extreme user inputs by a direct optimization in a white-box adversarial attack on the interactive segmentation model. Based on the performance with such adversarial user inputs, we assess the robustness of interactive segmentation models w.r.t click positions. Besides, we introduce a novel benchmark for measuring the robustness of interactive segmentation, and report the results of an extensive evaluation of dozens of models.

Comments:	Accepted by AAAI2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
MSC classes:	68T45
ACM classes:	I.4.6
Cite as:	arXiv:2402.06132 [cs.CV]
	(or arXiv:2402.06132v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2402.06132
Related DOI:	https://doi.org/10.1609/aaai.v38i5.28225

Submission history

From: Andrey Moskalenko [view email]
[v1] Fri, 9 Feb 2024 01:36:21 UTC (18,354 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TETRIS: Towards Exploring the Robustness of Interactive Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TETRIS: Towards Exploring the Robustness of Interactive Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators