GMT: Guided Mask Transformer for Leaf Instance Segmentation

Chen, Feng; Tsaftaris, Sotirios A.; Giuffrida, Mario Valerio

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.17109 (cs)

[Submitted on 24 Jun 2024 (v1), last revised 11 Sep 2024 (this version, v2)]

Title:GMT: Guided Mask Transformer for Leaf Instance Segmentation

Authors:Feng Chen, Sotirios A. Tsaftaris, Mario Valerio Giuffrida

View PDF HTML (experimental)

Abstract:Leaf instance segmentation is a challenging multi-instance segmentation task, aiming to separate and delineate each leaf in an image of a plant. Accurate segmentation of each leaf is crucial for plant-related applications such as the fine-grained monitoring of plant growth and crop yield estimation. This task is challenging because of the high similarity (in shape and colour), great size variation, and heavy occlusions among leaf instances. Furthermore, the typically small size of annotated leaf datasets makes it more difficult to learn the distinctive features needed for precise segmentation. We hypothesise that the key to overcoming the these challenges lies in the specific spatial patterns of leaf distribution. In this paper, we propose the Guided Mask Transformer (GMT), which leverages and integrates leaf spatial distribution priors into a Transformer-based segmentor. These spatial priors are embedded in a set of guide functions that map leaves at different positions into a more separable embedding space. Our GMT consistently outperforms the state-of-the-art on three public plant datasets.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.17109 [cs.CV]
	(or arXiv:2406.17109v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.17109

Submission history

From: Feng Chen [view email]
[v1] Mon, 24 Jun 2024 19:52:27 UTC (3,334 KB)
[v2] Wed, 11 Sep 2024 14:32:51 UTC (5,151 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GMT: Guided Mask Transformer for Leaf Instance Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GMT: Guided Mask Transformer for Leaf Instance Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators