Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content

Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything

Notifications You must be signed in to change notification settings

Vibashan/PosSAM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

PosSAM: Panoptic Open-vocabulary Segment Anything

Framework: PyTorch

[Project Page] [arXiv] [PDF] [Slides] [BibTeX]

PWC

Contributions
  • We introduce PosSAM, an open-vocabulary panoptic segmentation model that generates class and instance-aware masks with excellent generalization to a variety of visual concepts by unifying SAM and CLIP in an end-to-end trainable framework.
  • We develop a novel Local Discriminative Pooling (LDP) module to enhance discriminative CLIP features with class-agnostic SAM features for an unbiased OV classification.
  • We introduce the Mask-Aware Selective Ensembling algorithm to adaptively discern between seen and unseen classes by leveraging IoU and LDP confidence scores for each image.
  • We conduct extensive experiments and demonstrate superior performance over existing state-of-the-art open-vocabulary panoptic segmentation methods across multiple benchmark datasets.

Training

'Coming Soon...!!!'

Inference

'Coming Soon...!!!'

About

Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published