research-article

PSINet: Progressive Saliency Iteration Network for RGB-D Salient Object Detection

Authors:

Bin Ge,

Kuan-Ching LiAuthors Info & Claims

HCMA '22: Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis

Pages 61 - 70

https://doi.org/10.1145/3552458.3556451

Published: 10 October 2022 Publication History

Get Access

Abstract

RGB-D Salient Object Detection (RGB-D SOD) is a pixel-level dense prediction task that can highlight the prominent object in the scene by combining color information and depth constraints. Attention mechanisms have been widely employed in SOD due to their ability to capture important cues. However, most existing attentions (\textite.g., spatial attention, channel attention, self-attention) mainly exploit the pixel-level attention maps, ignoring the region properties of salient objects. To remedy this issue, we propose a progressive saliency iteration network (PSINet) with a region-wise saliency attention to improve the regional integrity of salient objects in an iterative manner. Specifically, two-stream Swin Transformers are first employed to extract RGB and depth features. Second, a multi-modality alternate and inverse module (AIM) is designed to extract complementary features from RGB-D images in an interleaved manner, which breaks down the barriers of inconsistency existing in the cross-modal data and also sufficiently captures the complementarity. Third, a triple progressive iteration decoder (TPID) is proposed to optimize the salient objects, where a coarse saliency map, generated by integrating multi-scale features with a U-Net, is viewed as region-wise attention maps to construct a region-wise saliency attention module(RSAM), which can emphasize the prominent region of features. Finally, the regional integrity of salient objects can be gradually optimized from coarse to fine by iterating the above steps on TPID. Quantitative and qualitative experiments demonstrate that the proposed model performs favorably against 19 state-of-the-art (SOTA) saliency detectors on five benchmark RGB-D SOD datasets.

Supplementary Material

MP4 File (HCMA22-hcma12p.mp4)

This video introduces my work about salient object detection, the title is Progressive Saliency Iteration Network for RGB-D Salient Object Detection, which contains the introduction, Motivation, Solution, Experiment, and Ablation.

Download
128.67 MB

References

[1]

Radhakrishna Achanta, Sheila Hemami, Francisco Estrada, and Sabine Susstrunk. Frequency-tuned salient region detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1597--1604, 2009.

Abstract

Supplementary Material

References

Index Terms

Recommendations

Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection

RGB-D salient object detection via cross-modal joint feature extraction and low-bound fusion loss

Residual attentive feature learning network for salient object detection

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations