research-article

SalGCN: Saliency Prediction for 360-Degree Images Based on Spherical Graph Convolutional Networks

Authors:

Hongkai XiongAuthors Info & Claims

MM '20: Proceedings of the 28th ACM International Conference on Multimedia

Pages 682 - 690

https://doi.org/10.1145/3394171.3413733

Published: 12 October 2020 Publication History

Get Access

Abstract

The non-Euclidean geometry characteristic poses a challenge to the saliency prediction for 360-degree images. Since spherical data cannot be projected onto a single plane without distortion, existing saliency prediction methods based on traditional CNNs are inefficient. In this paper, we propose a saliency prediction framework for 360-degree images based on graph convolutional networks (SalGCN), which directly applies to the spherical graph signals. Specifically, we adopt the Geodesic ICOsahedral Pixelation (GICOPix) to construct a spherical graph signal from a spherical image in equirectangular projection (ERP) format. We then propose a graph saliency prediction network to directly extract the spherical features and generate the spherical graph saliency map, where we design an unpooling method suitable for spherical graph signals based on linear interpolation. The network training process is realized by modeling the node regression problem of the input and output spherical graph signals, where we further design a Kullback-Leibler (KL) divergence loss with sparse consistency to make the sparseness of the saliency map closer to the ground truth. Eventually, to obtain the ERP format saliency map for evaluation, we further propose a spherical crown-based (SCB) interpolation method to convert the output spherical graph saliency map into a saliency map in ERP format. Experiments show that our SalGCN can achieve comparable or even better saliency prediction performance both subjectively and objectively, with a much lower computation complexity.

Supplementary Material

MP4 File (3394171.3413733.mp4)

We propose a saliency prediction architecture for 360-degree images based on graph convolutional networks (SalGCN). This method can achieve excellent performance with very low computational complexity.

Download
17.82 MB

References

[1]

Fang-Yi Chao, Lu Zhang, Wassim Hamidouche, and Olivier Deforges. 2018. SAlGAN360: visual saliency prediction on 360 degree images with generative adversarial networks. In 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 01--04.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Mesh saliency

Relevance of a feed-forward model of visual attention for goal-oriented and free-viewing tasks

Incorporating visual field characteristics into a saliency map

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations