Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3394171.3413733acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

SalGCN: Saliency Prediction for 360-Degree Images Based on Spherical Graph Convolutional Networks

Published: 12 October 2020 Publication History

Abstract

The non-Euclidean geometry characteristic poses a challenge to the saliency prediction for 360-degree images. Since spherical data cannot be projected onto a single plane without distortion, existing saliency prediction methods based on traditional CNNs are inefficient. In this paper, we propose a saliency prediction framework for 360-degree images based on graph convolutional networks (SalGCN), which directly applies to the spherical graph signals. Specifically, we adopt the Geodesic ICOsahedral Pixelation (GICOPix) to construct a spherical graph signal from a spherical image in equirectangular projection (ERP) format. We then propose a graph saliency prediction network to directly extract the spherical features and generate the spherical graph saliency map, where we design an unpooling method suitable for spherical graph signals based on linear interpolation. The network training process is realized by modeling the node regression problem of the input and output spherical graph signals, where we further design a Kullback-Leibler (KL) divergence loss with sparse consistency to make the sparseness of the saliency map closer to the ground truth. Eventually, to obtain the ERP format saliency map for evaluation, we further propose a spherical crown-based (SCB) interpolation method to convert the output spherical graph saliency map into a saliency map in ERP format. Experiments show that our SalGCN can achieve comparable or even better saliency prediction performance both subjectively and objectively, with a much lower computation complexity.

Supplementary Material

MP4 File (3394171.3413733.mp4)
We propose a saliency prediction architecture for 360-degree images based on graph convolutional networks (SalGCN). This method can achieve excellent performance with very low computational complexity.

References

[1]
Fang-Yi Chao, Lu Zhang, Wassim Hamidouche, and Olivier Deforges. 2018. SAlGAN360: visual saliency prediction on 360 degree images with generative adversarial networks. In 2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 01--04.
[2]
Benjamin Coors, Alexandru Paul Condurache, and Andreas Geiger. 2018. Spherenet: Learning spherical representations for detection and classification in omnidirectional images. In Proceedings of the European Conference on Computer Vision (ECCV). 518--533.
[3]
Hongyang Gao and Shuiwang Ji. 2019. Graph u-nets. arXiv preprint arXiv:1905.05178 (2019).
[4]
Jesús Gutiérrez, Erwan David, Yashas Rai, and Patrick Le Callet. 2018. Toolbox and dataset for the development of saliency and scanpath models for omnidirectional/360 still images. Signal Processing: Image Communication, Vol. 69 (2018), 35--42.
[5]
Le Han, Xuelong Li, and Yongsheng Dong. 2018. SalNet: Edge constraint based end-to-end model for salient object detection. In Chinese Conference on Pattern Recognition and Computer Vision (PRCV). Springer, 186--198.
[6]
Jonathan Harel, Christof Koch, and Pietro Perona. 2007. Graph-based visual saliency. In Advances in neural information processing systems. 545--552.
[7]
Xun Huang, Chengyao Shen, Xavier Boix, and Qi Zhao. 2015. Salicon: Reducing the semantic gap in saliency prediction by adapting deep neural networks. In Proceedings of the IEEE International Conference on Computer Vision. 262--270.
[8]
Srinivas SS Kruthiventi, Kumar Ayush, and R Venkatesh Babu. 2017. Deepfix: A fully convolutional neural network for predicting human eye fixations. IEEE Transactions on Image Processing, Vol. 26, 9 (2017), 4446--4456.
[9]
Pierre Lebreton and Alexander Raake. 2018. GBVS360, BMS360, ProSal: Extending existing saliency prediction models from 2D to omnidirectional images. Signal Processing: Image Communication, Vol. 69 (2018), 69--78.
[10]
Jing Ling, Kao Zhang, Yingxue Zhang, Daiqin Yang, and Zhenzhong Chen. 2018. A saliency prediction model on 360 degree images using color dictionary based sparse representation. Signal Processing: Image Communication, Vol. 69 (2018), 60--68.
[11]
Maxime Meilland, Andrew I Comport, and Patrick Rives. 2010. A spherical robot-centered representation for urban navigation. In 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 5196--5201.
[12]
Rafael Monroy, Sebastian Lutz, Tejo Chalasani, and Aljosa Smolic. 2018. Salnet360: Saliency maps for omni-directional images with cnn. Signal Processing: Image Communication, Vol. 69 (2018), 26--34.
[13]
Cagri Ozcinar, Julián Cabrera, and Aljosa Smolic. 2019. Visual attention-aware omnidirectional video streaming using optimal tiles for virtual reality. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, Vol. 9, 1 (2019), 217--230.
[14]
Junting Pan, Cristian Canton Ferrer, Kevin McGuinness, Noel E O'Connor, Jordi Torres, Elisa Sayrol, and Xavier Giro-i Nieto. 2017. Salgan: Visual saliency prediction with generative adversarial networks. arXiv preprint arXiv:1701.01081 (2017).
[15]
Junting Pan, Elisa Sayrol, Xavier Giro-i Nieto, Kevin McGuinness, and Noel E O'Connor. 2016. Shallow and deep convolutional networks for saliency prediction. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 598--606.
[16]
Yashas Rai, Jesús Gutiérrez, and Patrick Le Callet. 2017. A dataset of head and eye movements for 360 degree images. In Proceedings of the 8th ACM on Multimedia Systems Conference. 205--210.
[17]
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention. Springer, 234--241.
[18]
Yu-Chuan Su and Kristen Grauman. 2017a. Learning spherical convolution for fast features from 360 imagery. In Advances in Neural Information Processing Systems. 529--539.
[19]
Yu-Chuan Su and Kristen Grauman. 2017b. Making 360 video watchable in 2d: Learning videography for click free viewing. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 1368--1376.
[20]
Yu-Chuan Su and Kristen Grauman. 2019. Kernel transformer networks for compact spherical convolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 9442--9451.
[21]
Tatsuya Suzuki and Takao Yamanaka. 2018. Saliency map estimation for omni-directional image considering prior distributions. In 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE, 2079--2084.
[22]
Qin Yang, Chenglin Li, Wenrui Dai, Junni Zou, GuoJun Qi, and Hongkai Xiong. 2020. Rotation Equivariant Graph Convolutional Network for Spherical Image Classification. In CVPR 2020. IEEE.
[23]
Qin Yang, Junni Zou, Kexin Tang, Chenglin Li, and Hongkai Xiong. 2019. Single and Sequential Viewports Prediction for 360-Degree Video Streaming. In 2019 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, 1--5.
[24]
Jianming Zhang and Stan Sclaroff. 2013. Saliency detection: A boolean map approach. In Proceedings of the IEEE international conference on computer vision. 153--160.
[25]
Ziheng Zhang, Yanyu Xu, Jingyi Yu, and Shenghua Gao. 2018. Saliency detection in 360 videos. In Proceedings of the European Conference on Computer Vision (ECCV). 488--503.

Cited By

View all
  • (2024)SVGC-AVA: 360-Degree Video Saliency Prediction With Spherical Vector-Based Graph Convolution and Audio-Visual AttentionIEEE Transactions on Multimedia10.1109/TMM.2023.330659626(3061-3076)Online publication date: 2024
  • (2023)View-Aware Salient Object Detection for $360^{\circ }$ Omnidirectional ImageIEEE Transactions on Multimedia10.1109/TMM.2022.320901525(6471-6484)Online publication date: 2023
  • (2023)SAVG360: Saliency-aware Viewport-guidance-enabled 360-video Streaming System2023 IEEE International Symposium on Multimedia (ISM)10.1109/ISM59092.2023.00011(36-43)Online publication date: 11-Dec-2023
  • Show More Cited By

Index Terms

  1. SalGCN: Saliency Prediction for 360-Degree Images Based on Spherical Graph Convolutional Networks

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '20: Proceedings of the 28th ACM International Conference on Multimedia
    October 2020
    4889 pages
    ISBN:9781450379885
    DOI:10.1145/3394171
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 October 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. 360 degree images
    2. graph convolutional neural networks
    3. graph unpooling
    4. interpolation
    5. saliency

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    MM '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 995 of 4,171 submissions, 24%

    Upcoming Conference

    MM '24
    The 32nd ACM International Conference on Multimedia
    October 28 - November 1, 2024
    Melbourne , VIC , Australia

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)58
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 15 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)SVGC-AVA: 360-Degree Video Saliency Prediction With Spherical Vector-Based Graph Convolution and Audio-Visual AttentionIEEE Transactions on Multimedia10.1109/TMM.2023.330659626(3061-3076)Online publication date: 2024
    • (2023)View-Aware Salient Object Detection for $360^{\circ }$ Omnidirectional ImageIEEE Transactions on Multimedia10.1109/TMM.2022.320901525(6471-6484)Online publication date: 2023
    • (2023)SAVG360: Saliency-aware Viewport-guidance-enabled 360-video Streaming System2023 IEEE International Symposium on Multimedia (ISM)10.1109/ISM59092.2023.00011(36-43)Online publication date: 11-Dec-2023
    • (2023) Viewing Bias Matters in 360 ° Videos Visual Saliency Prediction IEEE Access10.1109/ACCESS.2023.326956411(46084-46094)Online publication date: 2023
    • (2022)Machine Learning for Multimedia CommunicationsSensors10.3390/s2203081922:3(819)Online publication date: 21-Jan-2022
    • (2022)Intra- and Inter-Reasoning Graph Convolutional Network for Saliency Prediction on 360° ImagesIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2022.319715932:12(8730-8743)Online publication date: Dec-2022
    • (2022)MFVP: Mobile-Friendly Viewport Prediction for Live 360-Degree Video Streaming2022 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME52920.2022.9859789(1-6)Online publication date: 18-Jul-2022
    • (2022)Saliency-Based Multiple Region of Interest Detection From a Single 360° ImageIEEE Access10.1109/ACCESS.2022.320048610(89124-89133)Online publication date: 2022
    • (2022) SST-Sal: A spherical spatio-temporal approach for saliency prediction in 360 videos Computers & Graphics10.1016/j.cag.2022.06.002106(200-209)Online publication date: Aug-2022
    • (2021)CDPProceedings of the 29th ACM International Conference on Multimedia10.1145/3474085.3475680(5491-5500)Online publication date: 17-Oct-2021
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media