research-article

An Adaptive Two-Layer Light Field Compression Scheme Using GNN-Based Reconstruction

Authors:

Shervin ShirmohammadiAuthors Info & Claims

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 16, Issue 2s

Article No.: 72, Pages 1 - 23

https://doi.org/10.1145/3395620

Published: 21 June 2020 Publication History

Abstract

As a new form of volumetric media, Light Field (LF) can provide users with a true six degrees of freedom immersive experience because LF captures the scene with photo-realism, including aperture-limited changes in viewpoint. But uncompressed LF data is too large for network transmission, which is the reason why LF compression has become an important research topic. One of the more recent approaches for LF compression is to reduce the angular resolution of the input LF during compression and to use LF reconstruction to recover the discarded viewpoints during decompression. Following this approach, we propose a new LF reconstruction algorithm based on Graph Neural Networks; we show that it can achieve higher compression and better quality compared to existing reconstruction methods, although suffering from the same problem as those methods—the inability to deal effectively with high-frequency image components. To solve this problem, we propose an adaptive two-layer compression architecture that separates high-frequency and low-frequency components and compresses each with a different strategy so that the performance can become robust and controllable. Experiments with multiple datasets¹ show that our proposed scheme is capable of providing a decompression quality of above 40 dB, and can significantly improve compression efficiency compared with similar LF reconstruction schemes.

References

[1]

Edward H. Adelson and James R. Bergen. 1991. The Plenoptic Function and the Elements of Early Vision. Vol. 2. Vision and Modeling Group, Media Laboratory, Massachusetts Institute of Technology.

[2]

Amar Aggoun. 2011. Compression of 3D integral images using 3D wavelet transform. Journal of Display Technology 7, 11 (2011), 586--592.

[3]

H. Amirpour, M. Pereira, and A. Pinheiro. 2018. High efficient snake order pseudo-sequence based light field image compression. In 2018 Data Compression Conference. 397--397.

[4]

Computer Graphics Laboratory, Stanford University 2008. Light Field Datasets. Retrieved from http://lightfield.stanford.edu/lfs.html.

[5]

Caroline Conti, Luís Ducla Soares, and Paulo Nunes. 2016. HEVC-based 3D holoscopic video coding using self-similarity compensated prediction. Signal Processing Image Communication 42 (2016), 59--78.

Digital Library

[6]

D. G. Dansereau, O. Pizarro, and S. B. Williams. 2013. Decoding, calibration and rectification for lenselet-based plenoptic cameras. In 2013 IEEE Conference on Computer Vision and Pattern Recognition. 1027--1034.

Digital Library

[7]

Steven J. Gortler, Radek Grzeszczuk, Richard Szeliski, and Michael F. Cohen. 1996. The lumigraph. In 23rd Annual Conference on Computer Graphics and Interactive Techniques. 43--54.

[8]

Bichuan Guo, Yuxing Han, and Jiangtao Wen. 2018. Convex optimization based bit allocation for light field compression under weighting and consistency constraints. In Data Compression Conference. IEEE.

[9]

Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems. 1024--1034.

[10]

Harini Priyadarshini Hariharan, Tobias Lange, and Thorsten Herfet. 2017. Low complexity light field compression based on pseudo-temporal circular sequencing. In IEEE International Symposium on Broadband Multimedia Systems and Broadcasting. 1--5.

[11]

Fatma Hawary, Christine Guillemot, Dominique Thoreau, and Guillaume Boisson. 2017. Scalable light field compression scheme using sparse reconstruction and restoration. In 2017 IEEE International Conference on Image Processing (ICIP). IEEE, 3250--3254.

[12]

Xinjue Hu, Jingming Shan, Yu Liu, and Lin Zhang. 2019. Adaptive two-layer light field compression scheme based on sparse reconstruction. In 10th ACM Multimedia Systems Conference. 74--85.

Digital Library

[13]

Xiaoran Jiang, Mikaël Le Pendu, Reuben A. Farrugia, and Christine Guillemot. 2017. Light field compression with homography-based low rank approximation. IEEE Journal of Selected Topics in Signal Processing PP, 99 (2017), 1--1.

[14]

Shinjini Kundu. 2012. Light field compression using homography and 2D warping. In IEEE International Conference on Acoustics, Speech and Signal Processing. 1349--1352.

[15]

Marc Levoy and Pat Hanrahan. 1996. Light field rendering. In 23rd Annual Conference on Computer Graphics and Interactive Techniques. 31--42.

Digital Library

[16]

Li Li, Zhu Li, Bin Li, Dong Liu, and Houqiang Li. 2017. Pseudo-sequence-based 2-D hierarchical coding structure for light-field image compression. IEEE Journal of Selected Topics in Signal Processing 11, 7 (2017), 1107--1119.

[17]

Yun Li, Marten Sjostrom, Roger Olsson, and Ulf Jennehag. 2014. Efficient intra prediction scheme for light field image compression. In IEEE International Conference on Acoustics, Speech and Signal Processing. 539--543.

[18]

Dong Liu, Lizhi Wang, Li Li, Zhiwei Xiong, Feng Wu, and Wenjun Zeng. 2016. Pseudo-sequence-based light field image compression. In IEEE International Conference on Multimedia and Expo Workshops. 1--4.

[19]

Luís F. R. Lucas, Caroline Conti, Paulo Nunes, Luís Ducla Soares, Nuno M. M. Rodrigues, Carla L. Pagliari, Eduardo A. B. Da Silva, and Sérgio M. M. De Faria. 2014. Locally linear embedding-based prediction for 3D holoscopic image coding using HEVC. In Signal Processing Conference. 11--15.

[20]

Ricardo Monteiro, Luis Lucas, Caroline Conti, Paulo Nunes, Nuno Rodrigues, Sergio Faria, Carla Pagliari, Eduardo Da Silva, and Luis Soares. 2016. Light field HEVC-based image coding using locally linear embedding and self-similarity compensated prediction. In IEEE International Conference on Multimedia and Expo Workshops. 1--4.

[21]

Ren Ng, Marc Levoy, Mathieu Brédif, Gene Duval, Mark Horowitz, Pat Hanrahan, et al. 2005. Light field photography with a hand-held plenoptic camera. Computer Science Technical Report CSTR 2, 11 (2005), 1--11.

[22]

Mathias Niepert, Mohamed Ahmed, and Konstantin Kutzkov. 2016. Learning convolutional neural networks for graphs. In International Conference on Machine Learning.

[23]

C. Perra and P. Assuncao. 2016. High efficiency coding of light field images based on tiling and pseudo-temporal data arrangement. In IEEE International Conference on Multimedia and Expo Workshops. 1--4.

[24]

Cristian Perra and Daniele Giusto. 2017. JPEG 2000 compression of unfocused light field images based on lenslet array slicing. In IEEE International Conference on Consumer Electronics. 27--28.

[25]

Reza Rassool. 2017. VMAF reproducibility: Validating a perceptual practical video quality metric. In IEEE International Symposium on Broadband Multimedia Systems and Broadcasting.

[26]

Franco Scarselli, Marco Gori, Ah Chung Tsoi, Markus Hagenbuchner, and Gabriele Monfardini. 2008. The graph neural network model. IEEE Transactions on Neural Networks 20, 1 (2008), 61--80.

Digital Library

[27]

Lixin Shi, Haitham Hassanieh, Abe Davis, Dina Katabi, and Fredo Durand. 2014. Light field reconstruction using sparsity in the continuous fourier domain. ACM Transactions on Graphics (TOG) 34, 1 (2014), 12.

Digital Library

[28]

Irene Viola, Hermina Petric Maretic, Pascal Frossard, and Touradj Ebrahimi. 2018. A graph learning approach for light field image compression. Applications of Digital Image Processing XLISpie-Int Soc Optical Engineering (2018), 12.

[29]

Gaochang Wu, Belen Masia, Adrian Jarabo, Yuchen Zhang, Liangyong Wang, Qionghai Dai, Tianyou Chai, and Yebin Liu. 2017. Light field image processing: An overview. IEEE Journal of Selected Topics in Signal Processing 11, 7 (2017), 926--954.

[30]

Shan Xu, Zhi-Liang Zhou, and Nicholas Devaney. 2014. Multi-view image restoration from plenoptic raw images. In Asian Conference on Computer Vision. Springer, 3--15.

[31]

Wei Zhang, Dong Liu, Zhiwei Xiong, and Jizheng Xu. 2018. SIFT-based adaptive prediction structure for light field compression. In Visual Communications and Image Processing. 1--4.

[32]

Xiang Zhang, Philip A. Chou, Ming Ting Sun, Maolong Tang, Shanshe Wang, Siwei Ma, and Wen Gao. 2018. Surface light field compression using a point cloud codec. IEEE Journal on Emerging and Selected Topics in Circuits and Systems 9, 1 (2018), 163--176.

Cited By

Yang CYang JZhai YWang R(2024)FICNet: An End to End Network for Free-View Image CodingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.339015134:9(8848-8861)Online publication date: 1-Sep-2024
https://dl.acm.org/doi/10.1109/TCSVT.2024.3390151
Pan YLuo KLiu YXu CLiu YZhang L(2024)Mobile edge assisted multi-view light field video system: Prototype design and empirical evaluationFuture Generation Computer Systems10.1016/j.future.2023.11.023153(154-168)Online publication date: May-2024
https://doi.org/10.1016/j.future.2023.11.023
Liang SMa WXie C(2023)Relation with Free Objects for Action RecognitionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/361759620:2(1-19)Online publication date: 18-Oct-2023
https://dl.acm.org/doi/10.1145/3617596
Show More Cited By

Index Terms

An Adaptive Two-Layer Light Field Compression Scheme Using GNN-Based Reconstruction
1. Computing methodologies
  1. Computer graphics
    1. Graphics systems and interfaces
      1. Virtual reality
    2. Image compression

Recommendations

Adaptive two-layer light field compression scheme based on sparse reconstruction
MMSys '19: Proceedings of the 10th ACM Multimedia Systems Conference

As a new form of volumetric media, the technology of light field and its compression has gradually become the research hotspots in academia. The scheme of compressing using the sparsity of the light field is a very promising idea, which has the ...
Perceptual Light Field Image Coding with CTU Level Bit Allocation
Computer Analysis of Images and Patterns
Abstract
Light field imaging simultaneously records the position and direction information of light in scene, as one of the important techniques for digital media. The amount of light field image (LFI) data is huge, it needs to be effectively compressed. ...
Layered Light Field Reconstruction for Defocus Blur

We present a novel algorithm for reconstructing high-quality defocus blur from a sparsely sampled light field. Our algorithm builds upon recent developments in the area of sheared reconstruction filters and significantly improves reconstruction quality ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 16, Issue 2s

Special Issue on Smart Communications and Networking for Future Video Surveillance and Special Section on Extended MMSYS-NOSSDAV 2019 Best Papers

April 2020

291 pages

ISSN:1551-6857

EISSN:1551-6865

DOI:10.1145/3407689

Editor:
Alberto Del Bimbo
University of Firenze, Italy

Issue’s Table of Contents

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 June 2020

Online AM: 07 May 2020

Accepted: 01 April 2020

Revised: 01 March 2020

Received: 01 December 2019

Published in TOMM Volume 16, Issue 2s

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

23
Total Citations
View Citations
409
Total Downloads

Downloads (Last 12 months)45
Downloads (Last 6 weeks)6

Reflects downloads up to 28 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Yang CYang JZhai YWang R(2024)FICNet: An End to End Network for Free-View Image CodingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.339015134:9(8848-8861)Online publication date: 1-Sep-2024
https://dl.acm.org/doi/10.1109/TCSVT.2024.3390151
Pan YLuo KLiu YXu CLiu YZhang L(2024)Mobile edge assisted multi-view light field video system: Prototype design and empirical evaluationFuture Generation Computer Systems10.1016/j.future.2023.11.023153(154-168)Online publication date: May-2024
https://doi.org/10.1016/j.future.2023.11.023
Liang SMa WXie C(2023)Relation with Free Objects for Action RecognitionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/361759620:2(1-19)Online publication date: 18-Oct-2023
https://dl.acm.org/doi/10.1145/3617596
Wang XZhu LWu FYang Y(2023)A Differentiable Parallel Sampler for Efficient Video ClassificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/356958419:3(1-18)Online publication date: 25-Feb-2023
https://dl.acm.org/doi/10.1145/3569584
Nousias SArvanitis GLalos AMoustakas K(2023)Deep Saliency Mapping for 3D Meshes and ApplicationsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/355007319:2(1-22)Online publication date: 6-Feb-2023
https://dl.acm.org/doi/10.1145/3550073
Hu XPan YWang YZhang LShirmohammadi S(2023)Multiple Description Coding for Best-Effort Delivery of Light Field Video Using GNN-Based CompressionIEEE Transactions on Multimedia10.1109/TMM.2021.312991825(690-705)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1109/TMM.2021.3129918
Xiao ZCheng ZXiong Z(2023)Space-Time Super-Resolution for Light Field VideosIEEE Transactions on Image Processing10.1109/TIP.2023.330012132(4785-4799)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1109/TIP.2023.3300121
Mou YJiang XXu KSun TWang Z(2023)Compressed Video Action Recognition With Dual-Stream and Dual-Modal TransformerIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.331914034:5(3299-3312)Online publication date: 25-Sep-2023
https://dl.acm.org/doi/10.1109/TCSVT.2023.3319140
Hu XWang CZhang LChen GShirmohammadi S(2023)Edge-Assisted Virtual Viewpoint Generation for Immersive Light FieldIEEE MultiMedia10.1109/MMUL.2022.323277130:2(18-27)Online publication date: 1-Apr-2023
https://dl.acm.org/doi/10.1109/MMUL.2022.3232771
Bach NTran CDuc TTan PKamioka E(2023)Message Passing Neural Network based Light Field Image Compression2023 IEEE 6th International Conference on Multimedia Information Processing and Retrieval (MIPR)10.1109/MIPR59079.2023.00028(1-4)Online publication date: Aug-2023
https://doi.org/10.1109/MIPR59079.2023.00028
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Issue’s Table of Contents