short-paper

Octree-Retention Fusion: A High-Performance Context Model for Point Cloud Geometry Compression

Authors:

Zhijing YuAuthors Info & Claims

ICMR '24: Proceedings of the 2024 International Conference on Multimedia Retrieval

Pages 1150 - 1154

https://doi.org/10.1145/3652583.3657620

Published: 07 June 2024 Publication History

Abstract

Point cloud compression is a pivotal technology for efficient storage and transmission of 3D point cloud data, which has significant implications for practical applications in virtual reality, autonomous driving, and cultural heritage preservation. In this paper, we propose a new learning-based model using the Retentive Network (RetNet) for point cloud compression, which achieves a lower bitrate while maintaining a high peak signal-to-noise ratio (PSNR). We first use an octree structure to segment the point cloud objects. Then, we use octree-based contextual windows to extract pivotal features from relevant sibling and ancestor nodes. Finally, we employ our proposed Octree-Retention model to effectively exploit the prior information between the spatially adjacent nodes for compression. The experimental results show that our method outperforms the state-of-the-art methods on both the LIDAR dataset(SemanticKITTI) and the object dataset(MPEG 8i), demonstrating its effectiveness.

References

[1]

Jens Behley, Martin Garbade, Andres Milioto, Jan Quenzel, Sven Behnke, Cyrill Stachniss, and Jurgen Gall. 2019. Semantickitti: A dataset for semantic scene understanding of lidar sequences. In Proceedings of the IEEE/CVF international conference on computer vision. 9297--9307.

[2]

Sourav Biswas, Jerry Liu, Kelvin Wong, Shenlong Wang, and Raquel Urtasun. 2020. Muscle: Multi sweep compression of lidar using deep entropy models. Advances in Neural Information Processing Systems, Vol. 33 (2020), 22170--22181.

[3]

Mingyue Cui, Junhua Long, Mingjian Feng, Boyang Li, and Huang Kai. 2023. OctFormer: Efficient octree-based transformer for point cloud compression with local enhancement. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 470--478.

Digital Library

[4]

Eugene d'Eon, Bob Harrison, Taos Myers, and Philip A Chou. 2017. 8i voxelized full bodies-a voxelized point cloud dataset. ISO/IEC JTC1/SC29 Joint WG11/WG1 (MPEG/JPEG) input document WG11M40059/WG1M74006, Vol. 7, 8 (2017), 11.

[5]

Yu Feng, Shaoshan Liu, and Yuhao Zhu. 2020. Real-time spatio-temporal lidar point cloud compression. In 2020 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE, 10766--10773.

Digital Library

[6]

Chunyang Fu, Ge Li, Rui Song, Wei Gao, and Shan Liu. 2022. Octattention: Octree-based large-scale contexts model for point cloud compression. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 625--633.

[7]

André FR Guarda, Nuno MM Rodrigues, and Fernando Pereira. 2019. Point cloud coding: Adopting a deep learning-based approach. In 2019 Picture Coding Symposium (PCS). IEEE, 1--5.

[8]

Yun He, Xinlin Ren, Danhang Tang, Yinda Zhang, Xiangyang Xue, and Yanwei Fu. 2022. Density-preserving deep point cloud compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2333--2342.

[9]

Lila Huang, Shenlong Wang, Kelvin Wong, Jerry Liu, and Raquel Urtasun. 2020. Octsqueeze: Octree-structured entropy model for lidar compression. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 1313--1323.

[10]

Tianxin Huang and Yong Liu. 2019. 3d point cloud geometry compression on deep learning. In Proceedings of the 27th ACM international conference on multimedia. 890--898.

Digital Library

[11]

Zujie Liang and Fan Liang. 2022. TransPCC: Towards Deep Point Cloud Compression via Transformers. In Proceedings of the 2022 International Conference on Multimedia Retrieval. 1--5.

Digital Library

[12]

Dat Thanh Nguyen, Maurice Quach, Giuseppe Valenzise, and Pierre Duhamel. 2021a. Lossless coding of point cloud geometry using a deep generative model. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 31, 12 (2021), 4617--4629.

[13]

Dat Thanh Nguyen, Maurice Quach, Giuseppe Valenzise, and Pierre Duhamel. 2021b. Multiscale deep context modeling for lossless point cloud geometry compression. In 2021 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 1--6.

[14]

Maurice Quach, Giuseppe Valenzise, and Frederic Dufaux. 2019. Learning convolutional transforms for lossy point cloud geometry compression. In 2019 IEEE international conference on image processing (ICIP). IEEE, 4320--4324.

[15]

Zizheng Que, Guo Lu, and Dong Xu. 2021. Voxelcontext-net: An octree based framework for point cloud compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6042--6051.

[16]

Sebastian Schwarz, Marius Preda, Vittorio Baroncini, Madhukar Budagavi, Pablo Cesar, Philip A Chou, Robert A Cohen, Maja Krivokuća, Sébastien Lasserre, Zhu Li, et al. 2018. Emerging MPEG standards for point cloud compression. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, Vol. 9, 1 (2018), 133--148.

[17]

Claude Elwood Shannon. 2001. A mathematical theory of communication. ACM SIGMOBILE mobile computing and communications review, Vol. 5, 1 (2001), 3--55.

[18]

Xuebin Sun, Han Ma, Yuxiang Sun, and Ming Liu. 2019. A novel point cloud compression algorithm based on clustering. IEEE Robotics and Automation Letters, Vol. 4, 2 (2019), 2132--2139.

[19]

Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, and Furu Wei. 2023. Retentive network: A successor to transformer for large language models. arXiv preprint arXiv:2307.08621 (2023).

[20]

Chenxi Tu, Eijiro Takeuchi, Alexander Carballo, and Kazuya Takeda. 2019. Point cloud compression for 3d lidar sensor using recurrent neural network with residual blocks. In 2019 International Conference on Robotics and Automation (ICRA). IEEE, 3274--3280.

Digital Library

[21]

Jianqiang Wang, Dandan Ding, Zhu Li, and Zhan Ma. 2021a. Multiscale point cloud geometry compression. In 2021 Data Compression Conference (DCC). IEEE, 73--82.

[22]

Jianqiang Wang, Hao Zhu, Haojie Liu, and Zhan Ma. 2021b. Lossy point cloud geometry compression via end-to-end learning. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 31, 12 (2021), 4909--4923.

[23]

Kang You and Pan Gao. 2021. Patch-based deep autoencoder for point cloud geometry compression. In ACM Multimedia Asia. 1--7.

[24]

Junteng Zhang, Gexin Liu, Dandan Ding, and Zhan Ma. 2022. Transformer and upsampling-based point cloud compression. In Proceedings of the 1st International Workshop on Advances in Point Cloud Compression, Processing and Analysis. 33--39.

Digital Library

[25]

Lili Zhao, Kai-Kuang Ma, Zhili Liu, Qian Yin, and Jianwen Chen. 2022. Real-time scene-aware LiDAR point cloud compression using semantic prior representation. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 32, 8 (2022), 5623--5637.

Digital Library

Index Terms

Octree-Retention Fusion: A High-Performance Context Model for Point Cloud Geometry Compression
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Reconstruction
      2. Image and video acquisition
        3D imaging
    2. Knowledge representation and reasoning
      1. Probabilistic reasoning

Recommendations

YOGA: Yet Another Geometry-based Point Cloud Compressor
MM '23: Proceedings of the 31st ACM International Conference on Multimedia

A learning-based YOGA (Yet Another Geometry-based Point Cloud Compressor) is proposed. It is flexible, allowing for the separable lossy compression of geometry and color attributes, and variable-rate coding using a single neural model; it is high-...
Block size selection in rate-constrained geometry based point cloud compression
Abstract
In geometry-based point cloud compression, the geometry information is typically compressed using octree coding. In octree coding, the size of the blocks in the voxelized point clouds, i.e., the number of voxels contained in a block, determines ...
Model-Based Rate-Distortion Optimized Video-Based Point Cloud Compression with Differential Evolution
Image and Graphics
Abstract
The Moving Picture Experts Group (MPEG) video-based point cloud compression (V-PCC) standard encodes a dynamic point cloud by first converting it into one geometry video and one color video and then using a video coder to compress the two video ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ICMR '24: Proceedings of the 2024 International Conference on Multimedia Retrieval

May 2024

1379 pages

ISBN:9798400706196

DOI:10.1145/3652583

General Chairs:
Cathal Gurrin
Dublin City University, Ireland
,
Rachada Kongkachandra
Thammasat University, Thailand
,
Klaus Schoeffmann
Klagenfurt University, Austria
,
Program Chairs:
Duc-Tien Dang-Nguyen
University of Bergen, Norway
,
Luca Rossetto
University of Zurich, Switzerland
,
Shin'ichi Satoh
National Institute of Informatics, Japan
,
Liting Zhou
Dublin City University, Ireland

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 June 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

Yongjiang Sci-Tech Innovation 2035
Ningbo Municipal Major Project of Science and Technology Innovation 2025
Zhejiang Provincial Natural Science Foundation of China
National Natural Science Foundation of China,

Conference

ICMR '24

Sponsor:

ICMR '24: International Conference on Multimedia Retrieval

June 10 - 14, 2024

Phuket, Thailand

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
92
Total Downloads

Downloads (Last 12 months)92
Downloads (Last 6 weeks)14

Reflects downloads up to 08 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten