Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3652583.3657620acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
short-paper

Octree-Retention Fusion: A High-Performance Context Model for Point Cloud Geometry Compression

Published: 07 June 2024 Publication History

Abstract

Point cloud compression is a pivotal technology for efficient storage and transmission of 3D point cloud data, which has significant implications for practical applications in virtual reality, autonomous driving, and cultural heritage preservation. In this paper, we propose a new learning-based model using the Retentive Network (RetNet) for point cloud compression, which achieves a lower bitrate while maintaining a high peak signal-to-noise ratio (PSNR). We first use an octree structure to segment the point cloud objects. Then, we use octree-based contextual windows to extract pivotal features from relevant sibling and ancestor nodes. Finally, we employ our proposed Octree-Retention model to effectively exploit the prior information between the spatially adjacent nodes for compression. The experimental results show that our method outperforms the state-of-the-art methods on both the LIDAR dataset(SemanticKITTI) and the object dataset(MPEG 8i), demonstrating its effectiveness.

References

[1]
Jens Behley, Martin Garbade, Andres Milioto, Jan Quenzel, Sven Behnke, Cyrill Stachniss, and Jurgen Gall. 2019. Semantickitti: A dataset for semantic scene understanding of lidar sequences. In Proceedings of the IEEE/CVF international conference on computer vision. 9297--9307.
[2]
Sourav Biswas, Jerry Liu, Kelvin Wong, Shenlong Wang, and Raquel Urtasun. 2020. Muscle: Multi sweep compression of lidar using deep entropy models. Advances in Neural Information Processing Systems, Vol. 33 (2020), 22170--22181.
[3]
Mingyue Cui, Junhua Long, Mingjian Feng, Boyang Li, and Huang Kai. 2023. OctFormer: Efficient octree-based transformer for point cloud compression with local enhancement. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 470--478.
[4]
Eugene d'Eon, Bob Harrison, Taos Myers, and Philip A Chou. 2017. 8i voxelized full bodies-a voxelized point cloud dataset. ISO/IEC JTC1/SC29 Joint WG11/WG1 (MPEG/JPEG) input document WG11M40059/WG1M74006, Vol. 7, 8 (2017), 11.
[5]
Yu Feng, Shaoshan Liu, and Yuhao Zhu. 2020. Real-time spatio-temporal lidar point cloud compression. In 2020 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE, 10766--10773.
[6]
Chunyang Fu, Ge Li, Rui Song, Wei Gao, and Shan Liu. 2022. Octattention: Octree-based large-scale contexts model for point cloud compression. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 625--633.
[7]
André FR Guarda, Nuno MM Rodrigues, and Fernando Pereira. 2019. Point cloud coding: Adopting a deep learning-based approach. In 2019 Picture Coding Symposium (PCS). IEEE, 1--5.
[8]
Yun He, Xinlin Ren, Danhang Tang, Yinda Zhang, Xiangyang Xue, and Yanwei Fu. 2022. Density-preserving deep point cloud compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2333--2342.
[9]
Lila Huang, Shenlong Wang, Kelvin Wong, Jerry Liu, and Raquel Urtasun. 2020. Octsqueeze: Octree-structured entropy model for lidar compression. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 1313--1323.
[10]
Tianxin Huang and Yong Liu. 2019. 3d point cloud geometry compression on deep learning. In Proceedings of the 27th ACM international conference on multimedia. 890--898.
[11]
Zujie Liang and Fan Liang. 2022. TransPCC: Towards Deep Point Cloud Compression via Transformers. In Proceedings of the 2022 International Conference on Multimedia Retrieval. 1--5.
[12]
Dat Thanh Nguyen, Maurice Quach, Giuseppe Valenzise, and Pierre Duhamel. 2021a. Lossless coding of point cloud geometry using a deep generative model. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 31, 12 (2021), 4617--4629.
[13]
Dat Thanh Nguyen, Maurice Quach, Giuseppe Valenzise, and Pierre Duhamel. 2021b. Multiscale deep context modeling for lossless point cloud geometry compression. In 2021 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 1--6.
[14]
Maurice Quach, Giuseppe Valenzise, and Frederic Dufaux. 2019. Learning convolutional transforms for lossy point cloud geometry compression. In 2019 IEEE international conference on image processing (ICIP). IEEE, 4320--4324.
[15]
Zizheng Que, Guo Lu, and Dong Xu. 2021. Voxelcontext-net: An octree based framework for point cloud compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6042--6051.
[16]
Sebastian Schwarz, Marius Preda, Vittorio Baroncini, Madhukar Budagavi, Pablo Cesar, Philip A Chou, Robert A Cohen, Maja Krivokuća, Sébastien Lasserre, Zhu Li, et al. 2018. Emerging MPEG standards for point cloud compression. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, Vol. 9, 1 (2018), 133--148.
[17]
Claude Elwood Shannon. 2001. A mathematical theory of communication. ACM SIGMOBILE mobile computing and communications review, Vol. 5, 1 (2001), 3--55.
[18]
Xuebin Sun, Han Ma, Yuxiang Sun, and Ming Liu. 2019. A novel point cloud compression algorithm based on clustering. IEEE Robotics and Automation Letters, Vol. 4, 2 (2019), 2132--2139.
[19]
Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, and Furu Wei. 2023. Retentive network: A successor to transformer for large language models. arXiv preprint arXiv:2307.08621 (2023).
[20]
Chenxi Tu, Eijiro Takeuchi, Alexander Carballo, and Kazuya Takeda. 2019. Point cloud compression for 3d lidar sensor using recurrent neural network with residual blocks. In 2019 International Conference on Robotics and Automation (ICRA). IEEE, 3274--3280.
[21]
Jianqiang Wang, Dandan Ding, Zhu Li, and Zhan Ma. 2021a. Multiscale point cloud geometry compression. In 2021 Data Compression Conference (DCC). IEEE, 73--82.
[22]
Jianqiang Wang, Hao Zhu, Haojie Liu, and Zhan Ma. 2021b. Lossy point cloud geometry compression via end-to-end learning. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 31, 12 (2021), 4909--4923.
[23]
Kang You and Pan Gao. 2021. Patch-based deep autoencoder for point cloud geometry compression. In ACM Multimedia Asia. 1--7.
[24]
Junteng Zhang, Gexin Liu, Dandan Ding, and Zhan Ma. 2022. Transformer and upsampling-based point cloud compression. In Proceedings of the 1st International Workshop on Advances in Point Cloud Compression, Processing and Analysis. 33--39.
[25]
Lili Zhao, Kai-Kuang Ma, Zhili Liu, Qian Yin, and Jianwen Chen. 2022. Real-time scene-aware LiDAR point cloud compression using semantic prior representation. IEEE Transactions on Circuits and Systems for Video Technology, Vol. 32, 8 (2022), 5623--5637.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMR '24: Proceedings of the 2024 International Conference on Multimedia Retrieval
May 2024
1379 pages
ISBN:9798400706196
DOI:10.1145/3652583
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 June 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. 3d reconstruction
  2. point cloud compression
  3. retnet

Qualifiers

  • Short-paper

Funding Sources

  • Yongjiang Sci-Tech Innovation 2035
  • Ningbo Municipal Major Project of Science and Technology Innovation 2025
  • Zhejiang Provincial Natural Science Foundation of China
  • National Natural Science Foundation of China,

Conference

ICMR '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 92
    Total Downloads
  • Downloads (Last 12 months)92
  • Downloads (Last 6 weeks)14
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media