research-article

3D Point Cloud Geometry Compression on Deep Learning

Authors:

Yong LiuAuthors Info & Claims

MM '19: Proceedings of the 27th ACM International Conference on Multimedia

Pages 890 - 898

https://doi.org/10.1145/3343031.3351061

Published: 15 October 2019 Publication History

Abstract

3D point cloud presentation has been widely used in computer vision, automatic driving, augmented reality, smart cities and virtual reality. 3D point cloud compression method with higher compression ratio and tiny loss is the key to improve data transportation efficiency. In this paper, we propose a new 3D point cloud geometry compression method based on deep learning, also an auto-encoder performing better than other networks in detail reconstruction. It can reach much higher compression ratio than the state-of-art while keeping tolerable loss. It also supports parallel compressing multiple models by GPU, which can improve processing efficiency greatly. The compression process is composed of two parts. Firstly, Raw data is compressed into codeword by extracting feature of raw model with encoder. Then, the codeword is further compressed with sparse coding. Decompression process is implemented in reverse order. Codeword is recovered and fed into decoder to reconstruct point cloud. Detail reconstruction ability is improved by a hierarchical structure in our decoder. Latter outputs are grown from former fuzzier outputs. In this way, details are added to former output by latter layers step by step to make a more precise prediction. We compare our method with PCL compression and Draco compression on ShapeNet40 part dataset. Our method may be the first deep learning-based point cloud compression algorithm. The experiments demonstrate it is superior to former common compression algorithms with large compression ratio, which can also reserve original shapes with tiny loss.

References

[1]

A. Brock, T. Lim, J. M. Ritchie, and N. Weston. Generative and discriminative voxel modeling with convolutional neural networks. Advances in Neural Information Processing Systems, Workshop on 3D learning, 2017. 3

[2]

A. Dai, A. X. Chang, M. Savva, M. Halber, T. Funkhouser, and M. Nießner. Scannet: Richly-annotated 3D reconstructions of indoor scenes. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017. 3

[3]

D. Maturana and S. Scherer. Voxnet: A 3D convolutional neural network for real-time object recognition. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 922--928. IEEE, 2015. 3

[4]

Riegler G, Osman Ulusoy A, Geiger A. Octnet: Learning deep 3d representations at high resolutions[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 3577--3586.

[5]

Klokov R, Lempitsky V. Escape from cells: Deep kd-networks for the recognition of 3d point cloud models[C]//Proceedings of the IEEE International Conference on Computer Vision. 2017: 863--872.

[6]

Zhu Z, Wang X, Bai S, et al. Deep learning representation using autoencoder for 3D shape retrieval[J]. Neurocomputing, 2016, 204: 41--50.

Digital Library

[7]

Chen D Y, Tian X P, Shen Y T, et al. On visual similarity based 3D model retrieval[C]//Computer graphics forum. Oxford, UK: Blackwell Publishing, Inc, 2003, 22(3): 223--232.

[8]

Qi C R, Su H, Mo K, et al. Pointnet: Deep learning on point sets for 3d classification and segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 652--660.

[9]

Qi C R, Yi L, Su H, et al. Pointnet++: Deep hierarchical feature learning on point sets in a metric space[C]//Advances in Neural Information Processing Systems. 2017: 5099--5108.

[10]

Yang Y, Feng C, Shen Y, et al. Foldingnet: Point cloud auto-encoder via deep grid deformation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 206--215.

[11]

Mandikal P, Babu R V. Dense 3D Point Cloud Reconstruction Using a Deep Pyramid Network[J].

[12]

Achlioptas P, Diamanti O, Mitliagkas I, et al. Learning Representations and Generative Models for 3D Point Clouds[C]//International Conference on Machine Learning. 2018: 40--49.

[13]

Goodfellow I, Pouget-Abadie J, Mirza M, et al. Generative adversarial nets[C]//Advances in neural information processing systems. 2014: 2672--2680.

[14]

Li J, Chen B M, Hee Lee G. So-net: Self-organizing network for point cloud analysis[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 9397--9406.

[15]

Shao Y, Zhang Q, Li G, et al. Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction[C]//2018 ACM Multimedia Conference on Multimedia Conference. ACM, 2018: 1199--1207.

[16]

Schnabel R, Klein R. Octree-based Point-Cloud Compression[J]. Spbg, 2006, 6: 111--120.

[17]

Gumhold S, Kami Z, Isenburg M, et al. Predictive point-cloud compression[C]//Siggraph Sketches. 2005: 137.

[18]

Morell V, Orts S, Cazorla M, et al. Geometric 3D point cloud compression[J]. Pattern Recognition Letters, 2014, 50: 55--62.

Digital Library

[19]

de Queiroz R L, Chou P A. Transform coding for point clouds using a gaussian process model[J]. IEEE Transactions on Image Processing, 2017, 26(7): 3507--3517.

Digital Library

[20]

Kammerl J, Blodow N, Rusu R B, et al. Real-time compression of point cloud streams[C]//2012 IEEE International Conference on Robotics and Automation. IEEE, 2012: 778--785.

[21]

Rusu R B, Cousins S. Point cloud library (pcl)[C]//2011 IEEE International Conference on Robotics and Automation. 2011: 1--4.

[22]

"Draco 3D graphics compression," https://google.github.io/draco/, Accessed: 2018-01--10.

[23]

Yu L, Li X, Fu C W, et al. Pu-net: Point cloud upsampling network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 2790--2799.

[24]

Rubner, Y., Tomasi, C., and Guibas, L. J. The earth mover's distance as a metric for image retrieval. IJCV, 2000.

[25]

Yi L, Kim V G, Ceylan D, et al. A scalable active framework for region annotation in 3d shape collections[J]. ACM Transactions on Graphics (TOG), 2016, 35(6): 210.

Digital Library

[26]

D. Kingma and J. Ba. Adam: A method for stochastic optimization. In Int. Conf.on Learning Representations (ICLR), 2015.

Cited By

hauwermeiren BDenis LMunteanu A(2025)Non-Uniform Voxelisation for Point Cloud CompressionSensors10.3390/s2503086525:3(865)Online publication date: 31-Jan-2025
https://doi.org/10.3390/s25030865
Wang JXue RLi JDing DLin YMa Z(2025)A Versatile Point Cloud Compressor Using Universal Multiscale Conditional Coding – Part I: GeometryIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.346293847:1(269-287)Online publication date: Jan-2025
https://doi.org/10.1109/TPAMI.2024.3462938
Liu GZhu JDing DMa ZLarson K(2024)Encoding auxiliary information to restore compressed point cloud geometryProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/242(2189-2197)Online publication date: 3-Aug-2024
https://dl.acm.org/doi/10.24963/ijcai.2024/242
Show More Cited By

Index Terms

3D Point Cloud Geometry Compression on Deep Learning
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Reconstruction
      2. Image and video acquisition
        3D imaging
  2. Computer graphics
    1. Image compression

Recommendations

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression
MMAsia '21: Proceedings of the 3rd ACM International Conference on Multimedia in Asia

The ever-increasing 3D application makes the point cloud compression unprecedentedly important and needed. In this paper, we propose a patch-based compression process using deep learning, focusing on the lossy point cloud geometry compression. Unlike ...
Vertex Data Compression through Vector Quantization

Rendering geometrically detailed 3D models requires the transfer and processing of large amounts of triangle and vertex geometry data. Compressing the geometry bitstream can reduce bandwidth requirements and alleviate transmission bottlenecks. In this ...
LMDC: Learning a multiple description codec for deep learning-based image compression
Abstract
Although deep learning technique has been widely leveraged to compress image, few attentions are paid to multiple description (MD) coding based on this technique. Meanwhile, packet loss and bit error may occur inevitably during transmission over ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '19: Proceedings of the 27th ACM International Conference on Multimedia

October 2019

2794 pages

ISBN:9781450368896

DOI:10.1145/3343031

General Chairs:
Laurent Amsaleg
CNRS-IRISA, France
,
Benoit Huet
EURECOM, France
,
Martha Larson
Radboud University and TU Delft (Netherlands)
,
Program Chairs:
Guillaume Gravier
CNRS-IRISA, France
,
Hayley Hung
Delft University of Technology Netherlands
,
Chong-Wah Ngo
City University of Hong Kong Hong Kong
,
Wei Tsang Ooi
National University of Singapore Singapore

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China

Conference

MM '19

Sponsor:

SIGMM

MM '19: The 27th ACM International Conference on Multimedia

October 21 - 25, 2019

Nice, France

Acceptance Rates

MM '19 Paper Acceptance Rate 252 of 936 submissions, 27%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

123
Total Citations
View Citations
2,200
Total Downloads

Downloads (Last 12 months)287
Downloads (Last 6 weeks)23

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

hauwermeiren BDenis LMunteanu A(2025)Non-Uniform Voxelisation for Point Cloud CompressionSensors10.3390/s2503086525:3(865)Online publication date: 31-Jan-2025
https://doi.org/10.3390/s25030865
Wang JXue RLi JDing DLin YMa Z(2025)A Versatile Point Cloud Compressor Using Universal Multiscale Conditional Coding – Part I: GeometryIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.346293847:1(269-287)Online publication date: Jan-2025
https://doi.org/10.1109/TPAMI.2024.3462938
Liu GZhu JDing DMa ZLarson K(2024)Encoding auxiliary information to restore compressed point cloud geometryProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/242(2189-2197)Online publication date: 3-Aug-2024
https://dl.acm.org/doi/10.24963/ijcai.2024/242
Xu YZhang YYang QXu XLiu S(2024)Compressed Point Cloud Quality Index by Combining Global Appearance and Local DetailsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/367256720:9(1-22)Online publication date: 15-Jun-2024
https://dl.acm.org/doi/10.1145/3672567
Cabrero Barros SElosegi ATamayo IDominguez AZorrilla M(2024)Volumetric Video on the Web: a platform prototype and empirical studyProceedings of the 29th International ACM Conference on 3D Web Technology10.1145/3665318.3677170(1-10)Online publication date: 25-Sep-2024
https://dl.acm.org/doi/10.1145/3665318.3677170
Xie LGao WZheng HLi GCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)ROI-Guided Point Cloud Geometry Compression Towards Human and Machine VisionProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681301(3741-3750)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3681301
Zheng HGao WYu ZZhao TLi GCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)ViewPCGC: View-Guided Learned Point Cloud Geometry CompressionProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681225(7152-7161)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3681225
Zhang WWang ZXu LYang XLiu JCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Informative Point cloud Dataset Extraction for Classification via Gradient-based Points MovingProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680767(6384-6393)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3680767
Gomes PRossi SToni L(2024)AGAR - Attention Graph-RNN for Adaptative Motion Prediction of Point Clouds of Deformable ObjectsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366218320:8(1-25)Online publication date: 13-Jun-2024
https://dl.acm.org/doi/10.1145/3662183
Zhang ZZhu ZBai YWang MYu ZGurrin CKongkachandra RSchoeffmann KDang-Nguyen DRossetto LSatoh SZhou L(2024)Octree-Retention Fusion: A High-Performance Context Model for Point Cloud Geometry CompressionProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3657620(1150-1154)Online publication date: 30-May-2024
https://dl.acm.org/doi/10.1145/3652583.3657620
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten