Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3343031.3351061acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

3D Point Cloud Geometry Compression on Deep Learning

Published: 15 October 2019 Publication History

Abstract

3D point cloud presentation has been widely used in computer vision, automatic driving, augmented reality, smart cities and virtual reality. 3D point cloud compression method with higher compression ratio and tiny loss is the key to improve data transportation efficiency. In this paper, we propose a new 3D point cloud geometry compression method based on deep learning, also an auto-encoder performing better than other networks in detail reconstruction. It can reach much higher compression ratio than the state-of-art while keeping tolerable loss. It also supports parallel compressing multiple models by GPU, which can improve processing efficiency greatly. The compression process is composed of two parts. Firstly, Raw data is compressed into codeword by extracting feature of raw model with encoder. Then, the codeword is further compressed with sparse coding. Decompression process is implemented in reverse order. Codeword is recovered and fed into decoder to reconstruct point cloud. Detail reconstruction ability is improved by a hierarchical structure in our decoder. Latter outputs are grown from former fuzzier outputs. In this way, details are added to former output by latter layers step by step to make a more precise prediction. We compare our method with PCL compression and Draco compression on ShapeNet40 part dataset. Our method may be the first deep learning-based point cloud compression algorithm. The experiments demonstrate it is superior to former common compression algorithms with large compression ratio, which can also reserve original shapes with tiny loss.

References

[1]
A. Brock, T. Lim, J. M. Ritchie, and N. Weston. Generative and discriminative voxel modeling with convolutional neural networks. Advances in Neural Information Processing Systems, Workshop on 3D learning, 2017. 3
[2]
A. Dai, A. X. Chang, M. Savva, M. Halber, T. Funkhouser, and M. Nießner. Scannet: Richly-annotated 3D reconstructions of indoor scenes. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017. 3
[3]
D. Maturana and S. Scherer. Voxnet: A 3D convolutional neural network for real-time object recognition. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 922--928. IEEE, 2015. 3
[4]
Riegler G, Osman Ulusoy A, Geiger A. Octnet: Learning deep 3d representations at high resolutions[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 3577--3586.
[5]
Klokov R, Lempitsky V. Escape from cells: Deep kd-networks for the recognition of 3d point cloud models[C]//Proceedings of the IEEE International Conference on Computer Vision. 2017: 863--872.
[6]
Zhu Z, Wang X, Bai S, et al. Deep learning representation using autoencoder for 3D shape retrieval[J]. Neurocomputing, 2016, 204: 41--50.
[7]
Chen D Y, Tian X P, Shen Y T, et al. On visual similarity based 3D model retrieval[C]//Computer graphics forum. Oxford, UK: Blackwell Publishing, Inc, 2003, 22(3): 223--232.
[8]
Qi C R, Su H, Mo K, et al. Pointnet: Deep learning on point sets for 3d classification and segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 652--660.
[9]
Qi C R, Yi L, Su H, et al. Pointnet++: Deep hierarchical feature learning on point sets in a metric space[C]//Advances in Neural Information Processing Systems. 2017: 5099--5108.
[10]
Yang Y, Feng C, Shen Y, et al. Foldingnet: Point cloud auto-encoder via deep grid deformation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 206--215.
[11]
Mandikal P, Babu R V. Dense 3D Point Cloud Reconstruction Using a Deep Pyramid Network[J].
[12]
Achlioptas P, Diamanti O, Mitliagkas I, et al. Learning Representations and Generative Models for 3D Point Clouds[C]//International Conference on Machine Learning. 2018: 40--49.
[13]
Goodfellow I, Pouget-Abadie J, Mirza M, et al. Generative adversarial nets[C]//Advances in neural information processing systems. 2014: 2672--2680.
[14]
Li J, Chen B M, Hee Lee G. So-net: Self-organizing network for point cloud analysis[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 9397--9406.
[15]
Shao Y, Zhang Q, Li G, et al. Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction[C]//2018 ACM Multimedia Conference on Multimedia Conference. ACM, 2018: 1199--1207.
[16]
Schnabel R, Klein R. Octree-based Point-Cloud Compression[J]. Spbg, 2006, 6: 111--120.
[17]
Gumhold S, Kami Z, Isenburg M, et al. Predictive point-cloud compression[C]//Siggraph Sketches. 2005: 137.
[18]
Morell V, Orts S, Cazorla M, et al. Geometric 3D point cloud compression[J]. Pattern Recognition Letters, 2014, 50: 55--62.
[19]
de Queiroz R L, Chou P A. Transform coding for point clouds using a gaussian process model[J]. IEEE Transactions on Image Processing, 2017, 26(7): 3507--3517.
[20]
Kammerl J, Blodow N, Rusu R B, et al. Real-time compression of point cloud streams[C]//2012 IEEE International Conference on Robotics and Automation. IEEE, 2012: 778--785.
[21]
Rusu R B, Cousins S. Point cloud library (pcl)[C]//2011 IEEE International Conference on Robotics and Automation. 2011: 1--4.
[22]
"Draco 3D graphics compression," https://google.github.io/draco/, Accessed: 2018-01--10.
[23]
Yu L, Li X, Fu C W, et al. Pu-net: Point cloud upsampling network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 2790--2799.
[24]
Rubner, Y., Tomasi, C., and Guibas, L. J. The earth mover's distance as a metric for image retrieval. IJCV, 2000.
[25]
Yi L, Kim V G, Ceylan D, et al. A scalable active framework for region annotation in 3d shape collections[J]. ACM Transactions on Graphics (TOG), 2016, 35(6): 210.
[26]
D. Kingma and J. Ba. Adam: A method for stochastic optimization. In Int. Conf.on Learning Representations (ICLR), 2015.

Cited By

View all
  • (2025)Non-Uniform Voxelisation for Point Cloud CompressionSensors10.3390/s2503086525:3(865)Online publication date: 31-Jan-2025
  • (2025)A Versatile Point Cloud Compressor Using Universal Multiscale Conditional Coding – Part I: GeometryIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.346293847:1(269-287)Online publication date: Jan-2025
  • (2024)Encoding auxiliary information to restore compressed point cloud geometryProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/242(2189-2197)Online publication date: 3-Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '19: Proceedings of the 27th ACM International Conference on Multimedia
October 2019
2794 pages
ISBN:9781450368896
DOI:10.1145/3343031
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. 3d point cloud
  2. auto-encoder
  3. detail reconstruction
  4. geometry compression
  5. hierarchical structure

Qualifiers

  • Research-article

Funding Sources

  • National Natural Science Foundation of China

Conference

MM '19
Sponsor:

Acceptance Rates

MM '19 Paper Acceptance Rate 252 of 936 submissions, 27%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)287
  • Downloads (Last 6 weeks)23
Reflects downloads up to 20 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Non-Uniform Voxelisation for Point Cloud CompressionSensors10.3390/s2503086525:3(865)Online publication date: 31-Jan-2025
  • (2025)A Versatile Point Cloud Compressor Using Universal Multiscale Conditional Coding – Part I: GeometryIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.346293847:1(269-287)Online publication date: Jan-2025
  • (2024)Encoding auxiliary information to restore compressed point cloud geometryProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/242(2189-2197)Online publication date: 3-Aug-2024
  • (2024)Compressed Point Cloud Quality Index by Combining Global Appearance and Local DetailsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/367256720:9(1-22)Online publication date: 15-Jun-2024
  • (2024)Volumetric Video on the Web: a platform prototype and empirical studyProceedings of the 29th International ACM Conference on 3D Web Technology10.1145/3665318.3677170(1-10)Online publication date: 25-Sep-2024
  • (2024)ROI-Guided Point Cloud Geometry Compression Towards Human and Machine VisionProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681301(3741-3750)Online publication date: 28-Oct-2024
  • (2024)ViewPCGC: View-Guided Learned Point Cloud Geometry CompressionProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681225(7152-7161)Online publication date: 28-Oct-2024
  • (2024)Informative Point cloud Dataset Extraction for Classification via Gradient-based Points MovingProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680767(6384-6393)Online publication date: 28-Oct-2024
  • (2024)AGAR - Attention Graph-RNN for Adaptative Motion Prediction of Point Clouds of Deformable ObjectsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366218320:8(1-25)Online publication date: 13-Jun-2024
  • (2024)Octree-Retention Fusion: A High-Performance Context Model for Point Cloud Geometry CompressionProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3657620(1150-1154)Online publication date: 30-May-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media