research-article

Transformer and Upsampling-Based Point Cloud Compression

Authors:

Junteng Zhang,

Gexin Liu,

Dandan Ding,

Zhan MaAuthors Info & Claims

APCCPA '22: Proceedings of the 1st International Workshop on Advances in Point Cloud Compression, Processing and Analysis

Pages 33 - 39

https://doi.org/10.1145/3552457.3555731

Published: 10 October 2022 Publication History

Get Access

Abstract

Learning-based point cloud compression has exhibited superior coding performance over the traditional methods such as MEPG G-PCC. Considering that conventional point cloud representation formats (e.g., octree or voxel) will introduce additional errors and affect the reconstruction quality, we directly use the point-based representation and develop a framework that leverages transformer and upsampling techniques for point cloud compression. To extract latent features that well characterize an input point cloud, we build an end-to-end learning framework: at the encoder side, we leverage cascading transformers to extract and enhance useful features for entropy coding; At the decoder side, in addition to the transformers, an upsampling module utilizing both coordinates and features is devised to reconstruct the point cloud progressively. Experimental results demonstrate that the proposed method achieves the best coding performance against state-of-the-art point-based methods, e.g., >1 dB D1 and D2 PSNR at bitrate 0.10 bpp and more visually pleasing reconstructions. Extensive ablation studies also confirm the effectiveness of transformer and upsampling modules.

Supplementary Material

MP4 File (APCCPA22-apccpa08.mp4)

Learning-based point cloud geometry compression (PCGC) has received extensive attention. Existing methods can be divided into two categories: octree/voxel-based methods and point-based methods. A typical octree-based example is the MPEG G-PCC which mainly handles large-scale point clouds. The point-based methods generally employ PointNet to exploit point correlations and are widely used for small-scale point clouds. In this work, we also focus on the point-based approach. We construct an end-to-end framework: the encoder first downsamples and captures multi-scale features of the point cloud and then enhances them for entropy encoding; the decoder adopts a reverse procedure where features received are enhanced for coordinate reconstruction. Furthermore, we embed Transformer in the feature enhancement process of encoder and decoder to aggregate and emphasize valuable features. Experiment results show that our method greatly outperforms state-of-the-art PCGC methods, e.g., average 39% D1 BDBR and 43% D2 BDBR.

Download
199.96 MB

References

[1]

Johannes Ballé, David Minnen, Saurabh Singh, Sung Jin Hwang, and Nick Johnston. 2018. Variational image compression with a scale hyperprior. arXiv preprint arXiv:1802.01436 (2018).

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

YOGA: Yet Another Geometry-based Point Cloud Compressor

TransPCC: Towards Deep Point Cloud Compression via Transformers

Block size selection in rate-constrained geometry based point cloud compression

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations