The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine

Guarda, André F. R.; Rodrigues, Nuno M. M.; Pereira, Fernando

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2409.08130 (eess)

[Submitted on 12 Sep 2024]

Title:The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine

Authors:André F. R. Guarda (1), Nuno M. M. Rodrigues (1 and 2), Fernando Pereira (1 and 3) ((1) Instituto de Telecomunicações, Lisbon, Portugal, (2) ESTG, Politécnico de Leiria, Leiria, Portugal, (3) Instituto Superior Técnico - Universidade de Lisboa, Lisbon, Portugal)

View PDF

Abstract:Efficient point cloud coding has become increasingly critical for multiple applications such as virtual reality, autonomous driving, and digital twin systems, where rich and interactive 3D data representations may functionally make the difference. Deep learning has emerged as a powerful tool in this domain, offering advanced techniques for compressing point clouds more efficiently than conventional coding methods while also allowing effective computer vision tasks performed in the compressed domain thus, for the first time, making available a common compressed visual representation effective for both man and machine. Taking advantage of this potential, JPEG has recently finalized the JPEG Pleno Learning-based Point Cloud Coding (PCC) standard offering efficient lossy coding of static point clouds, targeting both human visualization and machine processing by leveraging deep learning models for geometry and color coding. The geometry is processed directly in its original 3D form using sparse convolutional neural networks, while the color data is projected onto 2D images and encoded using the also learning-based JPEG AI standard. The goal of this paper is to provide a complete technical description of the JPEG PCC standard, along with a thorough benchmarking of its performance against the state-of-the-art, while highlighting its main strengths and weaknesses. In terms of compression performance, JPEG PCC outperforms the conventional MPEG PCC standards, especially in geometry coding, achieving significant rate reductions. Color compression performance is less competitive but this is overcome by the power of a full learning-based coding framework for both geometry and color and the associated effective compressed domain processing.

Comments:	28 pages, 12 figures, submitted to IEEE Access
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2409.08130 [eess.IV]
	(or arXiv:2409.08130v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2409.08130

Submission history

From: André Guarda [view email]
[v1] Thu, 12 Sep 2024 15:20:23 UTC (1,659 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators