CMC-Bench: Towards a New Paradigm of Visual Signal Compression

Li, Chunyi; Wu, Xiele; Wu, Haoning; Feng, Donghui; Zhang, Zicheng; Lu, Guo; Min, Xiongkuo; Liu, Xiaohong; Zhai, Guangtao; Lin, Weisi

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.09356 (cs)

[Submitted on 13 Jun 2024]

Title:CMC-Bench: Towards a New Paradigm of Visual Signal Compression

Authors:Chunyi Li, Xiele Wu, Haoning Wu, Donghui Feng, Zicheng Zhang, Guo Lu, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

View PDF HTML (experimental)

Abstract:Ultra-low bitrate image compression is a challenging and demanding topic. With the development of Large Multimodal Models (LMMs), a Cross Modality Compression (CMC) paradigm of Image-Text-Image has emerged. Compared with traditional codecs, this semantic-level compression can reduce image data size to 0.1\% or even lower, which has strong potential applications. However, CMC has certain defects in consistency with the original image and perceptual quality. To address this problem, we introduce CMC-Bench, a benchmark of the cooperative performance of Image-to-Text (I2T) and Text-to-Image (T2I) models for image compression. This benchmark covers 18,000 and 40,000 images respectively to verify 6 mainstream I2T and 12 T2I models, including 160,000 subjective preference scores annotated by human experts. At ultra-low bitrates, this paper proves that the combination of some I2T and T2I models has surpassed the most advanced visual signal codecs; meanwhile, it highlights where LMMs can be further optimized toward the compression task. We encourage LMM developers to participate in this test to promote the evolution of visual signal codec protocols.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2406.09356 [cs.CV]
	(or arXiv:2406.09356v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.09356

Submission history

From: Chunyi Li [view email]
[v1] Thu, 13 Jun 2024 17:41:37 UTC (41,699 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CMC-Bench: Towards a New Paradigm of Visual Signal Compression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CMC-Bench: Towards a New Paradigm of Visual Signal Compression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators