Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation Model

Chen, Yijia; Chen, Pinghua; Zhou, Xiangxin; Lei, Yingtie; Zhou, Ziyang; Li, Mingxian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.07072v2 (cs)

[Submitted on 10 Apr 2024 (v1), last revised 27 Apr 2024 (this version, v2)]

Title:Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation Model

Authors:Yijia Chen, Pinghua Chen, Xiangxin Zhou, Yingtie Lei, Ziyang Zhou, Mingxian Li

View PDF HTML (experimental)

Abstract:In the field of computer vision, visible light images often exhibit low contrast in low-light conditions, presenting a significant challenge. While infrared imagery provides a potential solution, its utilization entails high costs and practical limitations. Recent advancements in deep learning, particularly the deployment of Generative Adversarial Networks (GANs), have facilitated the transformation of visible light images to infrared images. However, these methods often experience unstable training phases and may produce suboptimal outputs. To address these issues, we propose a novel end-to-end Transformer-based model that efficiently converts visible light images into high-fidelity infrared images. Initially, the Texture Mapping Module and Color Perception Adapter collaborate to extract texture and color features from the visible light image. The Dynamic Fusion Aggregation Module subsequently integrates these features. Finally, the transformation into an infrared image is refined through the synergistic action of the Color Perception Adapter and the Enhanced Perception Attention mechanism. Comprehensive benchmarking experiments confirm that our model outperforms existing methods, producing infrared images of markedly superior quality, both qualitatively and quantitatively. Furthermore, the proposed model enables more effective downstream applications for infrared images than other methods.

Comments:	Accepted by IJCNN 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.07072 [cs.CV]
	(or arXiv:2404.07072v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.07072

Submission history

From: Ziyang Zhou [view email]
[v1] Wed, 10 Apr 2024 15:02:26 UTC (1,767 KB)
[v2] Sat, 27 Apr 2024 07:39:45 UTC (1,767 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators