Conditional diffusion and multi-channel high-low frequency parallel fusion of infrared and visible light images

Jing DI; Heran WANG; Chan LIANG; Jizhao LIU; Jing LIAN

doi:10.37188/OPE.20253301.0148

Optics and Precision Engineering, Volume. 33, Issue 1, 148(2025)

Conditional diffusion and multi-channel high-low frequency parallel fusion of infrared and visible light images

Jing DI1... Heran WANG1,*, Chan LIANG1, Jizhao LIU2 and Jing LIAN1 |Show fewer author(s)

Author Affiliations

¹School of Electronic and Information Engineering， Lanzhou Jiaotong University， Lanzhou730070， China

²School of Information Science and Engineering， Lanzhou University， Lanzhou730000， China

show less

Abstract Get PDF(in Chinese)

To address the challenges of the absence of baseline ground truth and the underutilization of visible light information in infrared and visible light image fusion using denoising diffusion models， this study introduces a novel conditional diffusion and multi-channel high-low frequency parallel infrared and visible light image fusion model. First， a conditional diffusion model is developed， employing a splicing technique to generate spliced source images as ground truth during training， thereby facilitating an optimal prior distribution for feature extraction in infrared and visible images. During the reverse denoising process， a multi-channel likelihood correction module is incorporated to effectively model the intricate multi-channel distribution of these images. Subsequently， a detail-adaptive denoising network is proposed to perform multi-channel high- and low-frequency feature extraction for infrared and visible light images. The model also integrates a multi-channel high- and low-frequency parallel fusion module within the fusion network， which utilizes a regional consistency fusion network and a multi-channel low-frequency feature fusion network to merge high- and low-frequency features， respectively. This approach introduces a trainable diffusion-based paradigm for feature extraction in infrared and visible light image fusion tasks， leveraging specialized convolutional neural networks for feature integration. Comparative experiments on the MSRS and RoadScene datasets， against nine state-of-the-art methods， reveal that the proposed model improves the average performance across eight objective evaluation metrics by 4.52% to 59.62%. The method demonstrates superior performance in maintaining color fidelity and preserving texture details， aligning well with human visual perception， and proves robust in handling diverse lighting and environmental conditions for infrared and visible light image fusion tasks.

AI Picture Guide

AI One Sentence

AI Short Abstract

Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.

Keywords

conditional diffusion model detail adaptive denoising block image fusion infrared and visible multi-channel high-low frequency parallel fusion block

Tools

Get Citation

Copy Citation Text

Jing DI, Heran WANG, Chan LIANG, Jizhao LIU, Jing LIAN. Conditional diffusion and multi-channel high-low frequency parallel fusion of infrared and visible light images[J]. Optics and Precision Engineering, 2025, 33(1): 148

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Aug. 26, 2024

Accepted: --

Published Online: Apr. 1, 2025

The Author Email: WANG Heran (838129431@qq.com)

DOI:10.37188/OPE.20253301.0148

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology

微信扫一扫：分享