Optics and Precision Engineering, Volume. 33, Issue 1, 148(2025)

Conditional diffusion and multi-channel high-low frequency parallel fusion of infrared and visible light images

Jing DI1... Heran WANG1,*, Chan LIANG1, Jizhao LIU2 and Jing LIAN1 |Show fewer author(s)
Author Affiliations
  • 1School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou730070, China
  • 2School of Information Science and Engineering, Lanzhou University, Lanzhou730000, China
  • show less

    To address the challenges of the absence of baseline ground truth and the underutilization of visible light information in infrared and visible light image fusion using denoising diffusion models, this study introduces a novel conditional diffusion and multi-channel high-low frequency parallel infrared and visible light image fusion model. First, a conditional diffusion model is developed, employing a splicing technique to generate spliced source images as ground truth during training, thereby facilitating an optimal prior distribution for feature extraction in infrared and visible images. During the reverse denoising process, a multi-channel likelihood correction module is incorporated to effectively model the intricate multi-channel distribution of these images. Subsequently, a detail-adaptive denoising network is proposed to perform multi-channel high- and low-frequency feature extraction for infrared and visible light images. The model also integrates a multi-channel high- and low-frequency parallel fusion module within the fusion network, which utilizes a regional consistency fusion network and a multi-channel low-frequency feature fusion network to merge high- and low-frequency features, respectively. This approach introduces a trainable diffusion-based paradigm for feature extraction in infrared and visible light image fusion tasks, leveraging specialized convolutional neural networks for feature integration. Comparative experiments on the MSRS and RoadScene datasets, against nine state-of-the-art methods, reveal that the proposed model improves the average performance across eight objective evaluation metrics by 4.52% to 59.62%. The method demonstrates superior performance in maintaining color fidelity and preserving texture details, aligning well with human visual perception, and proves robust in handling diverse lighting and environmental conditions for infrared and visible light image fusion tasks.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Jing DI, Heran WANG, Chan LIANG, Jizhao LIU, Jing LIAN. Conditional diffusion and multi-channel high-low frequency parallel fusion of infrared and visible light images[J]. Optics and Precision Engineering, 2025, 33(1): 148

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Aug. 26, 2024

    Accepted: --

    Published Online: Apr. 1, 2025

    The Author Email: WANG Heran (838129431@qq.com)

    DOI:10.37188/OPE.20253301.0148

    Topics