Infrared-Visible Image Fusion Network Based on Dual-Branch Feature Decomposition

Xundong Gao; Hui Chen; Yaning Yao; Chengcheng Zhang

doi:10.3788/LOP242481

Laser & Optoelectronics Progress, Volume. 62, Issue 14, 1439003(2025)

Infrared-Visible Image Fusion Network Based on Dual-Branch Feature Decomposition

Xundong Gao^1、*, Hui Chen^1,2、**, Yaning Yao¹, and Chengcheng Zhang¹

Author Affiliations

¹School of Information and Communication, Guilin University of Electronic Technology, Guilin 541004, Guangxi , China

²Guangxi University Key Laboratory of Microwave and Optical Wave Application Technology, Guilin 541004, Guangxi , China

show less

Abstract Get PDF(in Chinese)

Multimodal image fusion involves the integration of information from different sensors to obtain complementary modal features. Infrared-visible image fusion is a popular topic in multimodal tasks. However, the existing methods face challenges in effectively integrating these different modal features and generating comprehensive feature representations. To address this issue, we propose a dual-branch feature-decomposition (DBDFuse) network. A dual-branch feature extraction structure is introduced, in which the Outlook Attention Transformer (OAT) block is used to extract high-frequency local features, whereas newly designed fold-and-unfold modules in the Stoken Transformer (ST) efficiently capture low-frequency global dependencies. The ST decomposes the original global attention into a product of sparse correlation maps and low-dimensional attention to capture low-frequency global features. Experimental results demonstrate that the DBDFuse network outperforms state-of-the-art (SOTA) methods for infrared-visible image fusion. The fused images exhibit higher clarity and detail retention in visual effects, while also enhancing the complementarity between modalities. In addition, the performance of infrared and visible light fusion images in downstream tasks has been improved, with mean average accuracy of 80.98% in the M3FD object detection task and mean intersection to union ratio of 63.9% in the LLVIP semantic segmentation task.

Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.

Keywords

dual branch network feature decomposition image fusion infrared image visible image

Tools

Get Citation

Copy Citation Text

Xundong Gao, Hui Chen, Yaning Yao, Chengcheng Zhang. Infrared-Visible Image Fusion Network Based on Dual-Branch Feature Decomposition[J]. Laser & Optoelectronics Progress, 2025, 62(14): 1439003

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites