Infrared and Laser Engineering, Volume. 53, Issue 11, 20240305(2024)

Triple multi-modal image fusion algorithm based on mixed difference convolution and efficient vision Transformer network

Kunyu SI and Chunhui NIU
Author Affiliations
  • School of Instrument Science and Photoelectric Engineering, Beijing Information Science & Technology University, Beijing 100192, China
  • show less
    References(19)

    [1] ZHANG F, PENG H, YU L et al. Dual-modality space-time memory network for RGBT tracking[J]. IEEE Transactions on instrumentation and Measurement, 72, 1-11(2023).

    [5] LI H, WU X J. DenseFuse: a fusion approach to infrared and visible images[J]. IEEE Transactions on Image Processing, 28, 2614-2623(2018).

    [7] XU H, MA J, JIANG J et al. U2Fusion: A unified unsupervised image fusion network[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 502-518(2020).

    [10] [10] SU Z, LIU W, YU Z, et al. Pixel difference wks f efficient edge detection[C]Proceedings of the IEEECVF International Conference on Computer Vision, 2021: 51175127.

    [11] VASWANI A, SHAZEER N, PARMAR N et al. Attention is all you need[J]. Advances in Neural Information Processing Systems, 30, 5998-6008(2017).

    [12] WANG Z, CHEN Y, SHAO W et al. SwinFuse: A residual swin transformer fusion network for infrared and visible images[J]. IEEE Transactions on Instrumentation and Measurement, 71, 1-12(2022).

    [13] [13] ZHAO Z, Bai H, Zhang J, et al. Cddfuse: Crelationdriven dualbranch feature decomposition f multimodality image fusion[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2023: 59065916.

    [14] [14] LIU X, PENG H, ZHENG N, et al. EfficientViT: Memy efficient vision transfmer with caded group attention[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2023: 1442014430.

    [15] [15] HOU Q, ZHOU D, Feng J. Codinate attention f efficient mobile wk design[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2021: 1371313722.

    [17] XIANG Z, HAO J, QI G et al. MFST: Multi-modal feature self-adaptive transformer for infrared and visible image fusion[J]. Remote Sensing, 14, 3233-3233(2022).

    [18] KARIM S, TONG G, Li J et al. MTDFusion: A multilayer triple dense network for infrared and visible image fusion[J]. IEEE Transactions on Instrumentation and Measurement, 73, 1-17(2023).

    Tools

    Get Citation

    Copy Citation Text

    Kunyu SI, Chunhui NIU. Triple multi-modal image fusion algorithm based on mixed difference convolution and efficient vision Transformer network[J]. Infrared and Laser Engineering, 2024, 53(11): 20240305

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: 图像处理

    Received: Jul. 8, 2024

    Accepted: --

    Published Online: Dec. 13, 2024

    The Author Email:

    DOI:10.3788/IRLA20240305

    Topics