Infrared Technology, Volume. 46, Issue 9, 1070(2024)

Infrared Small Target Detection Method with Vision Transformer and Dual Decoder

Shaosheng DAI1, Kesheng LIU1, Lian HUANG2、*, Ziqiang HE1, Xinghua MAO1, and Wenhao REN1
Author Affiliations
  • 1Department of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
  • 2Department of Electrical and Electronic Engineering, Chongqing University of Technology, Chongqing 400054, China
  • show less

    The existing infrared small-target detection method based on convolutional neural networks (CNN) exhibits the problem of a limited receptive field in the encoder stage, and the decoder lacks an effective feature interaction when fusing multiscale features. To address the aforementioned issues, in this study, a new method is proposed based on an encoder–decoder structure. Specifically, a vision transformer is used as an encoder to extract multiscale features from small infrared target images. The vision transformer is an emerging deep-learning architecture that uses a self-attention mechanism to capture the global relationship between all pixels in the input image, thereby effectively processing long-range dependencies and contextual information in the image. Furthermore, a dual-decoder module, comprising an interactive decoder and auxiliary decoder, is proposed to improve the ability of the decoder to reconstruct small infrared targets. The dual-decoder module can make full use of the complementary information between different features, promote interaction between deep and shallow features, and better reconstruct small infrared targets by combining the results of the two decoders. Experimental results on widely used public datasets show that the proposed method outperforms other methods in terms of two evaluation indicators: F1 and mIoU.

    Tools

    Get Citation

    Copy Citation Text

    DAI Shaosheng, LIU Kesheng, HUANG Lian, HE Ziqiang, MAO Xinghua, REN Wenhao. Infrared Small Target Detection Method with Vision Transformer and Dual Decoder[J]. Infrared Technology, 2024, 46(9): 1070

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: May. 24, 2023

    Accepted: Jan. 21, 2025

    Published Online: Jan. 21, 2025

    The Author Email: Lian HUANG (hlcysxxy@163.com)

    DOI:

    Topics