Laser & Infrared, Volume. 55, Issue 2, 304(2025)

Infrared small dim target data augmentation algorithm based on image translation

LIAO Yan-bin1, JI Yu-xiang2, FU Zhi-ling1, YANG Hai1, and WANG Zhe1、*
Author Affiliations
  • 1School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
  • 2Shanghai Science and Technology Museum, Shanghai 200127, China
  • show less
    References(14)

    [1] [1] GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks[J]. Communications of the ACM, 2020, 63(11): 139-144.

    [2] [2] WANG T C, LIU M Y, ZHU J Y, et al. High-resolution image synthesis and semantic manipulation with conditional gans[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 8798-8807.

    [3] [3] PARK T C, LIU M Y, WANG T C, et al. Semantic image synthesis with spatially-adaptive normalization[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019: 2337-2346.

    [4] [4] PARK T, ZHU J Y, WANG O, et al. Swapping autoencoder for deep image manipulation[J]. Adv Neural Inf Process Syst, 2020, 33: 7198-7211.

    [5] [5] ZHU J Y, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]//Proceedings of the IEEE International Conference on Computer Vision, 2017: 2223-2232.

    [6] [6] PARK T, EFROS A A, ZHANG R, ZHU J Y. Contrastive learning for unpaired image-to-image translation[C]//Proceedings of European Conference on Computer Vision (ECCV), 2020: 319-345.

    [9] [9] KIM J H; HWANG Y. Gan-based synthetic data augmentation for infrared small target detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 1-12.

    [10] [10] KIM J, KIM M, KANG H, et al. U-gat-it: Unsupervised generative attentional networks with adaptive layer-instance normalization for image-to-image translation[C]//Proceedings of the International Conference on Learning Representations, 2020.

    [11] [11] ZHOU B, KHOSLA A, LAPEDRIZA A, et al. Learning deep features for discriminative localization[C]//Proceedings of the IEEE International Conference on Computer Vision, 2016: 2921-2929.

    [12] [12] HE K, CHEN X, XIE S, et al. Masked autoencoders are scalable vision learners[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, 16000-16009.

    [13] [13] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16×16 words: transformers for image recognition at scale[C]//Proceedings of the International Conference on Learning Representations, 2021.

    [15] [15] REDMON J, FARHADI A. Yolov3: an incremental improvement[J]. arXiv preprint arXiv: 1804.02767, 2018.

    [16] [16] LI B, XIAO C, WANG Y, et al. Dense nested attention network for infrared small target detection[J/OL]. http://arxiv.org/pdf/2106.04487.

    [17] [17] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detector[C]//Proceedings of European Conference on Computer Vision (ECCV), 2016: 21-37.

    Tools

    Get Citation

    Copy Citation Text

    LIAO Yan-bin, JI Yu-xiang, FU Zhi-ling, YANG Hai, WANG Zhe. Infrared small dim target data augmentation algorithm based on image translation[J]. Laser & Infrared, 2025, 55(2): 304

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Jun. 7, 2024

    Accepted: Apr. 3, 2025

    Published Online: Apr. 3, 2025

    The Author Email: WANG Zhe (64252310wangzhe@ecust.edu.cn)

    DOI:10.3969/j.issn.1001-5078.2025.02.021

    Topics