Laser & Infrared, Volume. 55, Issue 2, 304(2025)
Infrared small dim target data augmentation algorithm based on image translation
[1] [1] GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks[J]. Communications of the ACM, 2020, 63(11): 139-144.
[2] [2] WANG T C, LIU M Y, ZHU J Y, et al. High-resolution image synthesis and semantic manipulation with conditional gans[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 8798-8807.
[3] [3] PARK T C, LIU M Y, WANG T C, et al. Semantic image synthesis with spatially-adaptive normalization[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019: 2337-2346.
[4] [4] PARK T, ZHU J Y, WANG O, et al. Swapping autoencoder for deep image manipulation[J]. Adv Neural Inf Process Syst, 2020, 33: 7198-7211.
[5] [5] ZHU J Y, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]//Proceedings of the IEEE International Conference on Computer Vision, 2017: 2223-2232.
[6] [6] PARK T, EFROS A A, ZHANG R, ZHU J Y. Contrastive learning for unpaired image-to-image translation[C]//Proceedings of European Conference on Computer Vision (ECCV), 2020: 319-345.
[9] [9] KIM J H; HWANG Y. Gan-based synthetic data augmentation for infrared small target detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 1-12.
[10] [10] KIM J, KIM M, KANG H, et al. U-gat-it: Unsupervised generative attentional networks with adaptive layer-instance normalization for image-to-image translation[C]//Proceedings of the International Conference on Learning Representations, 2020.
[11] [11] ZHOU B, KHOSLA A, LAPEDRIZA A, et al. Learning deep features for discriminative localization[C]//Proceedings of the IEEE International Conference on Computer Vision, 2016: 2921-2929.
[12] [12] HE K, CHEN X, XIE S, et al. Masked autoencoders are scalable vision learners[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, 16000-16009.
[13] [13] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16×16 words: transformers for image recognition at scale[C]//Proceedings of the International Conference on Learning Representations, 2021.
[15] [15] REDMON J, FARHADI A. Yolov3: an incremental improvement[J]. arXiv preprint arXiv: 1804.02767, 2018.
[16] [16] LI B, XIAO C, WANG Y, et al. Dense nested attention network for infrared small target detection[J/OL]. http://arxiv.org/pdf/2106.04487.
[17] [17] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detector[C]//Proceedings of European Conference on Computer Vision (ECCV), 2016: 21-37.
Get Citation
Copy Citation Text
LIAO Yan-bin, JI Yu-xiang, FU Zhi-ling, YANG Hai, WANG Zhe. Infrared small dim target data augmentation algorithm based on image translation[J]. Laser & Infrared, 2025, 55(2): 304
Category:
Received: Jun. 7, 2024
Accepted: Apr. 3, 2025
Published Online: Apr. 3, 2025
The Author Email: WANG Zhe (64252310wangzhe@ecust.edu.cn)