Infrared small dim target data augmentation algorithm based on image translation

LIAO Yan-bin; JI Yu-xiang; FU Zhi-ling; YANG Hai; WANG Zhe

doi:10.3969/j.issn.1001-5078.2025.02.021

Laser & Infrared, Volume. 55, Issue 2, 304(2025)

Infrared small dim target data augmentation algorithm based on image translation

LIAO Yan-bin¹, JI Yu-xiang², FU Zhi-ling¹, YANG Hai¹, and WANG Zhe^1、*

Author Affiliations

¹School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China

²Shanghai Science and Technology Museum, Shanghai 200127, China

show less

Abstract Get PDF(in Chinese)

References(14)

[1] [1] GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks[J]. Communications of the ACM, 2020, 63(11): 139-144.

[2] [2] WANG T C, LIU M Y, ZHU J Y, et al. High-resolution image synthesis and semantic manipulation with conditional gans[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 8798-8807.

[3] [3] PARK T C, LIU M Y, WANG T C, et al. Semantic image synthesis with spatially-adaptive normalization[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019: 2337-2346.

[4] [4] PARK T, ZHU J Y, WANG O, et al. Swapping autoencoder for deep image manipulation[J]. Adv Neural Inf Process Syst, 2020, 33: 7198-7211.

[5] [5] ZHU J Y, PARK T, ISOLA P, et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C]//Proceedings of the IEEE International Conference on Computer Vision, 2017: 2223-2232.

[6] [6] PARK T, EFROS A A, ZHANG R, ZHU J Y. Contrastive learning for unpaired image-to-image translation[C]//Proceedings of European Conference on Computer Vision (ECCV), 2020: 319-345.

[9] [9] KIM J H; HWANG Y. Gan-based synthetic data augmentation for infrared small target detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 1-12.

[10] [10] KIM J, KIM M, KANG H, et al. U-gat-it: Unsupervised generative attentional networks with adaptive layer-instance normalization for image-to-image translation[C]//Proceedings of the International Conference on Learning Representations, 2020.

[11] [11] ZHOU B, KHOSLA A, LAPEDRIZA A, et al. Learning deep features for discriminative localization[C]//Proceedings of the IEEE International Conference on Computer Vision, 2016: 2921-2929.

[12] [12] HE K, CHEN X, XIE S, et al. Masked autoencoders are scalable vision learners[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, 16000-16009.

[13] [13] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16×16 words: transformers for image recognition at scale[C]//Proceedings of the International Conference on Learning Representations, 2021.

[15] [15] REDMON J, FARHADI A. Yolov3: an incremental improvement[J]. arXiv preprint arXiv: 1804.02767, 2018.

[16] [16] LI B, XIAO C, WANG Y, et al. Dense nested attention network for infrared small target detection[J/OL]. http://arxiv.org/pdf/2106.04487.

[17] [17] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detector[C]//Proceedings of European Conference on Computer Vision (ECCV), 2016: 21-37.

Tools

Get Citation

Copy Citation Text

LIAO Yan-bin, JI Yu-xiang, FU Zhi-ling, YANG Hai, WANG Zhe. Infrared small dim target data augmentation algorithm based on image translation[J]. Laser & Infrared, 2025, 55(2): 304

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Jun. 7, 2024

Accepted: Apr. 3, 2025

Published Online: Apr. 3, 2025

The Author Email: WANG Zhe (64252310wangzhe@ecust.edu.cn)

DOI:10.3969/j.issn.1001-5078.2025.02.021

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology