Acta Photonica Sinica, Volume. 54, Issue 5, 0510002(2025)

Combining Generative Adversarial Network and Contrastive Learning for Infrared Image Generation

Guodong YU1, Jianguo ZHU2, Chunyang WANG1、*, Jianghai FENG1, Xubin FENG3, Shi LIU2, Pengyu XU1, Zhongqi LI1, and Xiaochen LIU1
Author Affiliations
  • 1PLA Army Unit 63869,Baicheng 137001,China
  • 2PLA Army Unit 63856,Baicheng 137001,China
  • 3Xi'an Institute of Optics and Precision Mechanics of Chinese Academy of Sciences,Xi'an 710119,China
  • show less
    References(30)

    [1] ZHONG Guoli, LIAO Shouyi, YANG Xinjie. Real-time infrared image generation of battlefield environment based on JRM[J]. Infrared Technology, 46, 183-189(2024).

    [2] CAI Wei, JIANG Bo, JIANG Xinhao et al. Infrared image generation with unpaired training samples[J]. Optics and Precision Engineering, 31, 3651-3661(2023).

    [3] MA Decao, LI Shaopeng, SU Juan et al. Visible-to-infrared image translation for matching tasks[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 17, 18199-18213(2024).

    [4] HOU Yu, VOLK R, SOIBELMAN L. A novel building temperature simulation approach driven by expanding semantic segmentation training datasets with synthetic aerial thermal images[J]. Energies, 14, 353(2021).

    [5] POGLIO T, MATHIEU-MARNI S, RANCHIN T et al. OSIrIS: a physically based simulation tool to improve training in thermal infrared remote sensing over urban areas at high spatial resolution[J]. Remote Sensing of Environment, 104, 238-246(2006).

    [6] ZHANG Lichao, GONZALEZ-GARCIA A, VAN DE WEIJER J et al. Synthetic data generation for end-to-end thermal infrared tracking[J]. IEEE Transactions on Image Processing, 28, 1837-1850(2018).

    [7] LI Chenglong, XIA Wei, YAN Yan et al. Segmenting objects in day and night: edge-conditioned CNN for thermal image semantic segmentation[J]. IEEE Transactions on Neural Networks and Learning Systems, 32, 3069-3082(2020).

    [8] TANG Shi, YE Xinchen, XUE Fei et al. Cross-modality depth estimation via unsupervised stereo RGB-to-infrared translation[C], 1-5(2023).

    [9] JIANG Jiajuan, CHEN Xingxin, DAI Weichen et al. Thermal-inertial SLAM for the environments with challenging illumination[J]. IEEE Robotics and Automation Letters, 7, 8767-8774(2022).

    [10] ISOLA P, ZHU J Y, ZHOU Tinghui et al. Image-to-image translation with conditional adversarial networks[C], 1125-1134(2017).

    [11] KUANG Xiaodong, ZHU Jianfei, SUI Xiubao et al. Thermal infrared colorization via conditional generative adversarial network[J]. Infrared Physics & Technology, 107, 103338(2020).

    [12] ZHU J Y, PARK T, ISOLA P et al. Unpaired image-to-image translation using cycle-consistent adversarial networks[C], 2223-2232(2017).

    [13] LI Shuang, HAN Bingfeng, YU Zhenjie et al. I2v-gan: Unpaired infrared-to-visible video translation[C], 3061-3069(2021).

    [14] YADAV N K, SINGH S K, DUBEY S R. Mobilear-gan: mobilenet-based efficient attentive recurrent generative adversarial network for infrared-to-visual transformations[J]. IEEE Transactions on Instrumentation and Measurement, 71, 1-9(2022).

    [15] PARK T, EFROS A A, ZHANG R et al. Contrastive learning for unpaired image-to-image translation[C], 319-345(2020).

    [16] HAN Junlin, SHOEIBY M, PSTERSSON L et al. Dual contrastive learning for unsupervised image-to-image translation[C], 746-755(2021).

    [17] CHEN Yu, ZHAN Weida, JIANG Yichun et al. Contrastive learning with feature fusion for unpaired thermal infrared image colorization[J]. Optics and Lasers in Engineering, 170, 107745(2023).

    [18] JUNG C, KWON G, YE J C. Exploring patch-wise semantic relation for contrastive learning in image-to-image translation tasks[C], 18260-18269(2022).

    [19] CHEN Lingqiang, LIU Yuan, HE Yin et al. Colorization of infrared images based on feature fusion and contrastive learning[J]. Optics and Lasers in Engineering, 162, 107395(2023).

    [20] WANG Haining, LI Na, ZHAO Huijie et al. MappingFormer: learning cross-modal feature mapping for visible-to-infrared image translation[C], 10745-10754(2024).

    [21] CAI Mu, ZHANG Hong, HUANG Huijuan et al. Frequency domain image translation: more photo-realistic, better identity-preserving[C], 13930-13940(2021).

    [22] LIANG Jie, ZENG Hui, ZHANG Lei. High-resolution photorealistic image translation in real-time: a laplacian pyramid translation network[C], 9392-9400(2021).

    [23] HAN Zonghao, ZHANG Shun, SU Yuru et al. DR-AVIT: towards diverse and realistic aerial visible-to-infrared image translation[J]. IEEE Transactions on Geoscience and Remote Sensing, 62, 1-13(2024).

    [24] XIAO Tete, SINGH M, MINTUN E et al. Early convolutions help transformers see better[J]. Advances in Neural Information Processing Systems, 34, 30392-30400(2021).

    [25] WU Haiping, XIAO Bin, CODELLA N et al. Cvt: introducing convolutions to vision transformers[C], 22-31(2021).

    [26] ZHENG Lin, ZHU Jinchen, SHI Jinpeng et al. Efficient mixed transformer for single image super-resolution[J]. Engineering Applications of Artificial Intelligence, 133, 108035(2024).

    [27] YU Changqiang, WANG Jingbo, PENG Chao et al. Bisenet: Bilateral segmentation network for real-time semantic segmentation[C], 325-341(2018).

    [28] HWANG S, PARK J, KIM N et al. Multispectral pedestrian detection: benchmark dataset and baseline[C], 1037-1045(2015).

    [29] JIA Xinyu, ZHU Chuang, LI Minzhen et al. LLVIP: a visible-infrared paired dataset for low-light vision[C], 3496-3504(2021).

    [30] LEE D G, JEON M H, CHO Y et al. Edge-guided multi-domain rgb-to-tir image translation for training vision tasks with challenging labels[C], 8291-8298(2023).

    Tools

    Get Citation

    Copy Citation Text

    Guodong YU, Jianguo ZHU, Chunyang WANG, Jianghai FENG, Xubin FENG, Shi LIU, Pengyu XU, Zhongqi LI, Xiaochen LIU. Combining Generative Adversarial Network and Contrastive Learning for Infrared Image Generation[J]. Acta Photonica Sinica, 2025, 54(5): 0510002

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Oct. 30, 2024

    Accepted: Jan. 17, 2025

    Published Online: Jun. 18, 2025

    The Author Email: Chunyang WANG (fjh879211@163.com)

    DOI:10.3788/gzxb20255405.0510002

    Topics