Infrared image conversion technology based on improved pix2pix

In order to solve the problem of different cost of image acquisition in different light segments, an image conversion method based on pix2pix was proposed. It mainly focuses on the generator and discriminator. In terms of generators, the residual structures generator was used instead of the original U-Net generator to alleviate the gradient vanishing problem. Deformable convolution is introduced to improve the generation effect of target edges and small targets. The BAM attention mechanism is introduced to improve the feature extraction ability of the algorithm for the main target in the image to improve the image generation effect. In terms of discriminators: change the number of convolutional layers in PatchGAN (the original PatchGAN is 3-layer convolution), and set up a control experiment to find the convolutional layer with the best conversion effect. Some KAIST datasets are selected for training and testing. The experimental results show that the Root Mean Square Error (MSE) of the improved algorithm is reduced by 31.4% and the Structural Similarity (SSIM) is increased by 11.2%, which can better realize the conversion between infrared and visible images.

Keywords

generative adversarial network image conversion pix2pix residual structures

Tools

Get Citation

Copy Citation Text

YE Ming-liang, SHI Chun-jing, HAO Yong-ping, LI Da-Wei. Infrared image conversion technology based on improved pix2pix[J]. Laser & Infrared, 2024, 54(7): 1157

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Nov. 15, 2023

Accepted: Apr. 30, 2025

Published Online: Apr. 30, 2025

The Author Email: HAO Yong-ping (yphsit@126.com)

DOI:10.3969/j.issn.1001-5078.2024.07.024

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology