Laser & Infrared, Volume. 54, Issue 7, 1157(2024)
Infrared image conversion technology based on improved pix2pix
In order to solve the problem of different cost of image acquisition in different light segments, an image conversion method based on pix2pix was proposed. It mainly focuses on the generator and discriminator. In terms of generators, the residual structures generator was used instead of the original U-Net generator to alleviate the gradient vanishing problem. Deformable convolution is introduced to improve the generation effect of target edges and small targets. The BAM attention mechanism is introduced to improve the feature extraction ability of the algorithm for the main target in the image to improve the image generation effect. In terms of discriminators: change the number of convolutional layers in PatchGAN (the original PatchGAN is 3-layer convolution), and set up a control experiment to find the convolutional layer with the best conversion effect. Some KAIST datasets are selected for training and testing. The experimental results show that the Root Mean Square Error (MSE) of the improved algorithm is reduced by 31.4% and the Structural Similarity (SSIM) is increased by 11.2%, which can better realize the conversion between infrared and visible images.
Get Citation
Copy Citation Text
YE Ming-liang, SHI Chun-jing, HAO Yong-ping, LI Da-Wei. Infrared image conversion technology based on improved pix2pix[J]. Laser & Infrared, 2024, 54(7): 1157
Category:
Received: Nov. 15, 2023
Accepted: Apr. 30, 2025
Published Online: Apr. 30, 2025
The Author Email: HAO Yong-ping (yphsit@126.com)