Review of Infrared Image Colorization Technology (Invited)

Fig. 9. Several typical infrared image colorization network structures and learning strategies. (a) CNN-based infrared image colorization strategy; (b) supervised GAN-based infrared image colorization strategy (e.g., Pix2pix^[27]); (c) one-way unpaired GAN-based contrastive loss strategy (e.g., CUT^[38]); (d) one-way unpaired GAN-based attention-guided contrastive loss strategy (e.g., OS-Attn^[41]); (e) two-way unpaired GAN-based cycle consistency loss strategy (e.g., CycleGAN^[33]); (f) two-way unpaired GAN-based contrastive loss strategy (e.g., DCLGAN^[43])

Download full size

Fig. 10. Generative adversarial network model

Download full size

Fig. 11. Network architecture of CycleGAN

Download full size

Fig. 12. Basic framework of contrastive learning

Download full size

Fig. 13. Typical infrared image colorization network effect diagram

Download full size

Fig. 14. Detection results of YOLOv7 on colorized images. (a) Input nighttime infrared image; (b) nighttime RGB image; (c) FRAGAN result; (d) LKAT-GAN result

Download full size

Fig. 15. Principle framework of TeX-Net^[50]

Download full size

Fig. 16. Physically driven colorization of infrared hyperspectral image based on HADAR. (a)(b)(e)(f) Input image; (c)(d)(g)(h) colorized image

Download full size

Fig. 17. High frequency similarity between infrared image and visible image. (a) Visible image; (b) infrared image; (c) high-frequency information of visible image; (d) high-frequency information of infrared image

Download full size

Fig. 18. Zero-shot cross-modal colorization method framework^[51]

Download full size

Fig. 19. Colorization effect of infrared image based on zero-shot learning^[51]

Download full size

Table 0. [in Chinese]

View table

Table 0. [in Chinese]

Type

Method

Network

Learning

strategy

Strength

Weakness

GAN

CTSC^[37]

● Uses CUT architecture

Unsupervised

● Introduces topology-aware GNN and attention module

● Requires graph construction and feature propagation

● Data dependent adjacency

CUT^[38],

FastCUT^[38],

FRAGAN_O^[31]

● CUT/FastCUT generator: ResNet

● FRAGAN_O: CUT structure, improved UNet++ generator

● PatchGAN discriminator

Unsupervised

● Proposes contrastive loss

● Trains on unpaired data

● Mode collapse may occur

● Performs poorly in complex scenes

IRC^[39]

CCLGAN^[40]

● Generator: UNet++

● Based on CUT architecture

Unsupervised

● Trains on unpaired data

● Improves generator

● Adds perceptual loss based on CUT

● Perceptual loss may degrade quality

QS-Attn^[41]

CFSA-ICGAN^[42]

● Generator: ResNet

● Based on CUT architecture

Unsupervised

● Trains on unpaired data

● Adds attention to contrastive loss to improve keypoint detection

● Contrastive loss is costly

● Simple generators underperform in complex scenarios

DCLGAN^[43]

DC-Net^[44]

● Generator: ResNet

● Improved based on CycleGAN architecture

Unsupervised

● Trains on unpaired data

● Proposes bilateral contrastive loss to improve CycleGAN’s cycle consistency loss

● GAN design is computationally expensive

● Simplistic generators struggle with complexity

● Color distortion

Table 1. Summary of infrared image colorization methods based on deep learning

View table

Table 1. Summary of infrared image colorization methods based on deep learning

Type	Method	Network	Learning strategy	Strength	Weakness
CNN	TIR^[24]	● UNet	Supervised	● The first CNN infrared image colorization network ● Simple architecture ● Easy to implement	● Limited performance ● Requires data matching
	SNet^[25]	● Improved UNet-based	Supervised	● Auxiliary network designed within UNet ● Simple architecture ● Easy to implement	● Limited performance ● Requires precise data pairing for training
	AED^[26]	● Improved UNet-based	Supervised	● Use a weight-graph-based multiresolution fusion approach ● Simple architecture ● Easy to implement	● Tailored for near infrared (NIR) ● Needs data pairing
GAN	Pix2pix^[27], TICC-GAN^[28]	● Generator: UNet ● PatchGAN discriminator	Supervised	● Simple architecture ● Easy to implement ● Performs well in simple scenarios ● TICC-GAN integrates perceptual loss into Pix2pix	● Limited performance in complex scenes ● Needs data pairing
	DDGAN^[29], LKAT-GAN^[30], FRAGAN_P^[31], MUGAN^[32]	● Improved based on TICC-GAN ● DDGAN generator: dense connections ● LKAT-GAN generator: ViT+UNet ● FRAGAN_P generator: improved UNet++ ● MUGAN: improved UNet3+	Supervised	● The improved generator captures complex textures	● Needs exact data pairing ● High computation
	CycleGAN^[33], FRAGAN_T^[31]	● CycleGAN generator: ResNet ● FRAGAN_T: CycleGAN using improved UNet++ ● PatchGAN discriminator	Unsupervised	● Training without paired data ● Introduces cycle consistency loss ● Mitigates mode collapse	● Colors appear unnatural ● Dual generators and discriminators increase cost
	PearlGAN^[34], MornGAN^[35], FoalGAN^[36]	● Uses CycleGAN structure with an improved ResNet generator	Unsupervised	● PearlGAN introduces structured gradient alignment loss ● MornGAN incorporates semantic segmentation loss ● FoalGAN introduces subclass appearance consistency loss ● Effectively reduces subclass target loss	● Colors appear unnatural ● Segmentation loss depends on model quality ● Distortion persists in complex scenes

Table 2. Colorization test results of different methods

View table

Table 2. Colorization test results of different methods

Dataset	Method	PSNR（↑）	SSIM（↑）	MSE（↑）	Colorfulness（↑）	FID（↓）
KAIST	TIR	13.06	0.42	0.064	0.038	167.928
	Pix2pix	16.16	0.54	0.04	0.136	129.833
	CUT	13.60	0.43	0.058	0.158	152.386
	QS-Attn	16.83	0.58	0.033	0.417	84.191
	CycleGAN	14.73	0.44	0.059	0.271	224.823
	DCLGAN	15.16	0.54	0.055	0.060	171.339
FLIR	TIR	12.13	0.47	0.099	0.377	253.398
	Pix2pix	16.138	0.526	0.048	0.198	179.159
	CUT	13.52	0.47	0.091	0.052	169.721
	QS-Attn	16.07	0.54	0.051	0.095	176.752
	CycleGAN	15.75	0.51	0.056	0.207	176.779
	DCLGAN	15.15	0.54	0.055	0.075	172.146
NIR	TIR	15.52	0.51	0.058	0.266	193.416
	Pix2pix	18.60	0.68	0.032	0.319	119.104
	CUT	17.89	0.62	0.034	0.090	114.065
	QS-Attn	18.83	0.69	0.031	0.147	105.650
	CycleGAN	17.79	0.59	0.036	0.063	89.577
	DCLGAN	17.81	0.62	0.041	0.118	239.018

Tools

Get Citation

Copy Citation Text

Xiubao Sui, Yuan Liu, Tong Jiang, Tingting Liu, Qian Chen. Review of Infrared Image Colorization Technology (Invited)[J]. Acta Optica Sinica, 2025, 45(17): 1720017

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Optics in Computing

Received: Jun. 8, 2025

Accepted: Aug. 5, 2025

Published Online: Sep. 3, 2025

The Author Email: Xiubao Sui (sxb@njust.edu.cn)

DOI:10.3788/AOS251235

CSTR:32393.14.AOS251235

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology