Fusion method for infrared and other-type images based on the multi-scale Gaussian filtering and morphological transform

Table 1. The parameters set in the compared methods. ‘Filter’ represents the Orientation filter; ‘Levels’ denotes the decomposition levels and the corresponding number of orientations for each level.

View table
View all Tables
Table 1. The parameters set in the compared methods. ‘Filter’ represents the Orientation filter; ‘Levels’ denotes the decomposition levels and the corresponding number of orientations for each level.

Methods Pyramid filter Filter Levels
DWT rbio1.3 3
DTCWT 5-7 q-6 3
SWT bior1.3 3
WPT bior1.3 3
NSCT maxflat dmaxflat5 4,8,16
NSST maxflat 4,8,16

For NSST， the size of the local support of shear filter at each level are selected as 8， 16， 32. As for the proposed method， the parameters and k for the multi-scale Gaussian filtering process in Eq. 5 are selected experimentally. In this experiment， the source images are decomposed by 3-layer multi-scale Gaussian decomposition， and different fused images are obtained by changing the parameters and k . During the fusion process， the AVG-ABS rule is also adopted. When and k are in certain value， every fusion image will be evaluated by seven objective assessment metrics （mentioned in subsection 3.2）. For each metric， its mean value is obtained by averaging the evaluation results of the fusion images. Then， seven mean values are summed to get the sum values of objective metrics. Figure 5 gives three surface plots which show variations of the sum of the seven metrics with and k . As shown in Fig.5 ， the optimal values of and k for the four kinds of images are obtained. The structuring elements in the multi-scale inner- and outer-boundary decompositions are selected as square， and in the multi-scale top- and bottom-hat decompositions they are chosen to be disk. and k in Eq. 5 and the parameters K ， M ， N 1 ， N 2 ， and N 3 in schemes 9， 10， and 11 are set as shown in Table 2 to make the proposed method achieve a good performance.

Figure 5.

Download full size

View all figures

Table 2. The parameters of the proposed method for the four kinds of source images.

View table
View all Tables
Table 2. The parameters of the proposed method for the four kinds of source images.

Source images Parameters
$σ_{0}$ k [K, M, N₁, N₂, N₃]
Infrared-visible 0.6 1.4 [3,2,0,1,2]
Infrared intensity-polarization 0.6 1.1 [3,2,1,1,2]

3.2　Objective assessment metrics

Seven representative metrics， i.e.， Q 0 ［ 27 ］ ， Q E ［ 28 ］ ， Q AB / F ［ 29 ］ ， information entropy （IE） ［ 30 ］ ， mutual information （MI） ［ 31 ］ ， Tamura contrast （TC） ［ 32 ］ ， and visual information fidelity （VIF） ［ 33 ］ are employed to evaluate the proposed method comprehensively. The variable in TC is chosen to be 4.

3.3　Experimental results

3.3.1　Subjective assessment

In this section， the subjective assessment of the fusion methods is done by comparing the visual results obtained from the above and proposed methods. One sample pair in each type of source images are selected for visual comparison as shown in Figs. 6 and 7 .

Figure 6.Fusion results of one pair of the infrared-visible images (a) infrared image, (b) visible image, (c)-(i) the fusion results of the DWT, DTCWT, SWT, WPT, NSCT, NSST, and the proposed methods.

Download full size

View all figures

Figure 7.Fusion results of one pair of the infrared intensity-polarization images (a) Infrared intensity image, (b) Infrared polarization image, (c)-(i) the fusion results of the DWT, DTCWT, SWT, WPT, NSCT, NSST, and the proposed methods.

Download full size

View all figures

In Fig.6 ， both the DWT and WPT methods distort the edges of the roof， which was shown clearly in magnified squares. The DTCWT， SWT， NSCT and NSST methods produce artificial edges in the sky around the roof， while the result obtained by the proposed method is free from such artifacts or brightness distortions. In addition， the walls and the clouds in the sky in Fig.6 （i） are brighter those in Fig.6 （g） and （h）， which means that the fused image of the proposed method has better contrast.

The edges of the car are distorted heavily in Fig.7 （f）， and slightly distorted in Figs.7 （c-e） which is shown more clearly in the corresponding regions in magnified square. And Figs.7 （c-h） show some artifacts around the edges of the car. However， in Figs.7 （i） there are no distortions or certain artifacts. In addition， the car in magnified square of Fig.7 （i） is the darker than those in Figs.7 （h） and （i）， which demonstrate that the proposed method has better contrast.

The above experiments confirm that the proposed method performs better in visual effect for the two categories of source images. Although adopting the simple AVG-ABS rule， the proposed method does not generate certain artifacts or distortions and simultaneously preserves the detail information of source images as much as possible.

3.3.2　Objective assessment

The objective assessment of the seven multi-scale decomposition-based methods are shown in Tables 3 . For the infrared-visible images， the proposed method performs the best on all the seven metrics. For the infrared intensity-polarization images， the proposed method performs the best on the other five metrics except Q 0 and Q E on which it performs the second best. It can also be obtained from Tables 3 that compared with the seven methods， the proposed method always has the best assessment on metrics Q AB / F ， IE， MI， TC， and VIF. It means that the proposed method can transfer the original information of source image including the edges and brightness details to the fused image sufficiently， and improve the contrast of the fused image.

Table 3. Objective assessment of all methods (the best result of each metric is highlighted in bold).

View table

View all Tables

Table 3. Objective assessment of all methods (the best result of each metric is highlighted in bold).

Images	Methods	Q₀	Q^AB^/^F	Q_E	IE	MI	TC	VIF
Infrared-visible	DWT	0.439 1	0.485 8	0.226 8	6.660 1	2.165 8	0.258 8	0.293 6
	DTCWT	0.444 6	0.517 3	0.257 9	6.683 0	2.223 5	0.293 7	0.294 9
	SWT	0.445 2	0.509 7	0.245 7	6.615 5	2.187 2	0.220 3	0.278 4
	WPT	0.407 9	0.395 2	0.161 4	6.638 5	2.194 9	0.274 5	0.273 8
	NSCT	0.466 9	0.528 1	0.259 5	6.696 1	2.263 3	0.294 0	0.314 5
	NSST	0.465 3	0.523 1	0.257 0	6.685 8	2.257 5	0.290 2	0.310 3
	Proposed	0.475 7	0.535 6	0.268 9	6.735 9	2.470 7	0.317 7	0.362 6
Infrared intensity-polarization	DWT	0.385 3	0.420 6	0.167 6	6.478 2	2.266 4	0.347 6	0.219 6
	DTCWT	0.394 4	0.458 5	0.208 9	6.570 7	2.341 5	0.468 4	0.243 7
	SWT	0.387 5	0.439 1	0.193 1	6.473 0	2.342 9	0.330 8	0.230 0
	WPT	0.346 9	0.343 9	0.119 8	6.405 2	2.291 7	0.443 7	0.197 2
	NSCT	0.413 3	0.467 5	0.197 7	6.564 6	2.391 7	0.458 5	0.257 4
	NSST	0.413 8	0.464 1	0.199 5	6.574 0	2.389 8	0.459 7	0.259 2
	Proposed	0.413 4	0.469 0	0.201 3	6.658 0	2.624 1	0.547 8	0.313 7

3.3.3　Comparison of computational efficiency

To verify the efficiency of the proposed method， an experiment is conducted on the image sequences named as “Nato_camp”， “Tree”， and “Dune” from the TNO Image Fusion Dataset ［ 34 ］ . Table 4 shows the average processing time of all methods for a frame. Compared with the DWT， DTCWT， SWT， and WPT methods， the proposed method is more time-consuming because these four methods contain one types of multi-scale decomposition while the proposed method contains two， i.e.， the multi-scale decomposition using multi-scale Gaussian filtering and the multi-scale morphological decomposition， as mentioned in Sec.2. Compared with the NSCT and NSST methods which also contain two kinds of multi-scale decomposition， the proposed method is far more efficient mainly because the design of the multi-directional filter banks for NSCT and NSST is relatively complex and the processing speed of multi-directional filtering is much lower than that of multi-scale morphological operations.

Table 4. Average processing time (unit: sec.) comparison of eight methods. Each value represents the average run time of a frame in a certain sequence.

View table

View all Tables

Table 4. Average processing time (unit: sec.) comparison of eight methods. Each value represents the average run time of a frame in a certain sequence.

Image sequences	DWT	DTCWT	SWT	WPT	NSCT	NSST	Proposed
Nato_camp	0.018 0	0.036 2	0.064 7	0.140 1	24.517 3	2.307 2	0.141 9
Tree	0.016 5	0.035 7	0.064 3	0.139 8	24.821 5	2.292 3	0.141 1
Duine	0.017 1	0.036 1	0.064 1	0.140 6	24.584 1	2.288 1	0.141 2

4 Conclusions

Experiments on both visual quality and objective assessment demonstrate that although adopting the simple AVG-ABS rule， the proposed method does not generate certain artifacts or distortions and performs very well in aspects like information preservation and contrast improvement. Under the premise of ensuring image fusion quality， the proposed method is also proved computationally efficient. The proposed method provides an option for the fusion situations needing both high quality and particularly computational efficiency， such as fast high-resolution images fusion and video fusion.

Category: Image Processing and Software Simulation

Received: Jan. 7, 2020

Accepted: --

Published Online: Jan. 20, 2021

The Author Email: Feng-Bao YANG (yfengb@163.com)

DOI:10.11972/j.issn.1001-9014.2020.06.021

Table 1. The parameters set in the compared methods. ‘Filter’ represents the Orientation filter; ‘Levels’ denotes the decomposition levels and the corresponding number of orientations for each level.

Table 1. The parameters set in the compared methods. ‘Filter’ represents the Orientation filter; ‘Levels’ denotes the decomposition levels and the corresponding number of orientations for each level.

Table 2. The parameters of the proposed method for the four kinds of source images.

Table 2. The parameters of the proposed method for the four kinds of source images.

Table 3. Objective assessment of all methods (the best result of each metric is highlighted in bold).

Table 3. Objective assessment of all methods (the best result of each metric is highlighted in bold).

Table 4. Average processing time (unit: sec.) comparison of eight methods. Each value represents the average run time of a frame in a certain sequence.

Table 4. Average processing time (unit: sec.) comparison of eight methods. Each value represents the average run time of a frame in a certain sequence.