Laser & Optoelectronics Progress, Volume. 61, Issue 10, 1037004(2024)
Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention
At present, most of the multi-spectral pedestrian detection algorithms focus on the fusion methods of visible light and infrared images, but the number of parameters to fully fuse multi-spectral images is huge, resulting in lower detection speed. To solve this problem, we propose a multi-spectral pedestrian detection algorithm based on YOLOv5s with high timeliness. To ensure the detection speed of the algorithm, we select the merging method of visible light and infrared light channel direction as the input of the network, and improve the detection accuracy by improving the traditional algorithm. First, some standard convolution is replaced by deformable convolution to enhance the ability of the network to extract irregular shape feature objects. Second, the spatial pyramid pooling module in the network is replaced by multi-scale residual attention module, which weakens the interference of the background to the pedestrian target and improves the detection accuracy. Finally, by changing the connection mode and adding the large-scale feature splicing layer, the minimum detection scale of the network is increased, and the detection effect of the network for small targets is improved. Experimental results show that the improved algorithm has obvious advantages in detection speed, and improves the mAP@0.5 and mAP@0.5∶0.95 by 5.1 and 1.9 percentage points over the original algorithm, respectively.
Get Citation
Copy Citation Text
Guoli Zhang, Shuai Chang, Yansong Song, Tianci Liu. Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention[J]. Laser & Optoelectronics Progress, 2024, 61(10): 1037004
Category: Digital Image Processing
Received: Sep. 15, 2023
Accepted: Oct. 20, 2023
Published Online: Mar. 20, 2024
The Author Email: Chang Shuai (changshuai@cust.edu.cn)