Laser & Optoelectronics Progress, Volume. 61, Issue 10, 1037004(2024)

Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention

Guoli Zhang1,2, Shuai Chang1,2、*, Yansong Song1,2, and Tianci Liu1,2
Author Affiliations
  • 1College of Opto-Electronic Engineering, Changchun University of Science and Technology, Changchun 130022, Jilin, China
  • 2Institute of Space Photoelectric Technology, Changchun University of Science and Technology, Changchun 130022, Jilin, China
  • show less

    At present, most of the multi-spectral pedestrian detection algorithms focus on the fusion methods of visible light and infrared images, but the number of parameters to fully fuse multi-spectral images is huge, resulting in lower detection speed. To solve this problem, we propose a multi-spectral pedestrian detection algorithm based on YOLOv5s with high timeliness. To ensure the detection speed of the algorithm, we select the merging method of visible light and infrared light channel direction as the input of the network, and improve the detection accuracy by improving the traditional algorithm. First, some standard convolution is replaced by deformable convolution to enhance the ability of the network to extract irregular shape feature objects. Second, the spatial pyramid pooling module in the network is replaced by multi-scale residual attention module, which weakens the interference of the background to the pedestrian target and improves the detection accuracy. Finally, by changing the connection mode and adding the large-scale feature splicing layer, the minimum detection scale of the network is increased, and the detection effect of the network for small targets is improved. Experimental results show that the improved algorithm has obvious advantages in detection speed, and improves the mAP@0.5 and mAP@0.5∶0.95 by 5.1 and 1.9 percentage points over the original algorithm, respectively.

    Tools

    Get Citation

    Copy Citation Text

    Guoli Zhang, Shuai Chang, Yansong Song, Tianci Liu. Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention[J]. Laser & Optoelectronics Progress, 2024, 61(10): 1037004

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Digital Image Processing

    Received: Sep. 15, 2023

    Accepted: Oct. 20, 2023

    Published Online: Mar. 20, 2024

    The Author Email: Chang Shuai (changshuai@cust.edu.cn)

    DOI:10.3788/LOP232131

    Topics