Laser & Optoelectronics Progress, Volume. 61, Issue 10, 1037004(2024)

Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention

Guoli Zhang1,2, Shuai Chang1,2、*, Yansong Song1,2, and Tianci Liu1,2
Author Affiliations
  • 1College of Opto-Electronic Engineering, Changchun University of Science and Technology, Changchun 130022, Jilin, China
  • 2Institute of Space Photoelectric Technology, Changchun University of Science and Technology, Changchun 130022, Jilin, China
  • show less
    Figures & Tables(13)
    Sampling position. (a) Regular convolution; (b) DCN
    3×3 DCN
    Schematic diagram of SPP module
    Schematic diagram of multi-scale residual attention module. (a) Overall structure; (b) structure of UP mudule
    Neck before improvement
    Neck after improvement
    Improved YOLOv5s network
    Curves of P-R
    Visualization comparison of detection results by different algorithms. (a) Complex background scene; (b) heavily occlusion pedestrian target scene; (c) blur background small target scene; (d) weak feature pedestrian target scene
    • Table 1. Experimental environment configuration

      View table

      Table 1. Experimental environment configuration

      ParameterExperimental environment
      Operating systemWindows 11
      CPUIntel Core i5-12500H @ 2.40 GHz
      GPUGeForce RTX 3060
      Memory32 Gb
      Python3.8
      Deep learning frameworkPyTorch 1.7.0、CUDA 11.1
    • Table 2. Ablation experimental results

      View table

      Table 2. Ablation experimental results

      GroupDCNMRASmall targetmAP@0.5 /%mAP@0.5∶0.95 /%FPS /(frame/s)
      149.726.363.8
      251.126.960.7
      350.626.561.4
      451.727.060.3
      551.927.358.7
      653.527.756.4
      752.627.556.8
      854.828.254.4
    • Table 3. Comparative experimental results of fusion methods

      View table

      Table 3. Comparative experimental results of fusion methods

      MethodPlatformFPS /(frame/s)
      ACFMATLAB0.37
      Halfway FusionTITAN X2.36
      IAF-RCNNTITAN X4.76
      CIAN1080Ti14.28
      DSMN3090Ti1.32
      MBNet1080Ti14.29
      Ours3060Ti54.40
    • Table 4. Comparative experimental results of different methods

      View table

      Table 4. Comparative experimental results of different methods

      MethodmAP@0.5 /%FPS /(frame/s)
      Faster R-CNN46.714.8
      SSD38.642.9
      YOLOv4-tiny42.553.6
      YOLOv5s49.763.8
      YOLOv751.639.3
      Ours54.854.4
    Tools

    Get Citation

    Copy Citation Text

    Guoli Zhang, Shuai Chang, Yansong Song, Tianci Liu. Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention[J]. Laser & Optoelectronics Progress, 2024, 61(10): 1037004

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Digital Image Processing

    Received: Sep. 15, 2023

    Accepted: Oct. 20, 2023

    Published Online: Mar. 20, 2024

    The Author Email: Chang Shuai (changshuai@cust.edu.cn)

    DOI:10.3788/LOP232131

    Topics