Laser & Optoelectronics Progress, Volume. 58, Issue 24, 2415009(2021)

Two-Channel SSD Pedestrian Head Detection Algorithm Based on Multi-Scale Feature fusion

Yongfu Zhou1, Wenlong Li1,2, and Ranran Hu2、*
Author Affiliations
  • 1School of Management Engineering, Jilin Communications Polytechnic, Changchun, Jilin 130012, China
  • 2School of Electronic Information Engineering, Changchun University of Science and Technology, Changchun, Jilin 130022, China
  • show less
    Figures & Tables(16)
    Structure of SSD300 network
    Structure of two-channel network
    Improved two-channel SSD network model based on feature fusion
    Partial depth images. (a) Image 1; (b) image 2; (c) image 3
    Color images and depth images under different operations. (a) Original images; (b) 180 ° rotation; (c) x-axis reversal; (d) y-axis reversal
    Training loss and accuracy curves. (a) Training loss curve; (b) accuracy curve
    Relationship between precision and recall of different networks
    Detection results of SSD network and two-channel SSD network. (a) Original images; (b)SSD network; (c) two-channel SSD network
    Detection results of SSD network and improved SSD network with multi-scale feature fusion. (a) Original images; (b) SSD network; (c) two-channel SSD network
    Detection results of each algorithm under different illumination variation conditions. (a) SSD algorithm; (b) DSSD algorithm; (c) improved algorithm
    Detection results of each algorithm under different occlusion conditions. (a) SSD algorithm; (b) DSSD algorithm; (c) improved algorithm
    • Table 1. Size of default box for each layer

      View table

      Table 1. Size of default box for each layer

      ParameterMDConv 4_3_fusionFC 7_fusionConv 8_2_fusion
      Feature map size/(pixel×pixel)38×3819×1910×10
      Number of default boxes463
      ar{1,1,2,1/2}{1,1,2,1/2,3,1/3}{1,2,1/2}
      Small side length3060111
      Large side length60111-
    • Table 2. Average accuracy and speed of target detection in different prior frame networks

      View table

      Table 2. Average accuracy and speed of target detection in different prior frame networks

      NetworkmAP/%FPS
      Simplified SSD network83.7033
      SSD network84.9029
    • Table 3. Average detection accuracy of two-channel SSD network unit: %

      View table

      Table 3. Average detection accuracy of two-channel SSD network unit: %

      Fusion modeWeight ratioRGB imageDepth imageTwo-channel SSD
      Concat83.7082.6189.86
      Eltwise
      0.7∶0.393.59
      0.5∶0.591.94
      0.3∶0.787.46
    • Table 4. Average detection accuracy of multi-scale feature fusion network unit: %

      View table

      Table 4. Average detection accuracy of multi-scale feature fusion network unit: %

      ParameterSSDDSSDMulti-scale SSD
      mAP83.7088.4391.47
    • Table 5. Average detection accuracy of different models unit: %

      View table

      Table 5. Average detection accuracy of different models unit: %

      ParameterSSDRGB-D+YOLOv2RGB-D+Faster-RCNNRef. [18]Ref. [19]Ref. [25]Proposed model
      mAP84.9092.9593.1492.0190.7895.4797.80
    Tools

    Get Citation

    Copy Citation Text

    Yongfu Zhou, Wenlong Li, Ranran Hu. Two-Channel SSD Pedestrian Head Detection Algorithm Based on Multi-Scale Feature fusion[J]. Laser & Optoelectronics Progress, 2021, 58(24): 2415009

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Jul. 26, 2021

    Accepted: Sep. 2, 2021

    Published Online: Dec. 1, 2021

    The Author Email: Ranran Hu (huranran111@126.com)

    DOI:10.3788/LOP202158.2415009

    Topics