Acta Optica Sinica, Volume. 42, Issue 14, 1415003(2022)

Occluded Pedestrian Detection Algorithm Based on Improved YOLOv3

Xiang Li1,2,3,4, Miao He1,2,3, and Haibo Luo1,2,3、*
Author Affiliations
  • 1Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang 110016, Liaoning , China
  • 2Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, Liaoning , China
  • 3Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, Liaoning , China
  • 4University of Chinese Academy of Sciences, Beijing 100049, China
  • show less
    Figures & Tables(13)
    Illustration of difficulties in occluded object detection. (a) Loose prediction boxes of heavily overlapped objects; (b) center points of prediction boxes of heavily overlapped objects locate in same feature grid; (c) most regions in occluded object box occupied by foreground object
    Convergence results before and after introducing Tight Loss function. (a) Variance of convergence result of prediction box is relatively larger without Tight Loss function; (b) prediction boxes with different anchor frames as starting points tend to be consistent after introducing Tight Loss function
    Schematic diagrams of high-resolution feature pyramid and insertion position of spatial attention prediction head in network. (a) YOLOv3 network; (b) high-resolution feature pyramid; (c) center points of heavily overlapped objects locate in same grid in original feature pyramid; (d) center points of heavily overlapped objects locate in different grids in high resolution feature pyramid
    Schematic diagram of redundant bounding boxes with high confidence in high-resolution feature pyramid. (a) Target box in original feature pyramid and its confidence prediction; (b) confidence prediction and redundant prediction boxes generated by upsampling mechanism; (c) confidence and prediction boxes filtered by spatial attention mechanism
    Spatial attention module
    Schematic diagrams of spatial attention prediction head and spatial attention residual block. (a) Spatial attention prediction head; (b) spatial attention residual block
    Heat map of target confidence
    Influence of Tight Loss function on model performance. (a)(c) Prediction results after Tight Loss fine-tuning; (b)(d) prediction results without Tight Loss adjustment
    Comparison of comprehensive performance of models. (a)(c) Prediction results generated by improved YOLOv3; (b)(d) prediction results generated by original YOLOv3
    • Table 1. Results of ablation experiments based on YOLOv3

      View table

      Table 1. Results of ablation experiments based on YOLOv3

      Tight lossHRFPSAPHMAP /%MMR /%MRecall /%Reasoning speed /(frames-1
      86.8449.5290.6838
      86.8649.3890.6538
      89.5348.3093.8026
      89.5748.1493.7226
      89.7248.3493.8132
      89.7548.2893.8832
    • Table 2. Influence of spatial attention mechanism on number of predicted boxes

      View table

      Table 2. Influence of spatial attention mechanism on number of predicted boxes

      Mode00.10.20.30.40.50.60.70.80.9
      Variation-16199-344+1946+2313+2043+1768+1593+1184+883+429
      H3749111419251073159047777812651935149136305217428725
      H+S3587121415811092619279079855669615308437489225759154
    • Table 3. Performance comparison of models under different NMS methods

      View table

      Table 3. Performance comparison of models under different NMS methods

      IndexOriginal NMSSoft NMSAdaptive NMS
      YOLOv3H+T+SYOLOv3H+T+SYOLOv3H+T+S
      MAP84.9788.3889.0491.4186.8489.75
      MMR50.3949.1450.2649.0749.5248.28
      MRecall88.8592.6694.9797.3290.6893.88
    • Table 4. Comparison between proposed algorithm and current advanced occluded pedestrian detection algorithms

      View table

      Table 4. Comparison between proposed algorithm and current advanced occluded pedestrian detection algorithms

      AlgorithmNMS MethodMAPMMRMRecall
      RetinaNet15Original NMS78.3365.2294.13
      IterDet(RetinaNet)26Original NMS84.7756.2191.49
      Faster RCNN15Original NMS83.0752.3590.57
      PS-RCNN27Original NMS86.0593.77

      IterDet

      (Faster RCNN)26]

      Original NMS88.0849.4495.80
      YOLOv3+H+S+TOriginal NMS88.3849.1492.66
      RetinaNet15Soft NMS78.1066.3495.37
      Faster RCNN15Soft NMS83.9251.9791.73
      YOLOv3+H+S+TSoft NMS91.4149.0797.32
      RetinaNet15Adaptive NMS79.6763.0394.77
      Faster RCNN15Adaptive NMS84.7149.7391.27
      YOLOv3+H+S+TAdaptive NMS89.7548.2893.88
    Tools

    Get Citation

    Copy Citation Text

    Xiang Li, Miao He, Haibo Luo. Occluded Pedestrian Detection Algorithm Based on Improved YOLOv3[J]. Acta Optica Sinica, 2022, 42(14): 1415003

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Jan. 7, 2022

    Accepted: Feb. 14, 2022

    Published Online: Jul. 15, 2022

    The Author Email: Luo Haibo (luohb@sia.cn)

    DOI:10.3788/AOS202242.1415003

    Topics