Acta Optica Sinica, Volume. 40, Issue 5, 0504001(2020)

Multi-Scale Infrared Pedestrian Detection Based on Deep Attention Mechanism

Bin Zhao, Chunping Wang*, Qiang Fu, and Yichao Chen
Author Affiliations
  • Department of Electronic and Optical Engineering, Shijiazhuang Campus of Army Engineering University, Shijiazhuang, Hebei 050003, China
  • show less
    Figures & Tables(13)
    Characteristic of pedestrian in U-FOV infrared images. (a) Large and medium scale pedestrians; (b) small scale pedestrians
    Architecture of multi-scale infrared pedestrian detection network based on Darknet53
    Architecture of attention module
    Residual module
    Principle of pedestrian detection
    Learning rate and loss curves. (a) Learning rate on Caltech dataset; (b) loss on Caltech dataset; (c) learning rate on U-FOV dataset; (d) loss on U-FOV dataset
    Salient coefficient and feature maps
    Distribution of pedestrian size in U-FOV test set
    Visualization results of infrared pedestrian detection
    P-R curves under different IoU thresholds. (a) IoU threshold is 0.3; (b) IoU threshold is 0.45; (c) IoU threshold is 0.5; (d) IoU threshold is 0.7
    Visualization results of infrared pedestrian detection on LTIR dataset at different scenes
    • Table 1. Average precision of pedestrian detection under different IoU thresholds

      View table

      Table 1. Average precision of pedestrian detection under different IoU thresholds

      MethodBackboneDatasetAP
      IoU is 0.3IoU is 0.45IoU is 0.5IoU is 0.7
      Faster R-CNNResNet101U-FOV IR0.5932
      SSDMobilenet_v1U-FOV IR0.5584
      R-FCNResNet101U-FOV IR0.6312
      YOLOv3Darknet53U-FOV IR0.65950.66710.66280.6461
      MS-IRPDDarknet53U-FOV IR0.88800.88700.88280.8511
      MS-IRPDDarknet53Caltech+IR0.90570.90780.90840.8961
      MS-IRPD-attentionDarknet53Caltech+IR0.92950.93450.93400.9120
    • Table 2. Total times of U-FOV test set

      View table

      Table 2. Total times of U-FOV test set

      MethodYOLOv3YOLOv3+AttentionMS-IRPD-Attention
      Total time /s90.7595.32125.21
      FPS7.286.935.28
    Tools

    Get Citation

    Copy Citation Text

    Bin Zhao, Chunping Wang, Qiang Fu, Yichao Chen. Multi-Scale Infrared Pedestrian Detection Based on Deep Attention Mechanism[J]. Acta Optica Sinica, 2020, 40(5): 0504001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Detectors

    Received: Sep. 23, 2019

    Accepted: Nov. 27, 2019

    Published Online: Mar. 10, 2020

    The Author Email: Wang Chunping (wang_c_p@163.com)

    DOI:10.3788/AOS202040.0504001

    Topics