Infrared and Laser Engineering, Volume. 51, Issue 9, 20210924(2022)

Semantic enhanced guide feature reconstruction for occluded pedestrian detection

Xudan Sun, Qing Wu, Chunyan Zhao, and Mandun Zhang
Author Affiliations
  • School of Artificial Intelligence, Hebei University of Technology, Tianjin 300401, China
  • show less
    Figures & Tables(13)
    Framework diagram of proposed network model
    (a) Semantic feature enhancement structure diagram; (b) Improved global context bock
    Structure diagram of adaptive feature reconstruction
    Comparison of original image and different segmentation images. (a) Original image; (b) Fine segmentation map; (c) Rough segmentation map
    Comparison of feature visualization of Conv5_3 layer before and after adding each module. (a) Original image; (b) Baseline; (c) SFEM; (d) AFRM; (e) Proposed method
    Comparison of some test results. (a) CSP test results; (b) Proposed method test results
    Model failure scene
    • Table 1. Standards for dividing data set subsets

      View table
      View in Article

      Table 1. Standards for dividing data set subsets

      TypeBareReasonableHeavySmallAll
      Highh>50 h>50 h>50 50<h<75 h>20
      Visv>90% v>65% 0v>65% v>20%
    • Table 2. Comparison of AFRM fusion methods

      View table
      View in Article

      Table 2. Comparison of AFRM fusion methods

      BaselineCatSumMultiplyReasonableHeavy
      11.00%49.30%
      10.60%48.74%
      11.20%49.10%
      10.26%48.21%
    • Table 3. Comparison experiment of ablation of each module

      View table
      View in Article

      Table 3. Comparison experiment of ablation of each module

      BaselineSFEMAFRMReasonableHeavy
      11.00%49.30%
      10.82%48.77%
      10.26%48.21%
      9.85%47.32%
    • Table 4. Comparison results of different NMS types at different thresholds

      View table
      View in Article

      Table 4. Comparison results of different NMS types at different thresholds

      TypeIoU=0.5IoU=0.6IoU=0.7
      Baseline (NMS)49.3049.9153.47
      Proposed method (NMS)47.3247.8551.00
      Proposed method (DIOU-NMS)47.2847.8550.75
    • Table 5. Comparison of CityPersons data sets

      View table
      View in Article

      Table 5. Comparison of CityPersons data sets

      MethodReasonableBareHeavySmallAllTime
      RepLoss13.207.6056.9042.6044.45-
      OR-CNN12.806.7055.7042.3042.32-
      PRNet10.806.8053.30---
      PEN10.407.0047.40--0.36
      MSAF9.507.1048.4015.50--
      IDC10.70-50.6014.7041.40-
      R2NMS11.10-53.30---
      CSANet12.007.3051.30--0.32
      APD10.607.1049.8015.70-0.16
      Couple12.30-49.8138.3140.39-
      CSP11.007.0349.3016.00-0.33
      Proposed9.856.8247.2813.9336.650.36
    • Table 6. Comparison of caltech data set

      View table
      View in Article

      Table 6. Comparison of caltech data set

      MethodBackboneReasonableHeavyAll
      ATT-partVGG-1610.3045.18-
      RepLossResNet-505.0047.9059.00
      OR-CNNResNet-504.1045.00-
      SSNetResNet-506.30--
      PAMS_FCNResNet-504.5047.4053.70
      Bi-BoxVGG-167.6144.40-
      AR-PedResNet-504.3648.80-
      CSPResNet-504.5445.8056.94
      ProposedResNet-504.3244.0456.34
    Tools

    Get Citation

    Copy Citation Text

    Xudan Sun, Qing Wu, Chunyan Zhao, Mandun Zhang. Semantic enhanced guide feature reconstruction for occluded pedestrian detection[J]. Infrared and Laser Engineering, 2022, 51(9): 20210924

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image processing

    Received: Nov. 30, 2021

    Accepted: --

    Published Online: Jan. 6, 2023

    The Author Email:

    DOI:10.3788/IRLA20210924

    Topics