Laser & Optoelectronics Progress, Volume. 61, Issue 18, 1837010(2024)

Improved Two-Stage 3D Object Detection Algorithm for Roadside Scenes with Enhanced PointPillars and Transformer

Liangzi Wang1,2,3, Miaohua Huang1,2,3、*, Ruoying Liu1,2,3, Chengcheng Bi1,2,3, and Yongkang Hu1,2,3
Author Affiliations
  • 1Hubei Key Laboratory of Advanced Technology for Automotive Components, Wuhan University of Technology, Wuhan 430070, Hubei, China
  • 2Hubei Collaborative Innovation Center for Automotive Components Technology, Wuhan University of Technology, Wuhan 430070, Hubei, China
  • 3Hubei Research Center for New Energy & Intelligent Connected Vehicle, Wuhan University of Technology, Wuhan 430070, Hubei, China
  • show less
    Figures & Tables(14)
    Network architecture diagram
    SimAM attention mechanism. (a) SimAM structure; (b) comparison diagram
    Residual downsampling module. (a) Residual structure; (b) ResBlock structure; (c) BasicBlock structure; (d) BasicBlock_DS structure
    Encoder implementation process
    Detection results in DAIR-V2X-I roadside scenes
    Detection results in KITTI vehicle scenes
    Comparison of detection results for distant vehicle detection between proposed algorithm and PointPillars on the DAIR-V2X-I dataset. (a) PointPillars detection results; (b) BEV detection results of PointPillars; (c) enlarged detection results of PointPillars for distant vehicles; (d) detection results of proposed algorithm; (e) BEV detection results of proposed algorithm; (f) enlarged detection results of proposed algorithm for distant vehicles
    Comparison of detection results for pedestrians on both sides of the road between proposed algorithm and PointPillars on the DAIR-V2X-I dataset. (a) PointPillars detection results; (b) enlarged BEV detection results of PointPillars for roadside pedestrians; (c) detection results of proposed algorithm in this paper; (d) enlarged BEV detection results of proposed algorithm for roadside pedestrians
    Comparison of detection results for distant vehicles and pedestrians on both sides of the road between proposed algorithm and PointPillars on the KITTI dataset. (a) (d) Camera perspective; (b) (e) BEV detection results of PointPillars; (c) (f) BEV detection results of proposed algorithm
    • Table 1. Comparison results between proposed algorithm and single-stage algorithms on DAIR-V2X-I validation set

      View table

      Table 1. Comparison results between proposed algorithm and single-stage algorithms on DAIR-V2X-I validation set

      Algorithmcar /%pedestrian /%cyclist /%

      mAP /%

      (moderate)

      FPS /

      (frame/s)

      easymoderatehardeasymoderatehardeasymoderatehard
      IA-SSD2587.9078.0778.1356.5055.6255.6476.6757.4557.3163.7136.76
      SECOND87.7478.6278.6660.8755.7255.8980.5561.8361.8565.3925.77
      PointPillars87.5978.5178.5759.1857.9958.0480.3560.5960.5765.7030.67
      Proposed algorithm89.8780.4180.4570.5868.4968.6182.9262.7062.6270.5314.60
    • Table 2. Comparison results between proposed algorithm and a two-stage algorithm on DAIR-V2X-I validation set

      View table

      Table 2. Comparison results between proposed algorithm and a two-stage algorithm on DAIR-V2X-I validation set

      Algorithmcar /%pedestrian /%cyclist /%

      mAP /%

      (moderate)

      FPS /

      (frame/s)

      easymoderatehardeasymoderatehardeasymoderatehard
      Part-A289.5279.9179.9468.9464.6264.6483.2263.9964.1269.5114.49
      Proposed algorithm89.8780.4180.4570.5868.4968.6182.9262.7062.6270.5314.60
    • Table 3. Comparison results between proposed algorithm and single-stage algorithms on KITTI validation set

      View table

      Table 3. Comparison results between proposed algorithm and single-stage algorithms on KITTI validation set

      Algorithmcar /%pedestrian /%cyclist /%

      mAP /%

      (moderate)

      FPS /

      (frame/s)

      easymoderatehardeasymoderatehardeasymoderatehard
      IA-SSD88.8679.0977.8056.0952.4747.5987.3872.2667.2267.9453.35
      SECOND88.0377.7775.3253.6048.6944.1181.1768.3263.7564.9342.19
      PointPillars87.3377.0174.6154.3549.5545.0780.6563.7659.9563.4448.08
      Proposed algorithm89.6979.3578.0462.4454.2850.8686.5671.9366.7968.5328.74
    • Table 4. Comparison results between proposed algorithm and a two-stage algorithm on KITTI validation set

      View table

      Table 4. Comparison results between proposed algorithm and a two-stage algorithm on KITTI validation set

      Algorithmcar /%pedestrian /%cyclist /%

      mAP /%

      (moderate)

      FPS /

      (frame/s)

      easymoderatehardeasymoderatehardeasymoderatehard
      Part-A289.2778.9678.1761.5453.6049.1185.8672.4668.1068.3426.91
      Proposed algorithm89.6979.3578.0462.4454.2850.8686.5671.9366.7968.5328.74
    • Table 5. Impact of different modules on detection

      View table

      Table 5. Impact of different modules on detection

      AlgorithmSimAMResBlock

      Refinement

      network

      moderate
      car /%pedestrian /%cyclist /%mAP /%
      Baseline78.5157.9960.5965.70
      Experiment 178.7659.0161.9866.58
      Experiment 278.4159.7261.5066.54
      Experiment 378.6959.4662.1266.76
      Experiment 480.3567.1361.3569.61
      Experiment 580.3767.9261.8870.06
      Proposed algorithm80.4168.4962.7070.53
    Tools

    Get Citation

    Copy Citation Text

    Liangzi Wang, Miaohua Huang, Ruoying Liu, Chengcheng Bi, Yongkang Hu. Improved Two-Stage 3D Object Detection Algorithm for Roadside Scenes with Enhanced PointPillars and Transformer[J]. Laser & Optoelectronics Progress, 2024, 61(18): 1837010

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Digital Image Processing

    Received: Jan. 12, 2024

    Accepted: Feb. 5, 2024

    Published Online: Sep. 14, 2024

    The Author Email: Miaohua Huang (mh_huang@163.com)

    DOI:10.3788/LOP240516

    CSTR:32186.14.LOP240516

    Topics