Chinese Journal of Liquid Crystals and Displays, Volume. 37, Issue 10, 1355(2022)

3D object detection in voxelized point cloud scene

Rui-long LI1,2, Chuan WU1,2、*, and Ming ZHU1,2
Author Affiliations
  • 1Changchun Institute of Optics,Fine Mechanics and Physics,Chinese Academy of Sciences,Changchun 130033,China
  • 2University of Chinese Academy of Sciences,Beijing 100049,China
  • show less
    Figures & Tables(8)
    Framework of Pillar RCNN
    Sample images of sparse 3D convolutional(a)and submanifold 3D sparse convolutional(b)
    2D diagram of multi-scale voxel RoI aggregation module
    Projection of 3D detection effects in images and point clouds
    • Table 1. Accuracy comparison of max pooling layer and average pooling layer in 3D object detection of car AP40

      View table
      View in Article

      Table 1. Accuracy comparison of max pooling layer and average pooling layer in 3D object detection of car AP40

      EasyModHard
      Max92.3484.6382.48
      Average92.1783.0182.27
    • Table 2. Comparison of car AP40 effects of different neighborhood thresholds on multi-scale ROI feature aggregation module in 3D object detection

      View table
      View in Article

      Table 2. Comparison of car AP40 effects of different neighborhood thresholds on multi-scale ROI feature aggregation module in 3D object detection

      [block3][block4]EasyModHard
      1-2][2492.4285.4383.04
      24][2492.3484.6382.48
      24][4892.3785.1582.98
    • Table 3. Performance comparison on the KITTI test set with AP calculated by 11 recall positions of 3D object

      View table
      View in Article

      Table 3. Performance comparison on the KITTI test set with AP calculated by 11 recall positions of 3D object

      ModalityFPSCarPedestrianCyclist
      EasyModHardEasyModHardEasyModHard
      MV3DLidar & Img.371.1956.655.3N/AN/AN/AN/AN/AN/A
      SECONDLidar30.487.4376.4869.1N/AN/AN/AN/AN/AN/A
      VoxelNetLidar4.4481.9765.4662.8557.8653.4248.8767.1747.6545.11
      Voxel R-CNNLidar25.289.4184.5278.93N/AN/AN/AN/AN/AN/A
      PointPillarsLidar62.585.2276.5269.7363.8758.0852.877.1659.2255.91
      Pillar RCNNLidar40.389.2683.6178.5865.0358.8054.4785.3572.5668.68
    • Table 4. Performance comparison on the KITTI test set with AP calculated by 11 recall positions of BEV object

      View table
      View in Article

      Table 4. Performance comparison on the KITTI test set with AP calculated by 11 recall positions of BEV object

      ModalityFPSCarPedestrianCyclist
      EasyModHardEasyModHardEasyModHard
      MV3DLidar & Img.386.1877.3276.33N/AN/AN/AN/AN/AN/A
      PIXORLidar & Img.10.686.7980.7576.6N/AN/AN/AN/AN/AN/A
      SECONDLidar30.489.9687.0779.66N/AN/AN/AN/AN/AN/A
      VoxelNetLidar4.4489.684.8178.5769.9561.0556.9874.4152.1850.49
      PointPillarsLidar62.590.0787.4484.7871.1865.9261.3682.262.8159.77
      Pillar RCNNLidar40.390.1588.1387.6566.1361.1658.0392.2173.8570.65
    Tools

    Get Citation

    Copy Citation Text

    Rui-long LI, Chuan WU, Ming ZHU. 3D object detection in voxelized point cloud scene[J]. Chinese Journal of Liquid Crystals and Displays, 2022, 37(10): 1355

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Research Articles

    Received: Mar. 10, 2022

    Accepted: --

    Published Online: Oct. 10, 2022

    The Author Email: Chuan WU (wuchuan0458@sina.com)

    DOI:10.37188/CJLCD.2022-0082

    Topics