Laser & Optoelectronics Progress, Volume. 61, Issue 12, 1228007(2024)

Fast Two-Stage 3D Object Detection with Semantic Guidance

Mang Huang1,2,3,4, Bin Hui1,2、*, Zhaoji Liu1,2, and Tianming Jin1,2
Author Affiliations
  • 1Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang 110016, Liaoning, China
  • 2Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, Liaoning, China
  • 3Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, Liaoning, China
  • 4University of Chinese Academy of Sciences, Beijing 100049, China
  • show less

    With the continuous increase in the sampling rate of LiDAR, systems can rapidly acquire high-resolution point cloud data of scenes. Dense point clouds are advantageous for improving the accuracy of 3D object detection; however, they increase the computational load. In addition, point-based 3D object detection methods encounter the challenges of balancing speed and accuracy. To enhance the computational efficiency of multilevel downsampling in 3D object detection and address issues such as low foreground point recall rate and size ambiguity of the one-stage network, a fast two-stage method based on semantic guidance is proposed herein. In the first stage, a semantic-guided downsampling method is introduced to enable deep neural networks to efficiently perceive foreground points. In the second stage, a channel-aware pooling method is employed to aggregate semantic information of the sampled points by adding pooled points, thereby enrich the feature description of regions of interest, and obtain more accurate proposal boxes. Test results on the KITTI dataset reveal that compared with similar two-stage baseline methods, the proposed method achieves the highest detection-accuracy improvements of 4.62 percentage points, 1.44 percentage points, and 3.91 percentage points for cars, pedestrians, and cyclists, respectively. Furthermore, the inference speed reaches 55.6 frame/s, surpassing the fastest benchmark by 31.1%. The algorithm exhibits strong performance in accuracy and speed, holding practical value for real-world applications.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Mang Huang, Bin Hui, Zhaoji Liu, Tianming Jin. Fast Two-Stage 3D Object Detection with Semantic Guidance[J]. Laser & Optoelectronics Progress, 2024, 61(12): 1228007

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Remote Sensing and Sensors

    Received: Jul. 19, 2023

    Accepted: Sep. 6, 2023

    Published Online: Jun. 5, 2024

    The Author Email: Bin Hui (huibin@sia.cn)

    DOI:10.3788/LOP231763

    CSTR:32186.14.LOP231763

    Topics