Opto-Electronic Engineering, Volume. 48, Issue 5, 200418(2021)

Fusing point cloud with image for object detection using convolutional neural networks

Zhang Jiesong, Huang Yingping*, and Zhang Rui
Author Affiliations
  • [in Chinese]
  • show less

    Addressing on the issues like varying object scale, complicated illumination conditions, and lack of reliable distance information in driverless applications, this paper proposes a multi-modal fusion method for object detection by using convolutional neural networks. The depth map is generated by mapping LiDAR point cloud onto the image plane and taken as input data together with the RGB image. The input data is also processed by the sliding window to reduce information loss. Two feature extracting networks are used to extract features of the image and the depth map respectively. The generated feature maps are fused through a connection layer. The objects are detected by processing the fused feature map through position regression and object classification. Non-maximal suppression is used to optimize the detection results. The experimental results on the KITTI dataset show that the proposed method is robust in various illumination conditions and especially effective on detecting small objects. Compared with other methods, the proposed method exhibits integrated advantages in terms of detection accuracy and speed.

    Tools

    Get Citation

    Copy Citation Text

    Zhang Jiesong, Huang Yingping, Zhang Rui. Fusing point cloud with image for object detection using convolutional neural networks[J]. Opto-Electronic Engineering, 2021, 48(5): 200418

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Article

    Received: Nov. 10, 2020

    Accepted: --

    Published Online: Sep. 4, 2021

    The Author Email: Yingping Huang (huangyingping@usst.edu.cn。)

    DOI:10.12086/oee.2021.200418

    Topics