Chinese Journal of Lasers, Volume. 51, Issue 17, 1710002(2024)

Semantic Segmentation of Large‑Scale Laser Point Cloud in Mines Based on Local Feature Enhancement

Hongxiang Dong1, Yi An1,2、*, Lirong Xie1, Zhiyong Yang3, and Kai Zhang1
Author Affiliations
  • 1School of Electrical Engineering, Xinjiang University, Urumqi 830017, Xinjiang , China
  • 2School of Control Science and Engineering, Dalian University of Technology, Dalian 116024, Liaoning , China
  • 3Xinjiang Tianchi Energy Co., Ltd., Fukang831500, Xinjiang , China
  • show less

    Objective

    To improve the efficiency of coal mining and ensure production safety, the construction of intelligent coal mines is being vigorously promoted. With the rapid development of three-dimensional (3D) ranging technology, lidar can acquire high-density 3D laser point clouds in a short time. Semantic segmentation of 3D laser point cloud data can provide accurate environment perception for unmanned mining trucks and realize autonomous operation of those trucks. Point cloud data are characterized by disorder, unstructuredness, and sparsity, which brings difficulties to point cloud data processing and data analysis. To fully learn the local geometric features and the contextual information of mining point clouds and improve the accuracy of semantic segmentation, we propose a new local feature enhancement-based semantic segmentation network called LFE-Net for large-scale laser point clouds in mines.

    Methods

    The LFE-Net input is N×6 mining point cloud. Each point goes through a shared multilayer perceptron (MLP) to obtain eight dimensions as inputs to the encoder-decoder. The encoder-decoder consists of five encoding layers and five decoding layers. Each encoding layer consists of a dilated residual block and a random downsampling operation. The dilated residual block consists of two local feature extraction modules and two local feature aggregation modules, and it is used to enlarge the receptive field of each point. The local feature extraction module learns the spatial position information and surface orientation change information of a given point and its neighborhood points, and utilizes the spatial distance weights to enhance the semantic features of neighborhood points. The local feature aggregation module utilizes mixed pooling to aggregate the features of neighborhood points. Subsequently, we downsample the point cloud and reserve 1/4 of the point cloud for each downsampling. Each decoding layer contains an upsampling operation and an MLP. The decoding layers use upsampling operation to continuously restore the spatial resolution of point cloud features and connect with the intermediate features generated by the encoding layers through skip connections. Finally, the semantic segmentation results are output through three fully connected layers and a dropout layer. Moreover, to alleviate the problem of sample imbalance, a weighted cross-entropy loss function is adopted to make the network pay more attention to small sample classes to improve the accuracy of semantic segmentation.

    Results and Discussions

    To conduct experiments on semantic segmentation of the mining point clouds, we utilize a mining truck equipped with lidar sensors to collect laser point cloud data from the open pit mine. We add real semantic labels to the collected point cloud data and create a 3D laser point cloud semantic segmentation dataset of the mine. To fully evaluate the effectiveness of the LFE-Net, we compare its experimental results with other large-scale point cloud semantic segmentation methods by using the mining dataset. The overall accuracy (OA) and the mean intersection over union (mIoU) of the LFE-Net are 97.92% and 87.578%, respectively, which are higher than those of other methods. Additionally, we conduct ablation experiments on the local feature extraction module and the local feature aggregation module to verify the effectiveness of each module.

    Conclusion

    To fully learn the local geometric features and the contextual information of mining point clouds and improve the accuracy of semantic segmentation, we propose a new local feature enhancement-based semantic segmentation network LFE-Net for large-scale laser point clouds in mines. The main technical contributions of this paper are given as follows. 1) To solve the problem of difficulty in extracting the features of mining trucks, we propose a spatial position encoding unit in this paper. This unit encodes spatial position information of the point clouds and utilizes spatial distance weights to enhance the semantic features of neighborhood points to improve the segmentation accuracy of the algorithm. 2) To solve the problem of difficulty in extracting ground edge features, we propose a spatial normal vector encoding unit. By adding normal vector information, the network can enhance the perception ability of geometric structure features and improve the accuracy of ground edge segmentation. 3) To solve the problem of feature loss caused by max pooling, we propose a mixed pooling. The mixed pooling consists of max pooling and attention pooling, which enriches local features. The LFE-Net is tested on the mining dataset and excellent performance is achieved, with an OA of 97.92% and an mIoU of 87.578%. These experimental results validate the effectiveness of LFE-Net. The network proposed in this paper provides a theoretical basis for the application of unmanned mining trucks in open pit mines, which has significant implications for the unmanned operation of mining trucks.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Hongxiang Dong, Yi An, Lirong Xie, Zhiyong Yang, Kai Zhang. Semantic Segmentation of Large‑Scale Laser Point Cloud in Mines Based on Local Feature Enhancement[J]. Chinese Journal of Lasers, 2024, 51(17): 1710002

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: remote sensing and sensor

    Received: Nov. 22, 2023

    Accepted: Feb. 19, 2024

    Published Online: Aug. 29, 2024

    The Author Email: An Yi (anyi@dlut.edu.cn)

    DOI:10.3788/CJL231425

    CSTR:32183.14.CJL231425

    Topics