Laser & Optoelectronics Progress, Volume. 60, Issue 20, 2015002(2023)

Semantic Segmentation Method of Point Cloud Based on Sparse Convolution and Attention Mechanism

Meng Zuo1,2,3,4, Yiyang Liu1,2,3、*, Hao Cui1,2,3, and Hongfei Bai2
Author Affiliations
  • 1Key Laboratory Networked Control Systems, Chinese Academy of Sciences, Shenyang 110016, Liaoning , China
  • 2Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, Liaoning , China
  • 3Institutes for Robotics & Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, Liaoning , China
  • 4University of Chinese Academy of Sciences, Beijing 100049, China
  • show less
    Figures & Tables(15)
    Point cloud semantic segmentation network model
    Feature extraction network based on sparse convolution and improved attention mechanism
    Residual block based on sparse convolution
    Comparison between ordinary convolution and sparse convolution. (a) Ordinary convolution; (b) sparse convolution
    Non Local Block structure
    Spatial pyramid sampling
    Non Local Block combined with spatial pyramid sampling
    Scannet V2 dataset segmentation visualization. (a) True value label; (b) PointNet++; (c) FPConv; (d) SSCN; (e) Minkowski; (f) proposed network
    S3DIS AREA 5 segmentation visualization. (a) True value label; (b) PointNet; (c) KPConv; (d) Minkowski; (e) proposed network
    • Table 1. Comparison of different voxel resolution parameters

      View table

      Table 1. Comparison of different voxel resolution parameters

      Voxel resolution /cmMIOU /%Number of totle voxelsNumber of active voxels
      168.2141.440×1010112541
      270.8211.800×10995145
      370.1545.333×10887165
      468.6572.250×10881104
      567.2411.152×10879514
      666.1426.667×10772015
    • Table 2. Comparison of sampling parameters in different spatial pyramids

      View table

      Table 2. Comparison of sampling parameters in different spatial pyramids

      Sampling methodSample sizeSize of SMIOU /%
      Pyramid random1,4,9,365070.461
      Pyramid max1,4,9,365071.324
      Pyramid average1,4,9,365071.640
      Pyramid average1,9,36,6411071.825
      Pyramid average1,16,64,14422571.833
    • Table 3. Comparison of experimental results of Scannet V2 test set

      View table

      Table 3. Comparison of experimental results of Scannet V2 test set

      ClassPointNet++FPConvSSCNMinkowskiProposed algorithm
      MIOU33.963.970.870.671.8
      Wall52.379.983.684.583.8
      Floor67.794.895.195.994.9
      Cabinet25.660.365.363.968.4
      Bed47.876.080.780.880.4
      Chair36.079.890.490.191.2
      Sofa34.669.682.081.580.2
      Table23.261.472.270.973.5
      Door26.152.464.359.867.2
      Window25.256.760.560.664.1
      Bookshelf45.871.378.075.476.0
      Picture11.725.031.331.535.1
      Counter25.039.262.566.061.2
      Desk27.86.358.760.563.9
      Curtain24.753.475.871.376.2
      Refrigerator21.253.849.455.656.2
      Shower curtain58.472.370.866.572.2
      toilet14.587.293.090.384.2
      Sink54.859.863.965.262.5
      Bathtub36.478.587.493.588.1
      other18.345.751.456.657.2
    • Table 4. Comparison of experimental results of the S3DIS AREA 5

      View table

      Table 4. Comparison of experimental results of the S3DIS AREA 5

      ClassPointNetKPConvMinkowskiProposed network
      MIOU41.167.165.470.5
      Calling88.892.891.892.5
      Floor97.397.398.798.4
      Wall69.882.486.289.4
      Beam0.10.00.00.0
      Column3.923.934.154.2
      Window46.358.048.961.2
      Door10.869.062.465.1
      Table59.081.581.682.1
      Chair52.691.089.892.0
      Sofa5.975.447.278.2
      Bookcase40.375.374.974.2
      Board26.466.774.475.2
      Clutter33.258.958.654.4
    • Table 5. Comparison of segmentation accuracy of Non Local Block inserted into different layers after spatial pyramid sampling

      View table

      Table 5. Comparison of segmentation accuracy of Non Local Block inserted into different layers after spatial pyramid sampling

      LayerSSCNSSCN+NonLocal BlockSSCN+SPSNB
      170.82171.03471.034
      270.82171.34271.214
      370.82171.421
      470.82171.825
      570.82171.641
      670.82171.322
    • Table 6. Comparison of time for Non Local Block insertion into different layers of forward reasoning after a spatial pyramid sampling

      View table

      Table 6. Comparison of time for Non Local Block insertion into different layers of forward reasoning after a spatial pyramid sampling

      LayerSSCNSSCN+Non Local BlockSSCN+SPSNB
      173.1073.3273.32
      273.1077.3473.85
      373.1074.42
      473.1075.35
      573.1076.52
      673.1077.82
    Tools

    Get Citation

    Copy Citation Text

    Meng Zuo, Yiyang Liu, Hao Cui, Hongfei Bai. Semantic Segmentation Method of Point Cloud Based on Sparse Convolution and Attention Mechanism[J]. Laser & Optoelectronics Progress, 2023, 60(20): 2015002

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Oct. 18, 2022

    Accepted: Dec. 12, 2022

    Published Online: Oct. 13, 2023

    The Author Email: Liu Yiyang (sialiuyiyang@sia.cn)

    DOI:10.3788/LOP222819

    Topics