Laser & Optoelectronics Progress, Volume. 62, Issue 14, 1415001(2025)

Point Cloud Semantic Segmentation Method Based on Improved Point Transformer v2

Qi Chen*, Kangnian Wang, Zhiqiang Jiao, and Zhanhua Huang
Author Affiliations
  • Key Laboratory of Opto-Electronics Information Technology, Ministry of Education, School of Precision Instrument and Opto-Electronics Engineering, Tianjin University, Tianjin 300072, China
  • show less
    Figures & Tables(9)
    Improved Point Transformer v2 model structure
    Detailed structure design of each module. (a) Input embedding module; (b) encoder; (c) decoder; (d) semantic segmentation head
    Construction of full-scale aggregation feature sequence FDE, 2 of the second decoder
    Channel attention mechanism
    Vector attention mechanisms. (a) Vector attention mechanism encoded by linear layer; (b) grouped vector attention mechanism encoded by grouped linear layer; (c) grouped vector attention mechanism encoded by linear layer
    Visualization of semantic segmentation results of Point Transformer v2 and proposed model on S3DIS
    • Table 1. Results of semantic segmentation on S3DIS

      View table

      Table 1. Results of semantic segmentation on S3DIS

      Method

      OA/

      %

      mAcc/

      %

      mIoU/

      %

      mIoU /% for each category
      CeilingFloorWallBeamColumnWindowDoorTableChairSofaBookcaseBoardClutter
      PointNet12-49.041.188.897.369.80.13.946.310.859.052.65.940.326.433.2
      PointCNN1685.963.957.392.398.279.40.017.622.862.174.480.631.766.762.156.7
      SPGraph2086.466.558.089.469.978.10.042.848.961.684.775.469.852.62.156.2
      PAT23-70.860.193.098.572.31.041.558.138.257.783.648.167.061.333.6
      PCT39-67.761.392.598.480.60.019.461.648.076.685.246.267.767.952.3
      PTv12589.474.368.192.498.382.50.034.951.469.479.991.276.476.172.559.8
      Stratified Transformer4090.076.370.191.997.284.50.130.558.073.282.392.679.276.084.560.5
      Fast Point Transformer41-74.868.890.096.486.20.251.058.369.081.288.662.274.478.858.5
      Point Transformer v22690.175.769.692.698.384.00.033.056.677.981.192.774.475.776.861.4
      Proposed90.577.371.591.498.285.30.037.858.880.882.495.381.677.181.359.7
    • Table 2. Network learnable parameters and forward time on S3DIS of Point Transformer v2 and the proposed model

      View table

      Table 2. Network learnable parameters and forward time on S3DIS of Point Transformer v2 and the proposed model

      MethodParametersForward time /s
      Point Transformer v244427551.6335
      Proposed38134451.6329
    • Table 3. Results of module design ablation experiments

      View table

      Table 3. Results of module design ablation experiments

      ModelModuleParametersmIoU /%
      Grouped linear-GVA444275569.6
      Linear-GVA390846169.7
      Linear-GVA+FSSC374259771.1
      Linear-GVA+FSSC+CA381344571.5
    Tools

    Get Citation

    Copy Citation Text

    Qi Chen, Kangnian Wang, Zhiqiang Jiao, Zhanhua Huang. Point Cloud Semantic Segmentation Method Based on Improved Point Transformer v2[J]. Laser & Optoelectronics Progress, 2025, 62(14): 1415001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Dec. 3, 2024

    Accepted: Jan. 13, 2025

    Published Online: Jul. 2, 2025

    The Author Email: Qi Chen (chenqi_1032@163.com)

    DOI:10.3788/LOP242365

    CSTR:32186.14.LOP242365

    Topics