Laser & Optoelectronics Progress, Volume. 59, Issue 22, 2210010(2022)

Three-Dimensional Object Detection in Substation Operation Scene Based on Attention Mechanism

Wei Gao1, Boyang He1, Ting Zhang2, Meiqing Guo2, Jun Liu2, Huimin Wang2, and Xingzhong Zhang2、*
Author Affiliations
  • 1Internet Department, State Grid Shanxi Electric Power Company, Taiyuan 030021, Shanxi , China
  • 2College of Software, Taiyuan University of Technology, Jinzhong 030600, Shanxi , China
  • show less
    Figures & Tables(13)
    PowerNet structure
    Local area. (a) Local area input; (b) local area representation
    Channel direction attention structure diagram
    Point direction attention structure diagram
    Serial attention structure diagram
    Dataset collector and sample illustration. (a) Dataset collector; (b) sample illustration
    Data annotation. (a) PCAT annotated point cloud; (b) LabelImg annotated images; (c) label format
    Loss curve and performance curve
    Test result. (a) RGB images; (b) point cloud diagrams
    • Table 1. Comparison of effects of different combinations of attention on network performance

      View table

      Table 1. Comparison of effects of different combinations of attention on network performance

      Channel-direction attentionPoint-direction attentionParallelSerial
      APmAPAPmAP
      PedestrianTransformerPedestrianTransformer
      Two-layer MLP7×7 filter0.5500.7760.6630.5720.7970.685
      Two-layer MLP5×5 filter0.5590.7810.6700.5760.8000.688
      Four-layer MLP7×7 filter0.5720.7940.6830.5910.8490.720
      Four-layer MLP5×5 filter0.5790.8020.6910.6020.8670.735
    • Table 2. Choice of attention structure

      View table

      Table 2. Choice of attention structure

      Channel-direction attention

      (four-layer MLP)

      Point-direction attention

      (5×5 filter)

      APmAP
      PedestrianTransformer
      --0.5450.7750.660
      -0.5720.7900.681
      -0.5600.7790.670
      0.6020.8670.735
    • Table 3. Choice of loss function

      View table

      Table 3. Choice of loss function

      Cross entropy lossFocal lossAPmAP
      PedestrianTransformer
      -0.5720.8680.720
      -0.6020.8670.735
    • Table 4. Performance comparison results of mainstream detection models

      View table

      Table 4. Performance comparison results of mainstream detection models

      MethodModelAPmAP
      PedestrianTransformer
      3D to 2DPIXOR90.5270.7550.641
      Complex-YOLO100.5330.7790.656
      VoxelizationVote3Deep150.5370.7330.635
      VoxelNet130.5310.8020.667
      Original point cloudPointNet170.5400.7620.651
      PointNet++180.5450.7750.660
      Proposed method0.6020.8670.735
    Tools

    Get Citation

    Copy Citation Text

    Wei Gao, Boyang He, Ting Zhang, Meiqing Guo, Jun Liu, Huimin Wang, Xingzhong Zhang. Three-Dimensional Object Detection in Substation Operation Scene Based on Attention Mechanism[J]. Laser & Optoelectronics Progress, 2022, 59(22): 2210010

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Aug. 23, 2021

    Accepted: Oct. 27, 2021

    Published Online: Sep. 23, 2022

    The Author Email: Xingzhong Zhang (zhangxzsubmit@126.com)

    DOI:10.3788/LOP202259.2210010

    Topics