Laser & Optoelectronics Progress, Volume. 61, Issue 24, 2437003(2024)

Semantic Segmentation Network Based on V-Shaped Pyramid Bilateral Feature Fusion

Zheng Wang* and Wenyuan Li
Author Affiliations
  • School of Microelectronics, Tianjin University, Tianjin 300072, China
  • show less
    Figures & Tables(9)
    VPBF-Net overall architecture
    Schematic diagrams of the structure of the VASPP module and the coordinate attention module. (a) VASPP module; (b) coordinate attention module
    Schematic diagrams of the structure of the BAFA module. (a) BAFA module; (b) CA module; (c) SA module
    Visualization results on the PASCAL VOC 2012 dataset
    Visualization results on the Cityscapes dataset
    • Table 1. Comparison of MIoU results of different networks on the PASCAL VOC 2012 dataset

      View table

      Table 1. Comparison of MIoU results of different networks on the PASCAL VOC 2012 dataset

      MethodBackboneParams /MMIoU /%
      PSPNet31ResNet-10151.8680.23
      DeepLabV3+17ResNet-10168.3778.85
      WASPnet32ResNet-10147.4880.22
      DECANet33ResNet-10181.08
      CFANet34ResNet-5081.34
      N-Deeplabv3+35Xception37.3881.97
      Method of reference [36EfficientNetV255.5181.19
      Method of reference [37ResNet-10160.4081.13
      DeepLabV3+17Xception54.7180.94
      VPBF-NetXception42.4183.25
    • Table 2. Comparison with the quantitative information of the DeepLabV3+

      View table

      Table 2. Comparison with the quantitative information of the DeepLabV3+

      MethodBackboneMloU /%MPA /%Params /106Time /msSpeed /(frame/s)
      DeepLabV3+Xception80.9487.2954.7145.5521.96
      VPBF-NetXception83.2589.5342.4143.3223.08
      DeepLabV3+MobileNetV272.3182.525.8226.4437.82
      VPBF-NetMobileNetV273.1483.954.0725.5839.09
    • Table 3. Evaluation results of ablation experiments at different improvement points

      View table

      Table 3. Evaluation results of ablation experiments at different improvement points

      BackboneASPPVASPPCABAFAMloU /%MPA /%
      80.9487.29
      81.9188.36
      81.2687.78
      82.0388.12
      82.5989.08
      83.2589.53
    • Table 4. Comparison of MIoU results for different networks on the Cityscapes dataset

      View table

      Table 4. Comparison of MIoU results for different networks on the Cityscapes dataset

      MethodBackboneParams /106MloU /%
      WASPnet32ResNet-10147.4873.58
      DECANet33ResNet-10176.01
      CFANet34ResNet-5076.27
      Method of reference [38ResNet-5043.1676.23
      Method of reference [39Swin Transformer123.7775.18
      N-Deeplabv3+35Xception37.3876.31
      DeepLabV3+17Xception54.7176.23
      VPBF-NetXception42.4177.21
    Tools

    Get Citation

    Copy Citation Text

    Zheng Wang, Wenyuan Li. Semantic Segmentation Network Based on V-Shaped Pyramid Bilateral Feature Fusion[J]. Laser & Optoelectronics Progress, 2024, 61(24): 2437003

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Digital Image Processing

    Received: Mar. 29, 2024

    Accepted: Apr. 29, 2024

    Published Online: Dec. 17, 2024

    The Author Email: Zheng Wang (wangxiaozheng@tju.edu.cn)

    DOI:10.3788/LOP240990

    CSTR:32186.14.LOP240990

    Topics