Laser & Optoelectronics Progress, Volume. 61, Issue 24, 2437003(2024)

Semantic Segmentation Network Based on V-Shaped Pyramid Bilateral Feature Fusion

Zheng Wang* and Wenyuan Li
Author Affiliations
  • School of Microelectronics, Tianjin University, Tianjin 300072, China
  • show less

    Herein, a V-shaped pyramid bilateral feature fusion network (VPBF-Net) is proposed to address small-scale target missing segmentation, inaccurate edge segmentation, and inefficient fusion of deep and shallow feature information in current semantic segmentation networks. In the encoding stage, a V-shaped atrous spatial pyramid pooling (VASPP) module adopts multiple-parallel-branch interactive connection structures to enhance the information exchange between the local semantic information of each branch. In addition, multibranch feature hierarchical fusion is adopted to reduce grid artifact effects. Furthermore, a coordinate attention module is used to assign weights to the extracted deep semantic information, enhancing the network's attention to the segmentation target. In the decoding stage, a bilateral attention feature aggregation module is designed to guide shallow feature fusion through multiscale deep semantic information, thereby capturing different-scaled shallow feature representations and achieving more efficient deep and shallow feature fusion. Experiments are conducted on the PASCAL VOC 2012 dataset and Cityscapes dataset, the proposed method achieves average intersection to union ratios of 83.25% and 77.21%, respectively, indicating advanced results. Compared with other methods, the proposed method can more accurately perform small-scale object segmentation, alleviating missed segmentation and misclassification.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Zheng Wang, Wenyuan Li. Semantic Segmentation Network Based on V-Shaped Pyramid Bilateral Feature Fusion[J]. Laser & Optoelectronics Progress, 2024, 61(24): 2437003

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Digital Image Processing

    Received: Mar. 29, 2024

    Accepted: Apr. 29, 2024

    Published Online: Dec. 17, 2024

    The Author Email: Zheng Wang (wangxiaozheng@tju.edu.cn)

    DOI:10.3788/LOP240990

    CSTR:32186.14.LOP240990

    Topics