Laser & Optoelectronics Progress, Volume. 61, Issue 18, 1837015(2024)

Semantic Segmentation of Dual-Source Remote Sensing Images Based on Gated Attention and Multiscale Residual Fusion

Wen Guo1, Hong Yang1, and Chang Liu2、*
Author Affiliations
  • 1School of science, Beijing Information Science and Technology University, Beijing 100029, China
  • 2Institute of Applied Mathematics, Beijing Information Science and Technology University, Beijing 100101, China
  • show less

    The semantic segmentation of remote sensing images is a crucial step in the analysis of geographic-object-based remote sensing images. Combining remote sensing image data with elevation data effectively enhances feature complementarity, thereby improving pixel-level segmentation accuracy. This study proposes a dual-source remote sensing image semantic segmentation model, STAM-SegNet, that leverages the Swin Transformer backbone network to extract multiscale features. The proposed model integrates an adaptive gating attention mechanism and a multiscale residual fusion strategy. The adaptive gated attention mechanism includes gated channel attention and gated spatial attention mechanisms. Gated channel attention enhances the correlation between dual-source data features through competition/cooperation mechanisms, effectively extracting complementary features of dual-source data. In contrast, gated spatial attention uses spatial contextual information to dynamically filter out high-level semantic features and select accurate detail features. The multiscale feature residual fusion strategy captures multiscale contextual information via multiscale refinement and residual structure, thereby emphasizing detailed features, such as shadows and boundaries, and improving the model's training speed. Experiments conducted on the Vaihingen and Potsdam datasets demonstrate that the proposed model achieved an average F1-score of 89.66% and 92.75%, respectively, surpassing networks such as DeepLabV3+, UperNet, DANet, TransUNet, and Swin-UNet in terms of segmentation accuracy.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Wen Guo, Hong Yang, Chang Liu. Semantic Segmentation of Dual-Source Remote Sensing Images Based on Gated Attention and Multiscale Residual Fusion[J]. Laser & Optoelectronics Progress, 2024, 61(18): 1837015

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Digital Image Processing

    Received: Jan. 15, 2024

    Accepted: Feb. 26, 2024

    Published Online: Sep. 14, 2024

    The Author Email: Chang Liu (liuchang@bistu.edu.cn)

    DOI:10.3788/LOP240534

    CSTR:32186.14.LOP240534

    Topics