Acta Optica Sinica, Volume. 45, Issue 12, 1210001(2025)

Research on Thin Cloud Removal Based on Generative Adversarial Network with CBAM and Multi-Scale Attention

Yang Wang1, Guokun Chen1,2,3、*, Xingwu Duan4,5, Qingke Wen6,7, Jiatian Li1, and Zhen Zhang1,2,3
Author Affiliations
  • 1Faculty of Land Resources Engineering, Kunming University of Science and Technology, Kunming 650093, Yunnan , China
  • 2Yunnan Key Laboratory of Quantitative Remote Sensing, Kunming 650093, Yunnan , China
  • 3Yunnan International Joint Laboratory for Integrated Sky-Ground Intelligent Monitoring of Mountain Hazards, Kunming 650093, Yunnan , China
  • 4Institute of International Rivers and Eco-Security, Yunnan University, Kunming 650500, Yunnan , China
  • 5Yunnan Key Laboratory of Soil Erosion Prevention and Green Development, Yunnan University, Kunming 650500, Yunnan , China
  • 6Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
  • 7National Engineering Research Center for Geomatics, Beijing 100101, China
  • show less

    Objective

    Thin cloud contamination in remote sensing images presents a significant challenge affecting data quality, resulting in imprecise analysis and interpretation across applications including land cover classification, environmental monitoring, and disaster assessment. Conventional thin cloud removal methods typically depend on feature extraction at a single scale and inadequately capture the multi-scale characteristics of clouds, leading to suboptimal declouding results. Furthermore, deep learning-based approaches, particularly those utilizing generative adversarial network (GAN), frequently encounter detail loss and texture blur in generated images and demonstrate limited capability in modeling local features accurately. To address these challenges, this study introduces a novel GAN-based method incorporating a convolutional block attention module (CBAM) and a multi-scale attention mechanism. The proposed approach aims to enhance the accuracy of thin cloud removal while maintaining the spectral and spatial details of the original imagery, thus improving the overall quality of remote sensing data.

    Methods

    The proposed framework integrates the GAN architecture with CBAM and multi-scale attention mechanism for effective thin cloud removal. The generator network is engineered to capture global and local features of the input image, enabling the model to restore detailed surface information while removing thin clouds effectively. The discriminator network assesses the authenticity of the generated image, ensuring high similarity to the real cloud-free image. The multi-scale attention mechanism serves a crucial function by implementing parallel convolution branches with independent parameter optimization strategies. This approach enables differentiated feature expression, enhancing the model’s capacity to process cloud contamination and underlying surface features at various scales. Furthermore, CBAM is integrated for enhanced feature extraction at different scales. CBAM applies sequential channel and spatial attention to feature maps, adaptively emphasizing important features while suppressing irrelevant noise. This integration of multi-scale attention and CBAM substantially improves the model’s capability to restore image brightness and recover fine details. Comprehensive experiments were conducted on the RICE1 dataset and a custom remote sensing cloud removal dataset based on Sentinel-2 imagery. The model’s performance is evaluated using quantitative metrics including peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM). The proposed method is compared against several state-of-the-art thin cloud removal techniques, including Haze Removal, FFA-Net, C2PNet, CGAN, and SpA-GAN, to demonstrate its effectiveness.

    Results and Discussions

    Experimental results show that the proposed method surpasses traditional thin cloud removal techniques in both visual quality and quantitative metrics. The integration of CBAM with the multi-scale attention mechanism substantially enhances the model’s ability to recover detailed surface information while effectively removing thin clouds (Figs. 8?11). Comparative analysis reveals that the proposed method achieves a PSNR of 31.321 dB and an SSIM of 0.894, exceeding the performance of state-of-the-art methods (Tables 1 and 2). The generated images are further analyzed based on the average brightness of the RGB channels (Figs. 11 and 12). The results indicate that the cloud-free images generated by the proposed method most closely match the real images in terms of RGB channel brightness, validating the method’s effectiveness in preserving spectral details. An ablation study examines the synergistic contribution of the two attention mechanisms (Table 4 and Fig. 14). The results confirm that their combination significantly enhances model performance, demonstrating their complementary role in improving image quality. Specifically, the multi-scale attention mechanism facilitates feature capture at different scales, while CBAM enhances feature extraction accuracy through channel and spatial dimension focus.

    Conclusions

    This study presents a novel method for thin cloud removal from remote sensing images based on GAN enhanced with CBAM and multi-scale attention mechanism. The proposed approach enhances cloud removal accuracy while preserving the spectral and spatial details of the original image. Experimental results validate the effectiveness and robustness of the proposed method, demonstrating its superior performance compared to state-of-the-art techniques in terms of visual quality and quantitative metrics. The integration of CBAM and multi-scale attention mechanism proves instrumental in achieving these results, underscoring their significance in enhancing model performance. The proposed method offers a promising solution for improving remote sensing data quality. Future research will concentrate on optimizing the model architecture and expanding its applicability to additional types of cloud pollution and remote sensing datasets.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Yang Wang, Guokun Chen, Xingwu Duan, Qingke Wen, Jiatian Li, Zhen Zhang. Research on Thin Cloud Removal Based on Generative Adversarial Network with CBAM and Multi-Scale Attention[J]. Acta Optica Sinica, 2025, 45(12): 1210001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Feb. 5, 2025

    Accepted: Apr. 24, 2025

    Published Online: Jun. 23, 2025

    The Author Email: Guokun Chen (chengk@radi.ac.cn)

    DOI:10.3788/AOS250568

    CSTR:32393.14.AOS250568

    Topics