Laser & Optoelectronics Progress, Volume. 62, Issue 16, 1628001(2025)

Small Object Detection Based on Spatial-Frequency Separated Cross-Attention Swin Transformer

Keping Wang1,2, Bingqian Suo1,2、*, Gaopeng Zhang3, Yi Yang1,2, and Wei Qian1
Author Affiliations
  • 1School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo 454003, Henan , China
  • 2Henan International Joint Laboratory of Direct Drive and Control of Intelligent Equipment, Henan Polytechnic University, Jiaozuo 454003, Henan , China
  • 3Xi'an Institute of Optics and Precision Mechanics of CAS, Xi'an 710119, Shaanxi , China
  • show less

    To address the challenges of small object detection that information loss during the down-sampling and neglect of target details by deep features, this research proposes a small object detection algorithm based on the spatial-frequency separated cross-attention Swin Transformer (SF-SCST). The SF-SCST algorithm distinguishes objects from the background in the frequency domain through the wavelet decomposition and feature concatenation (WDFC) module and a feature channel spatial-frequency decomposition and fusion (SFDF) module. Then, these features are fused with spatial domain information to enhance the target contour, effectively preserving small object features during down-sampling. Additionally, the cross-self attention Swin Transformer (CS-swin) module performs dual-attention calculation on deep and shallow features to supplement the small object information lost in the deep features and to capture the contextual information of targets. Experimental results show that the SF-SCST algorithm achieves the mean average precision with intersection over union of 0.5 (mAP50) of 69.3% and 46.0% on the UAV-DA and VisDrone datasets, respectively. The performance of proposed algorithm is superior compared with the other six algorithms, significantly improving the detection accuracy of small objects.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Keping Wang, Bingqian Suo, Gaopeng Zhang, Yi Yang, Wei Qian. Small Object Detection Based on Spatial-Frequency Separated Cross-Attention Swin Transformer[J]. Laser & Optoelectronics Progress, 2025, 62(16): 1628001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Remote Sensing and Sensors

    Received: Dec. 31, 2024

    Accepted: Mar. 5, 2025

    Published Online: Jul. 24, 2025

    The Author Email: Bingqian Suo (suobingqian@home.hpu.edu.cn)

    DOI:10.3788/LOP242531

    CSTR:32186.14.LOP242531

    Topics