Laser & Optoelectronics Progress, Volume. 62, Issue 16, 1628001(2025)

Small Object Detection Based on Spatial-Frequency Separated Cross-Attention Swin Transformer

Keping Wang1,2, Bingqian Suo1,2、*, Gaopeng Zhang3, Yi Yang1,2, and Wei Qian1
Author Affiliations
  • 1School of Electrical Engineering and Automation, Henan Polytechnic University, Jiaozuo 454003, Henan , China
  • 2Henan International Joint Laboratory of Direct Drive and Control of Intelligent Equipment, Henan Polytechnic University, Jiaozuo 454003, Henan , China
  • 3Xi'an Institute of Optics and Precision Mechanics of CAS, Xi'an 710119, Shaanxi , China
  • show less
    Figures & Tables(15)
    Structure of SF-SCST
    Structure of stage 1 in SF-SCST network
    Structure of SFDF module
    Structure of CS-swin module
    Comparison of detection results for small targets in simple environment
    Comparison of detection results for small targets in complex environment
    Comparison of detection results for small targets in dark environment
    Heat map comparison of different algorithms
    Comparison of model accuracy and loss between SF-SCST and Swin Transformer algorithms. (a) Model accuracy; (b) classification loss; (c) regression loss
    Visualization of the module outputs for SF-SCST and Swin Transformer algorithms
    Accuracies of SF-SCST algorithm for small objects with different categories and scales on two datasets. (a) different categories; (b) different scales
    • Table 1. Comparison experimental results of different algorithms on two datasets

      View table

      Table 1. Comparison experimental results of different algorithms on two datasets

      AlgorithmUAV-DA datasetVisDrone datasetGFLOPsParams /106
      mAP /%mAP50 /%mAPS /%mAPM /%mAP /%mAP50 /%mAPS /%mAPM /%
      IterDet24.465.717.023.926.345.717.038.023.3641.36
      RetinaNet23.762.114.223.821.436.211.234.210.0736.13
      GCNet24.765.815.727.225.643.016.137.423.3751.13
      GFL26.068.717.928.924.642.615.535.710.0432.04
      TridentNet23.461.915.122.720.636.511.830.5738.1732.77
      Swin Transformer25.267.717.223.227.345.217.639.723.3944.79
      Swin+WDFC25.769.118.227.728.647.118.841.524.0145.19
      Swin+SFDF25.669.317.629.227.945.917.940.624.7348.64
      SF-SCST (proposed)25.569.318.130.228.146.018.340.626.6881.91
    • Table 2. Comparison experimental results between SF-SCST and Swin Transformer algorithms on the Wider Person dataset

      View table

      Table 2. Comparison experimental results between SF-SCST and Swin Transformer algorithms on the Wider Person dataset

      AlgorithmmAPmAP50mAPSmAPL
      Swin Transformer12.324.85.818.7
      SF-SCST (proposed)12.324.86.119.1
    • Table 3. Results of ablation experiments on different datasets

      View table

      Table 3. Results of ablation experiments on different datasets

      WDFCSFDFCS-swinUAV-DA datasetVisDrone dataset
      mAPmAP50mAPSmAPMmAPmAP50mAPSmAPM
      25.267.717.223.227.345.217.639.7
      25.769.118.227.728.647.118.841.5
      25.669.317.629.227.945.917.940.6
      26.469.817.630.628.546.718.841.1
      26.169.518.730.128.346.618.940.5
      25.769.918.629.727.946.218.440.1
      25.669.418.029.127.245.017.539.5
      25.569.318.130.228.146.018.340.6
    • Table 4. Comparison experimental results of WDFC module implementation

      View table

      Table 4. Comparison experimental results of WDFC module implementation

      AlgorithmUAV-DA datasetVisDrone dataset
      mAPmAP50mAPSmAPMmAPmAP50mAPSmAPM
      Swin Transformer25.267.717.223.227.345.217.639.7
      + WDFC(2×2)25.769.118.227.728.647.118.841.5
      + WDFC(1×4)26.069.918.027.628.447.018.541.2
    Tools

    Get Citation

    Copy Citation Text

    Keping Wang, Bingqian Suo, Gaopeng Zhang, Yi Yang, Wei Qian. Small Object Detection Based on Spatial-Frequency Separated Cross-Attention Swin Transformer[J]. Laser & Optoelectronics Progress, 2025, 62(16): 1628001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Remote Sensing and Sensors

    Received: Dec. 31, 2024

    Accepted: Mar. 5, 2025

    Published Online: Jul. 24, 2025

    The Author Email: Bingqian Suo (suobingqian@home.hpu.edu.cn)

    DOI:10.3788/LOP242531

    CSTR:32186.14.LOP242531

    Topics