Opto-Electronic Engineering, Volume. 51, Issue 9, 240149-1(2024)

Adaptive foreground focusing for target detection in UAV aerial images

Zhenjiu Xiao, Zhengwei Wu, Jiehao Zhang, and Haicheng Qu
Author Affiliations
  • School of Software, Liaoning University of Engineering and Technology, Huludao, Liaoning 125105, China
  • show less
    Figures & Tables(13)
    Overall model structure
    Panoramic feature classification layer (PFRC) structure
    Adaptive two-dimensional feature sampling (ATFS) structure
    Structure of multi-path full-text information integration (MPFT)
    Broadcast self-attention (BSA) mechanism structure
    SC_FCI structure
    Adaptive foreground focus detect head (AFF_Detect) structure
    Comparison of evaluation indicators between the YOLOv8s and improved mode
    Comparison test visualisation results. (a) RetinaNet; (b) YOLOv5s;(c) Faster-RCNN; (d) TPH-YOLOv5; (e) YOLOv7-tiny; (f) YOLOv8s;(g) YOLOv10s; (h) Improved YOLOv5; (i) Deformmable-DETR; (j) Ours
    • Table 1. Ablation experiments of the proposed algorithm in the VisDrone2019 dataset

      View table
      View in Article

      Table 1. Ablation experiments of the proposed algorithm in the VisDrone2019 dataset

      NumberYOLOv8sPFRCATFSMPFTAFFDHPrecision/%Recall/%mAP@0.5/%GFLOPs
      1××××49.737.538.528.8
      2×××52.741.740.934.3
      3×××53.442.841.827.1
      4×××54.541.542.725.5
      5×××52.441.439.826.8
      6××54.043.443.232.0
      7×55.844.244.329.2
      858.144.645.128.9
    • Table 2. Comparison results of the accuracy of ablation experiments by category/%

      View table
      View in Article

      Table 2. Comparison results of the accuracy of ablation experiments by category/%

      NumberPedestrianPeopleBicycleCarVanTruckTricycleAwning-tricycleBusMotormAP@0.5
      137.227.614.777.442.939.023.821.556.139.438.5
      238.222.318.981.044.141.218.725.558.135.640.9
      339.031.018.480.944.942.924.619.960.339.741.8
      441.833.217.683.845.440.625.826.161.545.642.7
      533.322.116.373.439.441.021.122.957.440.939.8
      644.834.718.984.746.650.726.825.458.349.143.2
      752.842.020.183.346.743.828.026.760.852.744.3
      853.941.324.187.850.545.331.628.962.655.245.1
    • Table 3. Results of comparison experiments on VisDrone2019 dataset

      View table
      View in Article

      Table 3. Results of comparison experiments on VisDrone2019 dataset

      ModelPrecision/%Recall/%mAP@0.5/%mAP@0.5:0.95/%GFLOPsFPS
      YOLOv5s48.735.034.418.515.7120
      RetinaNet35.521.920.312.592.926
      TPH-YOLOv549.437.036.919.114.6125
      YOLOv7-tiny50.141.237.622.612.9113
      Deformmable-DETR52.445.044.225.7179.369
      YOLOv8s49.737.538.522.128.8129
      YOLOv10s55.440.741.123.821.6133
      Improved YOLOv557.743.043.924.934.399
      Faster-RCNN48.035.135.021.842.523
      Ours58.144.645.128.328.9145
    • Table 4. Results of comparison experiments on VisDrone2021 dataset

      View table
      View in Article

      Table 4. Results of comparison experiments on VisDrone2021 dataset

      ModelPrecision/%Recall/%mAP@0.5/%mAP@0.5:0.95/%GFLOPsFPS
      YOLOv5s46.633.131.916.115.7115
      RetinaNet31.518.915.310.592.921
      TPH-YOLOv545.434.033.616.314.6119
      YOLOv7-tiny48.839.234.619.912.9105
      Deformmable-DETR51.542.142.022.4179.362
      YOLOv8s47.737.537.419.828.8120
      YOLOv10s50.738.140.521.221.6127
      Improved YOLOv5s52.240.742.122.634.388
      Faster-RCNN46.133.632.318.842.520
      Ours53.142.643.124.328.9138
    Tools

    Get Citation

    Copy Citation Text

    Zhenjiu Xiao, Zhengwei Wu, Jiehao Zhang, Haicheng Qu. Adaptive foreground focusing for target detection in UAV aerial images[J]. Opto-Electronic Engineering, 2024, 51(9): 240149-1

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Article

    Received: Jun. 28, 2024

    Accepted: Aug. 6, 2024

    Published Online: Dec. 12, 2024

    The Author Email:

    DOI:10.12086/oee.2024.240149

    Topics