Acta Optica Sinica, Volume. 40, Issue 4, 0415001(2020)

Dynamic Receptive Field-Based Object Detection in Aerial Imaging

Xueli Xie, Chuanxiang Li, Xiaogang Yang, Jianxiang Xi*, and Tong Chen
Author Affiliations
  • College of Missile Engineering, Rocket Force University of Engineering, Xi'an, Shaanxi 710025, China
  • show less
    Figures & Tables(19)
    Structure of RetinaNet
    Internal structure of dual attention SE-ResNeXt module
    Bottom-up short connection
    Structure of pixel-wise addition module
    Structure of GCU module
    Structure of object detection subnet
    Structure of DRF module
    Partial sample of VISDrone-g dataset
    Statistical of the VISDrone-g. (a) Object scale distribution characteristics; (b) object frame length and width proportional distribution characteristics
    Detailed explanation of COCO object detection and evaluation indexes [26]
    Visual contrast between DRF-RetinaNet and RetinaNet*. (a)(c)(e) DRF-RetinaNet's detection result; (b)(d)(f) RetinaNet's detection result
    Detection results of dim light
    Detection results of dense objects
    Detection results of oblique view
    Detection results of down view
    • Table 1. Focal Loss parameter tuning

      View table

      Table 1. Focal Loss parameter tuning

      αtγAP /%AP50 /%F1-score
      0.202.023.9338.1847.24
      0.252.024.3739.9548.43
      0.253.025.1442.6252.47
      0.303.024.8241.2350.72
    • Table 2. Performance comparison of model components

      View table

      Table 2. Performance comparison of model components

      ModuleWhether or not it contains
      RetinaNet*
      SE-ResNeXt
      Bottom-up
      GCU
      DRF detection subnet
      AP /%18.9720.2221.3322.0523.1725.14
      AP50 /%28.6530.6432.7834.3337.8342.62
      Note:*indicates that anchor parameters have been adjusted according to section 4.2.
    • Table 3. Performance comparison of each algorithm

      View table

      Table 3. Performance comparison of each algorithm

      MethodInput sizeBasebone NetworkAP /%AP50 /%AP75 /%AR1 /%AR10 /%AR100 /%Time /ms
      Faster R-CNN600Resnet-5016.7224.3214.154.2112.4716.65137
      R-FCN600Resnet-10119.3530.1819.525.6518.7322.56178
      SSD512Vgg-1612.2317.2911.543.7111.2215.4154
      RFB-Net512Resnet-5014.8722.1712.064.3413.1517.3875
      YOLO v3416Darknet-5314.7521.8612.174.1212.9317.4167
      RetinaNet608Resnet-5016.3523.1813.924.8514.7518.3685
      RetinaNet*608Resnet-5018.9728.6517.424.9217.2520.5288
      DRF-RetinaNet608SE-ResNeXt-5025.1442.6224.717.8224.2231.24103
      Note:*indicates that anchor parameters have been adjusted according to section 4.2.
    • Table 4. Performance comparison of algorithms for different scales object detection

      View table

      Table 4. Performance comparison of algorithms for different scales object detection

      MethodAPsmall /%APmedium /%APlarge /%ARsmall /%ARmedium /%ARlarge /%F1-score
      Faster R-CNN7.1424.4236.7310.6226.7541.4133.15
      R-FCN9.8526.1340.2514.5732.7147.7940.67
      SSD5.8520.0334.077.6324.9738.6826.41
      RFB-Net6.6222.1834.289.5525.7740.8233.13
      YOLO v36.2522.2636.179.7225.7240.2732.73
      RetinaNet7.2723.9536.7210.3126.6342.2332.69
      RetinaNet*9.8225.3538.3114.9331.9144.8237.92
      DRF-RetinaNet13.6240.3455.9517.4249.9761.5352.47
      Note:*indicates that anchor parameters have been adjusted according to section 4.2.
    Tools

    Get Citation

    Copy Citation Text

    Xueli Xie, Chuanxiang Li, Xiaogang Yang, Jianxiang Xi, Tong Chen. Dynamic Receptive Field-Based Object Detection in Aerial Imaging[J]. Acta Optica Sinica, 2020, 40(4): 0415001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Aug. 29, 2019

    Accepted: Nov. 6, 2019

    Published Online: Feb. 11, 2020

    The Author Email: Xi Jianxiang (xijx07@mails.tsinghua.edu.cn)

    DOI:10.3788/AOS202040.0415001

    Topics