Acta Optica Sinica, Volume. 39, Issue 11, 1128002(2019)

Object Detection in Remote Sensing Images Using Multiscale Convolutional Neural Networks

Qunli Yao1,2、*, Xian Hu1,2, and Hong Lei1
Author Affiliations
  • 1Department of Space Microwave Remote Sensing Systems, Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China
  • 2School of Electronics, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
  • show less
    Figures & Tables(8)
    Target detection framework of MSCNN
    Structures of EFPN and dilated bottleneck. (a) Structure of EFPN module; (b) dilated bottleneck structure; (c) dilated bottleneck structure with 1×1 Conv
    Network structure of EFPN-NoProj
    Visual detection results of MSCNN
    • Table 1. Definition of bounding box areas based on distribution of instance scales

      View table

      Table 1. Definition of bounding box areas based on distribution of instance scales

      SmallMediumLarge
      (0,60](60,120](120,+¥)
    • Table 2. Comparison of the detection precision of different algorithms on NWPU VHR-10 dataset

      View table

      Table 2. Comparison of the detection precision of different algorithms on NWPU VHR-10 dataset

      MethodRICNN[25]FRCN-VGG-16[16]YOLO[15]SSD[13]R-FCN[26]FRCN-Deform[27]FPN[18]MSDN[28]MSCNN
      Airplane0.8840.8300.8740.9560.9610.9830.9640.9980.994
      Ship0.7730.7760.8470.9370.9830.8920.9310.9720.953
      Storage tank0.8530.5250.4270.6170.7250.8170.9140.8380.918
      Baseball diamond0.8810.9630.9310.9950.9940.9840.9470.9910.963
      Tennis court0.4080.6290.6580.8600.9070.8590.9440.9730.954
      Basketball court0.5850.6880.8700.9440.9780.9270.9590.9990.967
      Ground track field0.8670.9840.9750.9870.9810.9880.9900.9860.993
      Harbor0.6860.8190.8000.9500.9240.9460.9210.9720.955
      Bridge0.6150.7930.9030.9660.9340.9470.8380.9270.972
      Vehicle0.7110.6390.7040.7450.8840.8160.9000.9010.933
      mAP0.7260.7640.7990.8940.9280.9170.9310.9560.960
    • Table 3. Ablation experimental parameters of MSCNN

      View table

      Table 3. Ablation experimental parameters of MSCNN

      BackboneAverage precision
      IOU=0.5:0.95IOU=0.5IOU=0.75SmallMediumLarge
      RetinaNet0.6900.9450.8090.5320.5590.682
      EFPN0.7060.9600.8240.5470.5780.701
      EFPN-NoProj0.7000.9500.8190.5440.5620.698
    • Table 4. Average precision and average recall under different IOU thresholds and different bounding box areas

      View table

      Table 4. Average precision and average recall under different IOU thresholds and different bounding box areas

      BackboneAverage precisionAverage precision(IOU =0.5:0.95)Average recall(IOU=0.5:0.95)
      IOU=0.5:0.95IOU=0.5IOU=0.75SmallMediumLargeSmallMediumLarge
      RetinaNet0.6900.9450.8090.5320.5590.6820.5730.5860.754
      MSCNN0.7060.9600.8240.5470.5780.7010.6000.6050.755
      EFPN-NoProj0.7000.9500.8190.5440.5620.6980.5970.6000.753
    Tools

    Get Citation

    Copy Citation Text

    Qunli Yao, Xian Hu, Hong Lei. Object Detection in Remote Sensing Images Using Multiscale Convolutional Neural Networks[J]. Acta Optica Sinica, 2019, 39(11): 1128002

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Remote Sensing and Sensors

    Received: Apr. 8, 2019

    Accepted: Jul. 26, 2019

    Published Online: Nov. 6, 2019

    The Author Email: Yao Qunli (yaoqunli15@mails.ucas.ac.cn)

    DOI:10.3788/AOS201939.1128002

    Topics