Laser & Optoelectronics Progress, Volume. 59, Issue 18, 1815015(2022)

Object Detection Based on Semantic Sampling and Localization Refinement

Yu Li1,2, Shaoyan Gai1,2,3, Feipeng Da1,2,3、*, and Ru Hong1,2
Author Affiliations
  • 1School of Automation, Southeast University, Nanjing 210096, Jiangsu, China
  • 2Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education,Southeast University, Nanjing 210096, Jiangsu, China
  • 3Shenzhen Research Institute, Southeast University, Shenzhen 518063, Guangdong, China
  • show less
    Figures & Tables(10)
    Comparison of object detection architectures. (a) Architecture of RepPoints[14]; (b) architecture of proposed detector
    Overall architecture of proposed object detector. Dashed box: semantic based positioning module; dashed box: feature enhancement module
    Architecture of localization subnet and classification subnet
    Training samplers for object detection. (a) Center-based training sampler; (b) semantic-based training sampler
    Qualitative results of proposed method on different datasets. (a), (b) VOC 2007; (c), (d) MS COCO
    • Table 1. Experimental result comparison of proposed method to other methods on MS COCO test-dev

      View table

      Table 1. Experimental result comparison of proposed method to other methods on MS COCO test-dev

      MethodBackboneAPAP50AP75APsAPmAPlTime /ms
      RetinaNet22ResNet-10139.159.142.321.842.750.290
      FCOS6ResNet-10141.560.745.024.444.851.691
      FSAF5ResNet-10140.961.544.024.044.251.3138
      ATSS10ResNet-10143.662.147.426.147.053.693
      RepPoints14ResNet-10141.062.944.323.644.151.787
      CornerNet4Hourglass-10440.556.543.119.442.753.9227
      ExtremeNet7Hourglass-10440.255.543.220.443.253.1348
      CenterNet8Hourglass-10442.161.145.924.145.552.8298
      LSNet21ResNeXt-10143.963.147.826.647.155.4138
      ProposedResNet-10142.865.146.326.147.355.878
    • Table 2. Experimental result comparison of integrating three modules into RepPoints

      View table

      Table 2. Experimental result comparison of integrating three modules into RepPoints

      MethodAPAP50AP75APsAPmAPlTime /ms
      Baseline RepPoints39.160.542.121.241.250.085
      RepPoints + LB40.461.943.623.143.852.485
      RepPoints + LB + FE41.062.644.324.344.653.386
      RepPoints + EO39.261.742.221.442.850.676
      Proposed(LB + FE+EO)41.263.044.224.645.053.977
    • Table 3. Comparison of different transformation conversions

      View table

      Table 3. Comparison of different transformation conversions

      Conversion function NLAPAP50AP75Time /ms
      1×1 conv40.460.243.685
      Max-pooling40.360.143.485
      Max-pooling + avg-pooling40.460.343.886
    • Table 4. Detection results with different location thresholds

      View table

      Table 4. Detection results with different location thresholds

      ϵLAPAP50AP75
      0.00039.159.942.1
      0.00539.860.142.1
      0.01040.460.243.6
      0.05040.260.143.3
      0.10039.859.942.9
    • Table 5. Impact of applying efficient refinement optimization to different networks

      View table

      Table 5. Impact of applying efficient refinement optimization to different networks

      MethodEOFLOPs /GParam /MFPS /(frame·s-1Time /msAP
      RepPoints190.1636.6211.88539.1
      190.0635.9713.17639.2
      Proposed190.1836.6211.68641.0
      190.0735.9712.97741.2
    Tools

    Get Citation

    Copy Citation Text

    Yu Li, Shaoyan Gai, Feipeng Da, Ru Hong. Object Detection Based on Semantic Sampling and Localization Refinement[J]. Laser & Optoelectronics Progress, 2022, 59(18): 1815015

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Jul. 23, 2021

    Accepted: Aug. 31, 2021

    Published Online: Sep. 5, 2022

    The Author Email: Da Feipeng (qxxymm@163.com)

    DOI:10.3788/LOP202259.1815015

    Topics