Laser & Optoelectronics Progress, Volume. 59, Issue 12, 1215016(2022)

Faster R-CNN Target-Detection Algorithm Fused with Adaptive Attention Mechanism

Qisheng Wang1,2,3, Fengsui Wang1,2,3、*, Jingang Chen1,2,3, and Furong Liu1,2,3
Author Affiliations
  • 1School of Electrical Engineering, Anhui Polytechnic University, Wuhu 241000, Anhui , China
  • 2Anhui Key Laboratory of Detection Technology and Energy Saving Devices, Wuhu 241000, Anhui , China
  • 3Key Laboratory of Advanced Perception and Intelligent Control of High-End Equipment, Ministry of Education, Wuhu 241000, Anhui , China
  • show less
    Figures & Tables(7)
    Structure diagram of MA
    Comparison diagram of test results. (a) Original algorithm detection results; (b) Detection results of improved algorithm based on MA
    • Table 1. Comparison of modified ResNet-50 structure based on MA

      View table

      Table 1. Comparison of modified ResNet-50 structure based on MA

      Layer nameOutput sizeResNet-50ResNet-50_MA*ResNet-50_MA
      Conv1112×1127×7,64,stride 2
      Attention 0112×112MA
      Conv2_x56×563×3 max pool,stride 2
      1×1,643×3,641×1,256×3
      Conv3_x28×281×1,1283×3,1281×1,512×4
      Conv4_x14×141×1,2563×3,2561×1,1024×6
      Conv5_x7×71×1,5123×3,5121×1,2048×3
      Attention 17×7MA*
      1×1Average pool,fc,softmax
    • Table 2. Experimental environment configuration

      View table

      Table 2. Experimental environment configuration

      NameModel number
      SystemWindows 10
      CPUCore i9-10900 @ 3.7 GHz
      GPUNvidia GeForce RTX 2080Ti(11 GB)
      FrameworkPytorch 1.2.0
      LanguagePython
    • Table 3. PASCAL VOC dataset training and test data statistics

      View table

      Table 3. PASCAL VOC dataset training and test data statistics

      DataTrainvalTest
      ImagesObjectsImagesObjects
      VOC2007501112608495212032
      VOC2012115402745000
      Total1655140058495212032
    • Table 4. Comparison of detection accuracy of 20 target classes

      View table

      Table 4. Comparison of detection accuracy of 20 target classes

      CategoryFRFR+MA*FR+MA
      mAP74.2575.01+0.7675.16+0.91
      Cat87.2387.31+0.0889.28+2.05
      Car86.6086.33-0.2785.85-0.75
      Horse84.8786.14+1.2786.50+1.63
      Dog83.6282.87-0.7584.29+0.67
      Bus80.2982.55+2.2683.45+3.16
      Train82.4983.12+0.6382.96+0.47
      Motorbike83.4883.62+0.1483.07-0.41
      Bicycle79.5683.14+3.5882.72+3.16
      Person79.7580.09+0.3480.38+0.63
      Aeroplane79.7576.09-3.6674.28-5.47
      Sheep75.2278.26+3.0473.20-2.02
      Bird74.1476.45+2.3177.53+3.39
      Cow74.7378.78+4.0574.70-0.03
      Tvmonitor74.3473.33-1.0173.09-1.25
      Diningtable72.2671.63-0.6373.96+1.70
      Sofa70.4475.12+4.6873.31+2.87
      Boat65.6962.92-2.7766.60+0.91
      Chair54.1951.53-2.6656.31+2.12
      Bottle52.0657.03+4.9756.48+4.42
      Pottedplant45.9643.85-2.1145.25-0.71
    • Table 5. Comparison of experimental results of ablation of MA

      View table

      Table 5. Comparison of experimental results of ablation of MA

      CategoryFRFR+MA1FR+MA2
      mAP74.2574.61+0.3674.70+0.45
      Cat87.2387.10-0.1390.07+2.84
      Car86.6085.94-0.6684.66-1.94
      Horse84.8785.20+0.3385.90+1.03
      Dog83.6286.41+2.7985.11+1.49
      Bus80.2983.17+2.8885.36+5.07
      Train82.4981.98-0.5182.09-0.40
      Motorbike83.4881.38-2.1081.37-2.11
      Bicycle79.5681.80+2.2483.38+3.82
      Person79.7578.64-1.1179.61-0.14
      Aeroplane79.7577.68-2.0776.20-3.55
      Sheep75.2275.62+0.4073.97-1.25
      Bird74.1475.58+1.4475.87+1.73
      Cow74.7378.09+3.3673.06-1.67
      Tvmonitor74.3472.20-2.1473.79-0.55
      Diningtable72.2670.78-1.4870.58-1.68
      Sofa70.4474.22+3.7873.24+2.80
      Boat65.6966.58+0.8966.29+0.66
      Chair54.1953.49-0.7051.81-2.38
      Bottle52.0651.17-0.8955.37+3.31
      Pottedplant45.9645.20-0.7646.21+0.25
    Tools

    Get Citation

    Copy Citation Text

    Qisheng Wang, Fengsui Wang, Jingang Chen, Furong Liu. Faster R-CNN Target-Detection Algorithm Fused with Adaptive Attention Mechanism[J]. Laser & Optoelectronics Progress, 2022, 59(12): 1215016

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Aug. 2, 2021

    Accepted: Aug. 31, 2021

    Published Online: May. 23, 2022

    The Author Email: Fengsui Wang (fswang@ahpu.edu.ac.cn)

    DOI:10.3788/LOP202259.1215016

    Topics