Laser & Optoelectronics Progress, Volume. 58, Issue 20, 2020001(2021)

Improved Encoder-Decoder Temporal Action Detection Algorithm

Yue Wang, Hansong Su, and Gaohua Liu*
Author Affiliations
  • School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
  • show less
    Figures & Tables(11)
    Structure of the improved encoder-decoder temporal convolutional neural network
    Structure of the residual module
    Different feature fusion methods
    Schematic diagram of traditional upsampling and improved upsampling. (a) Traditional upsampling; (b)improved upsampling
    Detection example of MERL Shopping dataset
    Detection example of GTEA dataset
    • Table 1. Parameters of feature extraction network

      View table

      Table 1. Parameters of feature extraction network

      BlockKernel sizeNumber of channels
      Conv17×764
      Conv2_x1×13×31×1×36464256×3
      Conv3_x1×13×31×1×4128128512×4
      Conv4_x1×13×31×1×62562561024×6
      Conv5_x1×13×31×1×35125122048×3
    • Table 2. Recognition accuracy rate of each action

      View table

      Table 2. Recognition accuracy rate of each action

      DatasetActionAccuracy /%
      MERL ShoppingReach to shelf77.8
      Retract from shelf79.3
      Hand in shelf81.6
      Inspect the product80.4
      Inspect the shelf81.2
    • Table 3. Effectiveness of various module on the algorithm

      View table

      Table 3. Effectiveness of various module on the algorithm

      DatasetVggNet16ResNet50ED-TCNImproved ED-TCNmAP /%
      MERL Shopping24.3
      MERL Shopping25.6
      MERL Shopping29.3
      GTEA25.8
      GTEA27.2
      GTEA30.2
    • Table 4. Seg-F1 of different algorithms on different datasets

      View table

      Table 4. Seg-F1 of different algorithms on different datasets

      DatasetED-TCNImproved ED-TCNSeg-F1@10Seg-F1@25Seg-F1@50
      MERL Shopping86.785.172.9
      MERL Shopping89.287.474.8
      GTEA72.269.356.0
      GTEA76.871.958.5
    • Table 5. Comparison of results of different algorithms on the MERL Shopping dataset

      View table

      Table 5. Comparison of results of different algorithms on the MERL Shopping dataset

      AlgorithmAccuracy /%mAP /%Seg-F1@10Seg-F1@25Seg-F1@50
      MSN Det64.629.546.442.625.6
      MSN Seg76.324.280.078.365.4
      Dilated TCN76.426.379.978.067.5
      ED-TCN79.025.586.785.172.9
      Improved ED-TCN82.429.389.287.474.8
    Tools

    Get Citation

    Copy Citation Text

    Yue Wang, Hansong Su, Gaohua Liu. Improved Encoder-Decoder Temporal Action Detection Algorithm[J]. Laser & Optoelectronics Progress, 2021, 58(20): 2020001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Optics in Computing

    Received: Sep. 24, 2020

    Accepted: Dec. 8, 2020

    Published Online: Oct. 15, 2021

    The Author Email: Liu Gaohua (suppig@126.com)

    DOI:10.3788/LOP202158.2020001

    Topics