Laser & Optoelectronics Progress, Volume. 58, Issue 20, 2020001(2021)

Improved Encoder-Decoder Temporal Action Detection Algorithm

Yue Wang, Hansong Su, and Gaohua Liu*
Author Affiliations
  • School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
  • show less

    Temporal action detection is a fundamental task in video understanding that is commonly used in the fields of human-computer interaction, video surveillance, intelligent security, and other fields. An improved encoder-decoder temporal action detection algorithm based on the convolutional neural network is proposed. The improved algorithm is applied in two stages: first, the feature extraction network is replaced and the residual structure network is used to extract the deep features of the video frame; and second, the encoder-decoder temporal convolutional network is constructed. The feature fusion is conducted via contact, and the method of upsampling is improved. To improve the detection accuracy of the network, the proposed algorithm employs the appropriate activation function LReLU for training. The experimental results show that the accuracy of the proposed algorithm on the temporal action detection datasets MERL Shopping and GTEA has improved.

    Tools

    Get Citation

    Copy Citation Text

    Yue Wang, Hansong Su, Gaohua Liu. Improved Encoder-Decoder Temporal Action Detection Algorithm[J]. Laser & Optoelectronics Progress, 2021, 58(20): 2020001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Optics in Computing

    Received: Sep. 24, 2020

    Accepted: Dec. 8, 2020

    Published Online: Oct. 15, 2021

    The Author Email: Liu Gaohua (suppig@126.com)

    DOI:10.3788/LOP202158.2020001

    Topics