Laser & Optoelectronics Progress, Volume. 58, Issue 20, 2020001(2021)
Improved Encoder-Decoder Temporal Action Detection Algorithm
Temporal action detection is a fundamental task in video understanding that is commonly used in the fields of human-computer interaction, video surveillance, intelligent security, and other fields. An improved encoder-decoder temporal action detection algorithm based on the convolutional neural network is proposed. The improved algorithm is applied in two stages: first, the feature extraction network is replaced and the residual structure network is used to extract the deep features of the video frame; and second, the encoder-decoder temporal convolutional network is constructed. The feature fusion is conducted via contact, and the method of upsampling is improved. To improve the detection accuracy of the network, the proposed algorithm employs the appropriate activation function LReLU for training. The experimental results show that the accuracy of the proposed algorithm on the temporal action detection datasets MERL Shopping and GTEA has improved.
Get Citation
Copy Citation Text
Yue Wang, Hansong Su, Gaohua Liu. Improved Encoder-Decoder Temporal Action Detection Algorithm[J]. Laser & Optoelectronics Progress, 2021, 58(20): 2020001
Category: Optics in Computing
Received: Sep. 24, 2020
Accepted: Dec. 8, 2020
Published Online: Oct. 15, 2021
The Author Email: Liu Gaohua (suppig@126.com)