Laser & Optoelectronics Progress, Volume. 57, Issue 18, 181506(2020)
Human Action Recognition Algorithm Based on Spatio-Temporal Interactive Attention Model
A human action recognition algorithm is proposed based on spatio-temporal interactive attention model (STIAM) to solve the problem of low recognition accuracy. This problem is caused by the incapability of the two-stream network to effectively extract the valid frames in each video and the valid regions in each frame. Initially, the proposed algorithm applies two different deep learning networks to extract spatial and temporal features respectively. Subsequently, a mask-guided spatial attention model is designed to calculate the salient regions in each frame. Then, an optical flow-guided temporal attention model is designed to locate the saliency frames in each video. Finally, the weights obtained from temporal and spatial attention are weighted respectively with spatial features and temporal features to make this model realize the spatio-temporal interaction. Compared with the existing methods on UCF101 and Penn Action datasets, the experimental results show that STIAM has high feature extraction performance and the accuracy of action recognition is obviously improved.
Get Citation
Copy Citation Text
Na Pan, Min Jiang, Jun Kong. Human Action Recognition Algorithm Based on Spatio-Temporal Interactive Attention Model[J]. Laser & Optoelectronics Progress, 2020, 57(18): 181506
Category: Machine Vision
Received: Dec. 23, 2019
Accepted: Feb. 14, 2020
Published Online: Sep. 2, 2020
The Author Email: Jiang Min (minjiang@jiangnan.edu.cn)