Infrared and Laser Engineering, Volume. 47, Issue 2, 203007(2018)
Action recognition method of spatio-temporal feature fusion deep learning network
Action recognition from natural scene was affected by complex illumination conditions and cluttered backgrounds. There was a growing interest in solving these problems by using 3D skeleton data. Firstly, considering the spatio-temporal features of human actions, a spatio-temporal fusion deep learning network for action recognition was proposed; Secondly, view angle invariant character was constructed based on geometric features of the skeletons. Local spatial character was extracted by short-time CNN networks. A spatio-LSTM network was used to learn the relation between joints of a skeleton frame. Temporal LSTM was used to learn spatio-temporal relation between skeleton sequences. Lastly, NTU RGB+D datasets were used to evaluate this network, the network proposed achieved the state-of-the-art performance for 3D human action analysis. Experimental results show that this network has strong robustness for view invariant sequences.
Get Citation
Copy Citation Text
Pei Xiaomin, Fan Huijie, Tang Yandong. Action recognition method of spatio-temporal feature fusion deep learning network[J]. Infrared and Laser Engineering, 2018, 47(2): 203007
Category: 特约专栏—“深度学习及其应用”
Received: Aug. 10, 2017
Accepted: Oct. 28, 2017
Published Online: Apr. 26, 2018
The Author Email: Xiaomin Pei (pxm_neu@126.com)