Infrared and Laser Engineering, Volume. 47, Issue 2, 203007(2018)

Action recognition method of spatio-temporal feature fusion deep learning network

Pei Xiaomin1,2、*, Fan Huijie2, and Tang Yandong2
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less

    Action recognition from natural scene was affected by complex illumination conditions and cluttered backgrounds. There was a growing interest in solving these problems by using 3D skeleton data. Firstly, considering the spatio-temporal features of human actions, a spatio-temporal fusion deep learning network for action recognition was proposed; Secondly, view angle invariant character was constructed based on geometric features of the skeletons. Local spatial character was extracted by short-time CNN networks. A spatio-LSTM network was used to learn the relation between joints of a skeleton frame. Temporal LSTM was used to learn spatio-temporal relation between skeleton sequences. Lastly, NTU RGB+D datasets were used to evaluate this network, the network proposed achieved the state-of-the-art performance for 3D human action analysis. Experimental results show that this network has strong robustness for view invariant sequences.

    Tools

    Get Citation

    Copy Citation Text

    Pei Xiaomin, Fan Huijie, Tang Yandong. Action recognition method of spatio-temporal feature fusion deep learning network[J]. Infrared and Laser Engineering, 2018, 47(2): 203007

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: 特约专栏—“深度学习及其应用”

    Received: Aug. 10, 2017

    Accepted: Oct. 28, 2017

    Published Online: Apr. 26, 2018

    The Author Email: Xiaomin Pei (pxm_neu@126.com)

    DOI:10.3788/irla201847.0203007

    Topics