Semiconductor Optoelectronics, Volume. 41, Issue 3, 414(2020)

Chinese Sign Language Recognition Based on Spatial-Temporal Attention Network

LUO Yuan1,*... LI Dan1 and ZHANG Yi2 |Show fewer author(s)
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less

    Sign language recognition is widely used in communication between deaf-mute and ordinary people. In adequate extraction of spatial-temporal features in sign language recognition task is likely to result in low recognition rate. In this paper, proposed is a novel sign language recognition model based on spatial-temporal attention which can learn more discriminative spatial-temporal features. Specially, a new spatial attention module based on residual 3D convolutional neural network (Res3DCNN) is proposed, which automatically focus on the salient areas in the spatial region. Then, to measure the importance of video frames, a new temporal attention module based on convolutional long short-term memory (ConvLSTM) is introduced. The crucial purpose of the proposed model is to focus on the salient areas spatially and pay attention to the key video frames temporally. Lastly, experimental results demonstrate the efficiency of the proposed method on the Chinese sign language (CSL) dataset.

    Tools

    Get Citation

    Copy Citation Text

    LUO Yuan, LI Dan, ZHANG Yi. Chinese Sign Language Recognition Based on Spatial-Temporal Attention Network[J]. Semiconductor Optoelectronics, 2020, 41(3): 414

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Dec. 30, 2019

    Accepted: --

    Published Online: Jun. 18, 2020

    The Author Email: Yuan LUO (luoyuan@cqupt.edu.cn)

    DOI:10.16818/j.issn1001-5868.2020.03.021

    Topics