Laser & Optoelectronics Progress, Volume. 58, Issue 2, 0210007(2021)

Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network

Wenqiang Zhang, Zengqiang Wang, and Liang Zhang*
Author Affiliations
  • Tianjin Key Laboratory of Advanced Signal and Image Processing, Civil Aviation University of China, Tianjin 300300, China
  • show less
    References(24)

    [1] Zhu Y, Zhao J K, Wang Y N et al. Areview of human action recognition based on deep learning[J]. Acta Automatica Sinica, 42, 848-857(2016).

    [2] Li Y P, Liu T T, Zhang L. Human action recognition based on deep learning[J]. Application Research of Computers, 37, 304-307, 316(2020).

    [3] Luo H L, Tong K, Kong F S. Theprogress of human action recognition in videos based on deep learning: a review[J]. Acta Electronica Sinica, 47, 1162-1173(2019).

    [8] Wang H, Schmid C. Actionrecognition with improved trajectories[C]∥2013 IEEE International Conference on Computer Vision, December 1-8, 2013, Sydney, NSW, Australia., 3551-3558(2013).

    [9] [9] Sun SY, Kuang ZH, ShengL, et al.Optical flow guided feature: a fast and robust motion representation for video action recognition[C]∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. New York: IEEE Press, 2018: 1390- 1399.

    [10] Zhang B W, Wang L M, Wang Z et al. Real-time action recognition with enhanced motion vector CNNs[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA., 2718-2726(2016).

    [11] Wang L L, Ge L Z, Li R F et al. Three-stream CNNs for action recognition[J]. Pattern Recognition Letters, 92, 33-40(2017).

    [12] Shi Y M, Tian Y H, Wang Y W et al. Sequential deep trajectory descriptor for action recognition with three-stream CNN[J]. IEEE Transactions on Multimedia, 19, 1510-1520(2017).

    [13] Chen S H, Chen Z Z. On human behavior recognition with deep learning and IR spectral signal restoration technologies in a natural classroom[J]. Infrared Physics & Technology, 105, 103167(2020).

    [14] Arivazhagan S, Shebiah R N, Harini R et al. Human action recognition from RGB-D data using complete local binary pattern[J]. Cognitive Systems Research, 58, 94-104(2019).

    [15] Fernando B, Gavves E, Oramas M J et al. Rank pooling for action recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 773-787(2017).

    [16] Karpathy A, Toderici G, Shetty S et al. Large-scale video classification with convolutional neural networks[C]∥2014 IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2014, Columbus, OH, USA., 1725-1732(2014).

    [17] Simonyan K. -11-12)[2020-07-07]. https:∥arxiv., org/abs/1406, 2199(2014).

    [18] Tran D, Bourdev L, Fergus R et al. Learning spatiotemporal features with 3D convolutional networks[C]∥2015 IEEE International Conference on Computer Vision (ICCV), December 7-13, 2015, Santiago, Chile., 4489-4497(2015).

    [19] Wang L M, Xiong Y J, Wang Z et al. -08-02)[2020-07-07]. https: ∥arxiv., org/abs/1608, 00859(2016).

    [20] Lan Z Z, Zhu Y, Hauptmann A G et al. Deep local video feature for action recognition[C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), July 21-26, 2017, Honolulu, HI, USA., 1219-1225(2017).

    [21] Ng Y H, Hausknecht M, Vijayanarasimhan S et al. Beyond short snippets: deep networks for video classification[J]. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4694-4702(2015).

    [22] Wang L M, Qiao Y, Tang X O. Action recognition with trajectory-pooled deep-convolutional descriptors[C]∥2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 7-12, 2015, Boston, MA, USA., 4305-4314(2015).

    [23] Zhu W J, Hu J, Sun G et al. A key volume mining deep framework for action recognition[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA., 1991-1999(2016).

    [24] Carreira J, Zisserman A. Quovadis, action recognition? A new model and the kinetics dataset[J]. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4724-4733(2017).

    Tools

    Get Citation

    Copy Citation Text

    Wenqiang Zhang, Zengqiang Wang, Liang Zhang. Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network[J]. Laser & Optoelectronics Progress, 2021, 58(2): 0210007

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Jun. 5, 2020

    Accepted: Jul. 7, 2020

    Published Online: Jan. 5, 2021

    The Author Email: Zhang Liang (l-zhang@cauc.edu.cn)

    DOI:10.3788/LOP202158.0210007

    Topics