Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network

Wenqiang Zhang; Zengqiang Wang; Liang Zhang

doi:10.3788/LOP202158.0210007

Laser & Optoelectronics Progress, Volume. 58, Issue 2, 0210007(2021)

Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network

Wenqiang Zhang, Zengqiang Wang, and Liang Zhang^*

Tianjin Key Laboratory of Advanced Signal and Image Processing, Civil Aviation University of China, Tianjin 300300, China

show less

Abstract Get PDF(in Chinese)

References(24)

[1] Zhu Y, Zhao J K, Wang Y N et al. Areview of human action recognition based on deep learning[J]. Acta Automatica Sinica, 42, 848-857(2016).

[2] Li Y P, Liu T T, Zhang L. Human action recognition based on deep learning[J]. Application Research of Computers, 37, 304-307, 316(2020).

[3] Luo H L, Tong K, Kong F S. Theprogress of human action recognition in videos based on deep learning: a review[J]. Acta Electronica Sinica, 47, 1162-1173(2019).

[4] Li Q H, Li A H, Wang T et al. Double-stream convolutional networks with sequential optical flow image for action recognition[J]. Acta Optica Sinica, 38, 0615002(2018).

[5] Liu F, Yu F Q. Humanaction recognition based on global and local features[J]. Laser & Optoelectronics Progress, 57, 021004(2020).

[6] Huang Y W, Wan C L, Feng H. Multi-feature fusion human behavior recognition algorithm based on convolutional neural network and long short term memory neural network[J]. Laser & Optoelectronics Progress, 56, 071505(2019).

[7] Wang H, Kläser A, Schmid C et al. Dense trajectories and motion boundary descriptors for action recognition[J]. International Journal of Computer Vision, 103, 60-79(2013).

[8] Wang H, Schmid C. Actionrecognition with improved trajectories[C]∥2013 IEEE International Conference on Computer Vision, December 1-8, 2013, Sydney, NSW, Australia., 3551-3558(2013).

[9] [9] Sun SY, Kuang ZH, ShengL, et al.Optical flow guided feature: a fast and robust motion representation for video action recognition[C]∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. New York: IEEE Press, 2018: 1390- 1399.

[10] Zhang B W, Wang L M, Wang Z et al. Real-time action recognition with enhanced motion vector CNNs[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA., 2718-2726(2016).

[11] Wang L L, Ge L Z, Li R F et al. Three-stream CNNs for action recognition[J]. Pattern Recognition Letters, 92, 33-40(2017).

[12] Shi Y M, Tian Y H, Wang Y W et al. Sequential deep trajectory descriptor for action recognition with three-stream CNN[J]. IEEE Transactions on Multimedia, 19, 1510-1520(2017).

[13] Chen S H, Chen Z Z. On human behavior recognition with deep learning and IR spectral signal restoration technologies in a natural classroom[J]. Infrared Physics & Technology, 105, 103167(2020).

[14] Arivazhagan S, Shebiah R N, Harini R et al. Human action recognition from RGB-D data using complete local binary pattern[J]. Cognitive Systems Research, 58, 94-104(2019).

[15] Fernando B, Gavves E, Oramas M J et al. Rank pooling for action recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 773-787(2017).

[16] Karpathy A, Toderici G, Shetty S et al. Large-scale video classification with convolutional neural networks[C]∥2014 IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2014, Columbus, OH, USA., 1725-1732(2014).

[17] Simonyan K. -11-12)[2020-07-07]. https:∥arxiv., org/abs/1406, 2199(2014).

[18] Tran D, Bourdev L, Fergus R et al. Learning spatiotemporal features with 3D convolutional networks[C]∥2015 IEEE International Conference on Computer Vision (ICCV), December 7-13, 2015, Santiago, Chile., 4489-4497(2015).

[19] Wang L M, Xiong Y J, Wang Z et al. -08-02)[2020-07-07]. https: ∥arxiv., org/abs/1608, 00859(2016).

[20] Lan Z Z, Zhu Y, Hauptmann A G et al. Deep local video feature for action recognition[C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), July 21-26, 2017, Honolulu, HI, USA., 1219-1225(2017).

[21] Ng Y H, Hausknecht M, Vijayanarasimhan S et al. Beyond short snippets: deep networks for video classification[J]. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4694-4702(2015).

[22] Wang L M, Qiao Y, Tang X O. Action recognition with trajectory-pooled deep-convolutional descriptors[C]∥2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 7-12, 2015, Boston, MA, USA., 4305-4314(2015).

[23] Zhu W J, Hu J, Sun G et al. A key volume mining deep framework for action recognition[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA., 1991-1999(2016).

[24] Carreira J, Zisserman A. Quovadis, action recognition? A new model and the kinetics dataset[J]. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4724-4733(2017).

Tools

Get Citation

Copy Citation Text

Wenqiang Zhang, Zengqiang Wang, Liang Zhang. Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network[J]. Laser & Optoelectronics Progress, 2021, 58(2): 0210007

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Image Processing

Received: Jun. 5, 2020

Accepted: Jul. 7, 2020

Published Online: Jan. 5, 2021

The Author Email: Zhang Liang (l-zhang@cauc.edu.cn)

DOI:10.3788/LOP202158.0210007

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology