Journal of Optoelectronics · Laser, Volume. 34, Issue 12, 1298(2023)
Human motion recognition based on ConvGRU and attention feature fusion
[1] [1] AHMAD T,JIN L W,ZHANG X,et al.Graph convolutional neural network for human action recognition: a comprehensive survey[J].IEEE Transactions on Artificial Intelligence,2021,2(2):128-145.
[3] [3] WANG H,SCHMID C.Action recognition with improved trajectories[C]//IEEE International Conference on Computer Vision,December 1-8,2013,Sydney,NSW,Australia.New York:IEEE,2013:3551-3558.
[5] [5] SIMONYAN K,ZISSERMAN A.Two-stream convolutional networks for action recognition in videos[EB/OL].(2014-06-09)[2023-03-20].https://arxiv.org/abs/1406.2199.
[6] [6] TRAN D,BOURDEV L,FERGUS R,et al.Learning spatiotemporal features with 3D convolutional networks[C]//IEEE International Conference on Computer Vision,December 7-13,2015,Santiago,Chile.New York:IEEE,2015:4489-4497.
[7] [7] DONAHUE J, HENDRICKS L A,GUADARRAMA S,et al.Long-term recurrent convolutional networks for visual recognition and description[C]//IEEE Conference on Computer Vision and Pattern Recognition,June 7-12,2015,Boston,MA,USA.New York:IEEE,2015:2625-2634.
[8] [8] PAN B,SUN J,LIN W, et al. Cross-stream selective networks for action recognition[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops,June 16-17,2019,Long Beach,CA,USA.New York:IEEE,2019:454-460.
[9] [9] JAOUEDI N,BOUJNAH N,BOUHLEL M S.A new hybrid deep learning model for human action recognition[J].Journal of King Saud University-Computer and Information Sciences,2020,32(4):447-453.
[11] [11] WANG Z,SHE Q,SMOLIC A.Action-net:multipath excitation for action recognition[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition,June 20-25,2021,Nashville,TN,USA.New York:IEEE,2021:13214-13223.
[12] [12] HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//IEEE Conference on Computer Vision and Pattern Recognition,June 18-23,2018,Salt Lake City,UT,USA.New York:IEEE,2018:7132-7141.
[13] [13] SHI X J,GAO Z H,LAUSEN S L,et al.Deep learning for precipitation nowcasting: a benchmark and a new model[C]//Advances in Neural Information Processing Systems,December 4-9,2017,Long Beach,CA.Red Hook,NY,USA:Curran Associates Inc.,2017:5617-5627.
[14] [14] DEL RIO R E,PARDO-NOVOA J C,CERDAGARCIA-ROJASC M,et al.Vibrational circular dichroism behavior of quinol cacalolides from Psacalium aff. sinuatum[J].Journal of Molecular Structure,2021,1224:128987.
[15] [15] SOOMRO K,ZAMIR A R,SHAH M.UCF101:a dataset of 101 human action classes from videos in the wild[EB/OL].(2012-12-03)[2023-03-21].https://arxiv.org/abs/1212.0402.
[16] [16] KUEHNE H,JHUANG H,GARROTE H,,et al.HMDB:A large video database for human motion recognition[C]//IEEE International Conference on Computer Vision,November 6-13,2011,Barcelona,Spain.New York:IEEE,2011:2556-2563.
[17] [17] DIBA A,FAYYAZ M,SHARMA V,et al.Spatio-temporal channel correlation networks for action classification[C]//European Conference on Computer Vision,September 8-14,2018,Munich,Germany.Cham:Springer,2018:299-315.
[18] [18] TU Z,XIE W,DAUWELS J,et al.Semantic cues enhanced multimodality multistream CNN for action recognition[J].IEEE Transactions on Circuits and Systems for Video Technology,2019,29(5):1423-1437.
[19] [19] LIN J,GAN C,HAN S.TSM:temporal shift module for efficient video understanding[C]//IEEE/CVF International Conference on Computer Vision,October 27-November 2,2019,Seoul,Korea (South).New York:IEEE,2019:7082-7092.
Get Citation
Copy Citation Text
CHENG Nana, ZHANG Rongfen, LIU Yuhong, LIU Yuan, LIU Xingfei, YANG Shuang. Human motion recognition based on ConvGRU and attention feature fusion[J]. Journal of Optoelectronics · Laser, 2023, 34(12): 1298
Received: Mar. 21, 2023
Accepted: --
Published Online: Sep. 25, 2024
The Author Email: ZHANG Rongfen (rfzhang@gzu.edu.cn)