Laser & Optoelectronics Progress, Volume. 58, Issue 2, 0210007(2021)

Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network

Wenqiang Zhang, Zengqiang Wang, and Liang Zhang*
Author Affiliations
  • Tianjin Key Laboratory of Advanced Signal and Image Processing, Civil Aviation University of China, Tianjin 300300, China
  • show less
    Figures & Tables(10)
    Overall flow diagram of action representation
    Static video frames and corresponding timing dynamic diagrams. (a) Static images; (b) timing dynamic diagrams; (c) optical flow diagrams
    TS-CNN network framework
    Recognition results of different subsequence lengths
    • Table 1. Recognition accuracy of UCF101 dataset with different input modes unit: %

      View table

      Table 1. Recognition accuracy of UCF101 dataset with different input modes unit: %

      MethodSplit1Split2Split3Accuracy
      SI84.684.985.084.8
      SOF87.389.991.089.4
      FSDI83.983.883.183.6
      BSDI84.183.384.383.9
      SDI85.786.285.585.8
      ESDI87.286.887.687.2
      SI+SOF93.294.094.293.8
      ESDI+SOF94.894.695.394.9
    • Table 2. Recognition accuracy of HMDB51 dataset with different input modes unit: %

      View table

      Table 2. Recognition accuracy of HMDB51 dataset with different input modes unit: %

      MethodSplit1Split2Split3Accuracy
      SI54.850.449.651.6
      SOF64.263.662.763.5
      FSDI50.751.453.651.9
      BSDI51.651.554.152.4
      SDI54.552.953.753.7
      ESDI53.655.555.654.9
      SI+SOF68.767.568.468.2
      ESDI+SOF69.671.271.670.8
    • Table 3. Recognition accuracy of different fusion methods on dataset unit: %

      View table

      Table 3. Recognition accuracy of different fusion methods on dataset unit: %

      Consensus functionUCF101HMDB51
      Max93.069.1
      Average94.970.8
      Weighted average93.869.7
    • Table 4. Recognition accuracy of different network models on dataset unit: %

      View table

      Table 4. Recognition accuracy of different network models on dataset unit: %

      Network structureUCF101HMDB51
      Resnet10193.668.4
      Bn-inception94.268.2
      InceptionV394.970.8
    • Table 5. Recognition accuracy of different human behavior recognition models unit: %

      View table

      Table 5. Recognition accuracy of different human behavior recognition models unit: %

      NetworkUCF101HMDB51
      Spatial stream84.851.4
      Temproral stream89.463.5
      Original two-stream88.059.4
      Ref. [19]94.069.4
      Appearance and long-sequential stream87.254.9
      Short sequential stream89.964
      TS-CNN94.970.8
    • Table 6. Recognition accuracy of different algorithms unit: %

      View table

      Table 6. Recognition accuracy of different algorithms unit: %

      Feature extractionMethodUCF101HMDB51
      TraditionRef. [7]84.857.2
      Ref. [8]87.961.1
      Deep learningRef. [17]88.059.4
      Ref. [21]88.6--
      Ref. [22]91.565.9
      Ref. [23]93.163.3
      Ref. [24]93.466.4
      Ref. [19]94.069.4
      Proposed94.970.8
    Tools

    Get Citation

    Copy Citation Text

    Wenqiang Zhang, Zengqiang Wang, Liang Zhang. Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network[J]. Laser & Optoelectronics Progress, 2021, 58(2): 0210007

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Jun. 5, 2020

    Accepted: Jul. 7, 2020

    Published Online: Jan. 5, 2021

    The Author Email: Zhang Liang (l-zhang@cauc.edu.cn)

    DOI:10.3788/LOP202158.0210007

    Topics