Laser & Optoelectronics Progress, Volume. 56, Issue 12, 121004(2019)

Video Classification Based on Three-Dimensional Squeeze Excitation Module

Ningxiao Li, Guodong Wang*, Yanjie Wang, Shiyu Hu, and Liangliang Wang
Author Affiliations
  • College of Computer Science & Technology, Qingdao University, Qingdao, Shandong 266071, China
  • show less

    To address the fusion problem of time sequence features in video classification, this paper proposes a new three-dimensional (3D) squeezing excitation (SE) network structure module that is constructed by combining the SE network in a two-dimensional convolutional neural network (CNN) with a 3D convolutional residual network. The new module adds an extra time-dimension coefficient to the coefficient set of a directly transformed 3D SE module, allowing it to record the changes in the motion trajectories of the research objects on time trajectories. The proposed module can not only record the characteristics of a specific time point, but also strengthen the relevance of multiple time points. To assess the effectiveness of the module, an SE network with a spatial and temporal latitude was used to perform character-action-behavior recognition. The experimental results indicate that the module can accelerate the loss convergence and effectively improve the accuracy of video classification.

    Tools

    Get Citation

    Copy Citation Text

    Ningxiao Li, Guodong Wang, Yanjie Wang, Shiyu Hu, Liangliang Wang. Video Classification Based on Three-Dimensional Squeeze Excitation Module[J]. Laser & Optoelectronics Progress, 2019, 56(12): 121004

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Nov. 29, 2018

    Accepted: Jan. 11, 2019

    Published Online: Jun. 13, 2019

    The Author Email: Wang Guodong (doctorwgd@gmail.com)

    DOI:10.3788/LOP56.121004

    Topics