Journal of Applied Optics, Volume. 45, Issue 6, 1204(2024)

Video smoke recognition based on random patch shift and deformable attention

Yehui XIE and Haitao ZHAO*
Author Affiliations
  • School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
  • show less
    Figures & Tables(12)
    Overall framework diagram of network
    Different patch shift patterns
    Schematic diagram of deformable attention
    Deformable attention
    Overview of datasets
    Grad-CAM visualization
    • Table 1. Comparison of different methods on RISE test set

      View table
      View in Article

      Table 1. Comparison of different methods on RISE test set

      方法F1分数
      S0S1S2S3S4S5
      Flow-SVM0.420.590.470.630.520.47
      Flow-I3D0.550.580.510.680.650.50
      SVM0.570.700.670.670.570.53
      I3D0.800.840.820.870.820.75
      I3D-ND0.760.790.810.860.760.68
      I3D-FP0.760.810.820.870.810.71
      I3D-TSM0.810.840.820.870.800.74
      I3D-LSTM0.800.840.820.850.830.74
      I3D-TC0.810.840.840.870.810.77
      CNN-NonFFM[12]0.830.820.840.850.780.83
      EFFNet[12]0.840.830.860.860.800.83
      AFSNet[13]0.850.860.820.910.810.80
      本文方法0.850.850.860.880.840.79
    • Table 2. Performance comparison of different methods

      View table
      View in Article

      Table 2. Performance comparison of different methods

      方法Parameters/MFLOPs/GFPS
      I3D12.362.732.71
      I3D-TSM12.362.731.40
      I3D-LSTM38.062.932.25
      I3D-TC12.362.732.88
      EFFNet[12]27.234.642.57
      AFSNet[13]30.840.634.87
      本文方法24.268.431.78
    • Table 3. Ablation experiments of RPS and DA

      View table
      View in Article

      Table 3. Ablation experiments of RPS and DA

      模型F1分数AccPrRe
      Swin0.580 20.704 90.615 70.548 6
      Swin+RPS0.846 50.886 90.853 80.839 4
      Swin+RPS+DA0.850 80.892 50.879 20.824 2
    • Table 4. Ablation experiments of RPS in different layers of Swin Transformer

      View table
      View in Article

      Table 4. Ablation experiments of RPS in different layers of Swin Transformer

      F1分数AccPrRe
      1234
      0.808 90.872 70.915 60.724 4
      0.829 90.880 90.884 30.781 9
      0.845 80.888 10.866 90.825 9
      0.846 50.886 90.853 80.839 4
    • Table 5. Different patterns of RPS

      View table
      View in Article

      Table 5. Different patterns of RPS

      模式F1分数AccPrRe
      pattern-a0.838 70.883 10.861 10.817 4
      pattern-b0.834 70.883 10.880 00.793 7
      pattern-c0.850 80.892 50.879 20.824 2
    • Table 6. Ablation experiments of DA in different layers of Swin Transformer+RPS

      View table
      View in Article

      Table 6. Ablation experiments of DA in different layers of Swin Transformer+RPS

      F1分数AccPrRe
      34
      0.850 80.892 50.879 20.824 2
      0.809 20.866 40.863 00.761 6
    Tools

    Get Citation

    Copy Citation Text

    Yehui XIE, Haitao ZHAO. Video smoke recognition based on random patch shift and deformable attention[J]. Journal of Applied Optics, 2024, 45(6): 1204

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Sep. 21, 2023

    Accepted: --

    Published Online: Jan. 14, 2025

    The Author Email: ZHAO Haitao (赵海涛)

    DOI:10.5768/JAO202445.0602005

    Topics