Acta Optica Sinica, Volume. 39, Issue 9, 0915006(2019)

Video-Based Person Re-Identification via Combined Multi-Level Deep Feature Representation and Ordered Weighted Distance Fusion

Rui Sun1,2, Qiheng Huang1,2、*, Weiming Lu1,2, and Jun Gao1
Author Affiliations
  • 1 School of Computer and Information, Hefei University of Technology, Hefei, Anhui 230009, China
  • 2 Anhui Provincial Key Laboratory of Industry Safety and Emergency Technology, Hefei, Anhui 230009, China
  • show less
    Figures & Tables(18)
    Challenges and difficulties in video-based person re-identification
    Architecture of video-based person re-identification algorithm combining multi-level depth feature representation and ordered weighted distance fusion
    Structure of RNN
    Flowchart of distance metric learning algorithm used for computing distance between persons
    Algorithm of ordered weighted distance fusion
    CMCs of comparison with still-image-based person re-identification methods on i-LIDS and PRID-2011 datasets. (a) i-LIDS dataset; (b) PRID-2011 dataset
    CMCs of comparison with video-based person re-identification methods on i-LIDS and PRID-2011 datasets. (a) i-LIDS dataset; (b) PRID-2011 dataset
    CMCs of comparison with deep-learning-based person re-identification methods on i-LIDS and PRID-2011 dataset. (a) i-LIDS dataset; (b) PRID-2011 dataset
    • Table 1. Structure parameters of global and local appearance depth feature network

      View table

      Table 1. Structure parameters of global and local appearance depth feature network

      Network layer/Parameter
      Conv1/Filters(96,11×11),Stride 4
      Max pooling1/Filters(3×3),Stride 2
      Conv2/Filters(256,5×5),Stride 1,pad 2
      Max pooling2/Filters(3×3),Stride 2
      Conv3/Filters(384,3×3),Stride 1,pad 1
      Conv4/Filters(384,3×3),Stride 1,pad 1
      Conv5/Filters(256,3×3),Stride 1,pad 1
      Max pooling5/Filters(3×3),Stride 2
      Fc6 layerSlice layer
      Fc6_Lh layerFc6_Ll layer
      Fc7 layerConcat layer
    • Table 1. 0 Results of comparison with key factors on PRID 2011 dataset

      View table

      Table 1. 0 Results of comparison with key factors on PRID 2011 dataset

      MethodMatching rate of PRID-2011 dataset /%
      Rank-1Rank-5Rank-10Rank-20
      Proposed71.890.494.197.5
      LAF-112.430.341.661.8
      SF-133.765.275.387.6
      GAF-140.464.079.891.0
      LAF+ST28.153.968.580.9
      GAF+ST42.767.481.889.9
    • Table 2. Public datasets for person re-identification

      View table

      Table 2. Public datasets for person re-identification

      Experimental datasetCharacteristicObject
      PRID-2011Viewpoint, lighting, and background changesOutdoor person
      i-LIDSViewpoint and lighting changes, occlusionsAirport person
      VIPeRViewpoint, lighting, and background changesSchool person
      CAVIAR4REIDLighting changes and occlusionsShopping person
      GRIDViewpoint and lighting changes, low resolutionSubway person
    • Table 3. Results of comparison with still-image-based person re-identification methods on i-LIDS dataset

      View table

      Table 3. Results of comparison with still-image-based person re-identification methods on i-LIDS dataset

      MethodMatching rate of i-LIDS dataset (image-based methods) /%
      Rank-1Rank-5Rank-10Rank-20
      Proposed55.784.795.396.7
      GRDL25.749.963.277.6
      DVDL25.948.257.368.9
      SRID24.944.555.666.2
      Salience10.224.835.552.9
    • Table 4. Results of comparison with still-image-based person re-identification methods on PRID 2011 dataset

      View table

      Table 4. Results of comparison with still-image-based person re-identification methods on PRID 2011 dataset

      MethodMatching rate of PRID-2011 dataset (image-based methods) /%
      Rank-1Rank-5Rank-10Rank-20
      Proposed71.890.494.197.5
      GRDL41.676.484.989.9
      DVDL40.669.777.885.6
      SRID35.159.469.879.7
      Salience25.843.652.662.0
    • Table 5. Results of comparison with video-based person re-identification methods on i-LIDS dataset

      View table

      Table 5. Results of comparison with video-based person re-identification methods on i-LIDS dataset

      MethodMatching rate of i-LIDS dataset (video-based methods) /%
      Rank-1Rank-5Rank-10Rank-20
      Proposed55.784.795.396.7
      TDL56.387.695.698.3
      DGM+IDE37.262.673.480.8
      VR23.342.455.368.4
      PAMM30.356.370.382.7
      UNKISS35.963.374.983.4
      ISR11.622.127.436.7
    • Table 6. Results of comparison with video-based person re-identification methods on PRID 2011 dataset

      View table

      Table 6. Results of comparison with video-based person re-identification methods on PRID 2011 dataset

      MethodMatching rate of PRID-2011 dataset (video-based methods) /%
      Rank-1Rank-5Rank-10Rank-20
      Proposed71.890.494.197.5
      TDL56.780.087.693.6
      DGM+IDE61.689.094.898.2
      VR28.955.365.582.8
      PAMM45.072.085.092.5
      UNKISS58.181.989.696.0
      ISR17.635.843.054.4
    • Table 7. Results of comparison with deep-learning-based person re-identification methods on i-LIDS dataset

      View table

      Table 7. Results of comparison with deep-learning-based person re-identification methods on i-LIDS dataset

      MethodMatching rate of i-LIDS dataset (deep learning methods) /%
      Rank-1Rank-5Rank-10Rank-20
      Proposed55.784.795.396.7
      RFA49.376.885.390.0
      Deep-RCN42.670.286.492.3
      CNN-KISS48.875.6-92.6
      BRNN55.385.091.795.1
      SRM-TAM55.286.5-97.0
      TRL57.781.7-94.1
    • Table 8. Results of comparison with deep-learning-based person re-identification methods on PRID 2011 dataset

      View table

      Table 8. Results of comparison with deep-learning-based person re-identification methods on PRID 2011 dataset

      MethodMatching rate of PRID-2011 dataset (deep learning methods) /%
      Rank-1Rank-5Rank-10Rank-20
      Proposed71.890.494.197.5
      RFA58.285.893.497.9
      Deep-RCN49.877.490.794.6
      CNN-KISS69.990.6-98.2
      BRNN72.892.095.197.6
      SRM-TAM79.494.4-99.3
      TRL87.897.499.3
    • Table 9. Results of comparison with key factors on i-LIDS dataset

      View table

      Table 9. Results of comparison with key factors on i-LIDS dataset

      MethodMatching rate of i-LIDS dataset /%
      Rank-1Rank-5Rank-10Rank-20
      Proposed55.784.795.396.7
      LAF-15.314.021.332.7
      ST-138.771.382.388.0
      GAF-146.074.079.389.3
      LAF+ST24.152.070.683.4
      GAF+ST31.561.471.384.0
    Tools

    Get Citation

    Copy Citation Text

    Rui Sun, Qiheng Huang, Weiming Lu, Jun Gao. Video-Based Person Re-Identification via Combined Multi-Level Deep Feature Representation and Ordered Weighted Distance Fusion[J]. Acta Optica Sinica, 2019, 39(9): 0915006

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Jan. 14, 2019

    Accepted: May. 31, 2019

    Published Online: Sep. 9, 2019

    The Author Email: Huang Qiheng (jchqh123@163.com)

    DOI:10.3788/AOS201939.0915006

    Topics