Optics and Precision Engineering, Volume. 33, Issue 4, 653(2025)

A Transformer-based visual tracker via knowledge distillation

Na LI*, Mengqiao LIU, Jinting PAN, Kai HUANG, and Xingxuan JIA
Author Affiliations
  • School of Communication and Information Engineering, Xi’an University of Posts and Telecommunications, Xi’an710121, China
  • show less
    Figures & Tables(10)
    Overall framework of our algorithm
    Dynamic allocation process of weights
    Block of the encoder
    Visualization of feature maps sequentially passing through Conv, BN and ReLU for the predicted frame and current frame
    Comparison with advanced trackers on LaSOT
    Comparison with advanced trackers for different attributes on LaSOT
    Screenshots of tracking results for different algorithms on sequences bird1 in OTB
    • Table 1. Details of KTransT and KTransT-T

      View table
      View in Article

      Table 1. Details of KTransT and KTransT-T

      MethodBlockParams/MFLOPs/GLatency/ms
      KTransT-T12896610
      KTransT646335
    • Table 2. Ablation experiments on LaSOT

      View table
      View in Article

      Table 2. Ablation experiments on LaSOT

      MethodAUCPNP
      Baseline-T64.369.174.0
      +OD65.170.275.0
      +Conv_164.569.374.1
      +Conv_364.769.474.5
      +OD+Conv_3(KTransT-T)67.372.076.6
      Baseline-S55.055.767.4
      +RF56.059.368.3
      +KD60.062.869.7
      +KDL62.565.171.6
      +RF+KDL(KTransT)64.167.973.3
    • Table 3. Comparison of different algorithms on GOT-10k, LaSOT, OTB100, UAV123 and TrackingNet

      View table
      View in Article

      Table 3. Comparison of different algorithms on GOT-10k, LaSOT, OTB100, UAV123 and TrackingNet

      MethodGOT-10kLaSOTOTB100TrackingNetUAV123GPU Speed
      AOSR0.5SR0.75AUCPAUCPAUCPAUCP
      SiamFC3035.539.011.833.633.957.876.557.153.352.373.1330
      SiamBAN3113.68.71.451.452.164.085.9--55.673.3138
      SiamGAT3262.774.348.853.953.063.182.375.369.861.481.3196
      ATOM3355.663.440.249.348.260.380.970.364.858.979.654
      DiMP503461.171.749.256.556.964.085.574.068.759.678.657
      TansT1172.382.468.264.268.267.287.781.480.361.979.884
      E.T.Track3556.864.942.859.060.467.087.375.070.662.780.964
      LightTrack-Mobile3658.267.144.253.853.766.186.072.569.546.960.971
      MixFormerV2-S3761.370.551.860.660.463.784.475.870.465.186.6240
      HCAT3865.376.85759.160.768.1-76.672.963.6-113
      HiT-Tity3952.659.342.754.852.954.370.774.668.858.776.673
      SeqTrack1567.376.660.764.369.166.288.479.376.665.385.987
      KTransT-T72.482.868.967.372.067.991.079.886.466.285.791
      KTransT69.879.464.264.167.964.386.077.380.161.881.3158
    Tools

    Get Citation

    Copy Citation Text

    Na LI, Mengqiao LIU, Jinting PAN, Kai HUANG, Xingxuan JIA. A Transformer-based visual tracker via knowledge distillation[J]. Optics and Precision Engineering, 2025, 33(4): 653

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Sep. 24, 2024

    Accepted: --

    Published Online: May. 20, 2025

    The Author Email: Na LI (lina114@xupt.edu.cn)

    DOI:10.37188/OPE.20253304.0653

    Topics