A Transformer-based visual tracker via knowledge distillation

Na LI; Mengqiao LIU; Jinting PAN; Kai HUANG; Xingxuan JIA

doi:10.37188/OPE.20253304.0653

Optics and Precision Engineering, Volume. 33, Issue 4, 653(2025)

A Transformer-based visual tracker via knowledge distillation

Na LI^*, Mengqiao LIU, Jinting PAN, Kai HUANG, and Xingxuan JIA

School of Communication and Information Engineering， Xi’an University of Posts and Telecommunications， Xi’an710121， China

show less

Abstract Get PDF(in Chinese)

To achieve high-precision and real-time tracking with limited computing resources， a transformer-based visual tracker via knowledge distillation was proposed. By introducing the image dynamic correction module， our tracker fused the search image of the current frame with the predicted image based on optical flow， which could effectively deal with challenges such as fast motion and motion blur. In order to reduce model complexity， the knowledge distillation learning strategy was adopted to compress the model. By introducing homoscedastic uncertainty into the loss function， loss weights of different subtasks could be learned through our network， thereby avoiding the cumbersome and difficult manual parameter tuning. Additionally， during training for the student network， a random blurring strategy was employed to enhance model robustness. Two tracking frameworks with different complexities， named KTransT-T and KTransT， were proposed and compared with 12 algorithms on 5 public datasets. Experimental results show that KTransT-T has significant advantages in precision and success rate， while KTransT has lower model complexity and competitive tracking performance. KTransT runs at a speed of up to 158 frames per second， which can meet the requirements of real-time tracking.

Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.

Keywords

computer vision homoscedastic uncertainty knowledge distillation object tracking transformer

Tools

Get Citation

Copy Citation Text

Na LI, Mengqiao LIU, Jinting PAN, Kai HUANG, Xingxuan JIA. A Transformer-based visual tracker via knowledge distillation[J]. Optics and Precision Engineering, 2025, 33(4): 653

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Sep. 24, 2024

Accepted: --

Published Online: May. 20, 2025

The Author Email: Na LI (lina114@xupt.edu.cn)

DOI:10.37188/OPE.20253304.0653

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology