Laser & Optoelectronics Progress, Volume. 60, Issue 6, 0610014(2023)
Multitarget Real-Time Tracking Algorithm Based on Transformer and BYTE Data
Hao Pan1, Xiang Liu1、*, Jingwen Zhao1, and Xing Zhang2,3
Author Affiliations
1School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China2School of Management, Shanghai University of Engineering Science, Shanghai 201620, China3Automotive Engineering Research Institute, Jiangsu University, Zhenjiang 212013, Jiangsu , Chinashow less
To solve the problems of trajectory missed detection, misdetection, and identity switching in complex multitarget tracking, this paper proposes a multitarget tracking algorithm based on improved YOLOX and BYTE data association methods. First, to enhance YOLOX's target detection capabilities in complex environments, we combine the YOLOX backbone network and Vision Transformer to improve the network's local feature extraction capability and add the loss function to further improve the regression accuracy of the network bounding box. Second, to meet the real-time requirements of the algorithm, we employ the BYTE data association method, abandon the traditional feature rerecognition (Re-ID) network, and further improving the speed of the proposed multitarget tracking algorithm. Finally, to mitigate the tracking problems in complex environments, such as illumination and occlusion, we adopt the extended Kalman filter, which is more adaptive to the nonlinear system, to improve the prediction accuracy of the network for tracking trajectory in complex scenes. The experimental results show that the multiple object tracking accuracy (MOTA) and identity F1-measure (IDF1) of the proposed algorithm on the MOT17 dataset are 73.0% and 70.2%, respectively, compared with the current optimal algorithm ByteTrack, they are improved by 1.3 percentage points and 2.1 percentage points, respectively, whereas number of identity switches (IDSW) is reduced by 3.7%. Meanwhile, the proposed algorithm achieves a tracking speed of 51.2 frames/s, which meets the real-time requirements of the system.