Electronics Optics & Control, Volume. 31, Issue 3, 1(2024)
A Review of UAV Aerial Photography Target Detection and Tracking Methods Based on Deep Learning
[2] [2] CIAPARRONE G,SNCHEZ F L,TABIK S,et al.Deep learning in video multi-object tracking:a survey[J].Neurocomputing,2020,381:61-88.
[3] [3] PRICE E,LAWLESS G,LUDWIG R,et al.Deep neural network-based cooperative visual tracking through multiple micro aerial vehicles[J].IEEE Robotics and Automation Letters,2018,3(4):3193-3200.
[4] [4] GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition.Columbus:IEEE,2014:580-587.
[5] [5] GIRSHICK R.Fast R-CNN[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision.Washington:IEEE,2015:1440-1448.
[6] [6] REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:unified,real-time object detection[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE,2016:779-788.
[7] [7] REDMON J,FARHADI A.YOLO9000:better,faster,stronger[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE,2017:6517-6525.
[8] [8] REDMON J,FARHADI A.YOLOv3:an incremental improvement[R].Los Alamos:arXiv Preprint,2018:arXiv:1804.02767.
[9] [9] WANG C Y,BOCHKOVSKIY A,LIAO H Y M.Scaled-YOLOv4:scaling cross stage partial network[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Nashville:IEEE,2021:13029-13038.
[10] [10] LI C Y,LI L,JIANG H L,et al.YOLOv6:a single-stage object detection framework for industrial applications[R].Los Alamos:arXiv Preprint,2022:arXiv:2209.02976.
[11] [11] WANG C Y,BOCHKOVSKIY A,LIAO H Y M.YOLOv7:trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Vancouver:IEEE,2023:7464-7475.
[12] [12] LIU W,ANGUELOV D,ERHAN D,et al.SSD:single shot multibox detector[C]//European Conference on Computer Vision.Cham:Springer,2016:21-37.
[13] [13] YE M S,XU S J,CAO T Y.HVNet:hybrid voxel network for lidar based 3D object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle:IEEE,2020:1631-1640.
[14] [14] YANG Z T,SUN Y N,LIU S,et al.3DSSD:point-based 3D single stage object detector[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle:IEEE,2020:11040-11048.
[15] [15] ZHENG W,TANG W L,LI J,et al.SE-SSD:self-ensembling single-stage object detector from point cloud[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE,2021:14494-14503.
[16] [16] BOLME D S,BEVERIDGE J R,DRAPER B A,et al.Visual object tracking using adaptive correlation filters[C]//2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.San Francisco:IEEE,2010:2544-2550.
[17] [17] HENRIQUES J F,CASEIRO R,MARTINS P,et al.Exploiting the circulant structure of tracking-by-detection with kernels[C]//The 12th European Conference on Computer Vision.Berlin:Springer Berlin Heidelberg, 2012:702-715.
[18] [18] HENRIQUES J F,CASEIRO R,MARTINS P,et al.High-speed tracking with kernelized correlation filters[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2014,37(3):583-596.
[19] [19] HUANG Z Y,FU C H,LI Y M,et al.Learning aberrance repressed correlation filters for real-time UAV tracking[C]//IEEE International Conference on Computer Vision.Piscataway:IEEE,2019:2891-2900.
[20] [20] FU C H,XU J T,LIN F L,et al.Object saliency-aware dual regularized correlation filter for real-time aerial tracking[J].IEEE Transactions on Geoscience and Remote Sensing,2020,58(12):8940-8951.
[21] [21] LI Y M,FU C H,DING F Q,et al.AutoTrack:towards high-performance visual tracking for UAV with automatic spatio-temporal regularization[C]//IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE,2020:11920-11929.
[22] [22] HONG S H,YOU T G,KWAK S,et al.Online tracking by learning discriminative saliency map with convolutional neural network[C]//International Conference on Machine Learning.New York:PMLR,2015:597-606.
[23] [23] NAM H,HAN B.Learning multi-domain convolutional neural networks for visual tracking[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE,2016:4293-4302.
[24] [24] QI Y K,QIN L,ZHANG S P,et al.Robust visual tracking via scale-and-state-awareness[J].Neurocomputing,2018, 329:75-85.
[25] [25] TAO R,GAVVES E,SMEULDERS A W M.Siamese instance search for tracking[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE,2016:1420-1429.
[26] [26] BERTINETTO L,VALMADRE J,HENRIQUES J F,et al.Fully-convolutional Siamese networks for object tracking[C]//Computer Vision-ECCV 2016 Workshops:Amsterdam.Cham:Springer International Publishing,2016:850-865.
[27] [27] LI B,YAN J J,WU W,et al.High performance visual tracking with Siamese region proposal network[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Salt Lake City:IEEE,2018:8971-8980.
[30] [30] ZHAO J H,XIAO G,ZHANG X C,et al.A survey on object tracking in aerial surveillance[C]//Proceedings of International Conference on Aerospace System Science and Engineering 2018.Singapore:Springer,2018:53-68.
[31] [31] KRIZHEVSKY A,SUTSKEVER I,HINTON G E.Imagenet classification with deep convolutional neural networks[J].Communications of the ACM,2017,60(6):84-90.
[32] [32] UIJLINGS J R R,VAN DE SANDE K E A,GEVERS T,et al.Selective search for object recognition[J].International Journal of Computer Vision,2013,104:154-171.
[33] [33] ZHANG H K,CHANG H,MA B P,et al.Dynamic R-CNN:towards high quality object detection via dynamic training[C]//Computer Vision-ECCV 2020:16th European Conference.Berlin:Springer-Verlag,2020:260-275.
[34] [34] SHI S S,GUO C X,LI J,et al.PV-RCNN:point-voxel feature set abstraction for 3D object detection[C]//Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle:IEEE,2020:10529-10538.
[35] [35] YIN T W,ZHOU X Y,KRAHENBUHL P.Center-based 3D object detection and tracking[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Nashville:IEEE,2021:11784-11793.
[38] [38] ZHANG Z P,PENG H W.Deeper and wider Siamese networks for real-time visual tracking[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Long Beach:IEEE,2019:4591-4600.
[39] [39] LI B,WU W,WANG Q,et al.SiamRPN++:evolution of Siamese visual tracking with very deep networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Long Beach:IEEE, 2019:4282-4291.
Get Citation
Copy Citation Text
OUYANG Quan, ZHANG Yi, MA Yan, XUE Yali, WANG Zhisheng. A Review of UAV Aerial Photography Target Detection and Tracking Methods Based on Deep Learning[J]. Electronics Optics & Control, 2024, 31(3): 1
Category:
Received: Jun. 5, 2023
Accepted: --
Published Online: Jul. 29, 2024
The Author Email: