Electronics Optics & Control, Volume. 31, Issue 2, 52(2024)
A Lightweight Object Detection Algorithm Based on Dynamic Transformer
[1] [1] REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:unifiedreal-time object detection[C]//Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE,2016:779-788.
[4] [4] XIE S N,GIRSHICK R,DOLLR P,et al.Aggregated residual transformations for deep neural networks[C]//Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE,2017:5987-5995.
[5] [5] LUO W J,LI Y J,URTASUN R,et al.Understanding the effective receptive field in deep convolutional neural networks[C]//The 30th International Conference on Neural Information Processing Systems.New York:Curran Associates Inc.,2016:4905-4913.
[6] [6] MINAEE S,KALCHBRENNER N,CAMBRIA E,et al.Deep learning-based text classification:a comprehensive review[J].ACM Computing Surveys,2021,54(3):1-40.
[7] [7] VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[C]//The 31th Conference on Neural Information Processing Systems.New York:Curran Associates Inc.,2017:6000-6010.
[8] [8] SRINIVAS A,LIN T Y,PARMAR N,et al.Bottleneck transformers for visual recognition[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition.Nashville:IEEE,2021:16514-16524.
[9] [9] ZHOU S H,NIE D,ADELI E,et al.Highresolution encoderdecoder networks for lowcontrast medical image segmentation[J].IEEE Transactions on Image Processing,2020,29:461-475.
[10] [10] DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al.An image is worth 16×16 words:transformers for image recognition at scale[C]//The 9th International Conference on Learning Representations.Vienna:IEEE,2021:11929.
[11] [11] REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.
[12] [12] SELVARAJU R R,COGSWELL M,DAS A,et al.GradCAM:visual explanations from deep networks via gradientbased localization[C]//International Conference on Computer Vision.Venice:IEEE,2017:618-626.
[13] [13] LIN T Y,MAIRE M,BELONGIE S,et al.Microsoft COCO:common objects in context[C]//European Conference on Computer Vision.Zurich:[s.n.],2014:740-755.
[14] [14] YUAN L,CHEN Y P,WANG T,et al.TokenstoToken ViT:training vision transformers from scratch on ImageNet[C]//IEEE/CVF International Conference on Computer Vision.Montreal:IEEE,2021:538-547.
[15] [15] WANG W H,XIE E Z,LI X,et al.Pyramid vision transformer:a versatile backbone for dense prediction without convolutions[C]//IEEE/CVF International Conference on Computer Vision.Montreal:IEEE-2021:548-558.
[16] [16] HAN K,XIAO A,WU E H,et al.Transformer in transformer[C]//The 35th Conference on Neural Information Processing Systems.[S.l.]:[s.n.],2021:15908-15919.
[17] [17] YUAN K,GUO S P,LIU Z W,et al.Incorporating convolution designs into visual transformers[C]//IEEE/CVF International Conference on Computer Vision.Montreal:IEEE,2021:559-568.
[18] [18] CHU X X,TIAN Z,WANG Y Q,et al.Twins:revisiting the design of spatial attention in vision transformers[C]//The 35th International Conference on Neural Information Processing Systems.[S.l.:s.n.],2021:9355-9366.
Get Citation
Copy Citation Text
FANG Sikai, SUN Guangling, LU Xiaofeng, LIU Xuefeng. A Lightweight Object Detection Algorithm Based on Dynamic Transformer[J]. Electronics Optics & Control, 2024, 31(2): 52
Category:
Received: Mar. 12, 2023
Accepted: --
Published Online: Jul. 26, 2024
The Author Email: