A Lightweight Object Detection Algorithm Based on Dynamic Transformer

[1] [1] REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:unifiedreal-time object detection［C］//Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE,2016:779-788.

[4] [4] XIE S N,GIRSHICK R,DOLLR P,et al.Aggregated residual transformations for deep neural networks［C］//Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE,2017:5987-5995.

[5] [5] LUO W J,LI Y J,URTASUN R,et al.Understanding the effective receptive field in deep convolutional neural networks［C］//The 30th International Conference on Neural Information Processing Systems.New York:Curran Associates Inc.,2016:4905-4913.

[6] [6] MINAEE S,KALCHBRENNER N,CAMBRIA E,et al.Deep learning-based text classification:a comprehensive review［J］.ACM Computing Surveys,2021,54(3):1-40.

[7] [7] VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need［C］//The 31th Conference on Neural Information Processing Systems.New York:Curran Associates Inc.,2017:6000-6010.

[8] [8] SRINIVAS A,LIN T Y,PARMAR N,et al.Bottleneck transformers for visual recognition［C］//IEEE/CVF Conference on Computer Vision and Pattern Recognition.Nashville:IEEE,2021:16514-16524.

[9] [9] ZHOU S H,NIE D,ADELI E,et al.Highresolution encoderdecoder networks for lowcontrast medical image segmentation［J］.IEEE Transactions on Image Processing,2020,29:461-475.

[10] [10] DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al.An image is worth 16×16 words:transformers for image recognition at scale［C］//The 9th International Conference on Learning Representations.Vienna:IEEE,2021:11929.

[11] [11] REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks［J］.IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.

[12] [12] SELVARAJU R R,COGSWELL M,DAS A,et al.GradCAM:visual explanations from deep networks via gradientbased localization［C］//International Conference on Computer Vision.Venice:IEEE,2017:618-626.

[13] [13] LIN T Y,MAIRE M,BELONGIE S,et al.Microsoft COCO:common objects in context［C］//European Conference on Computer Vision.Zurich:［s.n.］,2014:740-755.

[14] [14] YUAN L,CHEN Y P,WANG T,et al.TokenstoToken ViT:training vision transformers from scratch on ImageNet［C］//IEEE/CVF International Conference on Computer Vision.Montreal:IEEE,2021:538-547.

[15] [15] WANG W H,XIE E Z,LI X,et al.Pyramid vision transformer:a versatile backbone for dense prediction without convolutions［C］//IEEE/CVF International Conference on Computer Vision.Montreal:IEEE-2021:548-558.

[16] [16] HAN K,XIAO A,WU E H,et al.Transformer in transformer［C］//The 35th Conference on Neural Information Processing Systems.［S.l.］:［s.n.］,2021:15908-15919.

[17] [17] YUAN K,GUO S P,LIU Z W,et al.Incorporating convolution designs into visual transformers［C］//IEEE/CVF International Conference on Computer Vision.Montreal:IEEE,2021:559-568.

[18] [18] CHU X X,TIAN Z,WANG Y Q,et al.Twins:revisiting the design of spatial attention in vision transformers［C］//The 35th International Conference on Neural Information Processing Systems.［S.l.:s.n.］,2021:9355-9366.

Tools

Get Citation

Copy Citation Text

FANG Sikai, SUN Guangling, LU Xiaofeng, LIU Xuefeng. A Lightweight Object Detection Algorithm Based on Dynamic Transformer[J]. Electronics Optics & Control, 2024, 31(2): 52

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Mar. 12, 2023

Accepted: --

Published Online: Jul. 26, 2024

The Author Email:

DOI:10.3969/j.issn.1671-637x.2024.02.008

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology