Review of Deep Learning Based Object Detection Methods and Their Mainstream Frameworks

Model	Used method	Disadvantage	Improvement
R-CNN	1) Region proposal (SS);2) extraction feature(ConvNet); 3) classification(SVM); 4) regression(Candidate Bbox)	1) Complex training steps;2) training and testing areslow and take up a lot of diskspace; 3) CNN features arenot learned and updatedduring SVM and regression	1) Refresh mAP of DPMHSC from 34.3% to 66%;2) region proposal andconvolution network are used
Fast R-CNN	1) Region proposal(SS);2) extraction feature(ConvNet);3) classification(softmax);4) Bbox regression(multi-task loss function)	1) RP is still extracted withSS (consuming time of 2-3 s);2) difficult to meetreal-time requirements;3) GPU is utilized,but the region proposalmethod is implemented on CPU	1) mAP is increased by 4% from 66%;2) speeds of training and testing are improved
Faster R-CNN	1) Region proposalnetwork(RPN);2) extraction feature(ConvNet);3) classification(softmax);4) Bbox regression(multi-task loss function)	1) Real-time object detectionis not realized;2) computation ofobtaining region proposaland reclassification isvery large	1) It only takes 10 ms to generate suggestion box by usingconvolution network;2) accuracy and speed of detection are improved; 3) implement end-to-end target detection framework

Tools

Get Citation

Copy Citation Text

Zhongjing Duan, Shaobo Li, Jianjun Hu, Jing Yang, Zheng Wang. Review of Deep Learning Based Object Detection Methods and Their Mainstream Frameworks[J]. Laser & Optoelectronics Progress, 2020, 57(12): 120005

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Reviews

Received: Nov. 11, 2019

Accepted: Dec. 6, 2019

Published Online: Jun. 3, 2020

The Author Email: Li Shaobo (lishaobo@gzu.edu.cn)

DOI:10.3788/LOP57.120005

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology