Attention and cross-scale fusion for vehicle and pedestrian detection

Due to the complex environment of the target in road traffic, there exist the problems of the insufficient extraction of key features by the model and the low accuracy of target positioning. The SSD model is used as the basic framework in this paper, and research is conducted on feature extraction methods, key information enhancement, and non-local feature positioning. Firstly, in order to solve the multi-scale problem of targets in road traffic scenarios, a jumping reverse feature pyramid structure is proposed to generate more discriminant features. Secondly, in order to solve the problem that information at different semantic levels has different degrees of contribution to the feature fusion process, an adaptive feature fusion module based on attention mechanism is designed to enhance the key feature expression ability non-priori at the channel level. Finally, the cross-attention module is introduced to improve the position sensitivity of the model to the target. Experimental results indicate that compared with the original model of SSD, in guarantee under the condition of real-time, the average accuracy of the proposed algorithm is improved by 2.6% on PASCAL VOC sub-dataset and 3.9% on homemade road traffic dataset. Taking everything into account, the improved algorithm can be applied widely to the task of detecting vehicles and pedestrians on the road.

Keywords

attention mechanism deep learning feature fusion feature pyramid vehicle and pedestrian detection

Tools

Get Citation

Copy Citation Text

Jian-dong LI, Jia-qi LI, Hai-cheng QU. Attention and cross-scale fusion for vehicle and pedestrian detection[J]. Chinese Journal of Liquid Crystals and Displays, 2023, 38(12): 1707

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Research Articles

Received: Feb. 6, 2023

Accepted: --

Published Online: Mar. 7, 2024

The Author Email: Jia-qi LI (dor_emma@163.com)

DOI:10.37188/CJLCD.2023-0037

Topics