Laser & Optoelectronics Progress, Volume. 61, Issue 10, 1037004(2024)

Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention

Guoli Zhang1,2, Shuai Chang1,2、*, Yansong Song1,2, and Tianci Liu1,2
Author Affiliations
  • 1College of Opto-Electronic Engineering, Changchun University of Science and Technology, Changchun 130022, Jilin, China
  • 2Institute of Space Photoelectric Technology, Changchun University of Science and Technology, Changchun 130022, Jilin, China
  • show less
    References(26)

    [1] Wang Z Y, Huang B J, Wang G C et al. Masked face recognition dataset and application[J]. IEEE Transactions on Biometrics, Behavior, and Identity Science, 5, 298-304(2023).

    [2] Karami Z, Kashef R. Smart transportation planning: data, models, and algorithms[J]. Transportation Engineering, 2, 100013(2020).

    [3] Yuan P, Tan Z Y, Zhang X et al. Research on infrared imaging detection and differential spectrum filtering detection methods for industrial gas leakage[J]. Infrared and Laser Engineering, 51, 20210714(2022).

    [4] Li Q, Xue J P, Zhang Q C et al. Three dimensional shape measurement of high reflective elements using camera response curve[J]. Acta Optica Sinica, 42, 0712001(2022).

    [5] Zhao B, Wang C P, Fu Q et al. Multi-scale infrared pedestrian detection based on deep attention mechanism[J]. Acta Optica Sinica, 40, 0504001(2020).

    [6] Zou Z Y, Gai S Y, Da F P et al. Occluded pedestrian detection algorithm based on attention mechanism[J]. Acta Optica Sinica, 41, 1515001(2021).

    [7] Hwang S, Park J, Kim N et al. Multispectral pedestrian detection: benchmark dataset and baseline[C], 1037-1045(2015).

    [8] Dollár P, Appel R, Belongie S et al. Fast feature pyramids for object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36, 1532-1545(2014).

    [9] Wagner J, Fischer V, Herman M et al. Multispectral pedestrian detection using deep fusion convolutional neural networks[C], 509-514(2016).

    [10] Girshick R, Donahue J, Darrell T et al. Region-based convolutional networks for accurate object detection and segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38, 142-158(2016).

    [12] Li C Y, Song D, Tong R F et al. Illumination-aware faster R-CNN for robust multispectral pedestrian detection[J]. Pattern Recognition, 85, 161-171(2019).

    [13] Redmon J, Divvala S, Girshick R et al. You only look once: unified, real-time object detection[C], 779-788(2016).

    [14] Hsia C H, Peng H C, Chan H T. All-weather pedestrian detection based on double-stream multispectral network[J]. Electronics, 12, 2312(2023).

    [16] Wang W H, Xie E Z, Song X G et al. Efficient and accurate arbitrary-shaped text detection with pixel aggregation network[C], 8439-8448(2020).

    [17] Dai J F, Qi H Z, Xiong Y W et al. Deformable convolutional networks[C], 764-773(2017).

    [19] Lin T Y, Dollár P, Girshick R et al. Feature pyramid networks for object detection[C], 936-944(2017).

    [20] Liu S, Qi L, Qin H F et al. Path aggregation network for instance segmentation[J]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 8759-8768(2018).

    [21] Zhang L, Liu Z Y, Zhang S F et al. Cross-modality interactive attention network for multispectral pedestrian detection[J]. Information Fusion, 50, 20-29(2019).

    [22] Zhou K L, Chen L S, Cao X, Vedaldi A, Bischof H, Brox T et al. Improving multispectral pedestrian detection by addressing modality imbalance problems[M]. Computer vision-ECCV 2020. Lecture notes in computer science, 12363, 787-803(2020).

    [23] Liu W, Anguelov D, Erhan D, Leibe B, Matas J, Sebe N et al. SSD: single shot MultiBox detector[M]. Computer vision-ECCV 2016. Lecture notes in computer science, 9905, 21-37(2016).

    [24] Ren S, He K, Girshick R et al. Faster R-CNN: toward real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149(2017).

    [26] Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C], 7464-7475(2023).

    Tools

    Get Citation

    Copy Citation Text

    Guoli Zhang, Shuai Chang, Yansong Song, Tianci Liu. Multi-spectral Pedestrian Detection Based on Deformable Convolution and Multi-Scale Residual Attention[J]. Laser & Optoelectronics Progress, 2024, 61(10): 1037004

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Digital Image Processing

    Received: Sep. 15, 2023

    Accepted: Oct. 20, 2023

    Published Online: Mar. 20, 2024

    The Author Email: Chang Shuai (changshuai@cust.edu.cn)

    DOI:10.3788/LOP232131

    Topics