Acta Photonica Sinica, Volume. 52, Issue 1, 0110002(2023)
Object Detection Algorithm Based on Dual-modal Fusion Network
[1] ZHOU Y, TUZEL O. Voxelnet: End-to-end learning for point cloud based 3d object detection[C], 4490-4499(2018).
[2] KIM S, SONG W J, KIM S H. Infrared variation optimized deep convolutional neural network for robust automatic ground target recognition[C], 1-8(2017).
[3] GIRSHICK R, DONAHUE J, DARRELL T. Rich feature hierarchies for accurate object detection and semantic segmentation[C], 580-587(2014).
[4] GIRSHICK R. Fast R-CNN[C], 1440-1448(2015).
[5] REN S, HE K, GIRSHICK R. Faster R-CNN: towards real-time object detection with region proposal networks[J]. Advances in Neural Information Processing Systems, 28, 91-99(2015).
[6] LIU W, ANGUELOV D, ERHAN D. Ssd: single shot multibox detector[C], 21-37(2016).
[7] REDMON J, DIVVALA S, GIRSHICK R. You only look once: unified, real-time object detection[C], 779-788(2016).
[8] REDMON J, FARHADI A. YOLO9000: better, faster, stronger[C], 7263-7271(2017).
[9] REDMON J, FARHADI A. Yolov3: an incremental improvement[J]. arXiv preprint(2018).
[10] BOCHKOVSKIY A, WANG C Y, LIAO H Y M. Yolov4: optimal speed and accuracy of object detection[J]. arXiv preprint(2020).
[11] LAW H, DENG J. Cornernet: detecting objects as paired keypoints[C], 734-750(2018).
[12] ZHOU X, WANG D, KRÄHENBÜHL P. Objects as points[J]. arXiv preprint(2019).
[13] TIAN Z, SHEN C, CHEN H. Fcos: fully convolutional one-stage object detection[C], 9627-9636(2019).
[14] ZHAO F, WEI R, CHAO Y et al. Infrared bird target detection based on temporal variation filtering and a gaussian heat-map perception network[J]. Applied Sciences, 12, 5679-5694(2022).
[15] ZHU K, XU C, WEI Y et al. Fast-PLDN: fast power line detection network[J]. Journal of Real-Time Image Processing, 19, 3-13(2022).
[16] XU H, WANG X, MA J. DRF: Disentangled representation for visible and infrared image fusion[J]. IEEE Transactions on Instrumentation and Measurement, 70, 1-13(2021).
[17] YAO X, ZHAO S, XU P et al. Multi-source domain adaptation for object detection[C], 3273-3282(2021).
[18] DEVAGUPTAPU C, AKOLEKAR N, SHARMA MM et al. Borrow from anywhere: pseudo multi-modal object detection in thermal imagery[C], 1029-1038(2019).
[19] YANG L, MA R, ZAKHOR A. Drone object detection using RGB/IR fusion[J]. arXiv preprint(2022).
[20] ZHAO Ming, ZHANG Haoran. An infrared object detection method based on cross-domain fusion network[J]. Acta Photonica Sinica, 50, 1110001(2021).
[21] WANG Q, CHI Y, SHEN T et al. Improving RGB-infrared object detection by reducing cross-modality redundancy[J]. Remote Sensing, 14, 2020(2022).
[22] GENG X, LI M, LIU W et al. Person tracking by detection using dual visible-infrared cameras[J]. IEEE Internet of Things Journal, 9, 23241-23251(2022).
[23] ZHOU Tao, DONG Yali, LIU Shan et al. Cross-modality multi-encoder hybrid attention U-net for lung tumors images segmentation[J]. Acta Photonica Sinica, 51, 0410006(2022).
[24] ZHANG Y, YIN Z, NIE L et al. Attention based multi-layer fusion of multispectral images for pedestrian detection[J]. IEEE Access, 8, 165071-165084(2020).
[25] CAO Z, YANG H, ZHAO J et al. Attention fusion for one-stage multispectral pedestrian detection[J]. Sensors, 21, 4184-4198(2021).
[26] KONIG D, ADAM M, JARVERS C et al. Fully convolutional region proposal networks for multispectral person detection[C], 49-56(2017).
[27] FU L, GU W, AI Y et al. Adaptive spatial pixel-level feature fusion network for multispectral pedestrian detection[J]. Infrared Physics & Technology, 116, 103770(2021).
[28] WAGNER J, FISCHER V, HERMAN M et al. Multispectral pedestrian detection using deep fusion convolutional neural networks[C], 587, 509-514(2016).
[29] BAI Yu, HOU Zhiqiang, LIU Xiaoyi et al. Target detection algorithm based on decision-level fusion of visible light image and infrared image[J]. Journal of Air Force Engineering University (Natural Science Edition), 21, 53-59(2020).
[30] YANG L, ZHANG R Y, LI L. Simam: a simple, parameter-free attention module for convolutional neural networks[C], 11863-11874(2021).
[31] HOU Q, ZHOU D, FENG J. Coordinate attention for efficient mobile network design[C], 13713-13722(2021).
[32] MA J, ZHAO Z, YI X et al. Modeling task relationships in multi-task learning with multi-gate mixture-of-experts[C], 1930-1939(2018).
[33] HWANG S, PARK J, KIM N et al. Multispectral pedestrian detection: Benchmark dataset and baseline[C], 1037-1045(2015).
[34] LI C, SONG D, TONG R. Multispectral pedestrian detection via simultaneous detection and segmentation[J]. arXiv preprint(2018).
[35] LIU J, ZHANG S, WANG S et al. Multispectral deep neural networks for pedestrian detection[J]. arXiv preprint(2016).
[36] LI C, ZHAO N, LU Y. Weighted sparse representation regularized graph learning for RGB-T object tracking[C], 1856-1864(2017).
[37] SUN Y, CAO B, ZHU P et al. Drone-based RGB-infrared cross-modality vehicle detection via uncertainty-aware learning[J](2021).
[38] WANG Q, CHI Y, SHEN T et al. Improving RGB-infrared object detection by reducing cross-modality redundancy[J]. Remote Sensing, 14, 2020-2031(2022).
Get Citation
Copy Citation Text
Ying SUN, Zhiqiang HOU, Chen YANG, Sugang MA, Jiulun FAN. Object Detection Algorithm Based on Dual-modal Fusion Network[J]. Acta Photonica Sinica, 2023, 52(1): 0110002
Category:
Received: Jul. 13, 2022
Accepted: Aug. 2, 2022
Published Online: Feb. 27, 2023
The Author Email: Zhiqiang HOU (hou-zhq@sohu.com)