Laser & Optoelectronics Progress, Volume. 60, Issue 16, 1612002(2023)

Multiscale Monocular Three-Dimensional Object Detection Algorithm Incorporating Instance Depth

Fengsui Wang1,2,3、*, Lei Xiong1,2,3, and Yaping Qian1,2,3
Author Affiliations
  • 1School of Electrical Engineering, Anhui Polytechnic University, Wuhu 241000, Anhui, China
  • 2Anhui Key Laboratory of Detection Technology and Energy Saving Devices, Wuhu 241000, Anhui, China
  • 3Key Laboratory of Advanced Perception and Intelligent Control of High-End Equipment, Ministry of Education, Wuhu 241000, Anhui, China
  • show less
    References(36)

    [1] Hu J, Liu H, Xu W C et al. Position detection algorithm of road obstacles based on 3D LiDAR[J]. Chinese Journal of Lasers, 48, 2410001(2021).

    [2] Zhao L, Hu J, Liu H et al. Deep learning based on semantic segmentation for three-dimensional object detection from point clouds[J]. Chinese Journal of Lasers, 48, 1710004(2021).

    [3] Gong W, Shi S, Chen B W et al. Development and application of airborne hyperspectral LiDAR imaging technology[J]. Acta Optica Sinica, 42, 1200002(2022).

    [4] Ma X Z, Liu S N, Xia Z Y et al. Rethinking pseudo-LiDAR representation[M]. Vedaldi A, Bischof H, Brox T, et al. Computer vision-ECCV 2020. Lecture notes in computer science, 12358, 311-327(2020).

    [5] Wang L, Du L, Ye X Q et al. Depth-conditioned dynamic message propagation for monocular 3D object detection[C], 454-463(2021).

    [6] Chen Y J, Tai L, Sun K et al. MonoPair: monocular 3D object detection using pairwise spatial relationships[C], 12090-12099(2020).

    [7] Zhang Y P, Lu J W, Zhou J. Objects are different: flexible monocular 3D object detection[C], 3288-3297(2021).

    [8] Lu Y, Ma X Z, Yang L et al. Geometry uncertainty projection network for monocular 3D object detection[C], 3091-3101(2021).

    [9] Ma X Z, Zhang Y M, Xu D et al. Delving into localization errors for monocular 3D object detection[C], 4719-4728(2021).

    [11] Qin Z Y, Wang J L, Lu Y. MonoGRNet: a general framework for monocular 3D object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 5170-5184(2022).

    [12] Ju M R, Luo J N, Wang Z B et al. Multi-scale target detection algorithm based on attention mechanism[J]. Acta Optica Sinica, 40, 1315002(2020).

    [13] Liu F, Wu Z W, Yang A Z et al. Multi-scale feature fusion based adaptive object detection for UAV[J]. Acta Optica Sinica, 40, 1015002(2020).

    [14] Zhang P Y, Zhong Y X, Li X Q. SlimYOLOv3: narrower, faster and better for real-time UAV applications[C], 37-45(2019).

    [15] He K M, Zhang X Y, Ren S Q et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 1904-1916(2015).

    [17] Chen L C, Papandreou G, Kokkinos I et al. DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40, 834-848(2018).

    [18] Yu F, Wang D Q, Shelhamer E et al. Deep layer aggregation[C], 2403-2412(2018).

    [19] Geiger A, Lenz P, Stiller C et al. Vision meets robotics: the KITTI dataset[J]. The International Journal of Robotics Research, 32, 1231-1237(2013).

    [20] Geiger A, Lenz P, Urtasun R. Are we ready for autonomous driving? The KITTI vision benchmark suite[C], 3354-3361(2012).

    [21] Chen X Z, Kundu K, Zhang Z Y et al. Monocular 3D object detection for autonomous driving[C], 2147-2156(2016).

    [22] Chen X Z, Kundu K, Zhu Y K et al. 3D object proposals for accurate object class detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40, 1259-1272(2015).

    [23] Simonelli A, Bulò S R, Porzi L et al. Disentangling monocular 3D object detection[C], 1991-1999(2019).

    [24] Ma X Z, Wang Z H, Li H J et al. Accurate monocular 3D object detection via color-embedded 3D reconstruction for autonomous driving[C], 6850-6859(2019).

    [25] Liu H, Liu H P, Wang Y K et al. Fine-grained multilevel fusion for anti-occlusion monocular 3D object detection[J]. IEEE Transactions on Image Processing, 31, 4050-4061(2022).

    [26] Brazil G, Pons-Moll G, Liu X M et al. Kinematic 3D object detection in monocular video[M]. Vedaldi A, Bischof H, Brox T, et al. Computer vision-ECCV 2020. Lecture notes in computer science, 12368, 135-152(2020).

    [27] Reading C, Harakeh A, Chae J L et al. Categorical depth distribution network for monocular 3D object detection[C], 8551-8560(2021).

    [28] Chen H S, Huang Y Y, Tian W et al. MonoRUn: monocular 3D object detection by reconstruction and uncertainty propagation[C], 10374-10383(2021).

    [29] Zhou D F, Song X B, Fang J et al. Context-aware 3D object detection from a single image in autonomous driving[J]. IEEE Transactions on Intelligent Transportation Systems, 23, 18568-18580(2022).

    [30] Gao T Z, Pan H H, Gao H J. Monocular 3D object detection with sequential feature association and depth hint augmentation[J]. IEEE Transactions on Intelligent Vehicles, 7, 240-250(2022).

    [31] Qin Z Q, Li X. MonoGround: detecting monocular 3D objects from the ground[C], 3783-3792(2022).

    [32] Zhou Y S, He Y, Zhu H Z et al. MonoEF: extrinsic parameter free monocular 3D object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 10114-10128(2022).

    [33] Liu Y X, Yuan Y X, Liu M. Ground-aware monocular 3D object detection for autonomous driving[J]. IEEE Robotics and Automation Letters, 6, 919-926(2021).

    [34] Liu X P, Xue N, Wu T F. Learning auxiliary monocular contexts helps monocular 3D object detection[J]. Proceedings of the AAAI Conference on Artificial Intelligence, 36, 1810-1818(2022).

    [35] Fu H, Gong M M, Wang C H et al. Deep ordinal regression network for monocular depth estimation[C], 2002-2011(2018).

    [36] Zhao H S, Shi J P, Qi X J et al. Pyramid scene parsing network[C], 6230-6239(2017).

    Tools

    Get Citation

    Copy Citation Text

    Fengsui Wang, Lei Xiong, Yaping Qian. Multiscale Monocular Three-Dimensional Object Detection Algorithm Incorporating Instance Depth[J]. Laser & Optoelectronics Progress, 2023, 60(16): 1612002

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Instrumentation, Measurement and Metrology

    Received: Sep. 26, 2022

    Accepted: Oct. 24, 2022

    Published Online: Aug. 15, 2023

    The Author Email: Wang Fengsui (fswang@ahpu.edu.cn)

    DOI:10.3788/LOP222627

    Topics