Acta Optica Sinica (Online), Volume. 2, Issue 13, 1305001(2025)
A Survey of Monocular Depth Estimation Methods Based on Deep Learning
Monocular depth estimation is a critical task that infers scene depth information from a single image. It is widely applied in fields such as autonomous driving, medical imaging, and defense. Deep learning methods have significantly enhanced the representational capacity and prediction accuracy of these models, particularly excelling at handling complex scenes, multi-scale features, and dynamic objects, which are challenging for traditional methods. This paper systematically reviews monocular depth estimation methods based on deep learning, beginning with an introduction to the fundamental technical process of monocular depth estimation. Then, based on the type of supervision, deep learning methods for monocular depth estimation are categorized into three groups: supervised learning methods are reviewed in terms of network structure, auxiliary information, loss functions, and depth discretization; unsupervised methods are summarized based on cues such as image pairs, masking, visual odometry, auxiliary information, and generative adversarial networks; semi-supervised methods are explored with respect to image pairs, semantic information, and generative adversarial networks. Subsequently, the paper outlines the main monocular depth estimation datasets and commonly used evaluation metrics, and lists the quantitative evaluation results of several methods on these datasets. Finally, the paper discusses application examples of monocular depth estimation based on deep learning, and highlights the major challenges and potential development directions for the future.
Get Citation
Copy Citation Text
Yiquan Wu, Haobo Xie. A Survey of Monocular Depth Estimation Methods Based on Deep Learning[J]. Acta Optica Sinica (Online), 2025, 2(13): 1305001
Category: Optical Information Acquisition, Display, and Processing
Received: Apr. 25, 2025
Accepted: May. 15, 2025
Published Online: Jul. 7, 2025
The Author Email: Yiquan Wu (nuaaimage@163.com)
CSTR:32394.14.AOSOL250454