Acta Optica Sinica (Online), Volume. 2, Issue 13, 1305001(2025)

A Survey of Monocular Depth Estimation Methods Based on Deep Learning

Yiquan Wu* and Haobo Xie
Author Affiliations
  • College of Electronic and Information Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, Jiangsu , China
  • show less

    Monocular depth estimation is a critical task that infers scene depth information from a single image. It is widely applied in fields such as autonomous driving, medical imaging, and defense. Deep learning methods have significantly enhanced the representational capacity and prediction accuracy of these models, particularly excelling at handling complex scenes, multi-scale features, and dynamic objects, which are challenging for traditional methods. This paper systematically reviews monocular depth estimation methods based on deep learning, beginning with an introduction to the fundamental technical process of monocular depth estimation. Then, based on the type of supervision, deep learning methods for monocular depth estimation are categorized into three groups: supervised learning methods are reviewed in terms of network structure, auxiliary information, loss functions, and depth discretization; unsupervised methods are summarized based on cues such as image pairs, masking, visual odometry, auxiliary information, and generative adversarial networks; semi-supervised methods are explored with respect to image pairs, semantic information, and generative adversarial networks. Subsequently, the paper outlines the main monocular depth estimation datasets and commonly used evaluation metrics, and lists the quantitative evaluation results of several methods on these datasets. Finally, the paper discusses application examples of monocular depth estimation based on deep learning, and highlights the major challenges and potential development directions for the future.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Yiquan Wu, Haobo Xie. A Survey of Monocular Depth Estimation Methods Based on Deep Learning[J]. Acta Optica Sinica (Online), 2025, 2(13): 1305001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Optical Information Acquisition, Display, and Processing

    Received: Apr. 25, 2025

    Accepted: May. 15, 2025

    Published Online: Jul. 7, 2025

    The Author Email: Yiquan Wu (nuaaimage@163.com)

    DOI:10.3788/AOSOL250454

    CSTR:32394.14.AOSOL250454

    Topics