Acta Optica Sinica, Volume. 42, Issue 19, 1911001(2022)

Monocular Indoor Depth Estimation Method Based on Neural Networks with Constraints on Two-Dimensional Images and Three-Dimensional Geometry

Hao Sha, Yue Liu*, Yongtian Wang, Chenguang Lu, and Mengze Zhao
Author Affiliations
  • Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China
  • show less
    Figures & Tables(14)
    Principle of calculating the normal of nearest neighbor point sampling method
    Feature connection module based on depth channel attention mechanism
    Overall architecture of monocular depth estimation method
    Architecture of encoder and decoder sub-networks. (a) Sub-network structure of encoder; (b)-(d) subnetwork structures of decoder
    Depth prediction results of different methods on NYU Depth v2 dataset
    3D reconstruction results based on monocular depth
    Qualitative results of ablation experiments based on network architecture
    Qualitative results of ablation experiments based on constraints
    Quantitative results of test set in range of different depth values
    Quantitative results of selected images in range of different depth values. (a) 10 images with worst RMSE; (b) 10 images with worst REL; (c) 10 images with worst TH1
    • Table 1. Quantitative comparison between proposed method and other different methods on NYU Depth v2 dataset

      View table

      Table 1. Quantitative comparison between proposed method and other different methods on NYU Depth v2 dataset

      MethodRMSERELδ<1.25δ<1.252δ<1.253
      Ref. [180.9070.2150.6110.8870.971
      Ref. [370.8240.2300.6140.8830.971
      Ref. [380.6200.1490.8060.8830.987
      Ref. [390.6350.1430.7880.9580.991
      Ref. [400.8190.2320.6460.8920.968
      Ref. [240.6410.1580.7690.9500.988
      Ref. [190.5730.1270.8110.9530.988
      Ref. [220.5860.1210.8110.9540.987
      Ref. [260.6000.1440.7910.9600.991
      Ref. [410.5720.1390.8150.9630.991
      Ref. [270.5990.1590.7720.9420.984
      Ref. [370.5550.1260.8430.9680.991
      This paper0.5520.1640.7680.9400.984
    • Table 2. Comparison of running speeds of different methods

      View table

      Table 2. Comparison of running speeds of different methods

      MethodRuning time /msFrame rate /(frame·s-1RMSE
      Ref. [1823430.907
      Ref. [19237100.604
      Ref. [249660.753
      This paper58170.552
    • Table 3. Quantitative results of ablation experiments based on network architecture

      View table

      Table 3. Quantitative results of ablation experiments based on network architecture

      MethodRMSERELδ<1.25δ<1.252δ<1.253
      Without skip connect0.7270.2220.6310.8850.969
      Without SE_Concat_Block0.6040.1770.7310.9220.976
      Baseline0.5860.1780.7380.9320.982
      U-net0.6470.2020.6810.9150.978
      Resnet-1010.6280.1890.7040.9210.981
    • Table 4. Quantitative results of ablation experiments based on constraints

      View table

      Table 4. Quantitative results of ablation experiments based on constraints

      MethodRMSERELδ<1.25δ<1.252δ<1.253
      Baseline0.5940.1770.7400.9260.980
      With L2D0.5860.1780.7380.9320.982
      With L2D and LG0.5610.1650.7610.9350.983
      With L2D,LG, and LL0.5520.1640.7680.9400.984
    Tools

    Get Citation

    Copy Citation Text

    Hao Sha, Yue Liu, Yongtian Wang, Chenguang Lu, Mengze Zhao. Monocular Indoor Depth Estimation Method Based on Neural Networks with Constraints on Two-Dimensional Images and Three-Dimensional Geometry[J]. Acta Optica Sinica, 2022, 42(19): 1911001

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Imaging Systems

    Received: Nov. 22, 2021

    Accepted: Dec. 24, 2021

    Published Online: Oct. 18, 2022

    The Author Email: Liu Yue (liuyue@bit.edu.cn)

    DOI:10.3788/AOS202242.1911001

    Topics