Monocular Indoor Depth Estimation Method Based on Neural Networks with Constraints on Two-Dimensional Images and Three-Dimensional Geometry

Hao Sha; Yue Liu; Yongtian Wang; Chenguang Lu; Mengze Zhao

doi:10.3788/AOS202242.1911001

Acta Optica Sinica, Volume. 42, Issue 19, 1911001(2022)

Monocular Indoor Depth Estimation Method Based on Neural Networks with Constraints on Two-Dimensional Images and Three-Dimensional Geometry

Hao Sha, Yue Liu^*, Yongtian Wang, Chenguang Lu, and Mengze Zhao

Author Affiliations

Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China

show less

Abstract Get PDF(in Chinese)

This paper proposes a deep convolutional neural network with an encoder-to-decoder structure and constrains the network's in-depth learning from the monocular image at both two-dimensional (2D) and three-dimensional (3D) levels. At the 2D image level, an attention mechanism of channels is introduced to connect encoder features with decoder features with weights at the same scale, so as to balance the shallow detail features and deep semantic features extracted by the network. In addition, a scale-invariant loss and a multi-scale edge loss based on image pyramids are designed to obtain a depth map with rich edge detail information. At the 3D geometric level, a global geometric constraint loss and a local geometric constraint loss of depth are designed based on the local and global geometric relationships of coordinate points in space, in a bid to enhance the geometric consistency between point clouds. Furthermore, the results obtained through the proposed method are quantitatively and qualitatively compared with that obtained through other methods from the NYU Depth-v2 dataset, and it is shown that the proposed method can estimate indoor scene depth with higher accuracy and detail representation, obtaining accurate and smooth 3D reconstruction results on a single image.

Keywords

convolutional neural network depth estimation geometric constraint imaging systems monocular three-dimensional reconstruction

Tools

Get Citation

Copy Citation Text

Hao Sha, Yue Liu, Yongtian Wang, Chenguang Lu, Mengze Zhao. Monocular Indoor Depth Estimation Method Based on Neural Networks with Constraints on Two-Dimensional Images and Three-Dimensional Geometry[J]. Acta Optica Sinica, 2022, 42(19): 1911001

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Imaging Systems

Received: Nov. 22, 2021

Accepted: Dec. 24, 2021

Published Online: Oct. 18, 2022

The Author Email: Liu Yue (liuyue@bit.edu.cn)

DOI:10.3788/AOS202242.1911001

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology