Equal-scale structure from motion method based on deep learning

Chen Peng; Ren Jinjin; Wang Haixia; Tang Yuesheng; Liang Ronghua

doi:10.12086/oee.2019.190006

Opto-Electronic Engineering, Volume. 46, Issue 12, 190006(2019)

Equal-scale structure from motion method based on deep learning

Chen Peng^*, Ren Jinjin, Wang Haixia, Tang Yuesheng, and Liang Ronghua

Author Affiliations

[in Chinese]

show less

Abstract Get PDF(in Chinese)

Two problems exist in traditional multi-view geometry method to obtain the three-dimensional structure of the scene. First, the mismatching of the feature points caused by the blurred image and low texture, which reduces the accuracy of reconstruction; second, as the information obtained by monocular camera is lack of scale, the reconstruction results can only determine the unknown scale factor, and cannot get accurate scene structure. This paper proposes a method of equal-scale motion restoration structure based on deep learning. First, the convolutional neural network is used to obtain the depth information of the image; then, to restore the scale information of the monocular camera, an inertial measurement unit (IMU) is introduced, and the acceleration and angular velocity acquired by the IMU and the camera position acquired by the ORB-SLAM2 are demonstrated. The pose is coordinated in both time domain and frequency domain, and the scale information from the monocular camera is acquired in the frequency domain; finally, the depth information of the image and the camera pose with the scale factor are merged to reconstruct the three-dimensional structure of the scene. Experiments show that the monocular image depth map obtained by the Depth CNN network solves the problem that the output image of the multi-level convolution pooling operation has low resolution and lacks important feature information, and the absolute value error reaches 0.192, and the accuracy rate is up to 0.959. The multi-sensor fusion method can achieve a scale error of 0.24 m in the frequency domain, which is more accurate than that of the VIORB method in the frequency domain. The error between the reconstructed 3D model and the real size is about 0.2 m, which verifies the effectiveness of the proposed method.

Keywords

3D reconstruction deep learning IMU monocular camera scale factor

Tools

Get Citation

Copy Citation Text

Chen Peng, Ren Jinjin, Wang Haixia, Tang Yuesheng, Liang Ronghua. Equal-scale structure from motion method based on deep learning[J]. Opto-Electronic Engineering, 2019, 46(12): 190006

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Article

Received: Jan. 8, 2019

Accepted: --

Published Online: Jan. 9, 2020

The Author Email: Chen Peng (chenpeng@zjut.edu.cn)

DOI:10.12086/oee.2019.190006

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology