Depth Estimation from Monocular Infrared Video Based on Bi-Recursive Convolutional Neural Network

Shouchuan Wu; Haitao Zhao; Shaoyuan Sun

doi:10.3788/AOS201737.1215003

Acta Optica Sinica, Volume. 37, Issue 12, 1215003(2017)

Depth Estimation from Monocular Infrared Video Based on Bi-Recursive Convolutional Neural Network

Shouchuan Wu¹, Haitao Zhao^1、*, and Shaoyuan Sun²

¹ School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China

² School of Information Science and Technology, Donghua University, Shanghai 201620, China

show less

Abstract Get PDF(in Chinese)

For depth estimation from monocular infrared video, a method based on bi-recursive convolutional neural network (BrCNN) is proposed considering the uniqueness of a single frame and the continuity of the entire infrared video. BrCNN introduces the sequence information transfer mechanism of recurrent neural network (RNN) on the basis of the single frame feature extracted by the convolutional neural network (CNN). Thus, BrCNN possesses the feature extraction ability of CNN for a single image, which can automatically extract the local features of each frame in the infrared video, and the sequence information extraction ability of RNN, which can automatically extract the sequence information contained in each frame of the infrared video and recursively transfer this information. By introducing the bi-recursive sequence information transfer mechanism to estimate the depth of monocular infrared video, features extracted from each image containing the context information. The experimental results show that BrCNN can extract more expressive features and estimate the depth from the infrared video more precisely than the traditional CNN, which estimate the depth by extracting the feature of a single frame.

Keywords

bi-recursive convolution deep neural network depth estimation machine vision monocular infrared video

Tools

Get Citation

Copy Citation Text

Shouchuan Wu, Haitao Zhao, Shaoyuan Sun. Depth Estimation from Monocular Infrared Video Based on Bi-Recursive Convolutional Neural Network[J]. Acta Optica Sinica, 2017, 37(12): 1215003

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Machine Vision

Received: Jun. 21, 2017

Accepted: --

Published Online: Sep. 6, 2018

The Author Email: Zhao Haitao (haitaozhao@ecust.edu.cn)

DOI:10.3788/AOS201737.1215003

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology