Laser & Optoelectronics Progress, Volume. 58, Issue 20, 2010006(2021)

Visual Odometry Based on Improved Dual-Stream Network Structure

Haidong Zhang, Yiming Xu*, Li Wang, Chunlei Bian, and Fangjie Zhou
Author Affiliations
  • School of Electrical Engineering, Nantong University, Nantong, Jiangsu 226019, China
  • show less

    Because conventional visual odometry (VO) has cumbersome implementation process and complex calculation problems, a VO based on an improved dual-stream network structure is proposed. The proposed VO uses a dual-stream convolutional neural network structure that can simultaneously feed RGB and depth images into the model for training, use the Inception network structure to improve the convolutional layer, and reduce the number of parameters in the convolutional layer. Simultaneously, an attention mechanism is introduced to the convolutional layer to enhance the network’s recognition of image features and the system’s robustness. After being trained and tested on the KITTI dataset, the proposed improved model is compared with the VISO2-M, VISO2-S, and SfMLearner. The results show that the proposed model’s rotation and translation errors are significantly reduced compared with VISO2-M and SfMLearner when using monocular cameras and comparable to VISO2-S when using binocular cameras.

    Tools

    Get Citation

    Copy Citation Text

    Haidong Zhang, Yiming Xu, Li Wang, Chunlei Bian, Fangjie Zhou. Visual Odometry Based on Improved Dual-Stream Network Structure[J]. Laser & Optoelectronics Progress, 2021, 58(20): 2010006

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Nov. 11, 2020

    Accepted: Jan. 2, 2021

    Published Online: Oct. 12, 2021

    The Author Email: Xu Yiming (yimingx@ntu.edu.cn)

    DOI:10.3788/LOP202158.2010006

    Topics