Optics and Precision Engineering, Volume. 29, Issue 12, 2944(2021)

Image caption of space science experiment based on multi-modal learning

Pei-zhuo LI... Xue WAN* and Sheng-yang LI |Show fewer author(s)
Author Affiliations
  • Key Laboratory of Space Utilization, Chinese Academy of Sciences, Technology and Engineering Center for Space Utilization, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Beijing100094, China
  • show less

    In order to enable scientists to quickly locate the key process of the experiment and obtain detailed experimental process information, it is necessary to automatically add descriptive content to space science experiments. Aiming at the problem of small target and small data sample of space science experiment, this paper proposes the image captioning of space science experiment based on multi-modal learning. It is mainly divided into four parts: semantic segmentation model based on improved U-Net, space science experimental vocabulary candidate based on semantic segmentation, general scene image feature vector extraction from bottom-up model and image caption based on multimodal learning. In addition, the dataset of space science experiment is constructed, including semantic masks and image caption annotations. Experimental results demonstrate that: compared with the state-of-the-art image caption model neuraltalk2, the accuracy evaluation of the proposed algorithm is improved by 0.089 for METEOR and 0.174 for SPICE. It solves the difficulty of small objectives and small data samples of space science experiment. It constructs a model of space science experiment image caption based on multi-modal learning, which meets the requirements of describing space science experiment professionally and accurately, and realizes the ability from low-level sense to deep scene understanding.

    Tools

    Get Citation

    Copy Citation Text

    Pei-zhuo LI, Xue WAN, Sheng-yang LI. Image caption of space science experiment based on multi-modal learning[J]. Optics and Precision Engineering, 2021, 29(12): 2944

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Information Sciences

    Received: Apr. 29, 2021

    Accepted: --

    Published Online: Jan. 20, 2022

    The Author Email: WAN Xue (wanxue@csu.ac.cn)

    DOI:10.37188/OPE.2021.0244

    Topics