Image caption of space science experiment based on multi-modal learning

Pei-zhuo LI; Xue WAN; Sheng-yang LI

doi:10.37188/OPE.2021.0244

Optics and Precision Engineering, Volume. 29, Issue 12, 2944(2021)

Image caption of space science experiment based on multi-modal learning

Pei-zhuo LI... Xue WAN* and Sheng-yang LI |Show fewer author(s)

Key Laboratory of Space Utilization， Chinese Academy of Sciences， Technology and Engineering Center for Space Utilization， Chinese Academy of Sciences， University of Chinese Academy of Sciences， Beijing100094， China

show less

Abstract Get PDF(in Chinese)

Figures & Tables(15)

Fig. 1. Framework of the algorithm

Download full size

View in Article

Fig. 2. Comparison of original U-Net and improved U-Net

Download full size

View in Article

Fig. 3. Bottom-up attention and CNN attention results

Download full size

View in Article

Fig. 4. Space science experiment picture categroy quantity chart

Download full size

View in Article

Fig. 5. Samples of space science experiment dataset

Download full size

View in Article

Fig. 6. Samples of semantic segmentation annotations of the dataset

Download full size

View in Article

Fig. 7. Growth experiment of Arabidopsis thaliana and rice in Tiangong-2

Download full size

View in Article

Fig. 8. Loss curves of Arabidopsis thaliana and rice growth experiment on Tiangong-2

Download full size

View in Article

Fig. 9. Comparisons between segmentation results of between Arabidopsis thaliana and rice growth experiment in Tiangong-2

Download full size

View in Article

Fig. 10. Vocabulary candidate extraction for Tiangong-2 Arabidopsis thaliana and rice growth experiment

Download full size

View in Article

Fig. 11. Comparison of image caption between Neuraltalk2 and this paper in space science experiment

Download full size

View in Article

Fig. 12. Comparison results of image caption between this paper and Neuraltalk2 in space droplet experiment

Download full size

View in Article

Table 1. Comparison between original U-Net and advanced U-Net
View table
View in Article
Table 1. Comparison between original U-Net and advanced U-Net
模型损失函数优化器上下采样次数激活函数网络输入
原始U-Net 带权值的交叉熵随机梯度下降 4 softmax 572×572
改进U-Net BCEDice损失函数 RMSProp 6 sigmoid 256×256

Table 2. Comparison table for evaluation of semantic segmentation algorithm in Tiangong-2 Arabidopsis thaliana and rice growth experiment

View table

View in Article

Table 2. Comparison table for evaluation of semantic segmentation algorithm in Tiangong-2 Arabidopsis thaliana and rice growth experiment

指标	Mask R-CNN		本算法
指标	水稻	拟南芥	水稻	拟南芥
$F_{M} ↑$	0.187 4	0.523 4	0.927 2	0.966 4
$F_{O} ↑$	0.216 7	0.7	1	1
$F_{D} ↓$	0.279 1	-0.186 5	-0.091 3	-0.028
$J_{M} ↑$	0.167 5	0.202	0.663 9	0.882 2
$J_{O} ↑$	0.2	0	0.883 3	1
$J_{D} ↓$	0.227 6	-0.105 7	-0.243 6	-0.075

Table 3. Evaluation results of this paper and Neuraltalk2
View table
View in Article
Table 3. Evaluation results of this paper and Neuraltalk2
Experiment Neuraltalk2 本算法
METEOR SPICE METEOR SPICE
T2 0.040 0 0.272 0.230
ZeroG-Flame 0.068 0.003 0.095 0.169
Droplet-Ping-Pong 0.133 0.071 0.11 0.138
ISS-Flame 0.11 0.088 0.228 0.319

Tools

Get Citation

Copy Citation Text

Pei-zhuo LI, Xue WAN, Sheng-yang LI. Image caption of space science experiment based on multi-modal learning[J]. Optics and Precision Engineering, 2021, 29(12): 2944

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites