Acta Optica Sinica, Volume. 41, Issue 22, 2228001(2021)
Remote Sensing Image Caption Method Based on Attention and Reinforcement Learning
[1] Yao Q L, Hu X, Lei H. Object detection in remote sensing images using multiscale convolutional neural networks[J]. Acta Optica Sinica, 39, 1128002(2019).
[2] Zhu M M, Xu Y L, Ma S P et al. Airport detection method with improved region-based convolutional neural network[J]. Acta Optica Sinica, 38, 0728001(2018).
[3] Xu Z J, Ding Y. Ship object detection of remote sensing images based on adaptive rotation region proposal network[J]. Laser & Optoelectronics Progress, 57, 242805(2020).
[4] Dong Y F, Zhang C T, Wang P et al. Airplane detection of optical remote sensing images based on deep learning[J]. Laser & Optoelectronics Progress, 57, 041007(2020).
[5] Zhang M, Wang S C, Yang D F. Air-to-ground target detection algorithm based on attention learning in key areas[J]. Laser & Optoelectronics Progress, 57, 041006(2020).
[6] Zhao J Q, Wang H Z, Zhou Y et al. Remote sensing image description generation method based on attention and multi-scale feature enhancement[J]. Computer Science, 48, 190-196(2021).
[7] Shi Z W, Zou Z X. Can a machine generate humanlike language descriptions for a remote sensing image?[J]. IEEE Transactions on Geoscience and Remote Sensing, 55, 3623-3634(2017).
[8] Qu B, Li X L, Tao D C et al. Deep semantic understanding of high resolution remote sensing image[C]∥2016 International Conference on Computer, Information and Telecommunication Systems (CITS), July 6-8, 2016, Kunming, China., 1-5(2016).
[9] Lu X Q, Wang B Q, Zheng X T et al. Exploring models and data for remote sensing image caption generation[J]. IEEE Transactions on Geoscience and Remote Sensing, 56, 2183-2195(2018).
[10] Rennie S J, Marcheret E, Mroueh Y et al. Self-critical sequence training for image captioning[C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA., 1179-1195(2017).
[11] Papineni K, Roukos S, Ward T et al. BLEU: a method for automatic evaluation of machine translation. [C]∥Proceedings of the 40th Annual Meeting on Association for Computational Linguistics-ACL ‘02, July 7-12, 2002. Philadelphia, Pennsylvania. Morristown, NJ, USA: Association for Computational Linguistics, 311-318(2001).
[13] Banerjee S. -06-12)[2021-04-18]. http:∥citeseerx.ist.psu.edu/viewdoc/download;jsessionid=00D9354FBD891E7E5E554DD61 609BFE8?doi=10.1.1.61.2290&rep=rep1&type=pdf.(2005).
[14] [14] VedantamR, Zitnick CL, ParikhD. CIDEr: consensus-based image description evaluation[C]∥2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 7-12, 2015, Boston, MA, USA. New York: IEEE Press, 2015: 4566- 4575.
[15] [15] VinyalsO, ToshevA, BengioS, et al.Show and tell: a neural image caption generator[C]∥2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 7-12, 2015, Boston, MA, USA. New York: IEEE Press, 2015: 3156- 3164.
[16] Xu K, Ba J L, Kiros R et al. Show, attend and tell: neural image caption generation with visual attention. [C]∥ICML'15: Proceedings of the 32nd International Conference on International Conference on Machine Learning - Volume 37., 2048-2057(2015).
[17] [17] Lu JS, Xiong CM, ParikhD, et al.Knowing when to look: adaptive attention via a visual sentinel for image captioning[C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA. New York: IEEE Press, 2017: 3242- 3250.
Get Citation
Copy Citation Text
Yuanjun Nong, Junjie Wang. Remote Sensing Image Caption Method Based on Attention and Reinforcement Learning[J]. Acta Optica Sinica, 2021, 41(22): 2228001
Category: Remote Sensing and Sensors
Received: Apr. 30, 2021
Accepted: Jun. 3, 2021
Published Online: Nov. 17, 2021
The Author Email: Wang Junjie (wjj@ouc.edu.cn)