Laser & Optoelectronics Progress, Volume. 58, Issue 12, 1210030(2021)

Image Semantic Description Algorithm with Integrated Spatial Attention Mechanism

Lie Guo1、*, Tuanshan Zhang1, Weizhen Sun2, and Jielong Guo2
Author Affiliations
  • 1Xi'an Key Laboratory of Modern Intelligent Textile Equipment, College of Mechanical and Electrical Engineering, Xi'an Polytechnic University, Xi'an, Shaanxi 710600, China
  • 2Quanzhou Institute of Equipment Manufacturing, Haixi Institutes, Chinses Academy of Science, Quanzhou, Fujian 362216, China
  • show less

    The image semantic description model usually adopts the encoder-decoder method to realize the image semantic description. The model has problems such as insufficient utilization of image features and insufficient location information extraction of image objects. In response to this problem, an image semantic description algorithm is proposed that integrates the attention mechanism in the encoder part, and the attention weight of different image features is allocated through the context information of the decoder, thereby improving the expressive ability of image semantic description. And verified on the Flickr30k and MSCOCO data sets, the model improves the BLEU-4 evaluation index by 1.9% and 0.8%, respectively. The experiment proves the effectiveness of the proposed algorithm.

    Tools

    Get Citation

    Copy Citation Text

    Lie Guo, Tuanshan Zhang, Weizhen Sun, Jielong Guo. Image Semantic Description Algorithm with Integrated Spatial Attention Mechanism[J]. Laser & Optoelectronics Progress, 2021, 58(12): 1210030

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Sep. 4, 2020

    Accepted: Sep. 30, 2020

    Published Online: Jun. 22, 2021

    The Author Email: Guo Lie (lie_guo@163.com)

    DOI:10.3788/LOP202158.1210030

    Topics