Journal of Optoelectronics · Laser, Volume. 34, Issue 11, 1158(2023)

Natural scene text recognition based on character attention

XIONG Wei1,2,3、*, SUN Peng1, ZHAO Di1, and LIU Yue1
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • 3[in Chinese]
  • show less

    In natural scene text recognition,a fixed size convolution kernel is used to extract visual features,and then character classification is performed.The global modeling ability of this method is weak and it ignores the importance of text semantic modeling.Therefore,this paper proposes a natural scene text recognition method based on character attention.Firstly,a multi-level efficient Swin Transformer network is constructed to extract features,which is different from the convolutional network.This network can make the features of different windows interact with each other.Secondly,the character attention module (CAM) is designed to make the network focus on the features of the character region,so as to extract the visual features with higher recognition ability.Then,the semantic reasoning module (SRM) is designed to model the text sequence according to the context information of characters.And the module can obtain semantic features to correct the indistinguishable or fuzzy characters.At last,visual and semantic features are fused to get the results of character recognition.The experimental results show that the recognition accuracy in this paper reaches 95.2% on the regular text data set IC13 and 85.8% on the irregular curved text data set CUTE.The feasibility of the proposed method is proved by ablative and comparative experiments.

    Tools

    Get Citation

    Copy Citation Text

    XIONG Wei, SUN Peng, ZHAO Di, LIU Yue. Natural scene text recognition based on character attention[J]. Journal of Optoelectronics · Laser, 2023, 34(11): 1158

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Received: Sep. 6, 2022

    Accepted: --

    Published Online: Sep. 25, 2024

    The Author Email: XIONG Wei (xw@mail.hbut.edu.cn)

    DOI:10.16136/j.joel.2023.11.0625

    Topics