Journal of Optoelectronics · Laser, Volume. 33, Issue 5, 479(2022)

Natural scene text recognition algorithm based on multilevel feature selection

LI Lirong1,2、*, ZHANG Kai1, ZHANG Yunliang1, YUE Ling1, ZHOU Lei1, and GONG Pengcheng1,2
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less
    References(23)

    [2] [2] SHI B,YANG M,WANG X,et al.ASTER:an attentional scene text recognizer with flexible rectification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2018,41(9):2035-2048.

    [3] [3] SHENG F,CHEN Z,XU B.NRTR:A no-recurrence sequence-to-sequence model for scene text recognition[C]//International Conference on Document Analysis and Recognition,September 20-25,2019,Sydney,NSW,Australia.New York:IEEE,2019:781-786.

    [4] [4] SUBAKAN C,RAVANELLI M,CORNELL S,et al.Attention is all you need in speech separation[C]//IEEE International Conference on Acoustics,Speech and Signal Processing,Toronto,ON,Canada.New York:IEEE,2021:21-25.

    [5] [5] QIAO Z, ZHOU Y, YANG D, et al. Seed:semantics en- hanced encoder-decoder framework for scene text recognition[C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops,June 14-19,2020, Seattle,WA,USA.New York:IEEE,2020:13525-13534.

    [6] [6] NEWELL A,HUANG Z,DENG J.Associative embedding:end-to-end learning for joint detection and grouping[EB/OL]. (2017-06-09) [2021-11-12].https://arxiv.org/abs/1611.05424.

    [7] [7] BAEK J,KIM G,LEE J,et al.What is wrong with scene text recognition model comparisons dataset and model analysis[C]// IEEE/CVF International Conference on Computer Vision,October 27-November 2,2019,Seoul,Korea (South).New York:IEEE,2019:4714-4722.

    [8] [8] HE K,ZHANG X,REN S,et al.Deep residual learning for image recognition[C]// IEEE Conference on Computer Vision and Pattern Recognition,June 27-30,2016,Las Vegas,NV,USA.New York:IEEE,2016:770-778.

    [9] [9] SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[EB/OL]. (2015-08-10) [2021-11-12].https://arxiv.org/abs/1409.1556.

    [10] [10] GRAVES A,LIWICKI M,FERNANDEZ S,et al.A novel connectionist system for unconstrained handwriting recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2009,31(5):855-868.

    [11] [11] SHI B,BAI X,YAO C.An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(11):2298-2304.

    [12] [12] LITMAN R,ANSCHEL O,TSIPER S,et al.SCATTER:Selective context attentional scene text recognizer[EB/OL]. (2020-03-25) [2021-11-12]. https://arxiv.org/abs/2003.11288.

    [13] [13] SHI B,WANG X,LYU P, et al.Robust scene text recognition with automatic rectification[C]//IEEE Conference on Computer Vision and Pattern Recognition,June 27-30,2016,Las Vegas,NV,USA.New York:IEEE,2016:4168-4176.

    [14] [14] CHENG Z,BAI F,XU Y,et al.Focusing attention: towards accurate text recognition in natural images[C]// IEEE International Conference on Computer Vision,October 22-29,2017,Venice,Italy.New York:IEEE,2017:5086-5094.

    [15] [15] JADERBERG M,SIMONYAN K,VEDALDI A,et al.Synthetic data and artificial neural networks for natural scene text recognition[EB/OL]. (2014-12-09) [2021-11-12]. https://arxiv.org/abs/1406.2227v4.

    [16] [16] KARATZAS D,GOMEZ-BIGORDA L,NICOLAOU A,et al.ICDAR 2015 competition on robust reading[C]//International Conference on Document Analysis and Recognition, August 23-26,2015,Nancy,France.New York:IEEE,2015:1156-1160.

    [17] [17] KARATZAS D,SHAFAIT F,UCHIDA S,et al. ICDAR 2013 robust reading competition[C]//International Conference on Document Analysis and Recognition,August 25-28,2013,Washington,DC,USA.New York:IEEE,2013:1484-1493.

    [18] [18] MISHRA A,ALAHARI K,JAWAHAR C V.Scene text recognition using higher order language priors[C]//British Machine Vision Conference,September 3-7, 2012,Guildford,Surrey,UK.Durham:British Machine Vision Association (BMVA),2012:1-11.

    [19] [19] WANG K,BABENKO B,BELONGIE S.End-to-end scene text recognition[C]//IEEE International Conference on Computer Vision,November 6-13,2011,Barcelona,Spain.New York:IEEE,2011:1457-1464.

    [20] [20] PHAN T Q,SHIVAKUMARA P,TIAN S,et al.Recognizing text with perspective distortion in natural scenes[C]//IEEE International Conference on Computer Vision,December 1-8,2013,Sydney,NSW,Australia.New York:IEEE,2013:569-576.

    [21] [21] RISNUMAWAN A,SHIVAKUMARA P,CHAN C S,et al.A robust arbitrary text detection system for natural scene images[J].Expert Systems with Applications,2014,41(18):8027-8048.

    [22] [22] LUO C,JIN L,SUN Z.MORAN:A multi-object rectified attention network for scene text recognition[J].Pattern Recognition,2019,90:109-118.

    [23] [23] LYU P,LIAO M,YAO C,et al.Mask textspotter:An end-to-end trainable neural network for spotting text with arbitrary shapes[EB/OL]. (2018-08-01) [2021-11-12].https://arxiv.org/abs/1807.02242v2.

    [24] [24] YANG M,GUAN Y,LIAO M,et al.Symmetry-constrained rectification network for scene text recognition[C]//IEEE International Conference on Computer Vision,October 27-28,2019,Seoul,Korea (South).New York:IEEE,2019:9146-9155.

    Tools

    Get Citation

    Copy Citation Text

    LI Lirong, ZHANG Kai, ZHANG Yunliang, YUE Ling, ZHOU Lei, GONG Pengcheng. Natural scene text recognition algorithm based on multilevel feature selection[J]. Journal of Optoelectronics · Laser, 2022, 33(5): 479

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Received: Nov. 12, 2021

    Accepted: --

    Published Online: Oct. 9, 2024

    The Author Email: LI Lirong (Rongli@hbut.edu.cn)

    DOI:10.16136/j.joel.2022.05.0761

    Topics