Journal of Optoelectronics · Laser, Volume. 33, Issue 10, 1038(2022)

Urban street view semantic segmentation based on height-driven effective attention and multi-stage feature fusion

ZHAO Di1, SUN Peng1, CHEN Yibo1, XIONG Wei1,2,3、*, LIU Yue1, and LI Lirong1,2
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • 3[in Chinese]
  • show less
    References(16)

    [1] [1] LONG J,SHELHAMER E,DARRELL T.Fully convolutional networks for semantic segmentation[C]//IEEE Conference on Computer Vision and Pattern Recognition,June 7-12,2015,Boston,MA,USA.New York:IEEE,2015:3431-3440.

    [2] [2] BADRINARAYANAN V,KENDALL A,CIPOLLA R.Segnet:a deep convolutional encoder-decoder architecture for image segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(12):2481-2495.

    [3] [3] CHEN L C,PAPANDREOU G,KOKKINOS I,et al.DeepLab: semantic image segmentation with deep convolutional nets,atrous convolution,and fully connected CRFs[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2018,40(4):834-848.

    [4] [4] CHEN L C,PAPANDREOU G,SCHROFF F,et al.Rethinking atrous convolution for semantic image segmentation[EB/OL].(2017-06-17)[2022-01-05].http://arxiv.org/abs/1706.05587.

    [5] [5] CHEN L C,ZHU Y,PAPANDREOU G,et al.Encoder-decoder with atrous separable convolution for semantic image segmentation[C]//European Conference on Computer Vision (ECCV),September 8-14,2018,Munich,Germany.Berlin:Springer,2018:801-818.

    [6] [6] CHOI S,KIM J T,CHOO J.Cars can′t fly up in the sky:improving urban-scene segmentation via height-driven attention networks[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition,June 13-19,2020,Seattle,WA,USA.New York:IEEE,2020:9373-9383.

    [7] [7] HE K,ZHANG X,REN S,et al.Deep residual learning for image recognition[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR),June 27-30,2016,Las Vegas,NV,USA.New York:IEEE,2016:770-778.

    [8] [8] LIN T Y,DOLLAR P,GIRSHICK R,et al.Feature pyramid networks for object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition,July 21-26,2017,Honolulu,HI,USA.New York:IEEE,2017:2117-2125.

    [9] [9] WANG Q,WU B,ZHU P,et al.Eca-net:efficient channel attention for deep convolutional neural networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,June 13-19,2020,Seattle,WA,USA.New York:IEEE,2020:11534-11542.

    [10] [10] VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[C]//Advances in Neural Information Processing Systems,December 4-9,2017,Long Beach,California,USA.Red Hook,NY:Curran Associates Inc.,2017:5998-6008.

    [11] [11] MA N,ZHANG X,ZHENG H T,et al.Shufflenet v2: Practical guidelines for efficient cnn architecture design[C]//European Conference on Computer Vision (ECCV),September 8-14,2018,Munich,Germany.Berlin:Springer,2018:116-131.

    [12] [12] HOWARD A G,ZHU M,CHEN B,et al.Mobilenets:efficient convolutional neural networks for mobile vision applications[EB/OL].(2017-04-17)[2022-01-05].http://arxiv.org/abs/1704.04861.

    [13] [13] PASZKE A,CHAURASIA A,KIM S,et al.Enet:a deep neural network architecture for real-time semantic segmentation[EB/OL].(2016-06-07)[2022-01-05].http://arxiv.org/abs/1606.02147.

    [14] [14] YU C,WANG J,PENG C,et al.BiSeNet:bilateral segmentation network for real-time semantic segmentation[C]//15th European Conference on Computer Vision,ECCV,September 8-14,2018,Munich,Germany.Berlin:Springer,2018:334-349.

    [15] [15] SUN K,XIAO B,LIU D,et al.Deep high-resolution representation learning for human pose estimation[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition,June 15-20,2019,Long Beach,CA,USA.New York:IEEE,2019:5693-5703.

    [16] [16] YANG Z,YU H,FU Q,et al.Ndnet:narrow while deep network for real-time semantic segmentation[J].IEEE Transactions on Intelligent Transportation Systems,2020,22(9):5508-5519.

    Tools

    Get Citation

    Copy Citation Text

    ZHAO Di, SUN Peng, CHEN Yibo, XIONG Wei, LIU Yue, LI Lirong. Urban street view semantic segmentation based on height-driven effective attention and multi-stage feature fusion[J]. Journal of Optoelectronics · Laser, 2022, 33(10): 1038

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Received: Jan. 15, 2022

    Accepted: --

    Published Online: Oct. 9, 2024

    The Author Email: XIONG Wei (xw@mail.hbut.edu.cn)

    DOI:10.16136/j.joel.2022.10.0035

    Topics