Optics and Precision Engineering, Volume. 33, Issue 1, 123(2025)

Improved DeepLabv3+ semantic segmentation incorporating attention mechanisms

He YAN*, Qiuxia LEI, and Xu WANG
Author Affiliations
  • Liangjiang College of Artificial Intelligence, Chongqing University of Technology, Chongqing401135, China
  • show less
    References(33)

    [1] REN F L, YANG L, ZHOU H B et al. Real-time semantic segmentation based on improved BiSeNet[J]. Opt. Precision Eng., 31, 1217-1227(2023).

         任凤雷, 杨璐, 周海波. 基于改进BiSeNet的实时图像语义分割[J]. 光学 精密工程, 31, 1217-1227(2023).

    [2] CHEN B K, GONG C, YANG J. Importance-aware semantic segmentation for autonomous vehicles[J]. IEEE Transactions on Intelligent Transportation Systems, 20, 137-148.

    [4] LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[C], 7, 3431-3440(2015).

    [5] RONNEBERGER O, FISCHER P, BROX T[M]. U-Net: Convolutional Networks for Biomedical Image Segmentation, 234-241(2015).

    [6] STRUDEL R, GARCIA R, LAPTEV I et al. Segmenter: transformer for semantic segmentation[C], 10, 7262-7272(2021).

    [7] KIRILLOV A, MINTUN E, RAVI N et al. Segment anything[C], 1, 4015-4026(2023).

    [8] 赵为平, 陈雨, 项松. 基于改进的DeepLabv3+图像语义分割算法研究[J]. 系统仿真学报, 35, 2333-2344(2023).

         ZHAO W P, CHEN Y, XIANG S et al. Image semantic segmentation algorithm based on improved DeepLabv3+[J]. Journal of System Simulation, 35, 2333-2344(2023).

    [9] CHEN L C, ZHU Y K, PAPANDREOU G et al. Encoder-decoder with atrous separable convolution for semantic image segmentation[C], 833-851(2018).

    [10] WANG X T, YAN H, LIU J Q et al. A new deeplabv3+ semantic segmentation model of edge gradient interpolation with double branch structure[J]. CAAI Transactions on Intelligent Systems, 18, 604-612(2023).

         王潇棠, 闫河, 刘建骐. 一种边缘梯度插值的双分支deeplabv3+语义分割模型[J]. 智能系统学报, 18, 604-612(2023).

    [11] 周羿, 刘德儿. 融合注意力机制及DenseASPP改进的DeeplabV3+遥感图像分割方法[J]. 遥感信息, 38, 85-92(2023).

         ZHOU Y, LIU D E. A semantic segmentation method for remote sensing image based on fusion attention mechanism and DenseASPP improved DeeplabV3+[J]. Remote Sensing Information, 38, 85-92(2023).

    [12] SANDLER M, HOWARD A, ZHU M L et al. MobileNetV2: inverted residuals and linear bottlenecks[C], 18, 4510-4520(2018).

    [14] Yang L, Zhang R Y, Li L et al. Simam: A simple, parameter-free attention module for convolutional neural networks[C], 11863-11874(2021).

    [15] ZHU X Z, CHENG D Z, ZHANG Z et al. An empirical study of spatial attention mechanisms in deep networks[C], 6688-6697(2019).

    [16] HUANG Z L, WANG X G, HUANG L C et al. CCNet: criss-cross attention for semantic segmentation[C], 603-612(2019).

    [17] YUAN Y H, CHEN X L, WANG J D[M]. Object-contextual Representations for Semantic Segmentation, 173-190(2020).

    [18] GUO M H, LIU Z N, MU T J et al. Beyond self-attention: external attention using two linear layers for visual tasks[J]. IEEE Trans Pattern Anal Mach Intell, 45, 5436-5447(2023).

    [19] PAN X R, GE C J, LU R et al. On the integration of self-attention and convolution[C], 18, 815-825(2022).

    [21] YU D J, WANG H L, CHEN P Q et al. Mixed Pooling for Convolutional Neural Networks[M]. Rough Sets and Knowledge Technology, 364-375(2014).

    [22] HSIAO T Y, CHANG Y C, CHOU H H et al. Filter-based deep-compression with global average pooling for convolutional networks[J]. Journal of Systems Architecture, 95, 9-18(2019).

    [23] CHENG T H, WANG X G, HUANG L C et al[M]. Boundary-preserving Mask R-CNN, 660-676(2020).

    [24] YUAN Y H, XIE J Y, CHEN X L et al[M]. SegFix: Model-agnostic Boundary Refinement for Segmentation, 489-506(2020).

    [25] ZHANG X G, DING L Z, LIU Y F et al. FDA-DeepLab semantic segmentation network based on dual attention module[J]. Journal of Southeast University (Natural Science Edition), 52, 1145-1151(2022).

         张小国, 丁立早, 刘亚飞. 基于双注意力模块的FDA-DeepLab语义分割网络[J]. 东南大学学报(自然科学版), 52, 1145-1151(2022).

    [26] XU G P, LIAO W T, ZHANG X et al. Haar wavelet downsampling: a simple but effective downsampling module for semantic segmentation[J]. Pattern Recognition, 143, 109819(2023).

    [27] EVERINGHAM M, MALI ESLAMI S, VAN GOOL L et al. The pascal visual object classes challenge: a retrospective[J]. International Journal of Computer Vision, 111, 98-136(2015).

    [28] WANG Q L, WU B G, ZHU P F et al. ECA-net: efficient channel attention for deep convolutional neural networks[C], 13, 11534-11542(2020).

    CLP Journals

    [1] Hui LIU, Jin LIANG, Meitu YE, Jianying GUO, Leigang LI. Comparison of deviation in aircraft casing deformation measurement based on data mapping optimization[J]. Optics and Precision Engineering, 2023, 31(20): 2930

    [2] Jianying GUO, Jin LIANG, Meitu YE, Mingming WANG, Wei SHI. Mirror-assisted multi-view measurement of 3D deformation for aero-engine casing[J]. Optics and Precision Engineering, 2022, 30(24): 3105

    Tools

    Get Citation

    Copy Citation Text

    He YAN, Qiuxia LEI, Xu WANG. Improved DeepLabv3+ semantic segmentation incorporating attention mechanisms[J]. Optics and Precision Engineering, 2025, 33(1): 123

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Jun. 28, 2024

    Accepted: --

    Published Online: Apr. 1, 2025

    The Author Email: He YAN (yanhe@ cqut.edu.cn)

    DOI:10.37188/OPE.20253301.0123

    Topics