Laser & Optoelectronics Progress, Volume. 59, Issue 8, 0810015(2022)

Real-Time Semantic Segmentation Network Based on Octave Convolution

Xin Wang and Kaijun Wu*
Author Affiliations
  • College of Information Technology, Shanghai Ocean University, Shanghai 201306, China
  • show less

    When a convolutional neural network performs real-time image semantic segmentation, processing large blocks of pixels with small color changes leads to computational spatial redundancy. Also, the accuracy of feature extraction using lightweight networks is low. We designed a real-time semantic segmentation network using the improved MobileNet v3 and lightweight OctConv high-frequency (OTCH-L) module to mitigate both problems. First, the hard-swish activation function was used to compensate for the accuracy of the lightweight network MobileNet v3. Then, we proposed an improved MobileNet v3 feature extraction network. Furthermore, we designed the OTCH-L module based on the octave convolution to solve the problem of spatial redundancy and reduce the computational size of the model while ensuring accuracy. The models were trained and verified on the Pascal VOC2012 and VOC2007 datasets, respectively. The experimental results show that the segmentation speed of the proposed model reaches 25.94 frame/s, and the mean intersection over union (MIoU) reaches 70.34%. Compared with the mainstream semantic segmentation models, such as SegNet, PSPNet, and DeepLab v3 plus, our proposed model significantly enhances the segmentation speed while maintaining segmentation accuracy.

    Tools

    Get Citation

    Copy Citation Text

    Xin Wang, Kaijun Wu. Real-Time Semantic Segmentation Network Based on Octave Convolution[J]. Laser & Optoelectronics Progress, 2022, 59(8): 0810015

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Image Processing

    Received: Feb. 23, 2021

    Accepted: May. 6, 2021

    Published Online: Apr. 11, 2022

    The Author Email: Wu Kaijun (walt@shou.edu.cn)

    DOI:10.3788/LOP202259.0810015

    Topics