Semantic Segmentation Method Based on Multiscale Feature Alignment and Aggregation

Zhaozhong Xu; Li Peng; Feifei Dai

doi:10.3788/LOP212814

Laser & Optoelectronics Progress, Volume. 60, Issue 2, 0215004(2023)

Semantic Segmentation Method Based on Multiscale Feature Alignment and Aggregation

Zhaozhong Xu¹, Li Peng^1,2、*, and Feifei Dai³

¹Engineering Research Center of Internet of Things Technology Applications, School of IoT Engineering, Jiangnan University, Wuxi 214122, Jiangsu, China

²Jiangsu Province Internet of Things Application Technology Key Construction Laboratory, Wuxi Taihu College, Wuxi 214122, Jiangsu, China

³Taizhou Product Quality and Safety Monitoring Institute, Taizhou 318000, Zhejiang, China

show less

Abstract Get PDF(in Chinese)

During semantic segmentation of images, a convolutional neural network easily misplaces the high-level features with low-level features after down-sampling and padding operations. To solve the mismatch problem between high- and low-level features and better aggregate the multiscale feature information, this paper proposes a semantic segmentation method with a multiscale feature alignment aggregation (MFAA) module. The MFAA module adopts a learnable interpolation strategy to learn pixel transform migration, thereby alleviating the feature-misalignment problem of feature aggregation at different scales. The module includes an attention mechanism that improves the decoder's ability to recover the important details. Using multiple MFAA modules, the semantic information of high-level features, and the spatial information of low-level features, this method aligns and aggregates the high- and low-level features to refine the semantic segmentation effect. The proposed network structure was validated on PASCAL VOC 2012. Using a ResNet-50 backbone network, the mean intersection-over-union reached 78.4% on the validation set. Experimentally, the proposed method achieved better evaluation indices than several mainstream segmentation methods and effectively improved the image segmentation effect.

Keywords

attention mechanism feature alignment image semantic segmentation machine vision multiscale feature

Tools

Get Citation

Copy Citation Text

Zhaozhong Xu, Li Peng, Feifei Dai. Semantic Segmentation Method Based on Multiscale Feature Alignment and Aggregation[J]. Laser & Optoelectronics Progress, 2023, 60(2): 0215004

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category: Machine Vision

Received: Oct. 26, 2021

Accepted: Nov. 29, 2021

Published Online: Feb. 7, 2023

The Author Email: Peng Li (penglimail2002@163.com)

DOI:10.3788/LOP212814

Topics

laser devices and laser physics

Lasers and Laser Optics

Laser physics

laser manufacturing

Instrumentation, Measurement and Metrology