Chinese Journal of Liquid Crystals and Displays, Volume. 37, Issue 12, 1598(2022)
High resolution scene parsing network based on semantic segmentation
In order to efficiently segment and analyze complex scenes such as urban landscapes, this paper combines the high-resolution network (HRNet) and supplements the global context information through the pyramid pooling module, and proposes a high-resolution scene analysis network. Firstly, HRNet was used as the backbone feature extraction network, and the atrous separable convolution was used to improve its widely used residual module, so as to reduce the amount of parameters and improve the segmentation ability of multi-scale targets. Secondly, the mixed cavity convolution framework was used to design the multi-level cavity rate, which can dense the receptive field and reduce the influence of the grid problem. Then, a multi-stage continuous up-sampling structure was designed to improve the simple post fusion mechanism of HRNetV2. Finally, the improved pyramid pooling module which can adapt to different image resolutions was used to aggregate the context information of different regions to obtain high-quality segmentation images. The accuracy of 83.3% MIOU is achieved with only 16.4 Mbit parameters on the CityScapes urban landscape dataset, and good results are also achieved on the Camvid dataset. A more reliable, accurate, and low-computing scene analysis method based on semantic segmentation has realized.
Get Citation
Copy Citation Text
Jian-feng SHI, Ning XANG, A-chuan WANG. High resolution scene parsing network based on semantic segmentation[J]. Chinese Journal of Liquid Crystals and Displays, 2022, 37(12): 1598
Category: Research Articles
Received: May. 19, 2022
Accepted: --
Published Online: Nov. 30, 2022
The Author Email: A-chuan WANG (wangca1964@126.com)