Electronics Optics & Control, Volume. 27, Issue 12, 84(2020)
RGBD Semantic Segmentation Based on Depth-Sensitive Spatial Pyramid Pooling
The RGBD semantic segmentation model based on the standard 2D convolution kernel mostly takes the depth map as a single channel.Due to the limitation of convolution kernel characteristics, the geometric structure information brought by the depth information cannot be fully exploited.To overcome this defect, this paper constructs depth-sensitive convolution kernels and a pooling layer to make rich mining of depth information,and uses depth-sensitive spatial pyramid pooling to extract multi-scale information, so as to realize the segmentation of objects of different scales.Results of experiment on NYU v2 and SUN RGB-D datasets show that this method effectively improves the overall semantic segmentation accuracy.
Get Citation
Copy Citation Text
YANG Shengjie, QIU Zhenan, GAO Xiaoning, LI Jianxun. RGBD Semantic Segmentation Based on Depth-Sensitive Spatial Pyramid Pooling[J]. Electronics Optics & Control, 2020, 27(12): 84
Category:
Received: Nov. 5, 2019
Accepted: --
Published Online: Jan. 14, 2021
The Author Email: