Laser & Optoelectronics Progress, Volume. 60, Issue 24, 2428005(2023)
Dual-Stream Feature Aggregation Network for Unmanned Aerial Vehicle Aerial Images Semantic Segmentation
Large object size difference in unmanned aerial vehicle (UAV) aerial photography makes it difficult to take into account the segmentation effect of objects of different sizes in the receptive field. A dual-stream feature aggregation network (DSFA-Net) with two branches to extract low-level and high-level features separately, is proposed for such problems. In the encoder, a low-level information extraction branch with three serial ConvNeXt modules is used to preserve more low-level features by generating more channels of features. In the deep feature branch, the coordinate attention atrous spatial pyramid pooling (CA-ASPP) module reassigns weights to feature maps in the channel dimension. It makes the module focus on segmentation objects of different sizes and deep-level multi-scale features are obtained. During the decoding process, the bilateral guided aggregation module performs resolution aggregation between the low-level and deep-level features. Our method is evaluated on the AeroScapes and Semantic Drone datasets, the mean intersection over union is 83.16% and 72.09% respectively, and the mean pixel accuracy is 90.75% and 80.34% respectively. The proposed method is more capable of segmenting objects with large difference sizes compared to mainstream methods. It is suitable for semantic segmentation tasks for UAV aerial images.
Get Citation
Copy Citation Text
Runzeng Li, Zaifeng Shi, Fanning Kong, Xiangyang Zhao, Tao Luo. Dual-Stream Feature Aggregation Network for Unmanned Aerial Vehicle Aerial Images Semantic Segmentation[J]. Laser & Optoelectronics Progress, 2023, 60(24): 2428005
Category: Remote Sensing and Sensors
Received: Mar. 27, 2023
Accepted: Apr. 23, 2023
Published Online: Nov. 27, 2023
The Author Email: Shi Zaifeng (shizaifeng@tju.edu.cn)