Optoelectronics Letters, Volume. 21, Issue 4, 226(2025)
Multi-scale feature fusion optical remote sensing target detection method
An improved model based on you only look once version 8 (YOLOv8) is proposed to solve the problem of low detection accuracy due to the diversity of object sizes in optical remote sensing images. Firstly, the feature pyramid network (FPN) structure of the original YOLOv8 mode is replaced by the generalized-FPN (GFPN) structure in GiraffeDet to realize the "cross-layer" and "cross-scale" adaptive feature fusion, to enrich the semantic information and spatial information on the feature map to improve the target detection ability of the model. Secondly, a pyramid-pool module of multi atrous spatial pyramid pooling (MASPP) is designed by using the idea of atrous convolution and feature pyramid structure to extract multi-scale features, so as to improve the processing ability of the model for multi-scale objects. The experimental results show that the detection accuracy of the improved YOLOv8 model on DIOR dataset is 92% and mean average precision (mAP) is 87.9%, respectively 3.5% and 1.7% higher than those of the original model. It is proved the detection and classification ability of the proposed model on multi-dimensional optical remote sensing target has been improved.
Get Citation
Copy Citation Text
BAI Liang, DING Xuewen, LIU Ying, CHANG Limei. Multi-scale feature fusion optical remote sensing target detection method[J]. Optoelectronics Letters, 2025, 21(4): 226
Received: Mar. 11, 2024
Accepted: Feb. 28, 2025
Published Online: Feb. 28, 2025
The Author Email: