Semiconductor Optoelectronics, Volume. 45, Issue 3, 501(2024)

Multiscale Target Detection for UAV Aerial Images

JIA Liang, LIN Mingwen, QILijin, and TAN Jin
Author Affiliations
  • [in Chinese]
  • show less

    A multiscale target detection network,VTO-YOLOv8,for unmanned aerial vehicle (UAV) images is proposed to address the low accuracy of existing algorithms caused by complex backgrounds,a high proportion of smalltargets,and uneven distributions.First,wise intersection over union (WIoU) v3 was used as the bounding-box regression loss,and a wise gradient allocation strategy was employed for the network to focus more on regular quality samples and improve localization ability.Second,a four-layer target bi-directional feature pyramid network (T-BiFPN) structurewas designed to strengthen the integration ofshallow and deep features.Furthermore,a faster implementation of CSP bottleneck with diverse branch blocks (C2f-DBB) module was designed to improve the detection performance of the network withoutincreasing computationalcomplexity.In addition,a focalmodulation module wasused to enhance the interaction ofinformation atdifferentscales.The experimentalresults demonstrated thatthe proposed network improved the mean average precision (mAP) and mAP50 by 5.9% and 9.0%,respectively,compared with those of the baseline network on the Visdrone2019 dataset.Moreover,the network parameters were reduced by 22.6%.The proposed method can be applied to targetdetection in UAV aerialphotography.

    Tools

    Get Citation

    Copy Citation Text

    JIA Liang, LIN Mingwen, QILijin, TAN Jin. Multiscale Target Detection for UAV Aerial Images[J]. Semiconductor Optoelectronics, 2024, 45(3): 501

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Jan. 9, 2024

    Accepted: --

    Published Online: Oct. 15, 2024

    The Author Email:

    DOI:10.16818/j.issn1001-5868.2024010903

    Topics