Infrared and Laser Engineering, Volume. 54, Issue 8, 20250209(2025)

Dynamic feature aggregation and multi-level collaboration for UAV infrared target instance segmentation

Zifen HE, Qigang WANG, Yinhui ZHANG*, Ying HUANG, Wei PENG, and Guangchen CHEN
Author Affiliations
  • Mechanical and Electrical Engineering, Kunming University of Science and Technology, Kunming 650500, China
  • show less
    References(20)

    [1] [1] XU K, SONG C, XIE Y, et al. RMTYOLOv9s: An infrared small target detection method based on UAV remote sensing images[J]. IEEE Geoscience Remote Sensing Letters, 2024,21: 15.

    [2] [2] ZHOU G, LIU X, BI H. Recognition of UAVs in infrared images based on YOLOv8[J]. IEEE Access, 2025, (13): 15341545.

    [3] [3] CHENG B, MISRA I, SCHWING A, et al. Maskedattention mask transfmer f universal image segmentation[C]2022 IEEECVF Conference on Computer Vision Pattern Recognition (CVPR), New leans, LA, USA, 2022: 12801289.

    [4] [4] CARION N, MASSA F, SYNNAEVE G, et al. Endtoend object detection with transfmers[C]European Conference on Computer Vision. Cham: Springer International Publishing, 2020: 213229.

    [5] [5] KIRILLOV A, MINTUN E, RAVI N, et al. Segment anything[C]Proceedings of the IEEECVF International Conference on Computer Vision, 2023: 40154026.

    [6] [6] LI L, FANG M, FU F, et al. Instance segmentation based on improved YOLACT[C]2020 International Conference on Virtual Reality Visualization (ICVRV), Recife, Brazil, 2020: 165170.

    [8] [8] XUE Shan, AN Hongyu, LV Qiongying, et al. Image target detection algithm based on YOLOv7tinyin complex background [J]. Infrared Laser Engineering, 2024, 53(1): 20230472. (in Chinese)

    [9] [9] SANG Jiageng, ZHANG Zhijia, XIAO Chuanmin, et al. An improved YOLOv8s method its application in road traffic target detection [J]. Infrared Laser Engineering, 2024, 53(11): 20240256. (in Chinese)

    [10] [10] LI C, ZHOU A, YAO A. Omnidimensional dynamic convolution.[EBOL](2022916)[20250507]. https:arxiv.g abs2209.07947

    [11] [11] YU J, HE Y, ZHANG F, et al. An infrared image stitching method f wind turbine blade using UAV flight data U[J]. IEEE Senss Journal, 2023, 23(8): 87278736.

    [12] [12] WANG C, BOCHKOVSKIY, A, LIAO, et al. Yolov7: Trainable bagoffreebies sets new stateoftheart f realtime object detects. [C]2023 Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2023: 74647475.

    [13] [13] ABOAH A, WANG B, BAGCI U, et al. Realtime multiclass helmet violation detection using fewshot data sampling technique yolov8[C]2023 Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2023: 5349–5357.

    [14] [14] LIU Q, XU Z, WANG S, et al. Aerial mancar dataset. [EBOL](20241020)[20250304]. http:openai.iraytek.com applyAerial_mancar.html

    [15] [15] MA N, ZHANG X, ZHENG H T, et al. Shuffle v2: Practical guidelines f efficient cnn architecture design[C]Proceedings of the European Conference on Computer Vision (ECCV), 2018: 116131.

    [16] [16] KOONCE B. MobileV3[M]Convolutional Neural wks With Swift f Tensflow: Image Recognition Dataset Categization. Berkeley, CA: Apress, 2021.

    [17] [17] HAN K, WANG Y, TIAN Q, et al. Ghostnet: Me features from cheap operations[C]Proceedings of the IEEECVF Conference on Computer Vision Pattern Recognition, 2020: 15801589.

    [18] [18] KOONCE B. Efficient[M]Convolutional Neural wks with Swift f Tensflow: Image Recognition Dataset Categization. Berkeley, CA: Apress, 2021.

    [19] [19] KHANAM R, HUSSAIN M. Yolov11: An overview of the key architectural enhancements. [EBOL](20241023)[20250507]. https:www.arxiv.gabs2410.17725

    [20] [20] TIAN Y, YE Q, DOERMANN D. Yolov12: Attentioncentric realtime object detects.[EBOL](20250218)[20250308]. https:arxiv.gabs2502.12524

    Tools

    Get Citation

    Copy Citation Text

    Zifen HE, Qigang WANG, Yinhui ZHANG, Ying HUANG, Wei PENG, Guangchen CHEN. Dynamic feature aggregation and multi-level collaboration for UAV infrared target instance segmentation[J]. Infrared and Laser Engineering, 2025, 54(8): 20250209

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Optical imaging, display and information processing

    Received: Mar. 4, 2025

    Accepted: --

    Published Online: Aug. 29, 2025

    The Author Email: Yinhui ZHANG (zhangyinhui@kust.edu.cn)

    DOI:10.3788/IRLA20250209

    Topics