Infrared Technology, Volume. 46, Issue 2, 162(2024)

Multi-scale Anchor Construction Method for Object Detection

Yanhua SHAO1,*... Qimeng HUANG1, Yanying MEI1, Xiaoqiang ZHANG1, Hongyu CHU1 and Yadong WU2 |Show fewer author(s)
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less

    Object detection is a popular research topic and fundamental task in computer vision. Anchorbased object detection has been widely used in many fields. Current anchor selection methods face two main problems: a fixed size of a priori values based on a specific dataset and a weak generalization ability in different scenarios. The unsupervised K-means algorithm for calculating anchor frames, which is significantly influenced by initial values, generates less variation in anchor points for clustering datasets with a single object size and cannot reflect the multiscale output of the network. In this study, a multiscale anchor (MSA) method that introduces multiscale optimization was developed to address these issues. This method scales and stretches the anchor points generated by clustering according to the dataset characteristics.The optimized anchor points retain the characteristics of the original dataset and reflect the advantages of the multiple scales of the model. In addition, this method was applied to the preprocessing phase of training without increasing the model inference time. Finally, the single-stage mainstream algorithm, You Only Look Once (YOLO), was selected to perform extensive experiments on different scenes of the infrared and industrial scene datasets. The results show that the MSA method can significantly improve the detection accuracy of small-sample scenes.

    Tools

    Get Citation

    Copy Citation Text

    SHAO Yanhua, HUANG Qimeng, MEI Yanying, ZHANG Xiaoqiang, CHU Hongyu, WU Yadong. Multi-scale Anchor Construction Method for Object Detection[J]. Infrared Technology, 2024, 46(2): 162

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Apr. 7, 2022

    Accepted: --

    Published Online: Jul. 31, 2024

    The Author Email: Yanhua SHAO (syh@alu.cqu.edu.cn。)

    DOI:

    CSTR:32186.14.

    Topics