Urban Mass Transit, Volume. 28, Issue 7, 177(2025)

Few-shot Metro Track-wheel Image Segmentation Algorithm Based on Cross-attention Network

CAO Jianxin1、*, ZHANG Yueying2, JIANG Weihao3, and GAO Yunhao2
Author Affiliations
  • 1Hangzhou Hanggang Metro Co., Ltd., 310018, Hangzhou, China
  • 2Zhejiang Testing & Inspection Institute for Mechanical and Electrical Products Quality Co., Ltd., 310018, Hangzhou, China
  • 3Hangzhou Dongshang Intelligent Technology Co., Ltd., 310018, Hangzhou, China
  • show less

    [Objective]The domain adaptation issue among metro track images results in low segmentation accuracy for track-wheel images with high inter-class similarity in existing algorithms. To address this challenge, a few-shot metro track-wheel image segmentation algorithm based on a cross-attention network is proposed.[Method]The computational roadmap and process of the few-shot metro track-wheel image segmentation algorithm based on cross-attention network is elaborated. First, a group of backbone networks with shared weights is employed to map the input track-wheel images from both the support branch and the query branch into a deep feature space. Then, the low-, mid-, and high-level features from the dual-branch mappings are fused across scales. A cross-attention network is used to mine the relational semantics between these fused features, enabling the capture of shared semantic information in the deep space across different metro track-wheel images belonging to the same class. Finally, an average pooling is applied to convert the common features of both branches into class-specific prototypes, and the prototypes are leveraged to guide the segmentation of unannotated track-wheel images in the query images. Comparative and ablation experiments are conducted on a self-constructed metro track-wheel image dataset to verify the accuracy and effectiveness of the algorithm.[Result & Conclusion]Testing shows that the proposed algorithm achieves a mIoU (mean intersection over union) of 66.17% and a foreground-background intersection over union (FB-IoU) of 78.21%. Compared with current mainstream semantic segmentation algorithms, the proposed few-shot metro track-wheel image segmentation algorithm based on cross-attention networks demonstrates significantly improved segmentation performance and shows potential for practical application.

    Tools

    Get Citation

    Copy Citation Text

    CAO Jianxin, ZHANG Yueying, JIANG Weihao, GAO Yunhao. Few-shot Metro Track-wheel Image Segmentation Algorithm Based on Cross-attention Network[J]. Urban Mass Transit, 2025, 28(7): 177

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Nov. 3, 2023

    Accepted: Aug. 21, 2025

    Published Online: Aug. 21, 2025

    The Author Email: CAO Jianxin (317419638@qq.com)

    DOI:10.16037/j.1007-869x.20230947

    Topics