Chinese Journal of Liquid Crystals and Displays, Volume. 40, Issue 5, 740(2025)

Scene recognition based on deep metric learning and semantic segmentation

Xuan JIA1,2, Ye ZHANG1,2、*, Xuling CHANG1,2, and Jianbo SUN1,2
Author Affiliations
  • 1Changchun Institute of Optics,Fine Mechanics and Physics,Chinese Academy of Sciences,Changchun 130033,China
  • 2University of Chinese Academy of Sciences,Beijing 100049,China
  • show less

    To address the issue of low recognition accuracy in scene images caused by subtle inter-class differences and ambiguous intra-class classifications, this paper proposes a novel semantic segmentation framework. By introducing deep metric learning and focusing on the semantic relationships between pixels, the model’?s recognition accuracy can be improved. Firstly, feature extraction is performed through the hollow space pyramid pooling module. Then, in the decoding process, the shallow high-resolution features and deep low resolution features are fused using a structure to better restore the details and boundaries in the image. Secondly, in the deep metric learning module, a well structured pixel semantic embedding space is learned to effectively classify pixels by maximizing the Euclidean distance between pixels of different categories and minimizing the Euclidean distance between pixels of the same category. Finally, a fusion loss function combining weighted focus loss and contrast loss is adopted to balance the importance between different samples, thereby more accurately measuring the performance of the model and improving the accuracy and robustness of scene recognition. The experimental results demonstrate that the average intersection to union ratios of the model on the publicly available datasets ADE20K and Cityscapes are 47.6% and 83.1%, respectively. Compared with the baseline of today?’?s advanced scene recognition methods, the results show that the proposed method is feasible and progressiveness.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Xuan JIA, Ye ZHANG, Xuling CHANG, Jianbo SUN. Scene recognition based on deep metric learning and semantic segmentation[J]. Chinese Journal of Liquid Crystals and Displays, 2025, 40(5): 740

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Sep. 20, 2024

    Accepted: --

    Published Online: Jun. 18, 2025

    The Author Email: Ye ZHANG (yolanda@spirits.ai)

    DOI:10.37188/CJLCD.2024-0288

    Topics