Scene recognition based on deep metric learning and semantic segmentation

To address the issue of low recognition accuracy in scene images caused by subtle inter-class differences and ambiguous intra-class classifications, this paper proposes a novel semantic segmentation framework. By introducing deep metric learning and focusing on the semantic relationships between pixels, the model’?s recognition accuracy can be improved. Firstly, feature extraction is performed through the hollow space pyramid pooling module. Then, in the decoding process, the shallow high-resolution features and deep low resolution features are fused using a structure to better restore the details and boundaries in the image. Secondly, in the deep metric learning module, a well structured pixel semantic embedding space is learned to effectively classify pixels by maximizing the Euclidean distance between pixels of different categories and minimizing the Euclidean distance between pixels of the same category. Finally, a fusion loss function combining weighted focus loss and contrast loss is adopted to balance the importance between different samples, thereby more accurately measuring the performance of the model and improving the accuracy and robustness of scene recognition. The experimental results demonstrate that the average intersection to union ratios of the model on the publicly available datasets ADE20K and Cityscapes are 47.6% and 83.1%, respectively. Compared with the baseline of today?’?s advanced scene recognition methods, the results show that the proposed method is feasible and progressiveness.

AI Video Guide

AI Picture Guide

AI One Sentence

AI Short Abstract

Note: This section is automatically generated by AI . The website and platform operators shall not be liable for any commercial or legal consequences arising from your use of AI generated content on this website. Please be aware of this.

Keywords

class imbalance deep learning deep metric learning scene recognition semantic segmentation

Tools

Get Citation

Copy Citation Text

Xuan JIA, Ye ZHANG, Xuling CHANG, Jianbo SUN. Scene recognition based on deep metric learning and semantic segmentation[J]. Chinese Journal of Liquid Crystals and Displays, 2025, 40(5): 740

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites

Paper Information

Category:

Received: Sep. 20, 2024

Accepted: --

Published Online: Jun. 18, 2025

The Author Email: Ye ZHANG (yolanda@spirits.ai)

DOI:10.37188/CJLCD.2024-0288

Topics