Laser & Optoelectronics Progress, Volume. 61, Issue 10, 1015001(2024)
Cross-Modal Person Re-Identification Based on Mask Reconstruction with Dynamic Attention
Cross-modal person re-identification is a challenging pedestrian retrieval task. Existing research focuses on reducing inter-modal differences by extracting modal shared features, while ignoring the processing of intra-modal differences and background interference. In this regard, a mask reconstruction and dynamic attention (MRDA) network is proposed to eliminate the influence of background clutter by reconstructing the features of human body regions, thereby enhancing the robustness of the network on background changes. In addition, the dynamic attention mechanism is combined to filter irrelevant information, dynamically mine and enhance the discriminating feature representations, and eliminate the influence of intra-modal differences. The experimental results show that the probability the first search result matches successfully (Rank-1) and mean average precision (mAP) in the all-search mode of the SYSU-MM01 dataset reach 70.55% and 63.89%, respectively. The Rank-1 and mAP in the visible-to-infrared retrieval mode of the RegDB dataset reach 91.80% and 82.08%, respectively. The effectiveness of the proposed method is verified on the public datasets.
Get Citation
Copy Citation Text
Kuo Zhang, Xinyue Fan, Jiahui Li, Gan Zhang. Cross-Modal Person Re-Identification Based on Mask Reconstruction with Dynamic Attention[J]. Laser & Optoelectronics Progress, 2024, 61(10): 1015001
Category: Machine Vision
Received: Jul. 17, 2023
Accepted: Oct. 9, 2023
Published Online: Apr. 2, 2024
The Author Email: Zhang Kuo (s210101189@stu.cqupt.edu.cn)