Laser & Optoelectronics Progress, Volume. 62, Issue 12, 1215010(2025)
Cross-Modal Pedestrian Re-Identification Combining Frequency-Domain Attention and Modal Co-Feature Optimization
Existing cross-modal pedestrian re-identification methods disregard noise interference and valuable information for pedestrian identification in modal specific features. Hence, a cross-modal pedestrian re-identification network combining frequency-domain attention and modal co-feature optimization is proposed to effectively suppress noise interference in different modal spaces and deeply mine and utilize the implicit identity-discrimination informations in modal specific features. First, a two-stream network integrated with a frequency-domain attention mechanism is used to effectively filter noise and extract modal shared and specific features. Second, the extracted modal specific features are purified and restored to reduce modal-style differences, while identity-discrimination informations are extracted and strengthened independent of modalities. Thereafter, this implicit discrimination informations are used to guide modal shared features to enhance the model's recognizability. Finally, variance aggregation loss is introduced to minimize the modal differences among the enhanced modal shared features. Based on extensive experimental results, the proposed method demonstrates significant performance improvement on three public datasets. In particular, its Rank-1 accuracy and mean average precision are 82.14% and 81.59%, respectively, in the all-search mode on the SYSU-MM01 dataset.
Get Citation
Copy Citation Text
Taizhe Tan, Mengrou Li, Zhuo Yang, Zhiyuan Gong. Cross-Modal Pedestrian Re-Identification Combining Frequency-Domain Attention and Modal Co-Feature Optimization[J]. Laser & Optoelectronics Progress, 2025, 62(12): 1215010
Category: Machine Vision
Received: Nov. 15, 2024
Accepted: Jan. 6, 2025
Published Online: Jun. 9, 2025
The Author Email: Mengrou Li (2794653908@qq.com)