Journals & Magazines >IEEE Transactions on Multimedia >Volume: 26

Multi-Granularity Matching Transformer for Text-Based Person Search

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Text-based person search aims to retrieve the most relevant pedestrian images from an image gallery based on textual descriptions. Most existing methods rely on two separ...Show More

Metadata

Abstract:

Text-based person search aims to retrieve the most relevant pedestrian images from an image gallery based on textual descriptions. Most existing methods rely on two separate encoders to extract the image and text features, and then elaborately design various schemes to bridge the gap between image and text modalities. However, the shallow interaction between both modalities in these methods is still insufficient to eliminate the modality gap. To address the above problem, we propose TransTPS, a transformer-based framework that enables deeper interaction between both modalities through the self-attention mechanism in transformer, effectively alleviating the modality gap. In addition, due to the small inter-class variance and large intra-class variance in image modality, we further develop two techniques to overcome these limitations. Specifically, Cross-modal Multi-Granularity Matching (CMGM) is proposed to address the problem caused by small inter-class variance and facilitate distinguishing pedestrians with similar appearance. Besides, Contrastive Loss with Weakly Positive pairs (CLWP) is introduced to mitigate the impact of large intra-class variance and contribute to the retrieval of more target images. Experiments on CUHK-PEDES and RSTPReID datasets demonstrate that our proposed framework achieves state-of-the-art performance compared to previous methods.

Published in: IEEE Transactions on Multimedia ( Volume: 26)

Page(s): 4281 - 4293

Date of Publication: 09 October 2023

ISSN Information:

DOI: 10.1109/TMM.2023.3321504

Funding Agency:

Contents

References is not available for this document.

Multi-Granularity Matching Transformer for Text-Based Person Search

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Multi-Granularity Matching Transformer for Text-Based Person Search

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?