TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text based Person Retrieval	CUHK-PEDES	PLIP-RN50	R@1	69.23	# 7
Text based Person Retrieval	CUHK-PEDES	PLIP-RN50	R@10	91.16	# 7
Text based Person Retrieval	CUHK-PEDES	PLIP-RN50	R@5	85.84	# 7
Person Re-Identification	DukeMTMC-reID	PLIP-RN50-MGN	mAP	81.7	# 32
Text based Person Retrieval	ICFG-PEDES	PLIP-RN50	R@1	64.25	# 5
Text based Person Retrieval	ICFG-PEDES	PLIP-RN50	R@5	80.88	# 2
Text based Person Retrieval	ICFG-PEDES	PLIP-RN50	R@10	86.32	# 2
Person Re-Identification	Market-1501	PLIP-RN50-ABDNet	mAP	91.2	# 29

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/plip-language-image-pre-training-for-person/text-based-person-retrieval-on-icfg-pedes)](https://paperswithcode.com/sota/text-based-person-retrieval-on-icfg-pedes?p=plip-language-image-pre-training-for-person)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/plip-language-image-pre-training-for-person/nlp-based-person-retrival-on-cuhk-pedes)](https://paperswithcode.com/sota/nlp-based-person-retrival-on-cuhk-pedes?p=plip-language-image-pre-training-for-person)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/plip-language-image-pre-training-for-person/person-re-identification-on-market-1501)](https://paperswithcode.com/sota/person-re-identification-on-market-1501?p=plip-language-image-pre-training-for-person)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/plip-language-image-pre-training-for-person/person-re-identification-on-dukemtmc-reid)](https://paperswithcode.com/sota/person-re-identification-on-dukemtmc-reid?p=plip-language-image-pre-training-for-person)`

PLIP: Language-Image Pre-training for Person Representation Learning

15 May 2023 · Jialong Zuo, Jiahao Hong, Feng Zhang, Changqian Yu, Hanyu Zhou, Changxin Gao, Nong Sang, Jingdong Wang ·

Language-image pre-training is an effective technique for learning powerful representations in general domains. However, when directly turning to person representation learning, these general pre-training methods suffer from unsatisfactory performance. The reason is that they neglect critical person-related characteristics, i.e., fine-grained attributes and identities. To address this issue, we propose a novel language-image pre-training framework for person representation learning, termed PLIP. Specifically, we elaborately design three pretext tasks: 1) Text-guided Image Colorization, aims to establish the correspondence between the person-related image regions and the fine-grained color-part textual phrases. 2) Image-guided Attributes Prediction, aims to mine fine-grained attribute information of the person body in the image; and 3) Identity-based Vision-Language Contrast, aims to correlate the cross-modal representations at the identity level rather than the instance level. Moreover, to implement our pre-train framework, we construct a large-scale person dataset with image-text pairs named SYNTH-PEDES by automatically generating textual annotations. We pre-train PLIP on SYNTH-PEDES and evaluate our models by spanning downstream person-centric tasks. PLIP not only significantly improves existing methods on all these tasks, but also shows great ability in the zero-shot and domain generalization settings. The code, dataset and weights will be released at~\url{https://github.com/Zplusdragon/PLIP}

PDF Abstract

Code

Add Remove Mark official

zplusdragon/plip official

Tasks

Add Remove

Attribute

Pedestrian Attribute Recognition

Person Re-Identification

Representation Learning

Text based Person Retrieval

Text-based Person Retrieval

Datasets

Introduced in the Paper:

SYNTH-PEDES

Used in the Paper:

Market-1501

DukeMTMC-reID MSMT17

CUHK-SYSU

CUHK-PEDES

PRW

PETA ICFG-PEDES

LPW

Results from the Paper

Edit

Ranked #5 on Text based Person Retrieval on ICFG-PEDES

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text based Person Retrieval	CUHK-PEDES	PLIP-RN50	R@1	69.23	# 7	Compare
			R@10	91.16	# 7	Compare
			R@5	85.84	# 7	Compare
Person Re-Identification	DukeMTMC-reID	PLIP-RN50-MGN	mAP	81.7	# 32	Compare
Text based Person Retrieval	ICFG-PEDES	PLIP-RN50	R@1	64.25	# 5	Compare
			R@5	80.88	# 2	Compare
			R@10	86.32	# 2	Compare
Person Re-Identification	Market-1501	PLIP-RN50-ABDNet	mAP	91.2	# 29	Compare

Methods

Add Remove

PLIP

Edit Social Preview

PLIP: Language-Image Pre-training for Person Representation Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove