Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association

Chen, Dapeng; Li, Hongsheng; Liu, Xihui; Shen, Yantao; Yuan, Zejian; Wang, Xiaogang

Computer Science > Computer Vision and Pattern Recognition

arXiv:1808.01571 (cs)

[Submitted on 5 Aug 2018]

Title:Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association

Authors:Dapeng Chen, Hongsheng Li, Xihui Liu, Yantao Shen, Zejian Yuan, Xiaogang Wang

View PDF

Abstract:Person re-identification is an important task that requires learning discriminative visual features for distinguishing different person identities. Diverse auxiliary information has been utilized to improve the visual feature learning. In this paper, we propose to exploit natural language description as additional training supervisions for effective visual features. Compared with other auxiliary information, language can describe a specific person from more compact and semantic visual aspects, thus is complementary to the pixel-level image data. Our method not only learns better global visual feature with the supervision of the overall description but also enforces semantic consistencies between local visual and linguistic features, which is achieved by building global and local image-language associations. The global image-language association is established according to the identity labels, while the local association is based upon the implicit correspondences between image regions and noun phrases. Extensive experiments demonstrate the effectiveness of employing language as training supervisions with the two association schemes. Our method achieves state-of-the-art performance without utilizing any auxiliary information during testing and shows better performance than other joint embedding methods for the image-language association.

Comments:	ECCV
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1808.01571 [cs.CV]
	(or arXiv:1808.01571v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1808.01571

Submission history

From: Dapeng Chen [view email]
[v1] Sun, 5 Aug 2018 07:19:24 UTC (9,281 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators