Cross-Modal Learning via Pairwise Constraints

He, Ran; Zhang, Man; Wang, Liang; Ji, Ye; Yin, Qiyue

doi:10.1109/TIP.2015.2466106

Computer Science > Computer Vision and Pattern Recognition

arXiv:1411.7798 (cs)

[Submitted on 28 Nov 2014]

Title:Cross-Modal Learning via Pairwise Constraints

Authors:Ran He, Man Zhang, Liang Wang, Ye Ji, Qiyue Yin

View PDF

Abstract:In multimedia applications, the text and image components in a web document form a pairwise constraint that potentially indicates the same semantic concept. This paper studies cross-modal learning via the pairwise constraint, and aims to find the common structure hidden in different modalities. We first propose a compound regularization framework to deal with the pairwise constraint, which can be used as a general platform for developing cross-modal algorithms. For unsupervised learning, we propose a cross-modal subspace clustering method to learn a common structure for different modalities. For supervised learning, to reduce the semantic gap and the outliers in pairwise constraints, we propose a cross-modal matching method based on compound ?21 regularization along with an iteratively reweighted algorithm to find the global optimum. Extensive experiments demonstrate the benefits of joint text and image modeling with semantically induced pairwise constraints, and show that the proposed cross-modal methods can further reduce the semantic gap between different modalities and improve the clustering/retrieval accuracy.

Comments:	12 pages, 5 figures, 70 references
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1411.7798 [cs.CV]
	(or arXiv:1411.7798v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1411.7798
Related DOI:	https://doi.org/10.1109/TIP.2015.2466106

Submission history

From: Ran He [view email]
[v1] Fri, 28 Nov 2014 10:11:03 UTC (910 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2014-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ran He
Man Zhang
Liang Wang
Ye Ji
Qiyue Yin

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-Modal Learning via Pairwise Constraints

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-Modal Learning via Pairwise Constraints

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators