Seeing the Big Picture: Deep Embedding with Contextual Evidences

Zheng, Liang; Wang, Shengjin; He, Fei; Tian, Qi

Abstract:In the Bag-of-Words (BoW) model based image retrieval task, the precision of visual matching plays a critical role in improving retrieval performance. Conventionally, local cues of a keypoint are employed. However, such strategy does not consider the contextual evidences of a keypoint, a problem which would lead to the prevalence of false matches. To address this problem, this paper defines "true match" as a pair of keypoints which are similar on three levels, i.e., local, regional, and global. Then, a principled probabilistic framework is established, which is capable of implicitly integrating discriminative cues from all these feature levels.
Specifically, the Convolutional Neural Network (CNN) is employed to extract features from regional and global patches, leading to the so-called "Deep Embedding" framework. CNN has been shown to produce excellent performance on a dozen computer vision tasks such as image classification and detection, but few works have been done on BoW based image retrieval. In this paper, firstly we show that proper pre-processing techniques are necessary for effective usage of CNN feature. Then, in the attempt to fit it into our model, a novel indexing structure called "Deep Indexing" is introduced, which dramatically reduces memory usage.
Extensive experiments on three benchmark datasets demonstrate that, the proposed Deep Embedding method greatly promotes the retrieval accuracy when CNN feature is integrated. We show that our method is efficient in terms of both memory and time cost, and compares favorably with the state-of-the-art methods.

Comments:	10 pages, 13 figures, 7 tables, submitted to ACM Multimedia 2014
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1406.0132 [cs.CV]
	(or arXiv:1406.0132v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1406.0132

Computer Science > Computer Vision and Pattern Recognition

Title:Seeing the Big Picture: Deep Embedding with Contextual Evidences

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators