Computer Science > Computer Vision and Pattern Recognition
[Submitted on 21 Apr 2016 (this version), latest version 30 Jul 2016 (v2)]
Title:LOH and behold: Web-scale visual search, recommendation and clustering using Locally Optimized Hashing
View PDFAbstract:We present a multimedia system based on a novel matching signature able to perform de-duplucation, search, clustering and visual recommendations in a way that is easily implemented in generic distributed computing environments. Starting from a state-of-the-art algorithm, we propose a novel hashing-based matching system that allow for fast search and is easily implemented in distributed system languages like PIG, as it only requires set intersections and summations to compute. We make the following contributions: a) we propose a novel hashing method for visual search using locally optimized codes that performs on-par with other state-of-the-art hashing approaches but offers more flexibility in terms of ranking, b) we extend our matching framework to multiple image queries and provide a simple and scalable solution that can efficiently produce visual recommendations for query sets of thousands of images and cluster collections of hundreds of millions of images, c) we show that this same representation can be used for efficient de-duplication of image search results, performing better than traditional hashing approaches, while still requiring only a few milliseconds to run. In this paper we display results on datasets of up to 100 Million images, but in practice our system can find and rank similar images for millions of users from a search set of hundreds of millions of images in a runtime on the order of one hour on a large Hadoop cluster.
Submission history
From: Yannis Kalantidis [view email][v1] Thu, 21 Apr 2016 20:23:55 UTC (26,970 KB)
[v2] Sat, 30 Jul 2016 02:34:52 UTC (7,222 KB)
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.