Tag Prediction at Flickr: a View from the Darkroom

Kofi Boakye; Sachin Farfade; Hamid Izadinia; Yannis Kalantidis; and
  Pierre Garrigues

by Kofi Boakye, Sachin Farfade, Hamid Izadinia, Yannis Kalantidis, and Pierre Garrigues

Released as a article .

2016

Abstract

Automated photo tagging has established itself as one of the most compelling applications of deep learning. While deep convolutional neural networks have repeatedly demonstrated top performance on standard datasets for classification, there are a number of often overlooked but important considerations when deploying this technology in a real-world scenario. In this paper, we present our efforts in developing a large-scale photo tagging system for Flickr photo search. We discuss topics including how to 1) select the tags that matter most to our users; 2) develop lightweight, high-performance models for tag prediction; and 3) leverage the power of large amounts of noisy data for training. Our results demonstrate that, for real-world datasets, training exclusively with this noisy data yields performance on par with the standard paradigm of first pre-training on clean data and then fine-tuning. In addition, we observe that the models trained with user-generated data can yield better fine-tuning results when a small amount of clean data is available. As such, we advocate for the approach of harnessing user-generated data in large-scale systems.
In text/plain format

Archived Files and Locations

application/pdf 115.9 kB
file_qbn2bvulincx5bckgngudx6hl4 arxiv.org (repository)
web.archive.org (webarchive)

Read Archived PDF

Preserved and Accessible

Type article
Stage

submitted

Date 2016-12-07
Version v2
Language en ^?

arXiv 1612.01922v2

Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)

Cite This

BibTeX
CSL-JSON
MLA
Harvard

Lookup Links

Worldcat
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar

Catalog Record
Revision: 54a99f99-1c5b-452d-b4d1-e549bf4c5617
API URL: JSON

Edit Metadata View History

Tag Prediction at Flickr: a View from the Darkroom release_smqo32e325d2ff2hswrz4nyb3u

Abstract

Archived Files and Locations

Tag Prediction at Flickr: a View from the Darkroom `release_smqo32e325d2ff2hswrz4nyb3u`