Tag Prediction at Flickr: a View from the Darkroom

Boakye, Kofi; Farfade, Sachin; Izadinia, Hamid; Kalantidis, Yannis; Garrigues, Pierre

Computer Science > Computer Vision and Pattern Recognition

arXiv:1612.01922 (cs)

[Submitted on 6 Dec 2016 (v1), last revised 19 Dec 2017 (this version, v3)]

Title:Tag Prediction at Flickr: a View from the Darkroom

Authors:Kofi Boakye, Sachin Farfade, Hamid Izadinia, Yannis Kalantidis, Pierre Garrigues

View PDF

Abstract:Automated photo tagging has established itself as one of the most compelling applications of deep learning. While deep convolutional neural networks have repeatedly demonstrated top performance on standard datasets for classification, there are a number of often overlooked but important considerations when deploying this technology in a real-world scenario. In this paper, we present our efforts in developing a large-scale photo tagging system for Flickr photo search. We discuss topics including how to 1) select the tags that matter most to our users; 2) develop lightweight, high-performance models for tag prediction; and 3) leverage the power of large amounts of noisy data for training. Our results demonstrate that, for real-world datasets, training exclusively with this noisy data yields performance on par with the standard paradigm of first pre-training on clean data and then fine-tuning. In addition, we observe that the models trained with user-generated data can yield better fine-tuning results when a small amount of clean data is available. As such, we advocate for the approach of harnessing user-generated data in large-scale systems.

Comments:	Presented at the ACM Multimedia Thematic Workshops, 2017
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1612.01922 [cs.CV]
	(or arXiv:1612.01922v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1612.01922

Submission history

From: Kofi Boakye [view email]
[v1] Tue, 6 Dec 2016 17:39:49 UTC (35 KB)
[v2] Wed, 7 Dec 2016 21:06:18 UTC (50 KB)
[v3] Tue, 19 Dec 2017 23:37:04 UTC (8,213 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Tag Prediction at Flickr: a View from the Darkroom

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Tag Prediction at Flickr: a View from the Darkroom

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators