Master's Thesis : Deep Learning for Visual Recognition

Cadène, Rémi; Thome, Nicolas; Cord, Matthieu

Computer Science > Computer Vision and Pattern Recognition

arXiv:1610.05567 (cs)

[Submitted on 18 Oct 2016]

Title:Master's Thesis : Deep Learning for Visual Recognition

Authors:Rémi Cadène, Nicolas Thome, Matthieu Cord

View PDF

Abstract:The goal of our research is to develop methods advancing automatic visual recognition. In order to predict the unique or multiple labels associated to an image, we study different kind of Deep Neural Networks architectures and methods for supervised features learning. We first draw up a state-of-the-art review of the Convolutional Neural Networks aiming to understand the history behind this family of statistical models, the limit of modern architectures and the novel techniques currently used to train deep CNNs. The originality of our work lies in our approach focusing on tasks with a low amount of data. We introduce different models and techniques to achieve the best accuracy on several kind of datasets, such as a medium dataset of food recipes (100k images) for building a web API, or a small dataset of satellite images (6,000) for the DSG online challenge that we've won. We also draw up the state-of-the-art in Weakly Supervised Learning, introducing different kind of CNNs able to localize regions of interest. Our last contribution is a framework, build on top of Torch7, for training and testing deep models on any visual recognition tasks and on datasets of any scale.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1610.05567 [cs.CV]
	(or arXiv:1610.05567v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1610.05567

Submission history

From: Rémi Cadène [view email]
[v1] Tue, 18 Oct 2016 12:26:49 UTC (6,959 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2016-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Rémi Cadène
Nicolas Thome
Matthieu Cord

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Master's Thesis : Deep Learning for Visual Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Master's Thesis : Deep Learning for Visual Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators