Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Thesis icon

Thesis

Self-supervised learning using motion and visualizing convolutional neural networks

Abstract:

We propose a novel method for learning convolutional image representations without manual supervision. We use motion in the form of optical-flow, to supervise representations of static images. Training a network to predict flow from a single image can be needlessly difficult due to intrinsic ambiguities in this prediction task. We instead propose two simpler learning goals: (a) embed pixels such that the similarity between their embeddings matches that between their optical-flow vectors (C...

Expand abstract

Actions


Access Document


Authors


More by this author
Division:
MPLS
Department:
Engineering Science
Department:
University of Oxford
Role:
Author

Contributors

Department:
University of Oxford
Role:
Supervisor
Department:
University of Oxford
Role:
Examiner
Department:
Universität Bern
Role:
Examiner
DOI:
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
University of Oxford
Language:
English
Keywords:
Subjects:
UUID:
uuid:05ef7004-0bb1-4852-be1f-892daf694430
Deposit date:
2019-02-23

Salient Deconvolutional Networks
Description:

Deconvolutional networks are a popular architecture. However, when used to reverse an existing model for visualization purposes, the process involves a few heuristic choices that cannot be immediately interpreted. In this paper, we introduce a family of reversed networks which include deconvolution, backpropagation and network saliency as special but important cases. We use this construction to thoroughly investigate the different architectures and compare them in terms of quality of the prod...

Expand description

Self-Supervised Segmentation by Grouping Optical-Flow
Description:
We propose to self-supervise a convolutional neural network operating on images using temporal information from videos. The task is to learn a representation of single images and the supervision for this is obtained by learning to group image pixels in such a way that their collective motion is “coherent”. This learning by grouping approach is used as a pre-training as well as segmentation strategy. Preliminary results suggest that the segments obtained are reasonable and the representation learned transfers well for classification.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP