Constraining Implicit Space with MDL: Regularity Normalization as Unsupervised Attention

by Baihan Lin

Released as a article .

2020

Abstract

Inspired by the adaptation phenomenon of neuronal firing, we propose the regularity normalization (RN) as an unsupervised attention mechanism (UAM) which computes the statistical regularity in the implicit space of neural networks under the Minimum Description Length (MDL) principle. Treating the neural network optimization process as a partially observable model selection problem, UAM constrained the implicit space by a normalization factor, the universal code length. We compute this universal code incrementally across neural network layers and demonstrated the flexibility to include data priors such as top-down attention and other oracle information. Empirically, our approach outperforms existing normalization methods in tackling limited, imbalanced and non-stationary input distribution in computer vision and reinforcement learning tasks. Lastly, UAM tracks dependency and critical learning stages across layers and recurrent time steps of deep networks.
In text/plain format

Type article
Stage

submitted

Date 2020-05-26
Version v10
Language en ^?

arXiv 1902.10658v10

Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)

Lookup Links

Worldcat
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar

Revision

This is a specific, static metadata record, not necessarily linked to any current entity in the catalog.

Catalog Record
Revision: 85b249ae-dc03-46c9-8336-6e33a45e58b7
API URL: JSON

Constraining Implicit Space with MDL: Regularity Normalization as Unsupervised Attention release_rev_85b249ae-dc03-46c9-8336-6e33a45e58b7

Abstract

Constraining Implicit Space with MDL: Regularity Normalization as Unsupervised Attention `release_rev_85b249ae-dc03-46c9-8336-6e33a45e58b7`