Learning Neural Networks with Adaptive Regularization

Zhao, Han; Tsai, Yao-Hung Hubert; Salakhutdinov, Ruslan; Gordon, Geoffrey J.

Computer Science > Machine Learning

arXiv:1907.06288 (cs)

[Submitted on 14 Jul 2019 (v1), last revised 23 Oct 2019 (this version, v2)]

Title:Learning Neural Networks with Adaptive Regularization

Authors:Han Zhao, Yao-Hung Hubert Tsai, Ruslan Salakhutdinov, Geoffrey J. Gordon

View PDF

Abstract:Feed-forward neural networks can be understood as a combination of an intermediate representation and a linear hypothesis. While most previous works aim to diversify the representations, we explore the complementary direction by performing an adaptive and data-dependent regularization motivated by the empirical Bayes method. Specifically, we propose to construct a matrix-variate normal prior (on weights) whose covariance matrix has a Kronecker product structure. This structure is designed to capture the correlations in neurons through backpropagation. Under the assumption of this Kronecker factorization, the prior encourages neurons to borrow statistical strength from one another. Hence, it leads to an adaptive and data-dependent regularization when training networks on small datasets. To optimize the model, we present an efficient block coordinate descent algorithm with analytical solutions. Empirically, we demonstrate that the proposed method helps networks converge to local optima with smaller stable ranks and spectral norms. These properties suggest better generalizations and we present empirical results to support this expectation. We also verify the effectiveness of the approach on multiclass classification and multitask regression problems with various network structures.

Comments:	Camera ready version
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1907.06288 [cs.LG]
	(or arXiv:1907.06288v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1907.06288

Submission history

From: Yao-Hung Tsai [view email]
[v1] Sun, 14 Jul 2019 22:07:15 UTC (1,303 KB)
[v2] Wed, 23 Oct 2019 04:17:02 UTC (1,303 KB)

Computer Science > Machine Learning

Title:Learning Neural Networks with Adaptive Regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Neural Networks with Adaptive Regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators