How does Weight Correlation Affect the Generalisation Ability of Deep Neural Networks

Jin, Gaojie; Yi, Xinping; Zhang, Liang; Zhang, Lijun; Schewe, Sven; Huang, Xiaowei

Computer Science > Machine Learning

arXiv:2010.05983v3 (cs)

[Submitted on 12 Oct 2020 (v1), last revised 17 Oct 2020 (this version, v3)]

Title:How does Weight Correlation Affect the Generalisation Ability of Deep Neural Networks

Authors:Gaojie Jin, Xinping Yi, Liang Zhang, Lijun Zhang, Sven Schewe, Xiaowei Huang

View PDF

Abstract:This paper studies the novel concept of weight correlation in deep neural networks and discusses its impact on the networks' generalisation ability. For fully-connected layers, the weight correlation is defined as the average cosine similarity between weight vectors of neurons, and for convolutional layers, the weight correlation is defined as the cosine similarity between filter matrices. Theoretically, we show that, weight correlation can, and should, be incorporated into the PAC Bayesian framework for the generalisation of neural networks, and the resulting generalisation bound is monotonic with respect to the weight correlation. We formulate a new complexity measure, which lifts the PAC Bayes measure with weight correlation, and experimentally confirm that it is able to rank the generalisation errors of a set of networks more precisely than existing measures. More importantly, we develop a new regulariser for training, and provide extensive experiments that show that the generalisation error can be greatly reduced with our novel approach.

Comments:	Accpeted by NeurIPS 2020 conference
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2010.05983 [cs.LG]
	(or arXiv:2010.05983v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2010.05983

Submission history

From: Gaojie Jin [view email]
[v1] Mon, 12 Oct 2020 19:18:27 UTC (4,069 KB)
[v2] Thu, 15 Oct 2020 09:33:10 UTC (4,068 KB)
[v3] Sat, 17 Oct 2020 22:38:21 UTC (4,068 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-10

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xinping Yi
Liang Zhang
Lijun Zhang
Sven Schewe
Xiaowei Huang

export BibTeX citation

Computer Science > Machine Learning

Title:How does Weight Correlation Affect the Generalisation Ability of Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:How does Weight Correlation Affect the Generalisation Ability of Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators