Compression based bound for non-compressed network: unified generalization error analysis of large compressible deep neural network

Suzuki, Taiji; Abe, Hiroshi; Nishimura, Tomoaki

Computer Science > Machine Learning

arXiv:1909.11274 (cs)

[Submitted on 25 Sep 2019 (v1), last revised 21 Jun 2020 (this version, v3)]

Title:Compression based bound for non-compressed network: unified generalization error analysis of large compressible deep neural network

Authors:Taiji Suzuki, Hiroshi Abe, Tomoaki Nishimura

View PDF

Abstract:One of the biggest issues in deep learning theory is the generalization ability of networks with huge model size. The classical learning theory suggests that overparameterized models cause overfitting. However, practically used large deep models avoid overfitting, which is not well explained by the classical approaches. To resolve this issue, several attempts have been made. Among them, the compression based bound is one of the promising approaches. However, the compression based bound can be applied only to a compressed network, and it is not applicable to the non-compressed original network. In this paper, we give a unified frame-work that can convert compression based bounds to those for non-compressed original networks. The bound gives even better rate than the one for the compressed network by improving the bias term. By establishing the unified frame-work, we can obtain a data dependent generalization error bound which gives a tighter evaluation than the data independent ones.

Comments:	published in ICLR2020
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1909.11274 [cs.LG]
	(or arXiv:1909.11274v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1909.11274

Submission history

From: Taiji Suzuki [view email]
[v1] Wed, 25 Sep 2019 03:43:14 UTC (705 KB)
[v2] Thu, 26 Sep 2019 05:40:09 UTC (131 KB)
[v3] Sun, 21 Jun 2020 16:39:16 UTC (107 KB)

Computer Science > Machine Learning

Title:Compression based bound for non-compressed network: unified generalization error analysis of large compressible deep neural network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Compression based bound for non-compressed network: unified generalization error analysis of large compressible deep neural network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators