Generalized Negative Correlation Learning for Deep Ensembling

Buschjäger, Sebastian; Pfahler, Lukas; Morik, Katharina

Computer Science > Machine Learning

arXiv:2011.02952 (cs)

[Submitted on 5 Nov 2020 (v1), last revised 9 Dec 2020 (this version, v2)]

Title:Generalized Negative Correlation Learning for Deep Ensembling

Authors:Sebastian Buschjäger, Lukas Pfahler, Katharina Morik

View PDF

Abstract:Ensemble algorithms offer state of the art performance in many machine learning applications. A common explanation for their excellent performance is due to the bias-variance decomposition of the mean squared error which shows that the algorithm's error can be decomposed into its bias and variance. Both quantities are often opposed to each other and ensembles offer an effective way to manage them as they reduce the variance through a diverse set of base learners while keeping the bias low at the same time. Even though there have been numerous works on decomposing other loss functions, the exact mathematical connection is rarely exploited explicitly for ensembling, but merely used as a guiding principle. In this paper, we formulate a generalized bias-variance decomposition for arbitrary twice differentiable loss functions and study it in the context of Deep Learning. We use this decomposition to derive a Generalized Negative Correlation Learning (GNCL) algorithm which offers explicit control over the ensemble's diversity and smoothly interpolates between the two extremes of independent training and the joint training of the ensemble. We show how GNCL encapsulates many previous works and discuss under which circumstances training of an ensemble of Neural Networks might fail and what ensembling method should be favored depending on the choice of the individual networks. We make our code publicly available under this https URL

Comments:	12 (+8) pages, 1(+1) figures
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2011.02952 [cs.LG]
	(or arXiv:2011.02952v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2011.02952

Submission history

From: Sebastian Buschjäger [view email]
[v1] Thu, 5 Nov 2020 16:29:22 UTC (2,915 KB)
[v2] Wed, 9 Dec 2020 08:19:49 UTC (239 KB)

Computer Science > Machine Learning

Title:Generalized Negative Correlation Learning for Deep Ensembling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generalized Negative Correlation Learning for Deep Ensembling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators