A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models

Gruber, Sebastian G.; Buettner, Florian

Computer Science > Machine Learning

arXiv:2310.05833v1 (cs)

[Submitted on 9 Oct 2023 (this version), latest version 10 Jul 2024 (v2)]

Title:A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models

Authors:Sebastian G. Gruber, Florian Buettner

View PDF

Abstract:Generative models, like large language models, are becoming increasingly relevant in our daily lives, yet a theoretical framework to assess their generalization behavior and uncertainty does not exist. Particularly, the problem of uncertainty estimation is commonly solved in an ad-hoc manner and task dependent. For example, natural language approaches cannot be transferred to image generation. In this paper we introduce the first bias-variance-covariance decomposition for kernel scores and their associated entropy. We propose unbiased and consistent estimators for each quantity which only require generated samples but not the underlying model itself. As an application, we offer a generalization evaluation of diffusion models and discover how mode collapse of minority groups is a contrary phenomenon to overfitting. Further, we demonstrate that variance and predictive kernel entropy are viable measures of uncertainty for image, audio, and language generation. Specifically, our approach for uncertainty estimation is more predictive of performance on CoQA and TriviaQA question answering datasets than existing baselines and can also be applied to closed-source models.

Comments:	Preprint
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2310.05833 [cs.LG]
	(or arXiv:2310.05833v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.05833

Submission history

From: Sebastian Gregor Gruber [view email]
[v1] Mon, 9 Oct 2023 16:22:11 UTC (1,194 KB)
[v2] Wed, 10 Jul 2024 14:37:50 UTC (3,846 KB)

Computer Science > Machine Learning

Title:A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators