Variational Gibbs Inference for Statistical Model Estimation from Incomplete Data

Simkus, Vaidotas; Rhodes, Benjamin; Gutmann, Michael U.

Computer Science > Machine Learning

arXiv:2111.13180 (cs)

[Submitted on 25 Nov 2021 (v1), last revised 15 Aug 2023 (this version, v4)]

Title:Variational Gibbs Inference for Statistical Model Estimation from Incomplete Data

Authors:Vaidotas Simkus, Benjamin Rhodes, Michael U. Gutmann

View PDF

Abstract:Statistical models are central to machine learning with broad applicability across a range of downstream tasks. The models are controlled by free parameters that are typically estimated from data by maximum-likelihood estimation or approximations thereof. However, when faced with real-world data sets many of the models run into a critical issue: they are formulated in terms of fully-observed data, whereas in practice the data sets are plagued with missing data. The theory of statistical model estimation from incomplete data is conceptually similar to the estimation of latent-variable models, where powerful tools such as variational inference (VI) exist. However, in contrast to standard latent-variable models, parameter estimation with incomplete data often requires estimating exponentially-many conditional distributions of the missing variables, hence making standard VI methods intractable. We address this gap by introducing variational Gibbs inference (VGI), a new general-purpose method to estimate the parameters of statistical models from incomplete data. We validate VGI on a set of synthetic and real-world estimation tasks, estimating important machine learning models such as variational autoencoders and normalising flows from incomplete data. The proposed method, whilst general-purpose, achieves competitive or better performance than existing model-specific estimation methods.

Comments:	Published at Journal of Machine Learning Research (JMLR)
Subjects:	Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
MSC classes:	62D10
ACM classes:	I.2.6; G.3
Cite as:	arXiv:2111.13180 [cs.LG]
	(or arXiv:2111.13180v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2111.13180
Journal reference:	Journal of Machine Learning Research, 24(196), 1-72, 2023

Submission history

From: Vaidotas Simkus [view email]
[v1] Thu, 25 Nov 2021 17:22:22 UTC (6,752 KB)
[v2] Mon, 9 May 2022 09:04:05 UTC (6,742 KB)
[v3] Thu, 2 Mar 2023 14:31:28 UTC (6,587 KB)
[v4] Tue, 15 Aug 2023 08:57:59 UTC (6,587 KB)

Computer Science > Machine Learning

Title:Variational Gibbs Inference for Statistical Model Estimation from Incomplete Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Variational Gibbs Inference for Statistical Model Estimation from Incomplete Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators