Beyond Normal: On the Evaluation of Mutual Information Estimators

Czyż, Paweł; Grabowski, Frederic; Vogt, Julia E.; Beerenwinkel, Niko; Marx, Alexander

Statistics > Machine Learning

arXiv:2306.11078 (stat)

[Submitted on 19 Jun 2023 (v1), last revised 16 Oct 2023 (this version, v2)]

Title:Beyond Normal: On the Evaluation of Mutual Information Estimators

Authors:Paweł Czyż, Frederic Grabowski, Julia E. Vogt, Niko Beerenwinkel, Alexander Marx

View PDF

Abstract:Mutual information is a general statistical dependency measure which has found applications in representation learning, causality, domain generalization and computational biology. However, mutual information estimators are typically evaluated on simple families of probability distributions, namely multivariate normal distribution and selected distributions with one-dimensional random variables. In this paper, we show how to construct a diverse family of distributions with known ground-truth mutual information and propose a language-independent benchmarking platform for mutual information estimators. We discuss the general applicability and limitations of classical and neural estimators in settings involving high dimensions, sparse interactions, long-tailed distributions, and high mutual information. Finally, we provide guidelines for practitioners on how to select appropriate estimator adapted to the difficulty of problem considered and issues one needs to consider when applying an estimator to a new data set.

Comments:	Accepted at NeurIPS 2023. Code available at this https URL
Subjects:	Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG)
Cite as:	arXiv:2306.11078 [stat.ML]
	(or arXiv:2306.11078v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2306.11078

Submission history

From: Paweł Czyż [view email]
[v1] Mon, 19 Jun 2023 17:26:34 UTC (13,477 KB)
[v2] Mon, 16 Oct 2023 13:17:17 UTC (36,399 KB)

Statistics > Machine Learning

Title:Beyond Normal: On the Evaluation of Mutual Information Estimators

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Beyond Normal: On the Evaluation of Mutual Information Estimators

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators