Semi-Supervised Learning, Causality and the Conditional Cluster Assumption

von Kügelgen, Julius; Mey, Alexander; Loog, Marco; Schölkopf, Bernhard

Statistics > Machine Learning

arXiv:1905.12081v3 (stat)

[Submitted on 28 May 2019 (v1), revised 15 May 2020 (this version, v3), latest version 24 Jun 2020 (v4)]

Title:Semi-Supervised Learning, Causality and the Conditional Cluster Assumption

Authors:Julius von Kügelgen, Alexander Mey, Marco Loog, Bernhard Schölkopf

View PDF

Abstract:While the success of semi-supervised learning (SSL) is still not fully understood, Schölkopf et al. (2012) have established a link to the principle of independent causal mechanisms. They conclude that SSL should be impossible when predicting a target variable from its causes, but possible when predicting it from its effects. Since both these cases are somewhat restrictive, we extend their work by considering classification using cause and effect features at the same time, such as predicting disease from both risk factors and symptoms. While standard SSL exploits information contained in the marginal distribution of all inputs (to improve the estimate of the conditional distribution of the target given inputs), we argue that in our more general setting we should use information in the conditional distribution of effect features given causal features. We explore how this insight generalises the previous understanding, and how it relates to and can be exploited algorithmically for SSL.

Comments:	36th Conference on Uncertainty in Artificial Intelligence (2020) (Previously presented at the NeurIPS 2019 workshop "Do the right thing": machine learning and causal inference for improved decision making, Vancouver, Canada.)
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Other Statistics (stat.OT)
Cite as:	arXiv:1905.12081 [stat.ML]
	(or arXiv:1905.12081v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1905.12081

Submission history

From: Julius von Kügelgen [view email]
[v1] Tue, 28 May 2019 20:53:56 UTC (116 KB)
[v2] Wed, 9 Oct 2019 13:35:22 UTC (127 KB)
[v3] Fri, 15 May 2020 09:32:52 UTC (129 KB)
[v4] Wed, 24 Jun 2020 10:40:15 UTC (132 KB)

Statistics > Machine Learning

Title:Semi-Supervised Learning, Causality and the Conditional Cluster Assumption

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Semi-Supervised Learning, Causality and the Conditional Cluster Assumption

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators