Multi-source Deep Gaussian Process Kernel Learning

Lu, Chi-Ken; Shafto, Patrick

Computer Science > Machine Learning

arXiv:2002.02826v1 (cs)

[Submitted on 7 Feb 2020 (this version), latest version 1 Oct 2021 (v3)]

Title:Multi-source Deep Gaussian Process Kernel Learning

Authors:Chi-Ken Lu, Patrick Shafto

View PDF

Abstract:For many problems, relevant data are plentiful but explicit knowledge is not. Predictions about target variables may be informed by data sources that are noisy but plentiful, or data which the target variable is merely some function of. Intrepretable and flexible machine learning methods capable of fusing data across sources are lacking. We generalize the Deep Gaussian Processes so that GPs in intermediate layers can represent the posterior distribution summarizing the data from a related source. We model the prior-posterior stacking DGP with a single GP. The exact second moment of DGP is calculated analytically, and is taken as the kernel function for GP. The result is a kernel that captures effective correlation through function composition, reflects the structure of the observations from other data sources, and can be used to inform prediction based on limited direct observations. Therefore, the approximation of prior-posterior DGP can be considered a novel kernel composition which blends the kernels in different layers and have explicit dependence on the data. We consider two synthetic multi-source prediction problems: a) predicting a target variable that is merely a function of the source data and b) predicting noise-free data using a kernel trained on noisy data. Our method produces better prediction and tighter uncertainty on the synthetic data when comparing with standard GP and other DGP method, suggesting that our data-informed approximate DGPs are a powerful tool for integrating data across sources.

Comments:	13 pages in current format
Subjects:	Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (stat.ML)
Cite as:	arXiv:2002.02826 [cs.LG]
	(or arXiv:2002.02826v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2002.02826

Submission history

From: Chi-Ken Lu [view email]
[v1] Fri, 7 Feb 2020 14:56:11 UTC (602 KB)
[v2] Wed, 2 Dec 2020 18:07:01 UTC (484 KB)
[v3] Fri, 1 Oct 2021 18:03:07 UTC (1,183 KB)

Computer Science > Machine Learning

Title:Multi-source Deep Gaussian Process Kernel Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Multi-source Deep Gaussian Process Kernel Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators