Abstract
Gaussian process models are a commonly used tool for model-based analysis of time series data. With growing database size, the difficulty to identify the most interesting insights in order to gain a deeper understanding of the data’s underlying behavior increases. To address this issue, we propose a novel approach for finding frequent kernel components efficiently. In this way, data scientists are empowered to focus their investigations on the most common parts hidden in a set of Gaussian process models. We show how to solve this task by means of frequent item set mining methods, which are capable of analyzing large databases efficiently. We provide evidence of our proposal with a first series of experiments, indicating that our method is capable of detecting frequent kernel components from Gaussian process models. Though this short paper can be thought of as a first preliminary approach towards analyzing Gaussian processes with conventional data mining methods, it simultaneously opens a novel research direction of Gaussian process mining at the intersection between machine learning and database research.
This research was supported by the research training group “Dataninja” (Trustworthy AI for Seamless Problem Solving: Next Generation Intelligence Joins Robust Data Analysis) funded by the German federal state of North Rhine-Westphalia.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Berns, F., Hüwel, J., Beecks, C.: Automated model inference for Gaussian processes: an overview of state-of-the-art methods and algorithms. SN Comput. Sci. 3(4), 300 (2022). https://doi.org/10.1007/s42979-022-01186-x
Berns, F., Schmidt, K., Bracht, I., Beecks, C.: 3CS algorithm for efficient Gaussian process model retrieval. In: 25th International Conference on Pattern Recognition (ICPR), pp. 1773–1780. IEEE (2021)
Duvenaud, D.: Automatic model construction with Gaussian processes. Ph.D. thesis, University of Cambridge (2014)
Duvenaud, D., Lloyd, J., Grosse, R., Tenenbaum, J., Zoubin, G.: Structure discovery in nonparametric regression through compositional kernel search. In: International Conference on Machine Learning, pp. 1166–1174. PMLR (2013)
Fournier-Viger, P., Lin, J.C.W., Vo, B., Chi, T.T., Zhang, J., Le, H.B.: A survey of itemset mining. Wiley Interdisc. Rev. Data Min. Knowl. Discov. 7(4), e1207 (2017)
Hüwel, J.D., Haselbeck, F., Grimm, D.G., Beecks, C.: Dynamically self-adjusting Gaussian processes for data stream modelling. In: Bergmann, R., Malburg, L., Rodermund, S.C., Timm, I.J. (eds.) KI 2022. LNCS, vol. 13404, pp. 96–114. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-15791-2_10
Lloyd, J.R., Duvenaud, D., Grosse, R.B., Tenenbaum, J.B., Ghahramani, Z.: Automatic construction and natural-language description of nonparametric regression models. In: AAAI, pp. 1242–1250. AAAI Press (2014)
Mueen, A., Keogh, E., Zhu, Q., Cash, S., Westover, B.: Exact discovery of time series motifs. In: Proceedings of the 2009 SIAM International Conference on Data Mining, pp. 473–484. SIAM (2009)
Thoning, K.W., Tans, P.P., Komhyr, W.D.: Atmospheric carbon dioxide at Mauna Loa Observatory: 2. Analysis of the NOAA GMCC data, 1974–1985. J. Geophys. Res. Atmos. 94(D6), 8549–8565 (1989)
Williams, C.K., Rasmussen, C.E.: Gaussian Processes for Machine Learning, vol. 2. MIT Press, Cambridge (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Hüwel, J.D., Beecks, C. (2023). Gaussian Process Component Mining with the Apriori Algorithm. In: Strauss, C., Amagasa, T., Kotsis, G., Tjoa, A.M., Khalil, I. (eds) Database and Expert Systems Applications. DEXA 2023. Lecture Notes in Computer Science, vol 14147. Springer, Cham. https://doi.org/10.1007/978-3-031-39821-6_34
Download citation
DOI: https://doi.org/10.1007/978-3-031-39821-6_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-39820-9
Online ISBN: 978-3-031-39821-6
eBook Packages: Computer ScienceComputer Science (R0)