Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Gaussian Process Component Mining with the Apriori Algorithm

  • Conference paper
  • First Online:
Database and Expert Systems Applications (DEXA 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14147))

Included in the following conference series:

  • 640 Accesses

Abstract

Gaussian process models are a commonly used tool for model-based analysis of time series data. With growing database size, the difficulty to identify the most interesting insights in order to gain a deeper understanding of the data’s underlying behavior increases. To address this issue, we propose a novel approach for finding frequent kernel components efficiently. In this way, data scientists are empowered to focus their investigations on the most common parts hidden in a set of Gaussian process models. We show how to solve this task by means of frequent item set mining methods, which are capable of analyzing large databases efficiently. We provide evidence of our proposal with a first series of experiments, indicating that our method is capable of detecting frequent kernel components from Gaussian process models. Though this short paper can be thought of as a first preliminary approach towards analyzing Gaussian processes with conventional data mining methods, it simultaneously opens a novel research direction of Gaussian process mining at the intersection between machine learning and database research.

This research was supported by the research training group “Dataninja” (Trustworthy AI for Seamless Problem Solving: Next Generation Intelligence Joins Robust Data Analysis) funded by the German federal state of North Rhine-Westphalia.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Berns, F., Hüwel, J., Beecks, C.: Automated model inference for Gaussian processes: an overview of state-of-the-art methods and algorithms. SN Comput. Sci. 3(4), 300 (2022). https://doi.org/10.1007/s42979-022-01186-x

    Article  Google Scholar 

  2. Berns, F., Schmidt, K., Bracht, I., Beecks, C.: 3CS algorithm for efficient Gaussian process model retrieval. In: 25th International Conference on Pattern Recognition (ICPR), pp. 1773–1780. IEEE (2021)

    Google Scholar 

  3. Duvenaud, D.: Automatic model construction with Gaussian processes. Ph.D. thesis, University of Cambridge (2014)

    Google Scholar 

  4. Duvenaud, D., Lloyd, J., Grosse, R., Tenenbaum, J., Zoubin, G.: Structure discovery in nonparametric regression through compositional kernel search. In: International Conference on Machine Learning, pp. 1166–1174. PMLR (2013)

    Google Scholar 

  5. Fournier-Viger, P., Lin, J.C.W., Vo, B., Chi, T.T., Zhang, J., Le, H.B.: A survey of itemset mining. Wiley Interdisc. Rev. Data Min. Knowl. Discov. 7(4), e1207 (2017)

    Google Scholar 

  6. Hüwel, J.D., Haselbeck, F., Grimm, D.G., Beecks, C.: Dynamically self-adjusting Gaussian processes for data stream modelling. In: Bergmann, R., Malburg, L., Rodermund, S.C., Timm, I.J. (eds.) KI 2022. LNCS, vol. 13404, pp. 96–114. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-15791-2_10

    Chapter  Google Scholar 

  7. Lloyd, J.R., Duvenaud, D., Grosse, R.B., Tenenbaum, J.B., Ghahramani, Z.: Automatic construction and natural-language description of nonparametric regression models. In: AAAI, pp. 1242–1250. AAAI Press (2014)

    Google Scholar 

  8. Mueen, A., Keogh, E., Zhu, Q., Cash, S., Westover, B.: Exact discovery of time series motifs. In: Proceedings of the 2009 SIAM International Conference on Data Mining, pp. 473–484. SIAM (2009)

    Google Scholar 

  9. Thoning, K.W., Tans, P.P., Komhyr, W.D.: Atmospheric carbon dioxide at Mauna Loa Observatory: 2. Analysis of the NOAA GMCC data, 1974–1985. J. Geophys. Res. Atmos. 94(D6), 8549–8565 (1989)

    Google Scholar 

  10. Williams, C.K., Rasmussen, C.E.: Gaussian Processes for Machine Learning, vol. 2. MIT Press, Cambridge (2006)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jan David Hüwel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hüwel, J.D., Beecks, C. (2023). Gaussian Process Component Mining with the Apriori Algorithm. In: Strauss, C., Amagasa, T., Kotsis, G., Tjoa, A.M., Khalil, I. (eds) Database and Expert Systems Applications. DEXA 2023. Lecture Notes in Computer Science, vol 14147. Springer, Cham. https://doi.org/10.1007/978-3-031-39821-6_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-39821-6_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-39820-9

  • Online ISBN: 978-3-031-39821-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics