Google Scholar

Dealing with categorical and integer-valued variables in bayesian optimization with gaussian processes

EC Garrido-Merchán, D Hernández-Lobato - Neurocomputing, 2020 - Elsevier

Neurocomputing, 2020•Elsevier

Abstract

Some optimization problems are characterized by an objective that is very expensive, that lacks an analytical expression, and whose evaluations can be contaminated by noise. Bayesian Optimization (BO) methods can be used to solve these problems efficiently. BO relies on a probabilistic model of the objective, which is typically a Gaussian process (GP). This model is used to compute an acquisition function that estimates the expected utility (for solving the optimization problem) of evaluating the objective at each potential new point. A problem with GPs is, however, that they assume real-valued input variables and cannot easily deal with categorical or integer-valued values. Common methods to account for these variables, before evaluating the objective, include assuming they are real and then using a one-hot encoding, for categorical variables, or rounding to the closest integer, for integer-valued variables. We show that this leads to suboptimal results and introduce a novel approach to tackle categorical or integer-valued input variables within the context of BO with GPs. Several synthetic and real-world experiments support our hypotheses and show that our approach outperforms the results of standard BO using GPs on problems with categorical or integer-valued input variables.

Elsevier

Show moreShow less

Save Cite Cited by 290 Related articles All 11 versions

Cite

Advanced search

Saved to My library

Dealing with categorical and integer-valued variables in bayesian optimization with gaussian processes