Dealing with categorical and integer-valued variables in bayesian optimization with gaussian processes
EC Garrido-Merchán, D Hernández-Lobato - Neurocomputing, 2020 - Elsevier
Neurocomputing, 2020•Elsevier
Some optimization problems are characterized by an objective that is very expensive, that
lacks an analytical expression, and whose evaluations can be contaminated by noise.
Bayesian Optimization (BO) methods can be used to solve these problems efficiently. BO
relies on a probabilistic model of the objective, which is typically a Gaussian process (GP).
This model is used to compute an acquisition function that estimates the expected utility (for
solving the optimization problem) of evaluating the objective at each potential new point. A …
lacks an analytical expression, and whose evaluations can be contaminated by noise.
Bayesian Optimization (BO) methods can be used to solve these problems efficiently. BO
relies on a probabilistic model of the objective, which is typically a Gaussian process (GP).
This model is used to compute an acquisition function that estimates the expected utility (for
solving the optimization problem) of evaluating the objective at each potential new point. A …
Abstract
Some optimization problems are characterized by an objective that is very expensive, that lacks an analytical expression, and whose evaluations can be contaminated by noise. Bayesian Optimization (BO) methods can be used to solve these problems efficiently. BO relies on a probabilistic model of the objective, which is typically a Gaussian process (GP). This model is used to compute an acquisition function that estimates the expected utility (for solving the optimization problem) of evaluating the objective at each potential new point. A problem with GPs is, however, that they assume real-valued input variables and cannot easily deal with categorical or integer-valued values. Common methods to account for these variables, before evaluating the objective, include assuming they are real and then using a one-hot encoding, for categorical variables, or rounding to the closest integer, for integer-valued variables. We show that this leads to suboptimal results and introduce a novel approach to tackle categorical or integer-valued input variables within the context of BO with GPs. Several synthetic and real-world experiments support our hypotheses and show that our approach outperforms the results of standard BO using GPs on problems with categorical or integer-valued input variables.
Elsevier