Authors
Gonzalo I Diaz, Achille Fokoue-Nkoutche, Giacomo Nannicini, Horst Samulowitz
Publication date
2017/9/8
Journal
IBM Journal of Research and Development
Volume
61
Issue
4/5
Pages
9: 1-9: 11
Publisher
IBM
Description
A major challenge in designing neural network (NN) systems is to determine the best structure and parameters for the network given the data for the machine learning problem at hand. Examples of parameters are the number of layers and nodes, the learning rates, and the dropout rates. Typically, these parameters are chosen based on heuristic rules and are manually fine-tuned, which may be very time-consuming, because evaluating the performance of a single parametrization of the NN may require several hours. In this paper, we address the problem of choosing appropriate parameters for the NN by formulating it as a box-constrained mathematical optimization problem and applying a derivative-free optimization tool that automatically and effectively searches the parameter space. The optimization tool employs a radial basis function model of the objective function (the prediction accuracy of the NN) to …
Total citations
20172018201920202021202220232024412202929294232
Scholar articles
GI Diaz, A Fokoue-Nkoutche, G Nannicini… - IBM Journal of Research and Development, 2017