On the efficiency of nature-inspired metaheuristics in expensive global optimization with limited budget

Sergeyev, Ya. D.; Kvasov, D. E.; Mukhametzhanov, M. S.

doi:10.1038/s41598-017-18940-4

Download PDF

Article
Open access
Published: 11 January 2018

On the efficiency of nature-inspired metaheuristics in expensive global optimization with limited budget

Scientific Reports volumeÂ 8, ArticleÂ number:Â 453 (2018) Cite this article

8007 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

Global optimization problems where evaluation of the objective function is an expensive operation arise frequently in engineering, decision making, optimal control, etc. There exist two huge but almost completely disjoint communities (they have different journals, different conferences, different test functions, etc.) solving these problems: a broad community of practitioners using stochastic nature-inspired metaheuristics and people from academia studying deterministic mathematical programming methods. In order to bridge the gap between these communities we propose a visual technique for a systematic comparison of global optimization algorithms having different nature. Results of more than 800,000 runs on 800 randomly generated tests show that both stochastic nature-inspired metaheuristics and deterministic global optimization methods are competitive and surpass one another in dependence on the available budget of function evaluations.

Hippopotamus optimization algorithm: a novel nature-inspired optimization algorithm

Article Open access 29 February 2024

Recent advances in use of bio-inspired jellyfish search algorithm for solving optimization problems

Article Open access 10 November 2022

The cheetah optimizer: a nature-inspired metaheuristic algorithm for large-scale optimization problems

Article Open access 29 June 2022

Introduction

Continuous global optimization problems arise frequently in many real-life applications^{1,2,3,4,5,6,7}: in engineering, statistics, decision making, optimal control, machine learning, etc. A general global optimization problem requires to find a point x^* and the value f(x^*) being the global (i.e., the deepest) minimum of a function f(x) over an Nâ dimensional domain D, where f(x) can be non-differentiable, multiextremal, hard to evaluate even in one point (evaluations of f(x) are expensive), and given as a âblack boxâ. Therefore, traditional local optimization methods^8,9 cannot be used in this situation. Among existing derivative-free global optimization methods two classes of algorithms can be marked out: stochastic metaheuristic algorithms (see, e.g.^{4,6,10,11,12,13,14,15}) and deterministic mathematical programming methods^{1,2,3,5,6,7,16,17,18}, etc. The former, due to their simplicity and attractive nature-inspired interpretations (genetic algorithms^6,10,12, particle swarm optimization¹⁵, firefly algorithms^12,13, etc.), are used by a broad community of engineers and practitioners to solve real-life problems whereas the latter are actively studied in academia due to their interesting theoretical properties including a guaranteed convergence. Historically, these two communities are almost completely disjoint: they have different journals, different conferences, and different test functions. Due to the hardness of global optimization problems and different nature of methods from these two groups, the problem of their comparison is very difficult and methods are collated on some dozens of test functions^{1,2,15,16,19,20} giving so a poor information and non reliable results. In order to bridge the gap between the two communities we propose a new efficient visual technique for a systematic comparison of global optimization algorithms having different nature. More than 800,000 runs on randomly generated 800 multidimensional test problems have been performed to compare five popular stochastic metaheuristics and three deterministic methods giving so a new level of understanding the tested algorithms. The test problems²¹ have been chosen because, after they have been randomly generated, the optimizer is provided with locations of the global minimum and of all local minimizers (this property has made the generator of these test problems very popularâit is used nowadays in more than 40 countries of the world). The knowledge of the global solution gives the possibility to check whether the tested method has found the global optimum. Since in practical problems the global solution is unknown and, therefore, it is not possible to check the quality of the obtained solution, it is very important to see how different methods are close to the global solution after their stopping rule has been satisfied.

In global minimization, problems where the objective function f(x) can have many local minima are considered and it is required to find the global minimizer (called also global solution or global optimum) x^* and the corresponding value fâ^* such that

$${f}^{\ast }=f({x}^{\ast })\le f(x),\,\quad x\in D\subset {{\mathbb{R}}}^{N},$$

(1)

where D is a search region. In other words, among all the local minima (that are called local solutions) it is necessary to find the deepest minimum fâ^* and its coordinates x^*. It is well-known that a general continuous global optimization problem (1) is NP-hard^22,23,24. This is true also, in particular, for problems (1) where the objective function f(x) satisfies the Lipschitz condition

$$|f(x^{\prime} )-f(x^{\prime\prime})|\le L||x^{\prime} -x^{\prime\prime} ||,\quad x^{\prime} ,x^{\prime\prime} \in D,\quad 0 < L < \infty ,$$

(2)

for a norm ||Â·|| with an unknown Lipschitz constant L. This condition means that any limited change in the parameters yields some limited changes in the values of the objective function. The assumption (2) can be justified by the fact that in technical systems the energy of change is always limited. In fact, this kind of problems can be very frequently met in practice (see^1,2,3,5,7,18), in particular, in many engineering applications in which observations of the produced values of f(x) can be made, but analytical expressions of the functions are not available. For example, the values of the objective function f(x) can be obtained by running some computationally expensive numerical models, by performing a set of experiments, and so on. One may refer, for instance, to various decision-making problems in automatic control and robotics, structural optimization, engineering design, etc. The continuous global minimization problem (1) where f(x) satisfies (2) and can be non-differentiable, multiextremal, hard to evaluate even at one point, and given as a âblack boxâ is studied in this paper.

In the traditional local optimization⁹, where strong assumptions on the structure of the objective function (such as convexity, continuity, differentiability, etc.) are made, these suppositions play a crucial role in the construction of any efficient search algorithm. In these cases, the dimensionality of the solved problem is often a measure of the goodness of optimization algorithms. In contrast, as was proved in²⁴, if the only information about the objective function f(x) from the global optimization problem (1) and (2) is that it belongs to the class of Lipschitz functions and the Lipschitz constant L is unknown, there does not exist any deterministic or stochastic algorithm that, after a finite number of function evaluations, is able to provide an accurate Îµ-estimate of the global minimum fâ^*. That is why in this case instead of the theoretical statement (P1) Construct an algorithm able to stop in a given time and to provide an Îµ-approximation of the global minimum fâ^* the more practical statement (P2) Construct an algorithm able to stop after a fixed number M of evaluations of f(x) and to return the lowest found value of f(x) is used. Under the latter statement, not the dimension of the problem (that is important in local optimization) but the number of allowed function evaluations (often called budget) becomes critical. In other words, when one has the possibility to evaluate f(x) M times (these evaluations are called trials hereinafter) in the global optimization problem of the dimension 5, 10 or 100, then the quality of the found solution after M evaluations is crucial and not the dimensionality of f(x). This happens because it is not possible to adequately explore the multi-dimensional search region D at this limited budget of expensive evaluations of f(x). For instance, if $D\subset {{\mathbb{R}}}^{20}$ is a hypercube, then it has 2²⁰ vertices. This means that one million of trials is not sufficient not only to explore well the whole region D but even to evaluate f(x) at all vertices of D. Thus, the statement (P2) makes sense both because in practice the budget is always limited and because the problem under consideration is NP-hard.

As a result, the goal of global optimization methods is often to obtain a better estimate of fâ^* and x^* given a fixed limited budget of evaluations of f(x). In fact, in global optimization the words âA method has solved a global optimization problemâ very often do not mean that the global solution fâ^* has been found. They mean just that the found solution was better than solutions found by other competitors (this is especially true for highly dimensional global optimization problems where the global solutions are unknown). That is why the possibility to compare the found solutions with the known global optimum offered by the generator of classes of test functions²¹ is very precious. It allows us not only to see that a solution A found by one method is better than a solution B found by another method, but to check whether these solutions are in a prefixed Îµ-neighborhood of the global optimum, i.e., to consider (P1) instead of (P2).

Let us describe now two groups of methods used in different communities and studied here. Metaheuristic algorithms widely used to solve (in sense of the statement (P2) discussed above) real-life global optimization problems have a number of attractive properties that have ensured their success among engineers and practitioners. First, they have limpid nature-inspired interpretations explaining how these algorithms simulate behavior of populations of individuals. Algorithms of this type studied here are: Particle Swarm Optimization (PSO) simulating fish schools¹¹, Firefly Algorithm (FA) simulating the flashing behavior of the fireflies¹³, Artificial Bee Colony (ABC) representing a colony of bees in searching the food sources¹⁴, Differential Evolution (DE) and Genetic algorithms (GA) simulating the evolution on a phenotype and genotype level, respectively^4,6. Other reasons that have led to a wide spread of metaheuristics are the following: it is not required to have a high level mathematical preparation to understand them; their implementation usually is simple and many codes are available for free; finally, they do not need a lot of memory working at each moment with only a limited population of points in the search domain. On the other hand, metaheuristics have some drawbacks including usually a high number of parameters to tune and absence of rigorously proved global convergence conditions ensuring that sequences of trial points generated by these methods always converge to the global solution x^*. In fact, populations used by these methods can degenerate prematurely, returning only a locally optimal solution instead of the global one or even non locally optimal point if it has been obtained at one of the last evaluations of f(x) and the budget of M evaluations has not allowed to proceed with an improvement of the obtained solution.

Deterministic algorithms belonging to the second group of methods studied here are based on the knowledge that the objective function f(x) satisfies the Lipschitz condition (2). Lipschitz global optimization algorithms is a well-studied class of deterministic methods^1,2,3,5,7,18. These methods are usually technically more sophisticated than metaheuristics, their implementation is not so easy, they require more memory and a higher mathematical preparation is necessary to understand and to use them. Commonly, they have a strong theory ensuring convergence to the global solution and a small number of control parameters allowing so their users to configure the search easily. Even though the Lipschitz constant L can be unknown, there exist several strategies for its estimation^2,3,5,7,18 and one of the most frequently used techniques¹⁶ works with all possible values of L from zero to infinity simultaneously. All deterministic algorithms considered here use it. They are: DIRECT method¹⁶, its locally-biased version DIRECT-L¹⁷, and the algorithm¹⁸ based on adaptive diagonal curves (called ADC hereinafter).

How can one compare these two groups of methods? On the one hand, there exist several approaches for a visual comparison of deterministic algorithms (see, e.g., operational characteristics²⁵, performance profiles²⁶, data profiles⁸, etc.). However, they do not allow one to compare stochastic methods. On the other hand, comparison of metaheuristics often is performed on different collections of single benchmark problems^15,20,27. As a result, the difficulty of test problems in collections can vary significantly leading sometimes to non homogeneous and, as a consequence, non reliable results. An additional difficulty consists of the fact that, due to a stochastic nature of metaheuristics, the obtained results cannot be repeated and have a character of some averages. Thus, the difficulties existing in performing a reliable comparison of these two groups of methods constitutes a serious gap between the respective communities. The goal of this paper is to start a dialog between them by proposing a methodology allowing one to compare numerically deterministic algorithms and stochastic metaheuristics using the problem statement (P1).

Instead of traditional comparisons executed just on several dozens of tests^{1,2,15,16,19,20} in this contribution more than 800,000 runs on 800 randomly generated test problems²¹ have been performed for a systematic comparison of the methods. In order to make this comparison more reliable, parameters of all tested algorithms were fixed following recommendations of their authors and then were used in all the experiments. One known and two novel methodologies for comparing global optimization algorithms are applied here: Operational Characteristics²⁵ for comparing deterministic algorithms and new Operational Zones and Aggregated Operational Zones generalizing ideas of operational characteristics to collate multidimensional stochastic algorithms.

Results

An operational characteristic²⁵ constructed on a class of 100 randomly generated test functions is a graph showing the number of solved problems in dependence on the number of executed evaluations of the objective function f(x). To construct classes of test functions required to build operational characteristics, the popular GKLS generator²¹ of multidimensional, multiextremal test functions was used. This generator allows one to generate randomly classes of 100 test problems having the same dimension, number of local minima, and difficulty. The property making this generator especially attractive consists of the fact that for each function a complete information of coordinates and values of all local minima (including the global one) is provided. Here, 8 different classes from¹⁸ were used (see supplementary materials for their description and for definition of what does it mean that a problem has been solved). These classes and the respective search accuracies have been taken since they represent a well established tool used frequently to compare deterministic global optimization algorithms^{18,28,29,30,31}. Fig. 1(a) shows operational characteristics for methods DIRECT, DIRECT-L, and ADC. Higher is a characteristic of a method with respect to characteristics of its competitors better is the behavior of this method. Operational characteristics allow us also to see the best performers in dependence on the available budget of evaluations of f(x). For instance, it can be seen from Fig. 1(a) that if the search budget is less than 14,000 possible trials than DIRECT method shows the best performance whereas for a budget superior to 14,000 the best method is ADC.

Since operational characteristics cannot be used to compare stochastic methods, we propose in this paper a new methodology called operational zones that can be used for collating stochastic algorithms. To build a zone, a tested stochastic method should be launched K times (in our experiments each metaheuristic was launched Kâ=â100 times for each of 100 test problems from each of 8 classes) with different randomly chosen populations (see supplementary materials for a detailed description of parameters of 5 tested metaheuristics) and a maximum number of trials N_max (in our experiments, N_maxâ=â10⁶). Then, each run of a tested metaheuristic was considered as a particular method and its operational characteristic was constructed. The totality of all 100 operational characteristics form the respective operational zone (see Fig. 1(b) for an operational zone obtained by FA). Then, the upper and the lower boundaries of the zone (shown in Fig. 1(b) as dark blue curves) can be outlined (notice that they can contain parts of several characteristics) representing the best and the worst performances of the tested method, respectively. The graph for the average performance within the zone can be also depicted (see Fig. 2(b) where the average performance of FA is shown as a continuous black line inside the yellow operational zone).

Figure 2 shows results on the 5-dimensional simple and hard classes for metaheuristics FA, GA, and ABC (figures for these methods for other test classes as well as results for metaheuristics PSO and DE are given in the supplementary materials). Figure 2(a) and (b) compare, respectively, performance of the three deterministic methods and FA on the simple (with N_maxâ=â2Â·10⁴) and the hard (with N_maxâ=â10⁵) classes. The joint representation of operational zones together with characteristics offers a lot of visual information. It can be seen, for example, in Fig. 2(a) that operational characteristics of DIRECT and ADC are higher than the upper boundary of the zone of FA and, therefore, on this class deterministic methods have a better performance. Figure 2(b) shows that the lower boundary of the FA zone is higher than characteristics of DIRECT and DIRECT-L and, therefore, FA outperforms these competitors. If the budget is less than 30,000 trials (see Fig. 2(b)) than in average FA is better than ADC, as well. If the budget is higher than 40,000 trials than ADC behaves better since its characteristic is higher than the upper boundary of this FA zone. Notice also that Fig. 2(b) shows that after 10⁵ trials only the method ADC was able to solve all 100 test problems of the class. For the same two test classes, Fig. 2 presents operational zones for metaheuristics GA and ABC and for the three deterministic methods.

One can see also that in many runs metaheuristics got trapped into local minima and were not able to exit from their attraction regions producing so operational zones with long horizontal parts (see, e.g., Fig. 2(d) where metaheuristic GA works significantly better than DIRECT and DIRECT-L if the budget is less than 40,000 trials and then almost does not improve the number of solved problems remaining, however, always better than the two deterministic methods). This means that increasing the number of trials does not improve results in this case and it is necessary to restart metaheuristics. Aggregated operational zones proposed in this paper show what happens in this case. They are constructed as follows.

First, an algorithm is launched K times (Kâ=â100 was used again here in order to have the same computational resources available for constructing operational zones) with an allowed number of trials n_maxâ<âN_max (in our experiments n_maxâ=â50,000, N_maxâ=â10⁶ for each metaheuristic). Then, for non-solved problems the algorithm is launched again with the same number n_max of allowed trials. Thus, if the algorithm did not solve a problem p in the first $n,1\le n < T,T=\lfloor {N}_{max}/{n}_{max}\rfloor ,$ runs but has solved it in the (nâ+â1)-th run in t, 1ââ¤âtââ¤ n_max, trials then the number of trials to solve the problem p is equal to n*n_maxâ+ât. Otherwise, if the algorithm did not solve the problem p in T runs, then the number of executed trials for the problem p is set equal to the maximal allowed number N_max (in order to remind that more than N_max trials are required to solve this problem the mark â>10⁶â is used in Table 1). In this way, T runs are executed to complete the aggregated characteristic. Finally, kâ=âK/T aggregated operational characteristics are used to build the aggregated operational zone in the same way as operational characteristics are used to construct an operational zone. The lower and upper boundaries are defined analogously. Fig. 3 shows results of experiments for the three deterministic methods and metaheuristics FA, GA, and ABC.

Table 1 Results of the experiments. For each test class the average number of trials required to solve all 100 problems is presented for each deterministic algorithm. For each metaheuristic method, the average number of trials required to solve each problem on 100 runs has been calculated, and the average of these 100 values is presented^â.

Full size table

It should be stressed that the aggregated operational zones allow one to emphasize better the potential of nature-inspired metaheuristics. In fact, the advantage of the aggregated zones with respect to the operational zones can be illustrated, for example, by situations shown in Figs 2(f) and 3(f). It can be seen from Fig. 2(f) that operational characteristics of deterministic methods DIRECT and DIRECT-L are located inside the zone of the metaheuristic ABC and, therefore, it is not possible to determine which of the three methods behaves better. In contrast, the aggregated zone of ABC is higher than the characteristics of both deterministic methods, i.e., it can be concluded that ABC outperforms them.

In order to see the advantages of the proposed methodologies for comparing methods, Table 1 constructed in a traditional way is shown. Due to the huge amount of data, only average results can be considered and included in Table 1. Notice that for deterministic methods and metaheuristics, due to the stochastic nature of the latter ones, different averages should be used: for metaheuristics the results on 10,000 runs for each class are used, whereas for the deterministic algorithms results on 100 runs (one run for each of 100 functions). This creates difficulties in comparing. For instance, which method is better on the 5-dimensional simple class: DIRECT or FA? On the one hand, DIRECT did not solve only one problem in 100 runs, demonstrating so success rate of 99%, whereas FA did not solve 16 problems in 10,000 runs, demonstrating, i.e., 99.84% of success. On the other hand, the average number of trials for DIRECT was only 16,057.5, while for FA 47,203.1 trials required in average. Moreover, Table 1 cannot give results for 50% or 75% of solved test problems, that can be also important. To see the detailed results, larger tables with hundreds of rows and columns should be used, complicating so the visual analysis of the results.

In contrast, operational zones very well present visually performance of tested methods giving the entire panorama of their behavior for different budgets. For instance, it can be seen from Figs 2 and 3 that metaheuristics perform very well on small budgets showing better results w.r.t. deterministic algorithms whereas the best algorithm for the higher budget on the used test classes is the algorithm ADC since it was able to solve all 100 test problems faster than other methods on both the classes. Even though this result can be also obtained from Table 1, the operational zones allow us to observe the performance of methods at all the stages of the search for each class. The average, the best, and the worst cases for each metaheuristic can be easily obtained from the graphs for any chosen number of trials. Moreover, the number of trials required to solve 50% (or 75%, 90%, etc.) of problems can be easily obtained and performance of methods is visualized clearly.

Let us see now another way for a statistical comparison of the two groups of algorithms using the same data. Let ${X}_{A}^{C}$ be a random variable describing the consumed percentage of the computational budget N_max performed by an algorithm A for solving a problem from the test class C. Let us consider the sample ${x}_{A}^{C}$ of 100 realizations of ${X}_{A}^{C}$ for Aâââ{ADC, DIRECT, DIRECTâââL} and 100âÃâ100â=â10,000 realizations of ${X}_{A}^{C}$ for Aâââ{FA, GA, ABC, PSO, DE}, i.e., if, for instance, the algorithm ADC solved the 2-dimensional hard test problem number 5 after 574 trials, then ${x}_{ADC}^{2-hard}=\frac{574}{{10}^{6}}\times 100 \% =0.0574 \% $. Then, after the construction of the cumulative distribution functions ${F}_{{X}_{A}^{C}}(x)$, one can obtain the sampled distribution quantiles of ${X}_{A}^{C}$. For instance, in Tables 2 and 3, the sampled 25%, 50%, 75%, and 90% quantiles are presented for simple and hard classes, respectively. These results can be interpreted as follows. The 90%-quantile for the FA on the 5-dimensional simple class is 14.11%, whereas the same quantile for the ADC is 1.02%. This means that with the probability 90% FA will consume no more than 14.11% of the computational budget (i.e., no more than 141,100 trials), while ADC will consume no more than 1.02% of the computational budget (i.e., no more than 10,200 trials) to solve successfully the test problem. As it can be seen from Table 2, GA for the same test class and the same confidence level will consume 100% of the computational budget. This means that it cannot be claimed with the probability 90% that GA will solve the problem in the selected computational budget. However, it should be noted that with the probability 75% GA will resolve the test problem of the same class with no more than 9.24% of the computational budget (i.e., with no more than 92,400 trials).

Table 2 Results of the experiments. For each algorithm, quantiles Q₂₅, Q₅₀, Q₇₅ and Q₉₀ for the number of trials for simple test classes are presented.

Full size table

Table 3 Results of the experiments. For each algorithm, quantiles Q₂₅, Q₅₀, Q₇₅, and Q₉₀ for the number of trials for hard test classes are presented.

Full size table

On can see that the presented quantiles correspond to the results presented in Figs 1â3. In particular, the results presented in Tables 2 and 3 correspond to the average operational zones for each metaheuristic algorithm presented in Figs 2 and 3 (see also additional figures for the remaining test classes and the algorithms PSO and DE in the supplementary materials).

In conclusion, the proposed operational zones and aggregated operational zones allow one to compare effectively deterministic and stochastic global optimization algorithms having different nature and give a handy visual representation of this comparison for different computational budgets. Nature-inspired metaheuristics and deterministic Lipschitz algorithms have been compared on 800 of tests giving so a new understanding for both classes of methods and opening a dialog between the two communities. It can be seen that both classes of algorithms are competitive and surpass one another in dependence on the available budget of function evaluations.

References

Horst, R. & Pardalos, P. M. (eds) Handbook of Global Optimization, vol. 1 (Kluwer Academic Publishers, Dordrecht, 1995).
PintÃ©r, J. D. Global Optimization in Action (Continuous and Lipschitz Optimization: Algorithms, Implementations and Applications). (Kluwer Academic Publishers, Dordrecht, 1996).
BookÂ MATHÂ Google ScholarÂ
Sergeyev, Y. D. & Kvasov, D. E. Deterministic Global Optimization: An Introduction to the Diagonal Approach. (Springer, New York, 2017).
BookÂ MATHÂ Google ScholarÂ
Price, K., Storn, R. M. & Lampinen, J. A. Differential Evolution: A Practical Approach to Global Optimization. Natural Computing Series. (Springer, New York, 2005).
MATHÂ Google ScholarÂ
Sergeyev, Y. D., Strongin, R. G. & Lera, D. Introduction to Global Optimization Exploiting Space-Filling Curves. (Springer, New York, 2013).
BookÂ MATHÂ Google ScholarÂ
Holland, J. H. Adaptation in Natural and Artificial Systems: an Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence (University of Michigan Press 1975).
Strongin, R. G. & Sergeyev, Y. D. Global Optimization with Non-Convex Constraints: Sequential and Parallel Algorithms. (Kluwer Academic Publishers, Dordrecht, 2000).
BookÂ MATHÂ Google ScholarÂ
MorÃ©, J. & Wild, S. Benchmarking derivative free optimization algorithms. SIAM Journal of Optimization 20, 172â191 (2009).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Nocedal, J. & Wright, S. J. Numerical Optimization, 2nd ed. (Springer, New York, 2006).
MATHÂ Google ScholarÂ
Deb, K. & Kumar, A. Real-coded genetic algorithms with simulated binary crossover: Studies on multimodal and multiobjective problems. Complex Systems 9, 431â454 (1995).
Google ScholarÂ
Kennedy, J., Eberhart, R. C. & Shi, Y. Swarm Intelligence. The Morgan Kaufmann Series in Evolutionary Computation (Morgan Kaufmann, San Francisco, USA, 2001).
Yang, X.-S. Nature-Inspired Metaheuristic Algorithms. (Luniver Press, Frome, 2008).
Google ScholarÂ
Yang, X.-S. & He, X. Firefly algorithm: Recent advances and applications. International Journal of Swarm Intelligence 1, 36â50 (2013).
ArticleÂ Google ScholarÂ
Karaboga, D. & Akay, B. A comparative study of Artificial Bee Colony algorithm. Applied Mathematics and Computations 214, 108â132 (2009).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Gao, Y., Du, W. & Yan, G. Selectively-informed particle swarm optimization. Scientific Reports 5, 9295 (2015).
ArticleÂ ADSÂ CASÂ PubMedÂ PubMed CentralÂ Google ScholarÂ
Jones, D. R., Perttunen, C. D. & Stuckman, B. E. Lipschitzian optimization without the Lipschitz constant. Journal of Optimization Theory and Applications 79, 157â181, https://doi.org/10.1007/BF00941892 (1993).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Gablonsky, J. M. & Kelley, C. T. A locally-biased form of the DIRECT algorithm. Journal of Global Optimization 21, 27â37, https://doi.org/10.1023/A:1017930332101 (2001).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Sergeyev, Y. D. & Kvasov, D. E. Global search based on efficient diagonal partitions and a set of Lipschitz constants. SIAM Journal on Optimization 16, 910â937 (2006).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Floudas, C. A. et al. Handbook of Test Problems in Local and Global Optimization. (Kluwer Academic Publishers, Dordrecht, 1999).
BookÂ MATHÂ Google ScholarÂ
Digalakis, J. G. & Margaritis, K. G. On benchmarking functions for genetic algorithms. International Journal of Computer Mathematics 77(4), 481â506 (2001).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Gaviano, M., Kvasov, D. E., Lera, D. & Sergeyev, Y. D. Algorithm 829: Software for generation of classes of test functions with known local and global minima for global optimization. ACM Trans. Math. Software 29, 469â480 (2003).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Pardalos, P. M. (ed.). Approximation and Complexity in Numerical Optimization: Continuous and Discrete Problems. (Kluwer Academic Publishers, Dordrecht, 2000).
MATHÂ Google ScholarÂ
Pardalos, P. M. & Vavasis, S. A. Open questions in complexity theory for numerical optimization. Mathematical Programming 57, 337â339 (1992).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Stephens, C. P. & Baritompa, W. Global optimization requires global information. J. Optim. Theory Appl. 96, 575â588 (1998).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Grishagin, V. A. Operational characteristics of some global search algorithms. Problems of Stochastic Search 7, 198â206 (1978).
MATHÂ Google ScholarÂ
Dolan, E. & MorÃ©, J. Benchmarking optimization software with performance profiles. Mathematical Programming 91, 201â213 (2002).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Rios, L. M. & Sahinidis, N. V. Derivative-free optimization: a review of algorithms and comparison of software implementations. Journal of Global Optimization 56, 1247â1293 (2013).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Barkalov, K. & Gergel, V. Parallel global optimization on GPU. Journal of Global Optimization 66, 3â20 (2016).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
Gimbutas, A. & Å½ilinskas, A. An algorithm of simplicial Lipschitz optimization with the bi-criteria selection of simplices for the bi-section. Journal of Global Optimization. https://doi.org/10.1007/s10898-017-0550-9 (2017).
Google ScholarÂ
Liu, H., Xu, S., Ma, Y. & Wang, X. Global optimization of expensive black box functions using potential Lipschitz constants and response surfaces. Journal of Global Optimization 63, 229â251 (2015).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ
PaulaviÄius, R., Sergeyev, Y. D., Kvasov, D. E. & Å½ilinskas, J. Globally-biased DISIMPL algorithm for expensive global optimization. Journal of Global Optimization 59, 545â567 (2014).
ArticleÂ MathSciNetÂ MATHÂ Google ScholarÂ

Download references

Acknowledgements

This research was supported by the Russian Science Foundation, project No 15-11-30022 âGlobal optimization, supercomputing computations, and applicationsâ.

Author information

Authors and Affiliations

University of Calabria, DIMES, 87036, Rende, (CS), Italy
Ya. D. Sergeyev,Â D. E. KvasovÂ &Â M. S. Mukhametzhanov
Lobachevsky State University, Institute of Information Technology, Mathematics and Mechanics, 603950, Nizhni Novgorod, Russia
Ya. D. Sergeyev,Â D. E. KvasovÂ &Â M. S. Mukhametzhanov

Authors

Ya. D. Sergeyev
View author publications
You can also search for this author in PubMedÂ Google Scholar
D. E. Kvasov
View author publications
You can also search for this author in PubMedÂ Google Scholar
M. S. Mukhametzhanov
View author publications
You can also search for this author in PubMedÂ Google Scholar

Contributions

Ya.D.S. supervised the work, designed the experiments, analyzed and interpreted data, and wrote the paper. D.E.K. and M.S.M. designed and performed the experiments, analyzed and interpreted data, and wrote the paper.

Corresponding author

Correspondence to Ya. D. Sergeyev.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sergeyev, Y.D., Kvasov, D.E. & Mukhametzhanov, M.S. On the efficiency of nature-inspired metaheuristics in expensive global optimization with limited budget. Sci Rep 8, 453 (2018). https://doi.org/10.1038/s41598-017-18940-4

Download citation

Received: 14 July 2017
Accepted: 14 December 2017
Published: 11 January 2018
DOI: https://doi.org/10.1038/s41598-017-18940-4