3.2. Sample Optimization and Consideration of Constraint Handling
To first demonstrate how a typical optimizer might perform for the particular model and design problem, problem P1a was solved using the high- and low-fidelity aeroelastic models. The SLSQP algorithm in OpenMDAO’s SciPy wrapper was used as an optimizer.
Figure 9 shows the convergence histories of the objective function and maximum constraint value. The low-fidelity run required 8 gradient calls and 23 non-gradient calls. However, the final load factor constraint was not active as expected, which could indicate issues with gradient accuracy or possibly numerical noise (which, to some extent, would also appear in the high-fidelity design space due to the regeneration of unstructured meshes). The high-fidelity run used 23 gradient calls and 62 non-gradient calls. Interestingly, the low-fidelity solution yielded a somewhat larger fuel burn (84,747 kg compared to 64,016), which more closely matches the solution reported by Brooks et al. [
5] (84,072 kg). Presumably, a RANS solution would more closely match that of Brooks and the VLM method, due to the presence of viscous drag.
Because the BFGS-based optimizer considered below does not handle inequality constraints directly and thus relies on penalty constraints, the high-fidelity SLSQP case was also run using the same penalty constraint formulation, taking 20 gradient calls and 53 non-gradient calls. Unsurprisingly, the case ultimately required more iterations to converge, with the objective converging much more slowly, although the feasible design space was located more quickly. Unfortunately, the final solution found did not match that of the direct constraint handling case. This could be due to noise from remeshing or simply indicate a complex design space with local minima or very flat regions.
For reference,
Figure 10 shows the resulting airfoil shapes at the Yehudi and tip wing sections. At the tip section, the baseline and directly constrained high-fidelity results appear to be quite similar, while larger differences can be seen in the low-fidelity and penalty-constrained high-fidelity results. While the low-fidelity result (which has higher twist and less camber) could simply be due to the lower fidelity physics (in combination with moment constraints, which would effectively influence chordwise and spanwise lift distributions), the penalty-constrained high-fidelity result (which has significantly different trailing edge camber) may be due more to numerical difficulties introduced by the penalty formulation and weights. As seen in the convergence histories, it is evident that this result followed a significantly different optimization trajectory than the directly constrained result, with the optimizer being strongly led towards a feasible design space with less emphasis on objective minimization. Traversing to the directly constrained optimum may then be difficult depending on the design space curvature; while moment constraints likely play a role in this, shock formations between the linearly interpolated Yehudi and tip sections may also be influencing lift and drag. Nonetheless, baseline and high-fidelity optimized results at the Yehudi section are fairly similar, with larger differences seen in the low-fidelity result. Here, the VLM geometry appears to be much thicker with significant differences in leading edge curvature. Unfortunately, it is unclear whether this is due more to physics or inaccuracies in shape derivatives (which has primarily been attributed to imprecise geometric sensitivities).
3.3. Airfoil Design
Figure 11 shows the single- and multi-fidelity convergence histories of the BFGS-based optimizers for the airfoil optimization cases (labeled problems P1a and P1b in
Table 6) using rigid and aeroelastic analyses.
(The penalty-constrained, single-fidelity BFGS was selected as an algorithmically similar reference for MF-BFGS rather than SLSQP.) In particular,
Figure 11a shows the penalty-constrained objective function versus the number of high-fidelity evaluations, with solid and dotted lines representing single- and multi-fidelity optimizers, respectively. This is the quantity directly minimized by the unconstrained optimizers. To illustrate how well the inequality constraints are satisfied through the penalty functions,
Figure 11b shows the maximum constraint value for each case, which generally drops by roughly two orders of magnitude throughout the optimizations.
Figure 11c shows the reduction in the penalty-constrained objective as a percentage of each case’s initial value. As a way to quantify the performance of using multiple fidelities,
Figure 11d shows the difference of reductions (multi-fidelity minus single-fidelity). At any given design evaluation, a positive value of this metric indicates an advantage of using multiple fidelities over the single-fidelity counterpart, since it means the high-fidelity design from the multi-fidelity optimizer is better than that of the single-fidelity for the same computational cost (assuming negligible low-fidelity model cost). Note that it will approach zero if the single- and multi-fidelity optimizers both reach the same solution.
In general, all four cases (rigid and aeroelastic P1a and P1b) showed some benefits early in the optimization process (i.e., for around 3–10 high-fidelity evaluations). For example, at around five evaluations, three out of four cases were roughly 5–10% ahead of the single-fidelity optimizer. By 20 evaluations, the metric for some of the cases dipped below zero slightly, but generally approached zero (indicating both optimizers were approaching the same solution), with the exception of the higher-dimensional rigid case. Unfortunately, for this particular case, the multi-fidelity optimizer stopped at an optimum that was roughly 1.8% worse than that of the single-fidelity. It is unclear if this was due to errors in gradients, design space complexity (and constraint handling through penalty functions), or numerical noise due to the regeneration of unstructured meshes. Although this stalling could be algorithmic, it was presumed to be unlikely as the penalty-constrained SLSQP result in the previous section also appeared to reach a worse solution than SLSQP with direct constraint handling.
3.6. Summary and Comparison
For reference,
Table 7 lists the initial and final fuel burns and constraint values for the rigid cases (best observed design from either BFGS-based optimizer), while
Table 8 lists them for the aeroelastic cases. Note that the initial designs are not trimmed and could therefore predict significantly different fuel burns when evaluated at level cruise. In particular, the trimmed rigid baseline designs yield fuel burns of 134,318 and 95,035 kg for the 7- and 17-DoF rigid cases, with the corresponding aeroelastic values being 159,717 and 84,744 kg. Among the rigid cases, the higher-dimensional airfoil parameterization yielded slightly lower initial and final fuel burn values when compared to the lower dimensional one. This is to be expected, as the baseline 17-DoF geometry yielded slightly better aerodynamic performance; among optimized geometries, it was able to more finely tune the geometry to improve performance.
The same trend was observed among aeroelastic shape-only cases. For comparison, the SLSQP P1a solution shown in
Section 3.2 (via direct constraint handling) yielded a final fuel burn of 64,015.9 kg with the constraints being [
,
,
]
T, which equates to roughly 2% less than the BFGS optimizers. However, the penalty-constrained SLSQP, which is perhaps a fairer comparison to the BFGS-based optimizers, yielded a fuel burn of 72,519.3 kg with final constraints of [
,
,
]
T, an 11% difference. The same trend was observed to a greater extent for the sizing-only cases, with the higher dimensional airfoil producing a significantly lower fuel burn (although it is possible the P2a result stalled).
Among simultaneous shape and sizing cases (P3a and P3b), the same trend was unfortunately not observed. While P3a yielded a large decrease in fuel burn from P2a, the solution to P3b was in fact worse than P2b (same problem, but lacking shape design variables), as well as P3a (same problem, but with lower-dimensional airfoils). This seems to indicate either a local minimum or, perhaps more likely, simply a difficult problem for the optimizers to solve. Potentially, numerical noise from remeshing, as well as neglecting structural grid velocities (which may be important for the stress gradients) could be contributing factors. Nonetheless, all cases were at least successful in approximately satisfying the constraints and reducing fuel burn compared to the baseline geometry. Additionally, the single- and multi-fidelity all yielded roughly the same solution, with the exception of rigid P1b, where the multi-fidelity optimizer found a slightly worse solution.
For reference,
Figure 14 illustrates the baseline and optimized airfoil shapes for all cases including shape variables, where undeflected jig shapes are shown for the aeroelastic cases. In particular, the top and bottom rows show undeflected airfoil shapes at the Yehudi and tip sections, respectively. Columns from left to right show results for rigid, aeroelastic shape-only, and aeroelastic with sizing cases.
The rigid Yehudi geometries were largely unchanged, although minor shape changes were likely needed to reduce the shock strength. Rigid tip sections varied significantly in twist angle, however. This is to be expected, as aeroelastic deflection tends to cause downward wing twist, so with deflection, optimized rigid and aeroelastic geometries are likely similar in the twist angle. Visually, the aeroelastic optima with and without sizing are difficult to compare; however, both visually and in terms of final fuel burn, a higher-dimensional airfoil section (or an even higher one) appears to be necessary to obtain the “true” optimized geometry.
For reference,
Figure 15 and
Figure 16 show a sample solution to P3a, which yielded the lowest fuel burn among the shape and sizing cases. In particular,
Figure 15 shows the undeflected geometry, as well as the deflected shapes during the cruise and maneuver scenarios. Shown also for reference are the trimmed baseline geometries, which had significantly larger deflections due to the lower aerodynamic efficiency, thus requiring a larger fuel mass to achieve the prescribed cruise range (and more lift to meet load factor constraints, thus leading to more induced and likely wave drag).
Figure 16a shows the thickness distributions of the structure, with the top, middle, and bottom showing top skins, ribs and spars, and bottom skins, respectively.
Figure 16b shows the corresponding stress contours (von Mises stress normalized by yield stress) in the maneuver scenario. For reference, the stress constraint used in the optimization enforces the normalized stress to not exceed 2/3, which is the inverse of the factor of safety (1.5).
To achieve minimum structural mass while remaining feasible, the optimal stress contours would yield a maximum value of 2/3 in each structural component. This appears to be the main driver for much of the wing skins, although a handful of skins still appeared thicker than they needed to be. Elsewhere in the wing, the aerodynamic performance and trim constraints (as well as the KS aggregation) appeared to be influencing the thickness distributions as well. For instance, while the tip skin thicknesses seemed to be approaching the lower bound of 0.005 m, the tip rib and spar thicknesses remained relatively thick in spite of the low stresses in the faces. This could mean that more torsional or bending stiffness was needed to ensure an adequate lift or moment.
In general, the stress and thickness contours compared reasonably well to those of Brooks et al. [
5], although the thicknesses were somewhat larger, leading to a somewhat heavier structure. Aside from optimization performance, one potential cause is the lack of stiffeners modeled in the structure, which may allow for thinner ribs and spars. The skins that intersect the wingbox trailing edge and wing–fuselage junction ribs were also quite thick and may also benefit from modeling stiffeners. There also appeared to be a stress concentration at the Yehudi on the bottom surface. This is due to the shape parameterization, as evidenced in the stress contours. Thus, more designable sections across the wing (as well as variable dihedral) may potentially prevent this.
In order to quantify the benefits of the multi-fidelity optimizer compared to its single-fidelity counterpart,
Table 9 lists the optimization costs and savings for each shape-only case. To achieve this, the best optimum from either optimizer is first identified to serve as the “true” optimum. This yields a total percent reduction in penalty-constrained objective when compared to the starting design. Specific percentages of this total reduction, ranging from 50 to 99%, are then chosen. For each optimizer, the number of high-fidelity function (and gradient) evaluations required to reach each reduction is then identified (for example, for rigid P1a, each optimizer required three function calls to achieve 50% of the total objective reduction). The cost savings associated with using the multi-fidelity optimizer compared to single-fidelity can then be computed as a percentage of the single-fidelity optimizer cost.
For the rigid P1a case, the multi-fidelity optimizer provided cost savings from 75 to 98% of total reduction, ranging from 20 to roughly 40%. However, by the end of the optimization, roughly where a 99% reduction was met, there were no longer any cost benefits. This is generally consistent with the metrics shown in
Figure 11d,
Figure 12d and
Figure 13d, which suggest benefits early in the optimization process, but less as the optimization progresses. Depending on the trust region size, it may also suggest that the minimum trust region size could be increased so that the optimizer more quickly switches to high-fidelity only. The other rigid case, P1b, also showed cost savings for smaller thresholds, but was ultimately unable to achieve the 98 and 99% thresholds. Interestingly, the aeroelastic cases differed in that the savings appeared for somewhat larger thresholds. Potentially, this could mean that the different low-fidelity model is better correlated near the optimum, where shocks in the flow would be weaker.
Figure 17 illustrates representative characteristics of the multi-fidelity optimization for airfoil design. Considering the comparison to the single-fidelity optimization in
Figure 11 and
Table 9, the multi-fidelity approach provided a cost benefit. The multi-fidelity optimizer was able to achieve most of the objective reduction within the first several high-fidelity evaluations. In the rigid P1a case (7-DoF Kulfan), after accepting the first design step, the multi-fidelity optimizer rejected the second step as the corrected low-fidelity model led to a design that increased rather than decreased the high-fidelity objective. Consequently, the trust region was halved. (The initial trust region covered the entire design domain.) The optimizer accepted the majority of subsequent steps, though the trust region steadily decreased to a size near 0.0625–0.125 (in the normalized design space
). This resulted from the expected improvement ratio between the multi-fidelity and high-fidelity objectives being near enough to one to indicate that the corrected low-fidelity model was sufficient to make progress, but not accurate enough to warrant trusting over a larger space. However, after about 25 high-fidelity evaluations, the search entered a region of the design space where the multi-fidelity model was not accurate, and the optimizer rejected a series of design steps that increased the high-fidelity objective and decreased the trust region in an attempt to find a step size where the multi-fidelity model was accurate.
Of the airfoil design cases, the aeroelastic case P1a exhibited the least benefit from the multi-fidelity approach. For the first ten evaluations, the optimizer once again found the multi-fidelity model to be sufficiently accurate and maintained the trust region over the entire design domain. After that point, the multi-fidelity model did not accurately predict the high-fidelity objective, and the trust region was quickly reduced. Once the trust region had fallen below 0.001 for three consecutive iterations (a selectable parameter), the optimization reverted to single-fidelity after approximately 25 evaluations, making use of the approximate Hessian already built using high-fidelity information. Despite reverting to high-fidelity optimization, there was no discernible reduction in the objective function, suggesting that the multi-fidelity optimizer approximately found the minimum.
For reference,
Table 10 lists the optimization costs for the cases with sizing. As discussed previously, the results were generally mixed, with multi-fidelity at times providing some cost savings, but often times not. Interestingly, while the cases could likely be converged more deeply, both optimizers appeared to reach a 99% reduction within 20 high-fidelity evaluations. This is likely dependent on the scale factors of the penalty functions, as the objective is generally around 0.6–2.0, with the penalty functions increasing the penalty-constrained objective by one to two orders of magnitude. Thus, much of the penalty-constrained objective reduction was achieved simply by locating a feasible design space, rather than simply reducing the fuel burn objective.
Figure 18 shows the behavior of the multi-fidelity optimizer for combined structural sizing and shape optimization. Compared to shape optimization alone (rigid and aeroelastic without sizing), the multi-fidelity optimization provided an inconsistent benefit in the 17-DoF airfoil case (P3b) and a cost penalty in the 7-DoF airfoil case (P3a). In case P3a, the multi-fidelity model had acceptable quality to maintain the trust region around 1.0 for approximately 15 evaluations, briefly reducing to 0.25. Then, the trust region was steadily reduced, though a few points were rejected due to increasing the objective function. After approximately 25 evaluations, most of the design steps were rejected and the multi-fidelity optimizer made no significant progress. While case P3b rejected fewer design steps in the first 15 evaluations, the trust region grew steadily smaller as the optimizer rejected almost all subsequent steps. Again, this was due to the expected improvement ratio being negative for both cases. With the poor agreement between high- and multi-fidelity models, the multi-fidelity optimization convergence was slow compared to single-fidelity, leading to the minimal cost benefit or penalty.