Solution Methods in CFD T. H
Solution Methods in CFD T. H
Solution Methods in CFD T. H
T. H. Pulliam∗
NASA Ames Research Center, MS T27B-1 Moffett Field CA, 94035
E-mail: tpulliam@mail.arc.nasa.gov
Abstract
Implicit finite difference schemes for solving two dimensional and three dimensional
Euler and Navier-Stokes equations will be addressed. The methods are demonstrated in
fully vectorized codes for a CRAY type architecture. We shall concentrate on the Beam
and Warming implicit approximate factorization algorithm in generalized coordinates.
The methods are either time accurate or accelerated non-time accurate steady state
schemes. Various acceleration and efficiency modifications such as matrix reduction,
diagonalization and flux split schemes will be presented. Examples for 2-D inviscid and
viscous calculations (e.g. airfoils with a deflected spoiler, circulation control airfoils and
unsteady buffeting) and also 3-D viscous flow are included.
Contents
1. Introduction 5
5. Numerical Algorithm 14
5.1. Implicit Time Differencing . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
5.2. Local Time Linearizations . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
∗
NASA Ames Research Center, USA
1
2 “Solution Methods In Computational Fluid Dynamics”
9. Boundary Conditions 45
9.1. Characteristic Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
9.2. Well Posedness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
9.3. Computational Mapping of Boundaries . . . . . . . . . . . . . . . . . . . . 48
9.3.1. Body Surfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
9.3.2. Free Surfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
9.3.3. Far Field Circulation Correction . . . . . . . . . . . . . . . . . . . . 51
13. Summary 76
4 “Solution Methods In Computational Fluid Dynamics”
Commentary, 1992
These notes were developed and put together in 1984-5 for a lecture series entitled “von
Kármán Institute For Fluid Dynamics Lecture Series : Numerical Techniques for Viscous
Flow Computation In Turbomachinery Bladings”, von Kármán Institute, Rhode-St-Genese,
Belgium , 1985. They may be a little dated today, but they still represent current algorithms
and codes being used on a day to day basis for both research and development. I will update
and correct some of the material, but I will not be attempting to completely modernize these
lectures. Newer topics, e.g. TVD, ENO, are handled better in other forums and I will have
to make my attempt at them some other time.
Finally, I would like to recognized the influence of one of my best friends and my
mentor, Dr. Joseph L. Steger who passed away this year. I first met Joe in the early
days of CFD at NASA Ames (circa 1974) when I started work on my thesis and Joe
became my advisor. I consider myself Joe’s first student and although Joe went on to
teach at Stanford and U.C. Davis and produced many fine CFD researchers, I think my
years with Joe will always be special since they were his and my first experiences as friends
and teacher/student. Joseph Steger was a real pioneer for CFD, he did much of the ground
breaking work in transonics and Euler/Navier-Stokes algorithms. I don’t think he gets much
credit for his transonic work, but if it wasn’t for Joe many of the important advances made
here at NASA Ames would never have happened. We always refer to the “Beam-Warming
algorithm”, but possibly it should be called the “Steger algorithm”. Although Beam and
Warming can be credited with the initial development, Steger had much to do with the
final developments and analysis. More importantly though, is the contribution Joe made in
making the algorithm practical and popular. In 1976, Joe wrote what was then call AIR2D,
based on the Beam-Warming algorithm and generalized coordinate transformations. That
one effort has blossomed into the ARC2D and ARC3D codes and their subsequent impact
on CFD today. Those kernels (gems of ideas) form the basis of most of the practical and
useful codes in existence today. One only has to look anywhere in the literature to see
that impact. Joe’s other developments were equally as important to the whole picture.
He developed elliptic grid generators (GRAPE) and hyperbolic grid generating algorithms.
Joe can also be credited with producing the first practical Parabolized Navier-Stokes (PNS)
codes and Incompressible Navier-Stokes (INS) based on psuedo-compressibility. His work on
the Chimera approach for complicated geometry is currently the workhorse of computation
here at NASA Ames and has been a significant part of the overall CFD effort. But besides
all that, we really have suffered a great loss in losing Joe Steger, he always will be in my
heart, his laugh and friendship are dear to me and I’m sure to all who knew him.
T.H. Pulliam, NASA Ames 5
1. Introduction
Computational fluid dynamics is a mature technology. Even though there is still a
substantial amount of theoretical development necessary before it becomes a consistent
engineering tool, we can produce research codes which can be applied to relevant physical
problems. At present full potential codes and panel methods have been the most widely
used tools in the design cycle. Those methods are computationally less expensive than the
Euler and Navier - Stokes codes and in general more robust and accurate ( mainly due to
the ability to employ a large number of grid points or panels). Euler and Navier - Stokes
codes require more storage and computational work per solution than the classical methods.
Even with the present class of super computers we still have not reached a stage where the
restrictions of computational speed and storage can be ignored. At this stage the Euler and
Navier - Stokes codes available should be considered to be research codes. At first we strive
to demonstrate the feasibility of the numerical technique used, then we should go on to
establish the accuracy, efficiency and robustness of a developed code. These are the areas
in code development and application which require careful consideration.
A wide variety of numerical techniques are in use today. Some have developed to a
high enough level to be used in production codes (see Refs. [1], [2], [3], [4], [5], and [6]),
while other techniques (for example TVD schemes, Ref. [7], [8], and [9]) are just now en-
tering the research code realm. In this presentation we shall concentrate on methods and
techniques which have been applied to various computational fluid flow problems. These
include, implicit finite differences, central space differencing, upwind differencing, approxi-
mate factorization, nonlinear dissipation models, characteristic boundary procedures, grid
refinement - reclustering algorithms and various acceleration techniques for steady state
and time accurate computations. Most of the applications are for external flows, but the
methods have been and are easily extended to internal flows. A lot of the development will
be in 2-D, with the extension to 3-D relatively straightforward.
A series of computer codes that have been developed at NASA Ames Research Center
based on the implicit approximate factorization algorithm of Beam and Warming [1] will be
used for demonstration. A particular application in two dimensions was first presented by
Steger [2] and for three dimensions by Pulliam and Steger [3]. Concurrent with this work
has been the paralleled development and application of MacCormacks method [4]. I shall
concentrate here on the theoretical development, application and assessment of the implicit
algorithm which at this stage has produced two codes, ARC2D a two dimensional version
and ARC3D the three dimensional code. The original development of these codes was more
in the spirit of a demonstration effort, where we were more concerned with demonstrating
the feasibility of the algorithm for general geometries and varying flow cases. A number
of applications appeared over the years in the literature. More recently we have improved
the accuracy, efficiency and robustness of the codes. I shall present below some details of
the current versions of the implicit codes, ARC2D and ARC3D. Notable exceptions will
be discussed. I shall not concentrate on the idiosyncrasies of the coding, I/O or other
programming aspects except where they affect the algorithm application.
6 “Solution Methods In Computational Fluid Dynamics”
where
ρ ρu ρv
ρu ρu2 + p ρuv
Q= , E = , F = , (2.2)
ρv ρuv ρv 2 + p
e u(e + p) v(e + p)
0 0
τxx τxy
Ev = , Fv = (2.3)
τxy τyy
f4 g4
with
τ = t
ξ = ξ(x, y, t)
η = η(x, y, t) (3.1)
The coordinate transformation introduced here follows the development of Viviand [10]
and Vinokur [11]. Curvilinear coordinates are a representation of n-space in which arbri-
tary vectors are represented by two sets of basis vectors (not necessarily orthogonal). An
arbritary vector V (here demonstrated in 2 dimensions) is defined as
V = v 1 e1 + v 2 e2
with ei covariant basis vectors and v i the contravariant components of V. Since we don’t
require the basis vectors to be orthogonal, another set of component extracting basis vectors
are required, the contravariant basis vectors ei . The basis vectors satisfy the relationship
ei · ej = δi,j . So that, contravariant component v i = ei · V. Another representation of V is
in terms of covariant basis vectors ei with
V = v 1 e1 + v 2 e2
The transformations are chosen so that the grid spacing in the curvilinear space is
uniform and of unit length, see Figure 1. This produces a computational space ξ and η
which is a rectangular domain and which has a regular uniform mesh so that standard
unweighted differencing schemes can be used in the numerical formulation. The original
Cartesian space will be referred to as the physical domain. Typically there will be a one to
one correspondence between a physical point in space and a computational point, except
for regions where there are singularities or cuts due to the topology. In those cases it may
be necessary to map one physical point to many computational points (this usually occurs
at computational boundaries). With this construction we can produce one computational
code for a wide variety of physical geometries and grid systems.
Chain rule expansions are used to represent the Cartesian derivatives ∂x and ∂y of
Equation 2.1 in terms of the curvilinear derivatives where in matrix form
∂t 1 ξt ηt ∂τ
∂ =
x 0 ξx η x ∂ξ (3.2)
∂y 0 ξy ηy ∂η
∂τ Q + ξt ∂ξ Q + ηt ∂η Q + ξx ∂ξ E + ηx ∂η E + ξy ∂ξ F + ηy ∂η F =
Re−1 (ξx ∂ξ Ev + ηx ∂η Ev + ξy ∂ξ Fv + ηy ∂η Fv ) (3.3)
T.H. Pulliam, NASA Ames 9
∂τ = ∂t + xτ ∂x + yτ ∂y , ∂ξ = xξ ∂x + yξ ∂y , ∂η = xη ∂x + yη ∂y (3.4)
Solving Equation 3.5 for the curvilinear derivatives in terms of the Cartesian derivatives
yields
∂t (xξ yη − yξ xη ) (−xτ yη + yτ xη ) (xτ yξ − yτ xξ ) ∂τ
∂x = J 0 yη −yξ ∂ξ (3.6)
∂y 0 −xη xξ ∂η
where J −1 = (xξ yη − xη yξ ). Evaluating Equation 3.6 for the metric terms by comparing to
the matrix of Equation 3.2 we find that
For simplicity, we examine only the inviscid terms, the derivation for the viscous terms
is similar. Collecting all the terms into two groups,
T erm1 + T erm2 = 0
where
If T erm2 is eliminated then the strong conservation law form of the equations results,
T erm1 = 0. Assuming solutions such that Q 6= 0, E 6= 0, and F 6= 0, the expressions, from
T erm2 ,
are defined as invariants of the transformation and will be shown to be analytically zero.
Substituting the metric definitions, Equation 3.7, into the invariants, Equation 3.10 we have
Now analytically differentiation is commutative and each of the above terms then sums
to zero. This eliminates T erm2 of Equation 3.10 and the resulting equations are in strong
conservation law form.
There is an important problem associated with these invariants which can be discussed
now. If T erm1 is evaluated for uniform flow,
ρ = 1,
u = M∞ ,
v = 0,
1 1 2
e = + M∞ (3.12)
γ(γ − 1) 2
then the resulting equations which must sum to zero (if we require that our equations
satisfy free stream) are exactly composed of the invariants, Equation 3.10. Now when
numerical differencing is applied to these equations (as developed in the Section 5.) then
the numerical difference formulas used to evaluate the spatial differences of the fluxes and
T.H. Pulliam, NASA Ames 11
the finite difference forms used to calculate the metrics must satisfy the commutative law.
It is not true in general that finite difference derivatives are commutative, (second order
central differences are, but mixing second order and fourth order formulas is not). As we
shall see, the central difference formulas used in two-dimensions can produce consistent
invariants, but in three-dimensions it is not a straightforward procedure.
It should be at least a minimum requirement of any finite difference formulation that
the finite difference equations satisfy free stream flow. Care must be taken to insure that
the finite difference formulation is consistent in this area or at least we should recognize and
correct as much as possible any errors of this type. Hindman [13], Pulliam and Steger [3] and
Flores et. al. [14] have investigated this area for a variety of finite difference formulations.
The Navier-Stokes equations written in generalized curvilinear coordinates are
∂τ Q bv + ∂η Fbv ]
b + ∂η Fb = Re−1 [∂ξ E
b + ∂ξ E (3.13)
where
ρ ρU ρV
ρu ρuU + ξx p ρuV + ηx p
b = J −1 b = J −1
Q , E , Fb = J −1 (3.14)
ρv ρvU + ξy p ρvV + ηy p
e U (e + p) − ξt p V (e + p) − ηt p
with
U = ξt + ξx u + ξy v, V = ηt + ηx u + ηy v (3.15)
rigid surfaces. The resulting grid systems usually have fine grid spacing in directions nearly
normal to the surfaces and coarse grid spacing along the surface, see Figure 2.
Even though we may program the full Navier-Stokes equations, the viscous terms asso-
ciated with derivatives along the body will not be resolved and in most cases for attached
and mildly separated flows these terms are negligible. The terms in the near normal will
be resolved for sufficiently fine grid spacing and these are substantial terms.
In boundary layer theory, appropriate scaling arguments show that streamwise compo-
nents of the viscous terms can be neglected relative to the normal terms. We rely upon
similar arguments as a justification for the thin layer approximation.
The thin layer approximation requires that
1. All body surfaces be mapped onto coordinate surfaces. For example, η = constant
coordinate surfaces, see Figure 1.
2. Grid spacing is clustered to the body surfaces such that sufficient resolution for a
particular Reynolds number is obtained. ( At least one or two grid points in the
sublayer).
3. All the viscous derivatives in the ξ direction are neglected, while the terms in the η
direction are retained. All of the inviscid terms are used.
The thin layer approximation is similar in philosophy but not the same as the boundary
layer theory. The normal momentum equation is solved and pressure can vary through the
boundary layer.
T.H. Pulliam, NASA Ames 13
The thin layer approximation can break down for low Reynolds numbers and in regions
of massive flow separation. It is not a necessary step into the development of the equations
and numerical algorithm. The full Navier-Stokes equations are incorporated in cases where
sufficient resolution was provided and the physical situation warranted it. The thin layer
Navier-Stokes equations have been widely used for a variety of applications.
4.1. Thin - Layer Equations
Applying the thin layer approximation to Equations 3.1, 3.14, 3.15 and Equation 3.16,
where all the viscous terms associated with ξ derivatives are neglected we obtain
∂τ Q b + ∂η Fb = Re−1 ∂η Sb
b + ∂ξ E (4.1)
where
0
ηx m1 + ηy m2
Sb = J −1 (4.2)
ηx m2 + ηy m3
ηx (um1 + vm2 + m4 ) + ηy (um2 + vm3 + m5 )
with
flows. A wide variety of two equation turbulence models are available, and as one quickly
finds out in this area, no single model seems universal or completely adequate. One aspect
of using turbulence models which is often overlooked is that adequate resolution is always
required to get reasonable results reguardless of the turbulence model employed. Typically,
the inaccuracy or inadequacy of a solution is not the fault of the turbulence model, but
rather a lack of proper resolution in the viscous and even inviscid regions of the flowfield.
5. Numerical Algorithm
There are a number of considerations to weigh when choosing a numerical algorithm
to apply to a set of partial differential equations. If we restrict ourselves to finite differ-
ence schemes then the possibilities are narrowed somewhat to the two classical approaches
for time integration, explicit and implicit techniques. The merits of either of these two
have been extensively discussed in the literature. Explicit methods typically require less
computational work and are simpler both in derivation and application. Implicit methods,
while computationally expensive, have less stringent stability bounds (classical stability
analysis shows unconditional stability but in practice on nonlinear problems bounds are
encountered).
Implicit numerical schemes are usually chosen because we wish to obtain solutions which
require fine grid spacing for numerical resolution, and we do not want to limit the time steps
by employing a conditionally stable explicit scheme. Explicit schemes are very useful and
schemes such as MacCormack’s explicit algorithm [4] have seen a lot of use and are even
in wide use today. The extra work required for an implicit scheme is usually offset by the
advantages obtained by the increased stability limits, and in general implicit schemes have
been very useful and successful for a variety of inviscid and viscous flowfield calculations.
With the advent of high speed vector and parallel processors one must also consider
the degree to which a certain algorithm can be vectorized/parallelized when choosing a
scheme. As a rule explicit schemes are more easily vectorized/parallelized than implicit
schemes. But implicit schemes can be fully vectorized and have been sucessfully employed
on parallel machines. This requires though a substantial amount of temporary storage and
a commitment to the details of data management, see for instance, Lomax and Pulliam [18].
Another consideration is the question of time accuracy verses non-time-accurate steady
state iteration. For unsteady, transient problems we wish to employ time accurate methods,
initialize the flow with some realizable state and integrate forward in time with time steps
commensurate with the unsteady phenomena which is being calculated. Both implicit and
explicit methods are capable of computing time accurately. In steady state calculation
we wish to integrate from some arbitrary state to the asymptotic solution in any manner
which will get us there in the least amount of computational work. Non-time-accurate
techniques (for instance relaxation methods, variable time steps, matrix preconditioning,
large time steps) can be employed as long as they are convergent and do not distort the
steady state equations so as to produce inaccurate results. The methods presented below
can be employed either for time accurate calculations or for steady state rapidly convergent
T.H. Pulliam, NASA Ames 15
solutions.
The algorithm to be presented is an implicit approximate factorization finite difference
scheme which can be either first or second order accurate in time. Local time linearizations
are applied to the nonlinear terms and an approximate factorization of the two-dimensional
implicit operator is used to produce locally one-dimensional operators. This results in block
tridiagonal matrices, which are easy to solve. The spatial derivative terms are approximated
with second order central differences. Explicit and implicit artificial dissipation terms are
added to achieve nonlinear stability. A spatially variable time step is used to accelerate
convergence for steady-state calculations. A diagonal form of the algorithm is also discussed,
which produces a computationally efficient modification of the standard algorithm where
the diagonalization results in scalar tridiagonal or pentadiagonal operators in place of the
block operators. This diagonal form of the algorithm produces a robust, rapid and versatile
scheme for steady state calculations. We also discuss the details of a matrix reduction
scheme, due to Barth and Steger [19] where the block matrices of the standard implicit
scheme are reduced to sets of lower rank matrices (e.g. two scalars and a 2 × 2 in 2-D).
5.1. Implicit Time Differencing
Consider Equation 4.1 (the derivation will be done for the thin layer equations but is
easily extended to the full Navier-Stokes) and apply an implicit three point time differencing
scheme of the form, Warming and Beam [20]
³ ´ · ¸
b n = ϑ∆t ∂ ∆Q
∆Q bn + ∆t ∂ b n
Q +
ϕ b n−1 + O (ϑ − 1 − ϕ)∆t2 + ∆t3 (5.1)
∆Q
1 + ϕ ∂t 1 + ϕ ∂t 1+ϕ 2
where ∆Qbn = Q b n+1 − Q
b n and Qb n = Q(n∆t).
b The parameters ϑ and ϕ can be chosen to
produce different schemes of either first or second order accuracy in time.
For ϑ = 1 and ϕ = 0, we have the first order Euler implicit scheme, and for ϑ = 1 and
ϕ = 1/2, the three point implicit scheme.
Let us restrict ourselves to the first order in time scheme (although all of the subsequent
development can easily be extended to any second order scheme formed from Equation 5.1).
Applying Equation 5.1 to Equation 4.1, results in
³ ´
b n+1 − Q
Q bn + h E
b n+1 + Fb n+1 − Re−1 Sbn+1 = 0 (5.2)
ξ η η
with h = ∆t.
5.2. Local Time Linearizations
We wish to solve Equation 5.2 for Q b n+1 given Q
b n . The flux vectors E b ,Fb and Sb are
b
nonlinear functions of Q and therefore Equation 5.2 is nonlinear in Q b n+1 . The nonlinear
b n by a Taylor Series such that
terms are linearized in time about Q
b n+1 = E
E bn + A
bn ∆Q
b n + O(h2 )
16 “Solution Methods In Computational Fluid Dynamics”
Fb n+1 = Fb n + B
b n ∆Q
b n + O(h2 )
h i
Re−1 Sbn+1 = Re−1 Sbn + M
cn ∆Q
b n ) + O(h2 ) (5.3)
where Ab = ∂ E/∂
b Q b , B
b = ∂ Fb /∂ Q
b and M
c = ∂ S/∂
b Q b are the flux Jacobians and ∆Q
b n is
O(h).
Note that the linearizations are second order accurate and so if a second order time
scheme had been chosen the linearizations would not degrade the time accuracy.
The Jacobian matrices are Ab or B
b=
κt κx κy 0
−uθ + κx φ2 κt + θ − (γ − 2)κx u κy u − (γ − 1)κx v (γ − 1)κx
(5.4)
−vθ + κy φ2 κx v − (γ − 1)κy u κt + θ − (γ − 2)κy v (γ − 1)κy
2
θ[φ − a1 ] κx a1 − (γ − 1)uθ κy a1 − (γ − 1)vθ γθ + κt
This is the unfactored form of the block algorithm. We shall call the right hand side of
Equation 5.7 the “explicit” part and the left hand side the “implicit” part of the algorithm.
T.H. Pulliam, NASA Ames 17
∆ξ = 1 and ∆η = 1 (5.9)
Second order central difference operators can be used where for example,
The choice of the type and order of the spatial differencing is important both in terms
of accuracy and stability. In most applications second order accuracy has proven to be
sufficient provided the grid resolution is reasonable. The choices for differencing type include
central and upwind operators. These choices are dictated by stability, and in the next section
we discuss what motivates certain choices.
5.4. Stability Analysis of Difference Forms
The choice of the type of difference forms to use for the Euler equations can be justified
by a linear stability analysis. For simplicity, let us examine a one dimensional coupled
system of linear equations of the form
Qt + AQx = 0 (5.13)
where A is analogous to the flux Jacobian matrix. Assume that A has a complete set of
real eigenvalues and eigenvectors (a property that the Euler flux Jacobians have) then
Λ = X −1 AX (5.14)
Multiplying Equation 5.13 by X −1 and combining terms using Equation 5.14 we have
with W = X −1 Q. Since A is linear and constant the eigenvector matrix X −1 can be brought
through the derivatives.
The resulting system is now uncoupled and we can examine the representative model
equation
wt + λwx = 0 (5.16)
where λ represents an eigenvalue of A.
We shall apply different finite difference approximations for the spatial derivative and
use Fourier analysis to determine conditions on λ for stability.
If the second order central difference operator is applied to the model equation one gets
(wj )t + λ (wj+1 − wj−1 ) /(2∆x) = 0 (5.17)
where j is the spatial index. This is the ODE (ordinary differential equation) approach to
the analysis, since now we are dealing with a system of ODE’s.
Classical Fourier analysis can be performed by assuming periodic boundary conditions
and a solution of the form
w(xj , t) = eαt eiβj∆x (5.18)
√
with i = −1 and x = j∆x.
Substituting this into Equation 5.16 yields
³ ´
αeαt eiβj∆x + λ eαt eiβ(j+1)∆x − eαt eiβ(j−1)∆x /(2∆x) = 0 (5.19)
The stability of the ODE is dependent on the sign of <(α) (the real part). Obviously,
if <(α) > 0 then w(x, t) will grow unboundedly with time.
For Equation 5.19
³ ´
α = −λ eiβ∆x − e−iβ∆x /(2∆x) = −λi sin(β∆x)/∆x (5.20)
Since α is pure imaginary (<(α) = 0) the scheme is stable in the ODE sense independent
of the sign of λ.
If one-sided difference formulas are employed, conditions on λ arise. For simplicity, let
us consider first order one-sided differences.
Applying forward differencing to the model Equation 5.16 gives
(wj )t + λ (wj+1 − wj ) /∆x = 0 (5.21)
so that,
³ ´
α = λ 1 − eiβ∆x /∆x = λ [1 − cos (β∆x) + i sin (β∆x)] /∆x (5.23)
Since cos (β∆x) is bounded by 1, <(α) will be less than zero if λ < 0. So for forward
spatial differencing λ must be less than zero for stability. A similar argument for first order
backward differencing shows that λ > 0 for stability. It can be shown that for higher order
central and one sided differences the stability requirements on λ remain the same.
These results have a direct application to the choice of differencing for the Euler equa-
tions. As we shall see below the inviscid flux Jacobians have eigenvalues (equivalent to λ)
with both positive and negative sign. In their basic form the only stable spatial differencing
is central differencing, but as we shall see when flux splitting is used or when the eigenvalues
can be restricted to one sign then upwind differencing can be employed. A class of upwind
schemes shall be discussed in Section 6.2..
5.5. Matrix Form of Unfactored Algorithm
We now turn to examining the matrices we get when difference formulas are applied
to the implicit algorithm. It is always instructive to examine the matrix structure of any
finite difference equation. With the application of central differences to Equation 5.7 it
is easy to show that the implicit algorithm produces a large banded system of algebraic
equations. Let the mesh size in ξ be Jmax and in η Kmax. Then the banded matrix is a
(Jmax · Kmax · 4) × (Jmax · Kmax · 4) square matrix of the form
h i
I + h∂ξ Abn + h∂η B
b n − Re−1 h∂η M
c ⇒
I A B
−A I A B
−A I A B
−A I A B
−B −A I A B
.. .. .. .. ..
. . . . .
(5.24)
−B −A I A B
−B −A I A B
−B −A I A
.. .. .. ..
. . . .
−B −A I A
−B −A I
where the variables have been ordered with j running first and then k.
The matrix is sparse but it would be very expensive (computationally) to solve the
algebraic system. For instance, for a reasonable two-dimensional calculation of transonic
20 “Solution Methods In Computational Fluid Dynamics”
flow past an airfoil we could use approximately 80 points in the ξ direction and 40 points in
the η direction. The resulting algebraic system is a 12,800 × 12,800 matrix problem to be
solved and although we could take advantage of its banded sparse structure it would still
be very costly in terms of both CPU time and storage.
5.6. Approximate Factorization
As we have seen, the integration of the full two-dimensional operator can be very expen-
sive. One way to simplify the solution process is to introduce an approximate factorization
of the two-dimensional operator into two one-dimensional operators. The implicit side of
Equation 5.7 can be written as
h i
cn ∆Q
b n − hRe−1 δη M
I + hδξ Abn + hδη B bn =
h i h i
I + hδξ Abn b n − hRe−1 δη M
I + hδη B cn ∆Q
bn
b n ∆Q
−h2 δξ Abn δη B b n + h2 Re−1 δξ A cn ∆Q
bn δη M bn (5.25)
Noting that ∆Qb n is O(h), one sees that the cross terms (h2 terms) are second order in
time and can be neglected without lowereing the time accuracy below second order.
The resulting factored form of the algorithm is
h ih i h i
I + hδξ Abn b n − hRe−1 δη M
I + hδη B cn ∆Q
b n = −h δξ E
b n + δη Fb n − Re−1 δη Sbn (5.26)
We now have two implicit operators each of which is block tridiagonal. The structure
of the block tridiagonal matrix is
I A
−A I A
−A I A
h i .. .. ..
bn
I + hδξ A ⇒
. . .
(5.27)
−A I A
−A I A
−A I A
−A I
Note that the thin layer implicit viscous term M c is kept with the η factor. Since it is a
three point stencil, it will not affect the tridiagonal structure. Also when vectorization and
parallization issues are considered the one dimensional form of the factored algorithm will
be advantageous.
The solution algorithm now consists of two one-dimensional sweeps, one in the ξ and
one in the η direction. The block matrix size is now at most (max[Jmax, Kmax] · 4) ×
(max[Jmax, Kmax] · 4). Each step requires the solution of a linear system involving a
T.H. Pulliam, NASA Ames 21
block tridiagonal which is solved by block LUD (lower-upper decomposition). The result-
ing solution process is much more economical than the unfactored algorithm in terms of
computer storage and CPU time.
5.7. Reduced Forms of The Implicit Algorithm
Even though the factorization has improved the efficiency of the block implicit algorithm
the major expense of the implicit scheme still resides in the block tridiagonal inversions.
Compared to standard explicit algorithms the implicit scheme is still computationally ex-
pensive. The increased stability bounds of the implicit scheme offsets some of this disad-
vantage especially for problems where refined grids are used. In general, this holds true for
time accurate applications where mesh refinement would unduly restrict the time steps for
explicit schemes, but developments in multigrid techniques, see Jespersen [21] for a review,
applied to steady state problems requires us to reexamine the implicit schemes. One way to
capture back the advantage is to make the implicit scheme less computationally expensive,
we will discuss other ways, such as accelerated convergence and improved accuracy, in later
sections.
To improve the efficiency of a numerical scheme we can modify or simplify the algorithm
so that the computational work is decreased. Most of the computational work for the
implicit algorithm is tied to the block tridiagonal solution process. One way to reduce that
work would be to reduce the block size for the tridiagonals. This can be accomplished by
reducing the equation set from four variables (density, x-momentum, y-momentum, and
energy) to three variables (density and the two momentum) by assuming constant total
enthalpy, H = (e + p)/ρ = H0 or similar thermodynamic approximations. The energy
equation is then replaced by the thermodynamic relation and the simplified set of equations
can be solved. Such approximations can be restrictive in terms of the physical situations
where they can be applied.
5.7.1. Diagonal Form
The computational work can also be decreased by introducing a diagonalization of the
blocks in the implicit operators as developed by Pulliam and Chaussee [22]. The eigensystem
of the flux Jacobians Ab and B b are used in this construction. For now let us again restrict
ourselves just to the Euler equations, the application to the Navier-Stokes is discussed later.
The flux Jacobians Ab and B
b each have real eigenvalues and a complete set of eigenvectors.
Therefore, the Jacobian matrices can be diagonalized, see Warming, Beam and Hyett [23],
b ξ
Λξ = Tξ−1 AT b η
and Λη = Tη−1 BT (5.28)
with Tξ the matrix whose columns are the eigenvectors of Ab and Tη the corresponding
b They are written out in the Appendix.
eigenvector matrix for B.
Here we take the factored algorithm in delta form, Equation 5.26 and replace Ab and B
b
22 “Solution Methods In Computational Fluid Dynamics”
At this point Equation 5.26 and 5.29 are equivalent. A modified form of Equation 5.29
can be obtained by factoring the Tξ and Tη eigenvector matrices outside the spatial derivative
terms δξ and δη . The eigenvector matrices are functions of ξ and η and therefore this
modification reduces the time accuracy to at most first order in time, as shown in [22]. The
resulting equations are
b [I + h δη Λη ] T −1 ∆Q
Tξ [I + h δξ Λξ ] N bn = R
bn (5.30)
η
b = T −1 Tη , see Appendix.
where N ξ
The explicit side of the diagonal algorithm (the steady-state finite difference equations)
is exactly the same as in the original algorithm, Equation 5.26. The modifications are
restricted to the implicit side and so if the diagonal algorithm converges, the steady-state
solution will be identical to one obtained with the unmodified algorithm. In fact, linear
stability analysis would show that the diagonal algorithm has exactly the same unconditional
stability as the original algorithm. (This is because the linear stability analysis assumes
constant coefficients and diagonalizes the blocks to scalars, the diagonal algorithm then
reduces to the unmodified algorithm.) The modification (pulling the eigenvector matrices
outside the spatial derivatives) of the implicit operator does affect the time accuracy of the
algorithm. It reduces the scheme to at most first order in time and also gives time accurate
shock calculations a nonconservative feature, i.e., errors in shock speeds and shock jumps,
see [22]. The steady-state is in full conservation law form since the steady-state equations
are unmodified. Also, computational experiments by Pulliam and Chaussee [22] have shown
that the convergence and stability limits of the diagonal algorithm are similar to that of
the unmodified algorithm.
The diagonal algorithm reduces the block tridiagonal inversion to 4 × 4 matrix multi-
plies and scalar tridiagonal inversions. The operation count associated with the implicit side
of the full block algorithm is 410 multiplies, 356 adds, and 10 divides, a total of 776 opera-
tions, while the diagonal algorithm requires 233 multiplies, 125 adds, and 26 divides or 384
operations. Adding in the explicit side and other overhead such as I/O (input/output) and
initialization, the overall savings in computational work can be as high as 40%. In fact the
computational work can be further decreased by noting that the first two eigenvalues of the
system are identical (see Appendix). This allows us to combine the coefficient calculations
and part of the inversion work for the first two scalar operators.
The diagonal algorithm as presented above is really only rigorously valid for the Euler
equations. This is because we have neglected the implicit linearization of the viscous flux
Sbn in the implicit operator for the η direction. The viscous flux Jacobian M cn is not
T.H. Pulliam, NASA Ames 23
simultaneously diagonalizable with the flux Jacobian B b n and therefore to retain the full
diagonalization we neglect it. For viscous flows we have investigated four options. One
possibility is to use the block tridiagonal algorithm in the η direction and include the
viscous Jacobian M cn . This increases the computational work and restricts us from using
some of the convergence acceleration techniques which will be discussed below. Another
option is to introduce a third factor to the implicit side of Equation 5.26 where we use
h i
cn
I − hRe−1 δη M (5.31)
This again increases the computational work since we now have an added block tridiagonal
inversion. We take these measures though because of the questionable stability of just
completely neglecting the implicit viscous terms. The third option is to throw caution to the
wind and actually neglect the viscous Jacobian, thereby gaining the increased efficiency of
the diagonal algorithm. As long as the algorithm remains stable and convergent, the steady
state obtained is identical for all three options since the explicit side is unchanged. The
fourth option is to include a diagonal term on the implicit side which is an approximation
to the viscous Jacobian eigenvalues. The estimate (taken from an examination of the terms
of the Jacobian Mc) currently used is
³ ´
λv (ξ) = µRe−1 J −1 ξx2 + ξy2 Jρ−1
³ ´
λv (η) = µRe−1 J −1 ηx2 + ηy2 Jρ−1 (5.32)
which are added to the appropriate implicit operators in Equation 5.30 with a differencing
stencil taken from Equation 5.11. The new form of the diagonal allgorithm is given as
b [I + h δη Λη − h I δηη λv (η)] T −1 ∆Q
Tξ [I + h δξ Λξ − h I δξξ λv (ξ)] N bn = R
bn (5.33)
η
The terms in Equation 5.32 which are contained under the overbar are distinguished from
the Jρ−1 because the application to the difference forms require those terms to be averaged
as in Equation 5.12. The ξ term is not added if the thin layer approximation is used. In all
the above cases the explicit viscous operator is unchanged from the standard algorithm.
We have compared the four options for a number of test cases. For the first option, block
tridiagonal with second order implicit dissipation the convergence rate was the slowest. For
the second option, the third factor, fast convergence rates and stability were obtained at
the expense of more computation. The third option, neglecting the viscous flux Jacobian,
produced identical stability and convergence as the second option in most cases but re-
quired less computational work. In the fourth option (which is the recommended form) the
convergence rates are typically the best and the overall robustness of a numerical code is
improved. In all cases the converged solutions are identical.
5.7.2. Pressure–Velocity Splitting
Another way is to reduce the block size by similarity transformations as proposed by
Steger [24]. This was originally restricted to Cartesian variables. Barth and Steger [19] have
24 “Solution Methods In Computational Fluid Dynamics”
removed some of this restriction and developed an algorithm where two scalar tridiagonals
and one block two by two tridiagonal inversion is required. The basic concept can be
demonstrated in two-dimensional Cartesian coordinates, see Barth and Steger [19] for the
extension to generalized coordinates.
The development of the sound speed - velocity splitting begins by considering the non-
conservative form of the Euler equations
∂t R + M ∂x R + N ∂y R = 0 (5.34)
where
ρ u ρ 0 0 v 0 ρ 0
u 0 u 0 ρ−1 0 v 0 0
R= , M = , N = −1 (5.35)
v 0 0 u 0 0 0 v ρ
p 0 γp 0 u 0 0 γp v
The eigenvalues of coefficient matrices, M and N, are the usual characteristic speeds
These coefficient matrices can each be split into two submatrices: one derived from the
velocity part of the eigenvalues and the other from the sound speed part of the eigenvalues. A
particular matrix splitting (there are many possibilities) was chosen to satisfy the following
conditions
Λ(M ) = Λ(Mu ) + Λ(Mc ), Λ(Mu ) = (u, u, u, u), Λ(Mc ) = (0, 0, c, −c) (5.37)
Λ(N ) = Λ(Nv ) + Λ(Nc ), Λ(Nv ) = (v, v, v, v), Λ(Nc ) = (0, 0, c, −c) (5.38)
v 0 ρ 0 0 0 0 0
0 v 0 0 0 0 0 0
N = Nv + Nc = + (5.40)
0 0 v 0 0 0 0 ρ−1
0 0 0 v 0 0 γp 0
Given the coefficient matrices M and N, a similarity transformation exists that trans-
forms these matrices into their conservative counterpart, the flux Jacobians A and B.
T.H. Pulliam, NASA Ames 25
0 0 0 0
(u2 + v 2 )/2 −u −v 1
Ac = (γ − 1) ,
0 0 0 0
c
a41 c
a42 −uv u
0 0 0 0
0 0 0 0
Bc = (γ − 1) (5.41)
(u2 + v 2 )/2 −u −v 1
bc41 −uv bc43 v
where
Au = A − Ac , Bv = B − Bc (5.44)
This splitting produces matrices Au and Bv that are more complex than A and B. But it
is found that Q is an eigenvector of Au and Bv , i.e.
Au Q = uQ, Bv Q = vQ (5.45)
Insertion of Equation 5.46 into the equations for local linearization of the Jacobians, the
Cartesian equivalent of Equation 5.3), produces
with
The end result of this splitting is that the new operators Lx and Ly form matrices that
no longer require 4 × 4 block tridiagonal inversions. In matrix operator form, we have
1 0 0 0 u 0 0 0
0 1 0 0 c c
a21 u + a22 a23c ac24
Lx = + θ∆tδx (5.52)
0 0 1 0 0 0 u 0
0 0 0 1 ac41 ac42 ac43 u + ac44
1 0 0 0 v 0 0 0
0 1 0 0 0 v 0 0
Ly = + θ∆tδy (5.53)
0 0 1 0 bc31 bc32 v + bc33 bc34
0 0 0 1 bc41 bc42 bc43 v + bc44
where ac and bc are the respective elements of Ac and Bc given by Equation 5.41.
In the Lx operator, for example, the first and third rows decouple from the system and
can be solved as scalar tridiagonal matrices with their respective right-hand-sides. Once
these rows are solved, the elements of the first and third columns can be moved to the
right-hand-side. The second and fourth equations remain coupled and are solved as a 2 × 2
block tridiagonal matrix. The block decoupling of the Ly operator is even more conspicuous
and is inverted (i.e., solved for) in a similar manner.
The use of the pressure–velocity splitting has substantially reduced the computational
work over the basic block implicit scheme. A typical 2 × 2 block tridiagonal requires 55
operations per point, so the overall inversion, including the two scalar tridiagonals, requires
73 operations per entry. Because the two scalar tridiagonals have identical coefficients, this
work can be even further cut by solving them together.
The matrix splitting produces the flux vectors
where
ρu 0 ρv 0
ρu2 p ρuv 0
Eu = , Ec = , Fv = , Fc = (5.55)
ρvu 0 ρv 2 p
ue up ve vp
Note that the Jacobians of Ec and Fc are not Ac and Bc as defined above. Usually, the
use of implicit linearizations which are not the Jacobians of the explicit flux vectors leads to
restricted stability bounds or unconditional instability. Linear stability analysis presented
by Barth and Steger, as well as numerical experiment have shown though that the use of
Ac and Bc leads to unconditional stability.
The generalized coordinate form of pressure–velocity splitting is developed in Barth
and Steger [19]. A rotation transformation is used to align the momentum equations with
generalized coordinate directions, e.g. in the ξ direction they use
1 0 0 0
ξx ξy
0 L1 0
Cξ =
ξ
L1
ξx
(5.56)
0 − Ly1 L1 0
0 0 0 1
q
with L1 = ξx2 + ξy2 . This produces the transformed splitting matrix
0 0 0 0
L1 (u2 +v2 )
e
−U −Vb L1
Ac = (γ − 1)
2
(5.57)
0 0 0 0
ac41 ac42 − ULVb1 U
where
c2 c2 U2
ac41 = U [(u2 + v 2 )/2 − ], ac42 = L1 − (5.58)
(γ − 1)2 (γ − 1)2 L1
with Vb = ηy u − ηx v.
The structure of Equation 5.57 is identical to Equation 5.41 so that the implicit operators
in generalized coordinates are again reducible to 2 scalars and one 2 × 2 block operator for
each direction. Barth and Steger also discuss the application of pressure–velocity splitting
to the Navier-Stokes equations.
28 “Solution Methods In Computational Fluid Dynamics”
We see that central differencing in two dimension does satisfy the invariant relations.
This becomes obvious when one realizes that second order central differencing operators
commute, i.e. δξc δηc = δηc δξc . This is not true for general differences, e.g. δξb δηc 6= δξc δηb . Take
the case of central differencing to form metrics and one sided backward differencing for the
fluxes. Then we have,
∇ξ δη y − ∇η δξ y = [yj,k+1 − yj−1,k+1 − yj,k−1 + yj−1,k−1 ]/2
+ [−yj+1,k + yj+1,k−1 + yj−1,k − yj−1,k−1 ]/2 6= 0 (5.60)
The error associated with not satisfying the invariant relations is truncation error equal
to or less than the lowest order accurate operator used. The error can be eliminated by
modifying the difference formulas, for example introducing simple averages. The equiva-
lent relationships for three dimensions can be very complicated. In most cases the error
introduced is small except in regions of large mesh spacing or large distorted cells (high
aspect ratios). It should be stressed though, that the satisfaction of the invariant relations
is at least a high priority for any flow code. Hindman [13] has investigated this area for the
Euler equations and Flores et.al. [14] give an interesting account of similar problems and
solutions for the conservative full potential equations.
6. Artificial Dissipation Added to Implicit Schemes
Even though linear stability analysis shows unconditional stability for the implicit al-
gorithm, in practice stability bounds are encountered. This is especially true in strongly
nonlinear cases, such as flows with shocks.
T.H. Pulliam, NASA Ames 29
Whenever discrete methods are used to “capture” shocks (as opposed to fitting them), or
to compute high Reynolds number viscous behavior, scales of motion appear which cannot
be resolved by the numerics. These can be brought about by the nonlinear interactions in
the convection terms of the momentum equations. If scale is represented by wave length
or frequency, it can be easily shown that two waves interact as products to form a wave of
higher frequency (the sum of the original two) and one of lower frequency (the difference).
The lower frequencies do not cause a problem, but the continual cascading into higher and
higher frequencies does. It is accounted for physically by shock formation (the harmonic
analysis of a discontinuity contains all frequencies ) or by viscous dissipation of the very high
wave numbers. In numerical computations it can not be ignored and must be accounted
for in the algorithm constructed. In any finite discrete mesh the cascading frequencies can
eventually exceed the capacity of the mesh resolution at which point they can either alias
back into the lower frequencies or pile up at the higher frequency side. In either case, if
uncontrolled, these terms can lead to serious inaccuracies and possible numerical instability.
The most common way of coping with the high-frequency cascade is to add to the
complete algorithm some form of numerical dissipation with an error level that does not
interfere with the accuracy of any physical viscous effects. This can be done in a variety of
ways.
6.1. Constant Coefficient Implicit and Explicit Dissipation
Historically, in the class of implicit finite difference codes developed in the mid 1970’s, a
common procedure was to add explicit fourth difference artificial dissipation to the central
difference algorithm of the form
bn
−²e ∆tJ −1 [(∇ξ ∆ξ )2 + (∇η ∆η )2 ]J Q (6.1)
which is added to the right-hand side of Equation 5.26 and implicit second difference smooth-
ing
which was inserted into the respective implicit block operators. Second difference implicit
dissipation was used to keep the block implicit operators tridiagonal. The difference oper-
ators are defined as
and are applied at all interior points. The parameter ²e is chosen to be O(1) and ²i = 2²e .
The smoothing terms are scaled with ∆t which makes the steady state independent of the
time step.
It is important to assess the effect on stability when these terms are added. In Section7.2.,
we provide a linear analysis of the effect of added dissipation on stability. We summarize
30 “Solution Methods In Computational Fluid Dynamics”
the results here. In the original development of the implicit algorithm we only added in
the explicit dissipation, but this lead to a linear stability bound which was dependent on
the magnitude of ²e ∆t. The implicit second difference term was added to eliminate this
stability bound. The proper approach would be to make the fourth difference dissipation
implicit. This would then necessitate the use of block pentadiagonal solvers which is too
computationally expensive. The second difference implicit dissipation stabilizes the algo-
rithm and allows us to retain block tridiagonal inversions. Linear analysis shows that if
²i À ²e then unconditional stability is obtained. It should be noted that in practice for
nonlinear problems the total algorithm has large but conditional stability bounds.
Beam and Bailey [25] suggest that while the implicit second difference dissipation im-
proves the practical stability bound, the use of fourth difference implicit dissipation match-
ing the explicit terms produces larger stability bounds and enhanced convergence. This
is consistent with a concept which I will discuss in more detail below. That is, maximum
stability bounds and optimal convergence rates are only achieved if we properly linearize the
explicit side of the algorithm. In this case a proper linearization of the explicit dissipation
produces improved stability and convergence. Beam and Bailey employed a block penta-
solver which greatly increased the computational work and storage. We take advantage of
the diagonal algorithm to produce a much more efficient scheme. Within the framework
of the diagonal scheme we can replace the four scalar tridiagonals with scalar pentadiago-
nals which is just a minor increase in computational work. The resulting scheme has the
advantage of increased stability bounds and convergence rates with the total computation
work still less than the standard block tridiagonal scheme. Computational experiments
demonstrate the increased efficiency and stability.
The approach of adding a constant coefficient fourth difference explicit dissipation can
produce some problems which are only evident in the case of refined meshes. Initially,
because of computer limitations we only employed coarse grids and this type of dissipation
was sufficient to produce stability and limited accuracy. With the advent of more powerful
computers we have gone to grid refinement especially to resolve shocks. The use of the
above type of fourth difference dissipation with refined meshes produces wild oscillations
near shocks even in cases where the computation is completely stable and converged. In
Figure 3, we show a converged solution for a NACA 0012 airfoil at a transonic Mach number,
M∞ = 0.8 and angle of attack, α = 0◦ isolating the region near the shock.
As can be seen, the solution seems perfectly fine except in the region of the shock where
a large every other point oscillation is evident. Varying the coefficient of artificial dissipation
over a fairly wide range did not alter the nature of this oscillation. This is obviously an
undesirable result which can be eliminated as shown below.
an upwind fashion. Other schemes of a similar nature but based on complicated theories are
the flux difference scheme of Osher and Chakravarthy [29] and Harten’s TVD methods [30].
These schemes all claim (with good justification) to be physically consistent since they follow
in some sense the characteristics of the flow. They in general can be shown to produce sharp
oscillation free shocks without added artificial dissipation. They are, though, complicated
schemes which are just now being applied to complicated flowfield situations. Also these
schemes have an inherent amount of internal dissipation, due to the one sided differences,
which cannot be modified or decreased. It may be advantageous to have the flexibility of a
simple central difference scheme with a controllable amount of artificial dissipation.
It can be shown (as done below) that the upwind schemes have an equivalence to central
difference schemes with added dissipation. The central schemes are much simpler and more
flexible and are therefore desirable if the dissipation can be added in an analogous fashion
to the upwind schemes.
The plus - minus flux split method of Steger and Warming [26] will be used here to
demonstrate the dissipative nature of upwind schemes. The approach taken is to split the
eigenvalue matrix Λ of the flux Jacobians into two matrices, one with all positive eigenvalues
and the other with all negative eigenvalues. Then the similarity transformations X or Y
are used to form new matrices A+ , A− and B + , B − . Formally,
A = XΛA X −1 = X(Λ+ −
A + ΛA )X
−1
= A+ + A− (6.4)
with
ΛA ± |ΛA |
Λ±
A = (6.5)
2
32 “Solution Methods In Computational Fluid Dynamics”
Here, |Λ| implies that we take the absolute values of the elements of Λ. The two matrices,
A+ and A− have by construction all positive and all negative eigenvalues, respectively.
New flux vectors can be constructed as
E = AQ = (A+ + A− )Q = E + + E −
F = BQ = (B + + B − )Q = F + + F − (6.6)
Different types of spatial differencing can now be used for each of the new flux vectors. One
stable form is to use one sided backward differencing for the positive terms and one sided
forward differencing for the negative terms. The one-sided difference operators are usually
either first order accurate
uj,k − uj−1,k uj+1,k − uj,k
∇bξ uj,k = and ∆fξ uj,k = (6.7)
∆ξ ∆ξ
or second order accurate
3
uj,k − 2 uj−1,k + 21 uj−2,k
δξb uj,k = 2
∆ξ
− uj,k + 2 uj+1,k − 12 uj+2,k
3
δξf uj,k = 2 (6.8)
∆ξ
Similar expressions are used for the η derivatives. Note that ∆ξ = 1, bull will appear in
formulas where its presence conveys meaning.
The plus-minus matrices, A+ and A− can be written as
µ ¶
Λ ± |Λ| A ± |A|
A± = X X −1 = (6.9)
2 2
which gives
A |A| E |A|
E ± = A± Q = Q± Q= ± Q (6.10)
2 2 2 2
Similar expressions are obtainable for the B matrices and flux vector F .
Examining the flux derivative
1h b i
(δξ + δξf )E + (δξb − δξf )|A|Q (6.13)
2
which is a second order central difference operator, but not δξ . The other term of Equa-
tion 6.13 is of more interest, where
1
(δξb − δξf )/2 = (E +2 − 4E +1 + 6I − 4E −1 + E −2 )/(4∆ξ) = (∆ξ ∇ξ )2 (6.15)
4∆ξ
which is a fourth difference stencil. The difference operators are defined in Equation 6.3.
Now Equation 6.13 can be written as
µ ¶
1
δξ E + (∆ξ ∇ξ )2 |A|Q (6.16)
4∆ξ
The form now is a second order central difference term plus fourth difference dissipation.
The dissipative term is a natural consequence of the upwind differencing. It is interesting
to note that the central difference term Equation 6.14 is not the standard three point
difference. If first order formulas are employed for the upwind differences then a similar
analysis would produce the standard second order three point central differencing plus a
second differencing dissipative term. For instance, Equation 6.16 is replaced by
µ ¶
1
δξ E − (∆ξ ∇ξ )|A|Q (6.17)
2∆ξ
We note a number of things from the form of Equations 6.16, 6.17 which can guide us
in developing artificial dissipation models for a central difference scheme. Adding fourth
difference dissipation to a central difference produces the equivalent of some second order
upwind scheme. The use of second difference dissipation can produce a first order upwind
equivalent. Research has shown that applying flux limiters to upwind schemes and some
of the TVD concepts suggest that the best approach for an upwind algorithm is to use a
locally first order upwind difference at a shock and second order elsewhere. This can be
accomplished by some switching and transitioning of second difference and fourth differ-
ence dissipation added to a central scheme. The coefficients for the dissipation parts of
Equation 6.16, 6.17 suggest some sort of flux Jacobian scaling where for instance a spectral
radius of the Jacobians could be used.
34 “Solution Methods In Computational Fluid Dynamics”
with
(2)
²j,k = κ2 ∆t max(Υj+1,k , Υj,k , Υj−1,k )
|pj+1,k −2pj,k +pj−1,k |
Υj,k = |pj+1,k +2pj,k +pj−1,k |
(4) (2)
²j,k = max(0, κ4 ∆t − ²j,k ) (6.19)
where typical values of the constants are κ2 = 1/4 and κ4 = 1/100. Similar terms are used
in the η direction. The term σj,k is a spectral radius scaling and is defined as
q
σj,k = |U | + a ξx2 + ξy2 (6.20)
The first term of Equation 6.18 is a second difference dissipation with an extra pressure
gradient coefficient to increase its value near shocks. The second term is a fourth difference
(4)
term where the logic to compute ²j,k switches it off when the second difference nonlinear
coefficient is larger then the constant fourth difference coefficient. This occurs right near a
shock. In Figures 4 and 5, we show solutions for the flow problem of Figure 3, using this
nonlinear artificial dissipation. For Figure 4 we employ just the fourth difference term, i.
e. κ2 = 0.
The oscillations at the shock are eliminated and a sharp shock is obtained. In this case
though there is an overshoot and undershoot at the top and bottom of the shock which is
eliminated in Figure 5 by adding the second difference term, κ2 = 1/4.
The results shown are fully converged to machine zero and in the case of Figure 5
represent the current quality of our shock capturing capabilities. The chosen values of the
coefficients have, at least to date, been static and are not changed from case to case.
The implicit dissipation used with Equation 6.18 is the linearization of the equation
treating the pressure coefficient Υ and the spectral radius σ as space varying functions
but ignoring their functional dependency on Q. Then the dissipation is linear in Qj,k and
is added to the diagonal algorithm again necessitating scalar pentadiagonal solvers. This
produces a very efficient, stable and convergent form of the implicit algorithm.
Near computational boundaries we modify the fourth difference dissipation so as to
maintain a dissipative term. A derivation and analysis of various boundary treatments in
T.H. Pulliam, NASA Ames 35
given in Ref. [31]. The modification is needed at the first interior point (e.g. Qj+1,k ) where
the five point fourth difference term Qj+2,k − 4Qj+1,k + 6Qj,k − 4Qj−1,k + Qj−2,k is to be
applied. There the point Qj+2,k doesn’t exist, the formula is modified to a one sided second
difference term with the differencing stencil −2Qj+1,k + 5Qj,k − 4Qj−1,k + Qj−2,k . Similar
formulas are used at other boundaries.
6.4. Total Variation Diminishing Schemes, TVD
The development of monotone, flux vector/difference splitting, TVD and other nonoscil-
latory schemes can be found in numerous publications, see for example, Refs. [7], [8], [9], [26],
[27], [28], [29], [30]. Here we shall just briefly define the conditions for a class of TVD schemes
introduced by Harten [30].
The conditions for a scheme to be TVD in Harten’s sense can be developed for the scalar
hyperbolic conservation law
∂u ∂f (u)
+ =0 (6.21)
∂t ∂x
where f (the flux) is a nonlinear function of u . We can define a characteristic speed
a(u) = ∂f /∂u.
A one parameter family of schemes can be defined
µ ¶ µ ¶
un+1
j + λθ hn+1
j+ 1
− hn+1
j− 1
= unj − λ(1 − θ) hnj+ 1 − hnj− 1 (6.22)
2 2 2 2
rewritten as
where unj = u(j∆x, n∆t), λ = ∆t/∆x, θ parameterizes the equations from the fully explicit
to fully implicit forms, and h is the numerical flux function with hj± 1 = h (uj∓1 , uj , uj±1 , uj±2 ).
2
For Equations 6.22, 6.23 the conditions due to Harten [30] are
and
³ ´ ³ ´
T V Lun+1 ≥ T V un+1 (6.27)
µ ¶n+1
un+1
j − −
λθ Cj+ 1 ∆j+ 1 u − C
+
∆ 1u
j− 1 j−
=
2 2 2 2
µ ¶n
− +
unj + λ(1 − θ) Cj+ 1 ∆j+ 1 u − C ∆ 1u
j− 1 j−
(6.28)
2 2 2 2
with C ∓ bounded functions. Sufficient conditions for Equations 6.26, 6.27 are
for all j :
∓
λ(1 − θ)Cj+ 1 ≥0
2
µ ¶
+ −
λ(1 − θ) Cj+ 1 + C
j+ 1
≤1
2 2
±
−∞ < C ≤ −λθCj+ 1 ≤ 0 (6.29)
2
for finite C.
These conditions can be used to analyze and construct various TVD schemes. Refer to
References [30], [7], and [8] for two forms of high resolution (at least second order accurate)
TVD schemes applied to hyperbolic conservation law equations.
7. Time Accuracy, Steady States, Convergence and Stability
7.1. Time Accuracy vrs Steady-State Computation
The implicit algorithm is designed to be time accurate where second order accuracy
can be maintained and the equations are integrated through time from some meaningful
initial condition to the the solution at time T . In this case the time step is chosen to be
commensurate with some time scale of the problem. The evolution of the solution through
time is physically realistic and good solution accuracy is dependent on the mesh spacing
and boundary conditions.
The equations can also be applied to steady-state problems. Typically we employ the
first order scheme in time and attempt to accelerate the algorithm by various non-time-like
maneuvers. The equations are then integrated from an arbitrary initial condition to a time
asymptotic state. Any procedure which drives us to the steady-state faster must also be
stable and accurate at convergence. It might be expected that large time steps could be
used to drive the solution to the steady-state faster. As we shall see, based on linear analysis
large time steps can increase the convergence rate, but for factored forms the limit of the
amplification factor (a measure of the maximum convergence rate) as h = ∆t → ∞ is 1.
38 “Solution Methods In Computational Fluid Dynamics”
e eiβ x
w = w(t) (7.1)
and treat the spatial derivative ∂x analytically, then examine the temporal differencing
schemes in one and two dimensions. This gives us the purely transient one dimensional
model problem
w
et + λx w
e=0 (7.2)
with λx = iβλ.
The delta form of the first order implicit algorithm is
e n = −hλx w
(1 + h λx ) ∆w en (7.3)
e 0 is some initial value. The term in the brackets is the amplification factor, σ.
where w
e n → 0 and the transient can actually be eliminated directly for large h.
For h → ∞, w
In contrast, let us examine a two-dimensional factored implicit scheme for the two-
dimensional transient problem
w
et + λx w
e + λy w
e=0. (7.5)
This is the two-dimensional counterpart to Equation 7.2. Applying the first order implicit
approximate factorization delta algorithm to Equation 7.5 we have
e n = −h (λx + λy ) w
[1 + h λx ] [1 + h λy ] ∆w en (7.6)
en = w
Expanding ∆w e n+1 − w
e n and combining terms we have
" ¡ ¢ #n
1 + h2 λx λy
en =
w e0
w (7.7)
(1 + h λx + h λy + h2 λx λy )
T.H. Pulliam, NASA Ames 39
and so |σ| → 1 as h → ∞.
A close examination of this result shows that the factorization has destroyed the good
convergence characteristics at large time steps. The factoring error term has introduced a
h2 term in the numerator of the amplification factor. Therefore the factored schemes do
not have good convergence characteristics for large time steps. Actually, there is a range of
moderately large time steps where the amplification factor is a minimum, see for instance
Abarbanel, Dwoyer, and Gottlieb [32]. Convergence can therefore be accelerated by using
a time step which minimizes the amplification factor.
Note that for the delta form of the algorithm (either factored or unfactored) the steady-
state solution is independent of the time step, h. (There are numerical schemes where this
is not the case, such as Lax-Wendroff.) Therefore, the time step path to the steady-state
does not affect the final solution and we can envision using time step sequences or spatially
variable time steps to accelerate convergence.
B. Space Varying ∆t
Manipulation of the time step can have a substantial influence on convergence even
within the framework of the factored algorithms. If only a steady state solution is required,
one can let h (or ∆t) change in space. This approach can be viewed as a way to condition the
iteration matrix of the relaxation scheme defined via Equation 5.26 or Equation 5.30. Use
of a space varying ∆t can also be interpreted as an attempt to use a more uniform Courant
number throughout the field. In any event, changing ∆t can be effective for grid spacings
that vary from very fine to very coarse - a situation usually encountered in aerodynamic
simulations where grids contain a wide variety of length scales.
A space varying ∆t has been used in both explicit and implicit schemes ( e.g. Shang and
Hankey [33], McDonald and Briley [34], Shirnivasan et al [35], Coakley [36], Jameson [6],
etc ). As a rule one wishes to adjust ∆t at each point proportional to the grid spacing and
the characteristic speed of the flow. Something like the Courant number restriction ( which
for the Euler equations in multi-dimensions is a bit of an approximation).
For highly stretched grids the space variation of the grid is the most important parameter
to scale with. In subsonic and transonic flow the characteristic speeds have moderate
variation and we have found that a purely geometric variation of ∆t is adequate, specifically
∆t|ref
∆t = √ (7.8)
1. + J
To illustrate the advantage of using a variable time step, Figure 6 shows the degradation
in convergence rate when a constant step size is substituted for the variable time step in a
NACA 0012 test case. For this comparison all other possible parameters were held constant
and no other changes were employed. We should note at this time that the above variation
of time step has often worked poorly in viscous flow until the numerical dissipation terms
40 “Solution Methods In Computational Fluid Dynamics”
were also put in implicitly as described later. Also other forms of the variable step size
sometimes perform better than Equation 7.8, for example
∆t|ref
∆t = q (7.9)
|U | + |V | + a ξx2 + ξy2 + ηx2 + ηy2
which is approximately a constant CFL condition. However, Equation 7.9 is more costly to
compute then Equation 7.8.
C. Mesh Sequences
For inviscid airfoil calculations on a grid of O(250 x 50) practical convergence is usually
obtained in 500-600 fine grid iterations when the flow field has been started from an initial
condition of uniform free stream flow. Typically the first 100 to 200 iterations on the fine
mesh are needed to get past the initial transients which can be a substantial portion of
the total solution time. For instance, in the above test case it takes on the order of 600
fine grid iterations for a tight convergence criteria (e.g. lift to 5 decimal places) , 200 of
which are spent on clearing out the impulsive start. One way to accelerate convergence
to a steady state is to obtain a good initial guess for a fine mesh by first iterating on a
sequence of coarse grids and then interpolating the solution up to the next refined grid.
Such a mesh sequence procedure can often reduce the amount of time required to obtain a
solution to plotable accuracy by a factor of two. Also, because a coarse grid tends to damp
high frequency waves, using a mesh sequence procedure can improve the overall robustness
of the code.
T.H. Pulliam, NASA Ames 41
detrimental effect on the convergence rates of an implicit algorithm for steady-state com-
putations.
Consider the model problem in one-dimension (equivalent to Equation 5.16 with a con-
venient change in notation),
qt + aqx = 0 (7.10)
Applying the first-order time accurate Euler implicit scheme in delta form to Equation 7.10
and adding explicit fourth-difference dissipation (β4 > 0), implicit second-difference dissi-
pation (α2 > 0), and implicit fourth-difference dissipation (α4 > 0) gives the algorithm
h i ³ ´
1 + haδx − hα2 ∇x ∆x + hα4 (∇x ∆x )2 (q n+1 − q n ) = −h aδx + β4 (∇x ∆x )2 q n (7.11)
Fourier analysis using q n = wn eikj j∆x (with kj the wave number in x) produces
h i ³ ´
1 + haλx − hα2 µx + hα4 µ2x (wn+1 − wn ) = −h aλx + β4 µ2x wn (7.12)
where λx = 2i sin(kj ∆x)/∆x represents the Fourier signature for the central difference δx ,
µx = −2 + 2 cos(kj ∆x) the signature of the second-difference dissipation operator ∇x ∆x ,
and µ2x for the fourth-difference dissipation.
The amplification factor for wn+1 = σwn is then
¡ ¢
1 + h (α4 − β4 )µ2x − α2 µx
σ= (7.13)
1 + h (aλx − α2 µx + α4 µ2x )
1 − hβ4 µ2x
σ= (7.14)
1 + haλx
Now, since λx is pure imaginary and has a minimum of 0, and −4 ≤ µx ≤ 0 the explicit
stability bound is hβ4 < 81 . This is a limit on the product of h and β4 and therefore one
can always find a combination which will be stable. But, for arbritrary h, especially in the
case where large h are used to accelerate convergence, this bound is too restrictive.
T.H. Pulliam, NASA Ames 43
In the second case, implicit second-difference dissipation can eliminate the above stabil-
ity bound. The amplification factor σ is now
The numerator term λx can only improve the stability bounds since it is pure imaginary,
so it is taken at its minimum, 0. Let α2 = 2β4 and apply the stability condition |σ| ≤ 1
which results in the condition −2 ≤ −hβ4 µx (4 + µx ). Since µx ≤ 0, the condition can be
rewritten as −2 ≤ hβ4 |µx |(4 + µx ) which is satisfied because −4 ≤ µx . Therefore, using
α2 = 2β4 leads to unconditional stability. The disadvantage of this is form is evident from
the amplification factor, Equation 7.15. Even though the scheme is unconditionally stable,
σ → 1 as h → ∞. In fact, the amplification factor has a minimum at a finite h and then
asymptotes rapidly to 1 as h increases. For this reason, large h cannot be used to accelerate
convergence even in this simple one-dimensional example.
In contrast, the third case of implicit and explicit dissipation is unconditionally stable
and has good convergence characteristics for large h. The amplification factor σ for α4 = β4
and α2 = 0 is
1
σ= (7.16)
1 + h (aλx + α4 µ2x )
advantage of vectorized computer processors and have been implemented for the Control
Data 205 and the CRAY 1-S and X-MP. On a single processor of the X-MP a vectorized
version of the code runs approximately 20 times faster than the original code which was
written for the Control Data 7600.
Traditionally gains in computational efficiency due to improved numerical algorithms
have kept pace with gains due to increased computer power. Since the ARC2D and ARC3D
codes were introduced, a variety of algorithmic changes have been individually tested and
have been shown to improve overall computational efficiency. These include use of a spa-
tially varying time step (∆t), use of a sequence of mesh refinements to establish approximate
solutions, implementation of various ways to reduce inversion work, improved numerical
dissipation terms, and more implicit treatment of terms. Although the various individual
algorithm improvements can interact with each other, sometimes adversely making opti-
mization difficult, their combined effect has lead to an order of magnitude gain in compu-
tational efficiency for steady state applications. This is a gain equivalent to that achieved
with computer hardware. Unsteady flow calculations have also benefited from some of the
above improvements.
We now summarize the two basic algorithms used in the code ARC2D, for details of
ARC3D see Section 8.. The standard algorithm is used mainly for time accurate calcula-
tions. The equations are reproduce from Equation 5.26
h ih i
I + hδξ Abn b n − hRe−1 δη M
I + hδη B cn ∆Q
bn = R
bn
T.H. Pulliam, NASA Ames 45
h i
b n = −h δξ E
R b n + δη Fb n − Re−1 δη Sbn (8.1)
This scheme consists of first forming the right hand side, R b n then performing two block
tridiagonal inversions. Central differences are used for the flux and Jacobian differences.
The dissipation models used are the implicit second order, Equations 6.2, added to the
appropriate implicit operator to keep the band width tridiagonal and the explicit nonlinear
term Equation 6.18. Since this scheme is used for time accurate calculation the typical
time step will be small enough to assure stability even though the explicit dissipation
operator is not properly linearized. The advantage of this scheme is time accuracy while
the disadvantage is substantial computational work.
In most instances we are interested in steady state computations. In that case we can
take advantage of simplifications such as the diagonal algorithm as long as the scheme
converges and we do not distort the steady state equations. The diagonal algorithm used
in ARC2D is
b [I + h δη Λη ] T −1 ∆Q
Tξ [I + h δξ Λξ ] N bn = R
bn (8.2)
η
In this case we always employ the nonlinear dissipation models, Equation 6.18 with a
linearization of the terms which necessitates the use of scalar pentadiagonal solvers. (Note
that the implicit artificial dissipation terms are placed inside the bracketed terms and are
operated on by the similarity transformations). This form of the diagonal scheme gives us a
very efficient code in terms of computational work and enhanced stability and convergence
due to the proper linearization of the explicit steady state equations. The time accuracy
though is at most first order. We also employ the variable time step Equations 7.8, 7.9 and
mesh sequencing to accelerate convergence.
We shall employ ARC2D and ARC3D to demonstrate various aspects of algorithm
improvements, accuracy, and application in the remainder of these notes.
9. Boundary Conditions
The implementation of a sophisticated numerical algorithm and the development of
a flow code usually are trivial tasks when compared with the work of actually solving a
particular fluid dynamics problem. We can always assess the applicability of a numerical
algorithm based on stability and accuracy considerations. The writing of specialized input
and output routines, while not unimportant, is usually mechanical. The real stumbling
block comes with the selection, implementation and assessment of boundary conditions
(BC).
There is a hierarchy of decisions which are made when the boundary condition problem
is attacked. The important aspects of boundary condition development are:
1. The physical definition of the flow problem must be satisfied. For example, inviscid
flow requires tangency at solid surfaces, or we may want to specify pressure at some
boundary.
46 “Solution Methods In Computational Fluid Dynamics”
2. The physical conditions must be posed in terms of the mathematics of the problem.
Characteristic theory suggests the number of conditions required at a boundary. The
condition of no slip for viscous flow is imposed by setting the flow velocities to zero at
solid surfaces.
3. The mathematical conditions are numerically approximated.
4. The numerical interior scheme may require more boundary information than the physics
provides. For example, standard central differencing as an interior scheme requires all
flow quantities at boundaries, but this may not be consistent with mathematical theory.
Additional numerical boundary conditions may be adjoined.
5. The combination of interior numerical scheme and boundary scheme should be checked
for stability and accuracy.
6. Finally, we must assess the efficiency and generality of a flow code in terms of its ability
to handle a wide variety of problems and flow conditions.
The physical definition of the flow problem is the first and foremost consideration. Once
a geometry and topology have been chosen, then physics dictates the constraints on the
boundaries.
9.1. Characteristic Approach
The concept of characteristic theory is best demonstrated with the one-dimensional
Euler equations, where
∂t Q + ∂x (AQ) = 0 (9.1)
represents the model equation. Assuming that A is a constant coefficient matrix we can
diagonalize Equation 9.1 using the eigenvector matrix X, so that
³ ´ ³ ´
∂t X −1 Q + ΛA ∂x X −1 Q = 0. (9.2)
At the left boundary of a closed physical domain, see Figure 9, where say 0 < u < a,
(for example, subsonic inflow for a channel flow) the two characteristic speeds u, u + a
are positive, while u − a is negative. At inflow then, only two pieces of information enter
the domain along the two incoming characteristics and one piece leaves along the outgoing
characteristic. At the outflow boundary one piece of information enters and two leave.
We would like to specify the first two components of W , which are the two incoming
characteristic variables and then handle the third characteristic variable such that its value
is not constrained, i.e., is determined by the interior flow.
T.H. Pulliam, NASA Ames 47
It is not necessary to fix values only in terms of the characteristic variables, other
flow quantities could be employed, as long as they lead to well posed conditions (that is,
conditions which guarantee the stability of the mathematical problem). Yee [37] provides
an excellent survey and development of boundary conditions within the framework of an
implicit algorithm. The major constraints which occur are that the correct number of
boundary values corresponding to incoming characteristics are specified and that the actual
implementation is stable and well posed. Chakravarthy [38] presents an implicit charac-
teristic boundary procedure. In this the eigenvectors of the system are coupled with the
chosen fixed boundary values and one sided finite differences to develop an equation which
is solved for the flow variables at the boundaries.
9.2. Well Posedness
A simple check on the well posedness of boundary conditions is obtained as part of
Chakravarthy’s development. As an example, let us consider one-dimensional flow with
subsonic inflow and subsonic outflow. Then two variables can be specified at inflow, associ-
ated with the first two eigenvalues, and one variable can be specified at outflow, associated
with the third eigenvalue. As specified values take, ρ = ρin , ρu = (ρu)in and p = pout .
These can be written as
q1
Bin (Q) = q2 = Bin (Qin ) (9.4)
0
0
Bout (Q) = 0 = Bout (Qout ) (9.5)
1 2
(γ − 1)(q3 − 2 q2 /q1 )
with q1 = ρ, q2 = ρu, q3 = e.
Forming the Jacobians Cin = ∂Bin /∂Q, and Cout = ∂Bout /∂Q we have
1 0 0 0 0 0
Cin = 0 1 0 and Cout = 0 0 0 (9.6)
0 0 0 ((γ − 1)/2) u2 −(γ − 1)u γ − 1
48 “Solution Methods In Computational Fluid Dynamics”
and
2
1 − u2 (γ − 1)a−2 (γ − 1)ua−2 −(γ − 1)a−2
2
C out = β[(γ − 1) u2 − ua] β[a − (γ − 1)u] β(γ − 1) (9.9)
2
(γ − 1) u2 −(γ − 1)u γ−1
These matrices are formed by adjoining the eigenvector associated with the outgoing
characteristic to the Jacobian matrices of the boundary conditions. The inverses of the
above matrices will exist if det(C) is nonzero. For the two boundaries we have det(C in ) =
β(γ − 1) 6= 0 and det(C out ) = β(γ − 1)a 6= 0 and so the boundary conditions are well posed.
Other choices for specified boundary values can be similarly checked.
Rather than go into any more detail on boundary condition theory we refer the reader
to Refs. [37] and [38]. For the remainder of this section, we shall discuss some of the current
types of physical conditions which are used in ARC2D.
9.3. Computational Mapping of Boundaries
Usually in a flow field computation, we are faced with a variety of boundary surfaces
and conditions. Our experience has been mostly in external flows and so we shall outline
below some of the more commonly used boundary conditions. The curvilinear coordinate
transformations are made such that physical boundaries are mapped to boundaries in the
computational domain. This makes the formulation and implementation of boundary con-
ditions easier.
At this stage in the development of ARC2D we have kept the boundary conditions
explicit. This gives us a more flexible clean code since all BC are handled in just one
subroutine. We realize that implicit boundary treatment will enhance the stability and
convergence rates of the codes. Our experience has been that the basic code in its present
form is fairly robust and can be implemented for a wide variety of cases. The user is then
responsible for the implementation of boundary conditions.
T.H. Pulliam, NASA Ames 49
A particular set of BC for an airfoil calculation is used below for demonstration purposes.
The geometry is mapped onto the computational rectangle such that all the boundary
surfaces are edges of the rectangle, for example see Figure 10 .
In “O” mesh topologies the wake cut boundary is periodic and can be handled as such
where periodic solvers are used in the implicit inversions.
9.3.1. Body Surfaces
At a rigid body surface, tangency must be satisfied for inviscid flow and the no slip
condition for viscous flow. In two-dimensions body surfaces are usually mapped to η =
constant coordinates. The normal component of velocity in terms of the curvilinear metrics
is given by
ηx u + ηy v
Vn = q (9.10)
(ηx2 + ηy2 )
ηy u − ηx v
Vt = q (9.11)
(ηx2 + ηy2 )
Therefore, tangency is satisfied by Vn = 0 (no flow through the body). The tangential
velocity Vt is obtained at the body surface through linear extrapolation for inviscid cases
and is set to zero for viscous cases. The Cartesian velocities are then formed from the
inverse relation between them and Equations 9.10, 9.11 where
50 “Solution Methods In Computational Fluid Dynamics”
à ! " # à !
u 1. ηy ηx Vt
=q (9.12)
v (ηx2 + ηy2 ) −ηx ηy Vn
The extrapolation of Vt produces less error if the mesh lines are clustered to the body
surface. The velocities of Equations 9.10, and 9.11 are scaled such that the metric variations
are removed which decreases the errors in the extrapolations, especially for nonorthogonal
meshes.
The pressure on the body surface is obtained from the normal momentum equation
where n is the local normal to the body surface. Equation 9.13 is solved at the surface using
central second-order accurate differences in ξ and one-sided first- or second-order accurate
differences in η. For steady uniform incoming flow free-stream stagnation enthalpy H0 is held
constant along the body in inviscid flow. Using the equation for enthalpy H0 = (e+p)/ρ and
the computed velocities and pressure, a value of density is obtained at the body. Adiabatic
or constant temperature walls are used for viscous and unsteady flows to obtain density at
the surface. In either case, total energy e is decoded from the equation of state.
9.3.2. Free Surfaces
Stretched grids are usually used to place far field boundaries far away from body surfaces.
When bow shocks and attached shocks are generated at a body surface care is taken to
ensure that the shocks are sufficiently weak when they reach far field boundaries so that
they are not reflected or at least they reflect outside the flow domain. A nonreflective
characteristic like boundary procedure is used at far field boundaries.
For subsonic free stream locally one-dimensional Riemann invariants are used at the
outer far field boundaries. The locally one-dimensional Riemann invariants are given in
terms of the normal velocity component as
The Riemann invariants R1 , R2 are associated with the two characteristic velocities (locally
one-dimensional) λ1 = Vn − a and λ2 = Vn + a respectively. Two other equations are
needed so that four unknowns (the four flow variables) can be calculated. We choose Vt and
S = ln(p/ργ ) where S is entropy. At the far field boundaries shown in Figure 10, the normal
n is directed away from the boundary. For subsonic inflow Vn < 0 and the characteristic
velocity λ2 > 0, therefore the characteristic variable R2 can be specified along with two other
conditions. The Riemann invariant R2 , Vt and S are all set to free stream values. The other
characteristic velocity λ1 < 0 and R1 is extrapolated from the interior flow variables. On
T.H. Pulliam, NASA Ames 51
subsonic outflow Vn > 0 and λ2 > 0 while λ1 < 0 so only R1 is fixed to free stream and R2 ,
Vt and ln(S) are extrapolated. Once these four variables are available at the boundary the
four flow variables Q can be obtained. For supersonic inflow boundaries all flow variables
are specified and for supersonic outflow all variables are extrapolated.
Along singularities or cuts in the geometry (such as the wake cut in a “C” mesh), aver-
aging is used to provide continuous flow variables. As mentioned above periodic conditions
are used for “O” meshes.
9.3.3. Far Field Circulation Correction
For lifting airfoils in subsonic free stream, circulation at the far field boundary is ac-
counted for to first-order (following Salas, et. al. [39]) by imposing a compressible po-
tential vortex solution which is added as a perturbation to the free stream quantities
(u∞ = M∞ cos(α) and v∞ = M∞ sin(α)). The perturbed far field boundary velocities
are defined as
βΓ sin(θ)
uf = u∞ + ¡
2 sin2 (θ − α)
¢ (9.15)
2πr 1 − M∞
and
βΓ cos(θ)
vf = v∞ − ¡
2 sin2 (θ − α)
¢ (9.16)
2πr 1 − M∞
are polar coordinates to the point of application on the outer boundary relative to an origin
at the quarter chord point on the airfoil center line. A corrected speed of sound is also used
which enforces constant free stream enthalpy at the boundary where
µ ¶
1
a2f = (γ − 1) H∞ − (u2f + vf2 ) (9.17)
2
Equations 9.15, 9.16 and 9.17 are used instead of free stream values in defining the fixed
quantities for the far field characteristic boundary conditions to be consistent with the
surface lift.
Figure 11 shows the coefficient of lift Cl plotted against the inverse of the distance to the
outer boundary for a NACA 0012 airfoil at the transonic condition M∞ = 0.8, α = 1.25◦
and at subcritical conditions M∞ = 0.63, α = 2.0◦ . In these cases the outer boundary
varies for 4.5 chords to 96 chords where outer mesh rings were eliminated from the 96
chord grid to produce the cut down meshes. This insures that the grid spacing between
the body and outer boundary is identical for all the cases. Without the far field vortex
correction the lift of the subcritical case can vary by as much as 12 % as seen in Figure 11.
With the far field vortex logic the subcritical case now has virtually no variation with outer
boundary distance. For the transonic case we see roughly a 1 - 2 % change which is quite
52 “Solution Methods In Computational Fluid Dynamics”
Figure 11: Ėffect on Lift of Varying Outer Boundary Distances With and Without Vortex
Correction.
good considering the strength of the shocks. The typical distance chosen for most cases
presented here is 25 chords.
The vortex correction logic can be modified to produce boundary conditions which allow
one to compute the angle of attack for a given lift. This is done by fixing the circulation
Γ in Equations 9.15, 9.16,and 9.17, at its value for the given lift. An iterative procedure
is used where the lift computed at the surface is compared to the desired lift and then the
initial angle of attack is modified by the formula
∆α = −βα (Cl (input) − Cl (calculated)) (9.18)
with βα a relaxation parameter on the order of 2 . Computations in which a specified
lift resulted in an angle of attack were compared with fixed α solutions at the same Mach
number and showed excellent agreement. This procedure has been verified in numerous
numerical examples.
10. Geometry and Grid Generation
The generalized coordinate transformation produces a system of equations which can
be applied to any regular and nonsingular geometry or grid system. The advantages of
this form are : Since uniform unit spacing is used, the computational domain has a one to
one correspondence with the positive integers and therefore regular unweighted difference
formulas can be used in the numerical scheme. This produces a computer code which can be
applied to a wide variety of problems without modification of the equations and numerical
scheme. Physical boundary surfaces can be mapped onto coordinate surfaces, which makes
T.H. Pulliam, NASA Ames 53
application of boundary conditions easier. The transformation allows for unsteady motion
of the coordinates, so that moving meshes and distorting surfaces can be computed. Grid
lines can be concentrated in regions of high gradients, for instance clustered to body surfaces
to calculate boundary layers, or clustered near shocks.
There are a wide variety of methods for generating grid systems. Algebraic methods
such as conformal mappings, quadratic functions, or the control function approach of Eise-
man [40] have been widely employed. The numerical approach of using elliptic solvers,
Thompson, Thames and Mastin [41], is also widely used. Thompson [42] provides a good
review of the current state of the art in grid generation. Figures 12 and 13 show a “C”
mesh and an “O” mesh topology for an airfoil where the mesh has been generated using a
variant of Eiseman’s method [40].The terminology “C” comes from the wrap around nature
of the grid. In the case shown grid lines are clustered at the leading and trailing edge, near
the body in the near normal direction and on the upper surface to capture an expected
shock.
One of the major deficiencies in computational fluid dynamics today lies in the area of
surface definition and grid generation. While there are a wide variety of generation methods,
there is no efficient and accurate means to assess the usefulness of a particular grid. The
obvious first checks such as not having grid lines cross, no discontinuous changes in grid
spacing and other cosmetic qualities can be checked. But, aspects such as high skewness,
curvature smoothness, and other intrinsic properties are hard to assess. What is needed at
54 “Solution Methods In Computational Fluid Dynamics”
this point is a set of well defined qualitative and quantitative checks for grid systems which
will allow us to distinguish between a “bad” grid and a “good” grid. A systematic study of
this form is lacking and hopefully will be pursued in the near future.
11. Examples and Application in 2-D
11.1. Code Validation
Once a computational code has been written and debugged, the author is faced with the
difficult challenge of assessing the accuracy, efficiency and robustness of the piece of work.
Each code is obviously tailored toward a class of problems and at least initially one should
restrict attention to that class. A series of test cases should be evaluated and then detailed
goals should be attacked.
In the case of ARC2D and ARC3D a number of sample cases have been computed and
details of these can be found in the literature, see for instance [2], [3], [43]– [50], and
numerious others. Here I shall briefly discuss some of the more exotic cases with the goal of
demonstrating the versatility and breath of applications. I refer the reader to the original
papers (where appropriate) for extensive details.
11.2. Inviscid Airfoils
The code ARC2D has the option of computing the inviscid equations (the Euler equa-
tions). The basic version of the code is written specifically for airfoil computations. The
T.H. Pulliam, NASA Ames 55
particular set of boundary conditions used now though are directed toward the solution
of flow past general airfoil shapes. The code has been applied to a wide variety of airfoil
shapes, flow conditions, and other geometries. We have validated the code against other
computational methods, [2], [50]. To demonstrate the accuracy and efficiency we have cho-
sen two test cases, a NACA0012 airfoil at M∞ = 0.8 , α = 1.25◦ on a coarse grid (192
by 33 points) and a fine grid (248 by 49 points). For comparison purposes we use results
from Jameson’s multigrid Euler code FLO52R [51]. FLO52R is an Euler code using a mul-
tistage Runge-Kutta like algorithm with a multigrid scheme to accelerate convergence. The
code employs enthalpy damping, residual averaging and an artificial dissipation model of
the same form as presented in Section 6.3.. In fact the boundary conditions and artificial
dissipation model used in ARC2D were modified to be the same as in FLO52R so that
quantitative as well as qualitative comparisons could be made. The two codes were run on
the same machine, the CRAY XMP at NASA Ames, on the same meshes and at the same
flow conditions.
The first case is the NACA0012 airfoil at M∞ = 0.8 and α = 1.25◦ . The grid used is an
“O” mesh topology with 192 points on the airfoil surface (running from the lower trailing
edge around the nose to the upper trailing edge) and 33 points in the normal direction. The
grid which is clustered at the leading and trailing edges, near the expected shock locations
on the upper and lower surfaces and in the normal direction is shown in Figure 14
Results from this case using ARC2D are shown in Figure 15. We show here coefficient of
pressure, Mach contours, pressure contours and contours of entropy. In Figure 16 we show
similar results for FLO52R. Computed lift for ARC2D is CL = 0.33957 and for FLO52R
CL = 0.32408. The comparison between the two codes is quite good, despite the differences
in spatial discretization.
We have established a number of accuracy checks and convergence criteria for compar-
ison purposes. In terms of accuracy we recommend comparison of pressure coefficients, lift
and other flow quantities. It is also important to establish the accuracy of certain flow re-
gions. The stagnation region near the nose of the airfoil is particularly susceptible to errors
due to poor boundary conditions, resolution, or physical assumptions. The best measure of
this error is the entropy field. For inviscid flow there should be no generation of entropy at
the leading edge of an airfoil in the absence of a leading edge shock. Examination of the
entropy at the leading edge for the above case shows, see Figure 17, that both codes give
rise to some error at the leading edge, although the magnitude is rather small.
A number of convergence criteria have been chosen to assess the efficiency and conver-
gence rates of the codes. We have chosen to use computer times as our measure of relative
speed. Since the two codes are run on the same machines and with the same meshes this is
an adequate measure. Other measures such as operation count, work or iteration are usually
programming dependent or susceptible to misinterpretation. The convergence criteria used
here are:
56 “Solution Methods In Computational Fluid Dynamics”
The residual is the l2 norm of the explicit or right hand side of Equation 8.1. We use
just the component from the continuity equation, the other components behave similarly.
For the above case on the 192 by 33 mesh the computer times for the convergence criteria
are given in Table 1.
As can be seen for this case FLO52R is up to twice as fast as ARC2D for some criteria.
In either event these are fairly good convergence times. In general, these numbers carry
over fairly consistently for a wide variety of airfoils and flow conditions for similar meshes.
A more stringent test is obtained with a finer grid and more grid points. A grid of 248
by 49 points is employed as the second study. The mesh is refined more at the nose, tail
and near the shocks. Also to reduce the entropy errors at the nose the grid is clustered
more tightly in the normal direction by reducing the minimum normal spacing by a factor
of 2. The mesh is shown in Figure 18.
Computational results for ARC2D and FLO52R are shown in Figures 19 and 20. In
this case the shocks are sharper and entropy errors at the leading edge are eliminated.
Convergence data for this case is contained in Table 2. In Figure 21, we show convergence
history vrs iteration for the two ARC2D results. All the results obtained with ARC2D were
done using the fully implicit pentadiagonal algorithm. As mentioned above, numerous other
cases and airfoils have been computed and perform similarly.
11.3. Viscous Airfoils
The code ARC2D has been applied to a wide variety of viscous computations for air-
foils [2], cascades [44], inlets [45], airfoils with a spoiler [46], circulation controlled air-
foils [47], and others. It has been used in an unsteady mode (see the next section) and
T.H. Pulliam, NASA Ames 59
for steady viscous computations. The algorithm as presented above performs very well for
viscous cases. It is convergent, fast and accurate. Two example cases are presented below.
The cases are taken from the suggested problems of the 1981 Stanford Olympics [52], an
RAE2822 airfoil at M∞ = 0.676, α = 1.93◦ , Re = 5.7 · 106 and M∞ = 0.73, α = 2.79 and
Re = 6.5 · 106 .
Results obtained from ARC2D for the first case are shown in Figure 22. The grid used
is a 248 by 51 point “O” mesh. The turbulence model was used and transition was fixed
at 11% chord. Experimental data due to Cook et. al. [53] is used for comparison. We see a
good comparison with experiment for pressure coefficient, and boundary layer properties.
The computed lift, drag and moment are compared with other computations and the
experiment in Table 3. Due to the uncertainty of the angle of attack correction all computors
matched lift. We show here our computation for both the experimentally corrected angle
of attack and the values when lift is matched. Also shown are results from computors at
the Stanford Olympics and some results of Mehta [54]. For the present computations at
the two angles of attack the pressure and boundary layer quantities are almost identical.
The changes in lift and drag are noted. The overall comparison with experiment and other
computations is quite good.
Results obtained for the second case are shown in Figure 23. The grid used is a 248 by
51 point “O” mesh. The turbulence model was used and transition was fixed at 3% chord.
62 “Solution Methods In Computational Fluid Dynamics”
Figure 22: V̇iscous Results for RAE2822 Airfoil at Re = 5.7 × 106 , M∞ = 0.676, α = 1.93◦ .
α CL CDP CDf CD CM
Experiment 2.40 0.566 0.0085 –0.082
Corrected Exp. 1.93 0.566 0.0085 –0.082
Mehta (1983) 1.80 0.566 0.0033 0.0061 0.0094 –0.087
Melnik (1981) 1.84 0.566 0.0027 0.0060 0.0087 –0.082
Le Balleur (1981) 1.93 0.566 0.0036 0.0056 0.0092 –0.080
Present 1.93 0.576 0.0034 0.0055 0.0089 –0.081
Present Cor. α 1.87 0.566 0.0034 0.0055 0.0089 –0.081
Figure 23: V̇iscous Results for RAE2822 Airfoil at Re = 6.5 × 106 , M∞ = 0.73, α = 2.79◦ .
We again see a good comparison with experiment for pressure coefficient, and boundary
layer properties.
The computed lift, drag and moment are compared with other computations and the
experiment in Table 4. Results from computors at the Stanford Olympics and some results
of Mehta [54] are shown. The overall comparison with experiment and other computations
is again quite good. The shock location on the upper surface compares well. In the present
computations a small region of separated flow occurs at the base of the shock and near the
trailing edge on the upper surface.
α CL CDP CDf CD CM
Experiment 3.19 0.803 0.0168 –0.099
Corrected Exp. 2.79 0.803 0.0168 –0.099
Mehta (1983) 2.79 0.793 0.0118 0.0059 0.0177 –0.094
Melnik (1981) 2.54 0.803 0.0100 0.0057 0.0157 –0.094
Le Balleur (1981) 2.79 0.787 0.0111 0.0055 0.0166 –0.086
Present 2.79 0.824 0.0128 0.0050 0.0178 –0.092
Present Cor. α 2.67 0.803 0.0113 0.0051 0.0164 –0.092
Convergence history vrs iteration for these cases are shown in Figure 24. Table 5 shows
the computed convergence criteria for these cases. The convergence for these cases is quite
good.
where below the boundary the aileron remained stationary. Above the buzz boundary the
shock system on the airfoil moves onto the aileron and excites the buzz motion of the aileron.
An unsteady harmonic motion occurs with the upper and lower shocks running across the
hinge onto and off of the aileron.
Steger and Bailey simulated this flow using the thin layer Navier Stokes equations for
the conditions shown by the symbols in Figure 25b. Figure 25c shows a case below the
buzz boundary. In this case they gave the aileron an initial deflection of 4◦ and integrated
forward in time. As seen the aileron motion damps to the neutral position of 0◦ deflection.
Above the buzz boundary even an aileron deflection of 0◦ is excited to the unsteady motion.
In Figure 25c the results are compared with the measure deflection angles. In Figure 25c
the computed buzz boundary compares quite well with the measured boundary.
Figure 26: Ėntropy Contours at Leading Edge Before and After Improved Accuracy.
with viscous experimental data was carried out by Anderson, Thomas and Rumsey [57]
in which good quantitative comparison were obtained. We shall briefly discuss here the
computations of Barton and Pulliam.
Barton and Pulliam presented two types of inviscid flow separation. In the first case at
flow conditions, M∞ = 0.25 α = 15◦ a shock free solution with inviscid flow separation was
obtained and the cause of the separation was traced to numerical error. At the high angle
of attack, inaccurate boundary conditions and resolution at the leading edge produced
vorticity (entropy gradients) which was convected downstream, resulting in an unsteady
separation on the upper surface near the trailing edge. By refining the grid and improving
the boundary conditions a steady error free solution was obtained. Figure 26 shows the
entropy contours at the leading edge before and after the improvement. Figure 27 shows a
comparison with full potential results using TAIR [58] and shows that good inviscid results
are obtained.
At a higher Mach number M∞ = 0.4 and the same angle of attack α = 15◦ a shock
forms at the leading edge, see Figure 28. In this case the shock is the source of vorticity
which is then convected downstream and forms an unsteady separation. Grid refinement
and the improved boundary conditions were used producing an error free leading edge, but
the unsteady motion was unaffected. The vorticity (entropy) generation is a result of the
strong normal shock strength gradient and high curvature of the leading edge.
The unsteady motion of the solution is depicted in Figure 29, 30, 31 , which shows the
time history of the pressure coefficient, stream function contours and entropy fields over a
complete cycle. A description of the evolution of this case is as follows. As the flow develops,
a strong shock is generated at the leading edge. Entropy, vorticity, and pressure loss are
created at the shock near the leading edge, and convected downstream along the body. A
small separation region appears at the trailing edge, which grows along the body towards
the leading edge. At some point the recirculation region is captured by the oncoming flow
T.H. Pulliam, NASA Ames 67
and is swept off the airfoil by convection. As the recirculation region passes the trailing
edge, another pocket of recirculation forms at the trailing edge, rotating in the opposite
direction. This counter-rotation is caused by the flow off the lower surface, whose direction
is opposite to that of the original region of recirculation. The shock then collapses, and
begins to slowly grow in strength as the pattern repeats itself. This flow pattern has a well
defined period and amplitude and has been reproduced in other computations with similar
grids and different values of time step and artificial viscosity.
As a final case Barton and Pulliam computed a viscous calculation at similar conditions,
a Mach number M∞ = 0.301 and α = 13.5◦ . The Reynolds number used was Re = 3.91×106
and the calculation was performed using the algebraic eddy-viscosity turbulence model. In
this case experimental data due to McCroskey [59] was available. For an inviscid simulation
unsteady results similar to the above M∞ = 0.4 case were obtained, but for the viscous
computation, a steady results occurred, which compared well with the experimental data.
The steady viscous comparison is shown in Figure 32.
Assuming the validity of the inviscid oscillation for this case, it was concluded that
the Euler solution is not a good approximation to the Navier-Stokes solution, under these
conditions. I refer the reader to the full paper for more details.
12. Three - Dimensional Algorithm
The 3 - D form of the implicit algorithm follows the same development as the 2 -
T.H. Pulliam, NASA Ames 69
Figure 32: V̇iscous Solution Compared with Experimental Data of McCroskey, et. al.
D algorithm. The curvilinear transformations are carried out in the same fashion. The
standard and diagonal algorithm take the same format. We also employ the thin layer
approximation. Boundary conditions are similar. The equations, algorithm, and other
details can be found in Pulliam and Steger [3]. We shall briefly outline the important
aspects and point out the pertinent differences from the 2 - D development.
12.1. Flow Equations
The full three dimensional Navier-Stokes equations in strong conservation law form
are reduced to the thin layer form under the same restrictions and assumptions as in two
dimensions. The equations in generalized curvilinear coordinates are
b + ∂ξ E
∂τ Q b + ∂η Fb + ∂ζ G
b = Re−1 ∂ζ Sb (12.1)
where now
ρ ρU
ρu ρuU + ξx p
b = −1 b −1
Q J ρv , E=J ρvU + ξy p ,
ρw ρwU + ξz p
e U (e + p) − ξt p
T.H. Pulliam, NASA Ames 71
ρV ρW
ρuV + ηx p ρuW + ζx p
−1 −1
Fb = J ρvV + ηy p , b=J
G ρvW + ζy p (12.2)
ρwV + ηz p ρwW + ζz p
V (e + p) − ηt p W (e + p) − ζt p
with
U = ξt + ξx u + ξy v + ξz w,
V = ηt + ηx u + ηy v + ηz w
W = ζt + ζx u + ζy v + ζz w (12.3)
with
0
µm1 uζ + (µ/3)m2 ζx
b −1
S=J µm1 vζ + (µ/3)m2 ζy (12.4)
µm1 wζ + (µ/3)m2 ζz
µm1 m3 + (µ/3)m2 (ζx u + ζy v + ζz w)
ξx = J(yη zζ − yζ zη ), ηx = J(zξ yζ − yξ zζ )
ξy = J(zη xζ − zζ xη ), ηy = J(xξ zζ − zξ xζ )
ξz = J(xη yζ − yη xζ ), ηz = J(yξ xζ − xξ yζ )
ζx = J(yξ zη − zξ yη ), ξt = −xτ ξx − yτ ξy − zτ ξz
ζy = J(zξ xη − xξ zη ), ηt = −xτ ηx − yτ ηy − zτ ηz
ζz = J(xξ yη − yξ xη ), ζt = −xτ ζx − yτ ζy − zτ ζz (12.6)
with
J −1 = xξ yη zζ + xζ yξ zη + xη yζ zξ − xξ yζ zη − xη yξ zζ − xζ yη zξ (12.7)
The implicit approximate factorization algorithm applied to the three dimensional equa-
tions is
h ih i h i
I + hδξ Abn bn
I + hδη B I + hδζ Cb n − hRe−1 δζ M
cn ∆Q
bn =
³ ´
− b n + δη Fb n + δζ G
h δξ E b n − Re−1 δζ Sbn (12.8)
with Nb = T −1 Tη and Pb = T −1 Tζ . Just as in two dimensions we use the explicit and implicit
ξ η
nonlinear artificial dissipation terms.
It is interesting (and somewhat disturbing) to note that a linear constant coefficient
Fourier analysis (periodic BC) for the three dimensional model wave equation shows uncon-
ditional instability for the three dimensional factored algorithm. This is due to the cross
term errors. In contrast to the case of two dimensions where the cross term errors just
affect the rapid convergence capability (at large time steps) of the algorithm. In three di-
mensions we also have a weak instability due to the cross terms. Linear analysis shows that
the instability is a weak one where the amplification factor is very close but greater than
one. It can be shown though that a small amount of added artificial dissipation moves the
amplification factor below one and therefore we have conditional stability. Practical model
equation analysis using nonperiodic boundary conditions also shows a stability range. In
actual practice on three dimensional nonlinear problems we have never encountered a case
where we could attribute an instability to this problem area. In fact, numerical experiments
show that if anything the three dimensional algorithm seems to be more stable and con-
vergent for a given problem than in two dimensions. Numerous cases have been calculated
where the three dimensional algorithm converges and residuals go to machine zero.
12.3. Boundary Conditions and Geometry
Physical boundaries are again mapped to computational boundaries. Explicit numer-
ical and physical conditions can be applied as necessary. The actual conditions used are
the straightforward extensions of the methods outlined in Section 9.. One aspect of the
three dimensional application which is more complicated then in two dimensions is the de-
velopment of grid topology and mesh systems. Surface definitions and the computational
mapping are complicated by the many surfaces involved, which include coordinate singular-
ities (unavoidable when mapping a closed 3-D body) and the lack of an adequate number of
grid points since 3-D is a bigger strain on computer storage limitations. This will definitely
T.H. Pulliam, NASA Ames 73
drive the computational community towards zonal or patch methods, i.e. the large 3-D
problem is broken into multiple sections and each section is handled separately. The inter-
action between zones can be explicit, see Rai [60] or implicit, Hessenius and Pulliam [61].
Also grid refinement techniques need more development, see Berger [62] and Nakahashi and
Deiwert [63] for examples of these concepts.
Geometry seems to be one of the biggest stumbling blocks in three dimensions. Another
aspect of 3-D which causes problems is the question of the metric invariants. If central
differencing is used to compute the metrics and for the flux derivatives then for 3-D the
metric invariants are not satisfied. This was pointed out in the original paper by Pulliam
and Steger [3]. By modifying the computation of the metrics we can satisfy the invariants.
This is done by averaging the central differences of the grid values, (xξ , xη , etc. ) to produce
metrics which are similar to terms which would be computed by a finite volume method.
For instance, ξx would be computed as
where δ is the standard central difference operator and µ is an average operator, for instance
µη xk = (xk+1 +xk−1 )/2. If all the metric terms are calculated is this manner then the metric
invariants are satisfied.
12.4. Code Structure and Vectorization
With the advent of vectorized computer architecture one cannot just program in a
linear fashion without regard to code structure and expect to produce efficient code. Two
major decisions face the programmer when writing code for a vector machine, for instance
CRAY type architecture. First you must identify the vector length or construct. Which
indices can be considered vectors, what are the vector lengths (some machines require long
vectors) and how is a vector loop implemented. Secondly, how do we manage the large data
bases which now can be processed because of the efficiency and speed of vector machines.
Many of the near future computers will have either large in-core memory or high speed
out-of-core storage. These large amounts of data will have to be managed efficiently. In
three dimensions for instance a data base of 1 million grid points and 32 variables will be
common place requiring us to manage 32 millions words. To efficiently handle this data base,
management systems such as plane slices or pencil concepts, see Lomax and Pulliam [18],
need to be developed and refined. The proper way to handle this is to divide memory into
two parts, the operating part of memory and the storage part. Identifiable blocks of data
are brought into the working area, processed and then moved back to storage. There are
a number of advantages to such a system. One is that large blocks of data can be moved
more efficiently especially for out-of-core storage devices but even for in-core storage. The
dimensions of the blocks define the vector lengths. For SIMD (single instruction - multiple
data) or MIMD (multiple instruction - multiple data) architecture the blocks define multiple
data strings to be operated on. In general, as machines become larger and more powerful
we will have to take more care in the development of a well structure efficient code.
74 “Solution Methods In Computational Fluid Dynamics”
Computed surface oil flow (particle paths restricted to the body) and surface pressure for
α = 6◦ show the type of results presented. The results reported by Deiwert [67] compared
very well with the experimental data of Shrewsbury [69].
The boattail study was undertaken as a first step toward the simulation of boattail
exhaust plumes which has been carried out in a preliminary stage by Deiwert [68]. In
this simulation the boattail sting is eliminated and a conical exhaust jet is added at the
base region. Figures 37 and 38 show comparisons of density contours and streamlines
with Schlieren photographs from Agrell and White [70]. In Figure 37 the pressure ratio for
the jet was 3 and we see an expanded exhaust plume and a complicated shock shear flow
pattern ( depicted by the bold lines). The qualitative comparison with the photograph is
quite remarkable. At a higher exhaust ratio, Figure 38 the jet is tighter and the shock shear
surface pattern more complicated. Again the qualitative comparison with the photograph is
quite good. I refer the reader to the original papers and one by Nakahashi and Deiwert [63]
for a more detailed analysis of these flowfields.
13. Summary
In summary, the development of some computational algorithms in two- and three-
dimensions have been presented. Details of two computational codes, ARC2D and ARC3D
have been presented. The basic algorithm used is the Beam and Warming implicit approxi-
mate factorization scheme or variants of that scheme such as the diagonalization. The codes
employ improvements to enhance accuracy, (grid refinement, better boundary conditions,
more versatile artificial dissipation model) and efficiency (diagonal algorithm, implicit treat-
ment of artificial dissipation terms, variable time steps). Results for a wide variety of cases
substantiate the accuracy and efficiency claims.
Future work is required to address improvement of boundary conditions, examining
T.H. Pulliam, NASA Ames 77
stability questions, eliminating cross term errors and more. We are also interested in devel-
oping new grid generation concepts both in terms of generation and grid quality. In three
dimensions we see the area of zonal concepts as the newest horizon and envision substantial
gains in solution capability as a result.
T.H. Pulliam, NASA Ames 81
References
[2] Steger, J. L. Implicit Finite Difference Simulation of Flow About Arbitrary Geome-
tries with Application to Airfoils, AIAA Paper 77-665, 1977
[5] Rizzi, A. and Eriksson, L. Explicit Multistage Finite Volume Procedure to Solve the
Euler Equations for Transonic Flow , Lecture Series on Computational Fluid Dynamics,
von Kárm án Institute, Rhode-St-Genese, Belgium 1983
[6] Jameson, A., Schmidt, W. and Turkel, E. Numerical Solutions of the Euler
Equations by Finite Volume Methods Using Runge- Kutta Time- Stepping Schemes,
AIAA paper 81-1259 AIAA 14th Fluid and Plasma Dynamics Conference, Palo Alto
1981
[7] Yee, H. C., and Harten, A. Implicit TVD Schemes for Hyperbolic Conservation
Laws in Curvilinear Coordinates, AIAA-85-1513 AIAA 7th Computational Fluid Dy-
namics Conference, Cincinatti, Oh.,1985
[8] Chakravarthy, S., and Osher, S. A New Class of High Accuracy TVD Schemes for
Hyperbolic Conservation Laws, AIAA-85-0363 AIAA 23rd Aerospace Sciences Meeting,
Reno, Nevada.,1985
[9] Anderson, W. K., Thomas, J. L., and Van Leer, B. A Comparison of Finite
Volume Flux Vector Splittings for The Euler Equations, AIAA-85-0122, AIAA 23rd
Aerospace Sciences Meeting, Reno,NV, 1985
[12] Korn, G. and Korn, T., Mathemaical Handbook for Scientists and Engineers, Mc
Graw Hill Book Co, New York, 1961
[13] Hindman, R. Geometrically Induced Errors and Their Relationship to the Form of
the Governing Equations and the Treatment of Generalized Mappings, AIAA Paper
81-1008 1981
82 “Solution Methods In Computational Fluid Dynamics”
[14] Flores, J., Holst, T., Kwak, D., and Batiste, D. A New Consistent Spatial
Differencing Scheme for the Transonic Full Potential Equation , AIAA Paper 83-073
1983
[15] Baldwin, B. S. and Lomax, H. Thin Layer Approximation and Algebraic Model for
Separated Turbulent Flows, , AIAA Paper No. 78-257 1978
[16] Spalart, P.R. and S.R. Allmaras A One Equation Turbulence Model for Aerody-
namic Flows AIAA Paper No. 92–0439, 1992.
[17] Baldwin, B. and Barth, T.J. A One-Equation Turbulence Transport Model for
High Reynolds Number Wall-Bounded Flows TM-102847, 1990.
[18] Lomax H. and Pulliam T. H. A Fully Implicit Factored Code for Computing Three
Dimensional Flows on the ILLIAC IV, Parallel Computations, G. Rodrigue, Ed., Aca-
demic Press, New York 1982 pp. 217-250
[19] Barth, T. J. and Steger, J. L. A Fast Efficient Implicit Scheme for the Gasdy-
namics Equations Using A Matrix Reduction Technique, AIAA-85-0439, AIAA 23rd
Aerospace Sciences Meeting, Reno,NV, 1985
[20] Warming, R. F., and Beam, R. M. On the Construction and Application of Implicit
Factored Schemes for Conservation Laws, SIAM-AMS Proceedings Vol. 11 1978 pp. 85–
129
[21] Jespersen, D. C. Recent Developments in Multigrid Methods for the Steady Euler
Equations, Lecture Notes for Lecture Series on Computational Fluid Dynamics 1984,
von Kárm án Institute, Rhode-St-Genese, Belgium
[22] Pulliam, T. H. and Chaussee, D. S. A Diagonal Form of an Implicit Approximate
Factorization Algorithm, JCP Vol. 39 1981 page 347
[23] Warming, R. F., Beam, R., and Hyett, B. J. Diagonalization and Simultaneous
Symmetrization of the Gas-Dynamic Matrices, Math Comp, Vol. 29 1975 page 1037
[24] Steger, J. Coefficient Matrices for Implicit Finite Difference Solution of the Invis-
cid Fluid Conservation Law Equations, Computer Methods In Applied Mechanics and
Engineering Vol.13 pp. 175-188, 1978
[25] Beam, R. and Bailey, H Private Communication
[26] Steger J. L. and Warming, R. F. Flux Vector Splitting of the Inviscid Gas Dynamic
Equations with Applications to Finite Difference Methods, J. Comp. Phys. Vol. 40 pp.
263-293 1981
[27] Roe, P. L. The Use of the Riemann Problem in Finite Difference Schemes, Presented
at the Seventh International Conference on Numerical Methods in Fluid Dynamics,
Stanford, CA 1980
T.H. Pulliam, NASA Ames 83
[28] van Leer, B. Flux-Vector Splitting for the Euler Equations Eighth International Con-
ference on Numerical Methods in Fluid Dynamics, Springer Lecture Notes in Physics
no. 170, ed. E. Krause 1983
[29] Osher S. and Chakravarthy, S. Upwind Schemes and Boundary Conditions with
Applications to Euler Equations in General Geometries, J. Comp. Phys. Vol. 50 pp.
447–481 1983
[30] Harten, A. A High Resolution Scheme for the Computation of Weak Solutions of
Hyperbolic Conservation Laws, J. Comp. Phys. Vol. 49 1983 pp. 357-393
[31] Pulliam, T. H. Artificial Dissipation Models For the Euler Equations , AIAA 85-0438
AIAA 23rd Aerospace Sciences Meeting, Reno, Nevada., 1985
[32] Abarbanel, S., Dwoyer, D. and Gottlieb, D. Improving the Convergence Rate
of Parabolic ADI Methods, ICASE Report 82-28 1982
[33] Shang, J. and Hankey, W. , Numerical Solution of the Compressible Navier- Stokes
Equations for a Three-Dimensional Corner, AIAA paper 77-169 1977
[35] Srinivasan, G., Chyu, W. , and Steger, J. Computation of Simple Three- Di-
mensional Wing- Vortex Interaction in Transonic Flow , AIAA paper 81-1206, 1981
[36] Coakley, T. J. Numerical Method for Gas Dynamics Combining Characteristic and
Conservation Concepts , AIAA Paper 81-1257 1981
[38] Chakravarthy, S. Euler Equations – Implicit Schemes and Implicit Boundary Con-
ditions , AIAA Paper 82-0228, Orlando, Fla. 1982
[39] Salas, M. , Jameson, A., and Melnik, R. A Comparative Study of the Nonuni-
queness Problem of the Potential Equation , AIAA paper 83-1888, AIAA 6th Compu-
tational Fluid Dynamics Conference 1983
[42] Thompson, J. Numerical Grid Generation, J. Thompson Ed. North Holland, 1982
84 “Solution Methods In Computational Fluid Dynamics”
[57] Anderson, W., Thomas, J. , and Rumsey, C. Application of Thin Layer Navier
Stokes Equations Near Maximum Lift , AIAA 84–0049, AIAA 22nd Aerospace science
Meeting, Reno, Nevada 1984
[58] Holst, T. L. Implicit Algorithm for the Conservative Transonic Full-Potential Equa-
tion Using an Arbitrary Mesh , AIAA J. 17, 1979 pp. 1038–1045
[59] McCroskey, W., McAlister, K., Carr, L., and Pucci, S. An Experimental
Study of Dynamic Stall on Advanced Airfoil Sections, Vol. 1, Summary of Experiment
NASA TM 84245 1982
[60] Rai, M. M. A Conservative Treatment of Zonal Boundaries for Euler Equations Cal-
culations, AIAA paper 84–0164, AIAA 22nd Aerospace Science Meeting, Reno, NV
1984
[62] Berger,M. Adaptive Mesh Refinement for Hyperbolic Partial Differential Equations,
Ph. D. Thesis, Department of Computer Science, Stanford University 1982.
[63] Nakahashi, K. and Deiwert, G. S. A Practical Adaptive Grid Method for Com-
plex Fluid Flow Problems, Submitted to Ninth International Conference on Numerical
Methods in Fluid Dynamics, Saclay, France 1984
Appendix
The flux Jacobian matrices of Equation 5.4 have real eigenvalues and a complete set of
eigenvectors. The similarity transforms are
Ab = Tξ Λξ Tξ−1 b = Tη Λη T −1
and B η (A1)
where
U
U
q
Λξ =
U + a ξx2 + ξy2
(A2)
q
U − a ξx2 + ξy2
V
V
q
Λη =
V + a ηx2 + ηy2
(A3)
q
V − a ηx2 + ηy2
with
1 0 α α
u κy ρ
e α(u + κx a)
e α(u − κ e x a)
Tκ = v −κex ρ (A4)
hα(v +κ e y a) i hα(v −κ e y a) i
φ2 φ2 +a2 e φ2 +a2
(γ−1) ρ(κ
ey u − κe x v) α (γ−1) + aθ α (γ−1) − aθe
(1 − φ2 /a2 ) (γ − 1)u/a2
−(κey u − κ
e x v)/ρ κ
e y /ρ
Tκ−1 =
β(φ2 − aθ) e β[κ
e x a − (γ − 1)u]
2
β(φ + aθ) e −β[κ
e x a + (γ − 1)u]
(γ − 1)v/a2 −(γ − 1)/a2
−κe x /ρ 0
(A5)
β[κ
e y a − (γ − 1)v] β(γ − 1)
−β[κe y a + (γ − 1)v] β(γ − 1)
√ √ q
and α = ρ/( 2a), β = 1/( 2ρa), θe = κ
ex u + κ e x = κx / κ2x + κ2y .
e y v, and, forexample, κ
b = T −1 Tη ,
N b −1 = T −1 Tξ
N (A6)
ξ η
88 “Solution Methods In Computational Fluid Dynamics”
where
1 0 0 0
0 m1 −µm2 µm2
b =
N (A7)
0 µm2 µ2 (1 + m1 ) µ2 (1 − m1 )
0 −µm2 µ2 (1 − m1 ) µ2 (1 + m1 )
and
1 0 0 0
0 m1 µm2 −µm2
b −1 =
N
(A8)
0 −µm2 µ2 (1 + m1 ) µ2 (1 − m1 )
0 µm2 µ2 (1 − m1 ) µ2 (1 + m1 )
³ ´ ³ ´ √
with m1 = ξex ηex + ξey ηey , m2 = ξex ηey − ξey ηex and µ = 1/ 2.
b is only a function of the metrics and not the
It is interesting to note that the matrix N
flow variables.
In three dimensions the Jacobian matrices Ab or B
b or C
b=
κt κx
φ2
κx − uθ κt + θ − κx (γ − 2)u
κy φ2 − vθ κx v − κy (γ − 1)u
κ¡z φ2 − wθ ¢ κ
¡ x
w − κz¢(γ − 1)u
−θ γe/ρ − 2φ 2 κx γe/ρ − φ2 − (γ − 1)uθ
(A9)
κy κz 0
κy u − κx (γ − 1)v κz u − κx (γ − 1)w κx (γ − 1)
κt + θ − κy (γ − 2)v κz v − κy (γ − 1)w κy (γ − 1)
κ w − κz (γ
¡ y ¢
− 1)v κ
¡ t
+ θ − κz¢(γ − 2)w κz (γ − 1)
κy γeρ−1 − φ2 − (γ − 1)vθ κz γeρ−1 − φ2 − (γ − 1)wθ κt + γθ
where
θ = κx u + κy v + κz w
2 2 2
φ2 = (γ − 1)( u +v2 +w ) (A10)
b B,
with κ = ξ, or η or ζ for A, b or, C
b respectively.
where
The eigensystem decomposition of the three dimensional Jacobians have the form Ab =
b = Tη Λη T −1 , and C
Tξ Λξ Tξ−1 , B b = Tζ Λζ T −1 . The eigenvalues are
η ζ
λ1 = λ2 = λ3 = κt + κx u + κy v + κz w
λ4 = λ1 +qκa λ5 = λ1 − κa
κ = κ2x + κ2y + κ2z (A13)
κex κey
κ
ex u κ
ey u − κez ρ
Tκ = κ
ex v + κez ρ κ
ey v
κ
ex w − κ ey ρ κ
ey w + κ ex ρ
£ ¤ £ ¤
κe x φ2 /(γ − 1) + ρ(κ ez v − κ
e y w) e y φ2 /(γ − 1) + ρ(κ
κ ex w − κ
e z u)
(A14)
κ
ez α α
κez u + κey ρ α(u + κe x a) α(u − κe x a)
κez v − κex ρ α(v + κe y a) α(v − κe y a)
κ
ez w h α(w + κ
e z a) i h α(w − κ
e z a) i
£ ¤
κe z φ2 /(γ − 1) + ρ(κ ey u − κ
e x v) e
α (φ2 + a2 )/(γ − 1) + θa e
α (φ2 + a2 )/(γ − 1) − θa
where
ρ κx κy κz θ
α= √ , κ
ex = , κ
ey = , κ
ez = , θe = (A15)
2a κ κ κ κ
90 “Solution Methods In Computational Fluid Dynamics”
(A16)
e x (γ − 1)v/a2 + κ
κ e z /ρ e x (γ − 1)w/a2 − κ
κ e y /ρ e x (γ − 1)/a2
−κ
κe y (γ − 1)v/a2 κ 2
e y (γ − 1)w/a + κ e x /ρ e y (γ − 1)/a2
−κ
e z (γ − 1)v/a2 − κ
κ e x /ρ κe z (γ − 1)w/a2 e z (γ − 1)/a2
−κ
−β[(γ − 1)v − κ e y a] −β[(γ − 1)w − κ e z a] β(γ − 1)
−β[(γ − 1)v + κ e y a] −β[(γ − 1)w + κ e z a] β(γ − 1)
where
1
β=√ (A17)
2ρa
Thomas H. Pulliam
Mail Stop T27B-1, NASA Ames Research Center
Moffett Field, California, 94035
Phone : 650 - 604 - 6417
email : tpulliam@mail.arc.nasa.gov
January, 1986