Orthogonal Polynomials (In Matlab) : Walter Gautschi

Orthogonal Polynomials (in Matlab)
Walter Gautschi
Abstract. A suite of Matlab programs has been developed as part of the book
“Orthogonal Polynomials: Computation and Approximation” expected to be pub-
lished in 2004. The package contains routines for generating orthogonal polyno-
mials as well as routines dealing with applications. In this paper, a brief review
is given of the first part of the package, dealing with procedures for generating
the three-term recurrence relation for orthogonal polynomials and more general
recurrence relations for Sobolev orthogonal polynomials. Moment-based methods
and discretization methods, and their implementation in Matlab, are among the
principal topics discussed.
Keywords: Orthogonal polynomials; recurrence relations; Matlab.
1. Introduction
The analytic theory of orthogonal polynomials is well documented in a number

of treatises; for classical orthogonal polynomials on the real line as well as on
the circle, see [25], for those on the real line also [24]. General orthogonal
polynomials are dealt with in [5] and more recently in [22], especially with
regard to nth-root asymptotics. The text [3] is rooted in continued fraction
theory and recurrence relations.
While the theory of orthogonal polynomials is well developed, the practice of

orthogonal polynomials — constructive, computational, and software aspects
— is still in an early stage of development. An effort in this direction is being
made by the author’s forthcoming book [13] and the accompanying package
OPQ: a Matlab Suite of Programs for Generating Orthogonal Polynomials and
Related Quadrature Rules, which can be found at the URL
http://www.cs.purdue.edu/archives/2002/wxg/codes.
The purpose of the work in [13] is twofold: (i) to present various procedures for
generating the coefficients of the recurrence relations satisfied by orthogonal
polynomials on the real line and by Sobolev orthogonal polynomials; and (ii) to
discuss selected applications of these recurrence relations, including numerical
quadrature, least squares and moment-preserving spline approximation, and
the summation of slowly convergent series. All is to be implemented in the
1
form of Matlab scripts. In the present article we wish to give a brief account
of the first part of [13]: the generation of recurrence coefficients for orthogonal
polynomials and related Matlab programs. All Matlab routines mentioned in
this paper, and many others, are downloadable individually from the above
Web site.
2. Orthogonal polynomials
We begin with some basic facts about orthogonal polynomials on the real line
and introduce appropriate notation as we go along. Suppose dλ is a positive
measure supported on an interval (or a set of disjoint intervals) on the real
line such that all moments µr = R tr dλ(t) exist and are finite. Then the inner
R
product Z
(p, q)dλ = p(t)q(t)dλ(t) (1)
R
is well defined for any polynomials p, q and gives rise to a unique system
πr (t) = tr + · · · , r = 0, 1, 2, . . . , of monic orthogonal polynomials

=

0, k 6= `,
πk ( · ) = πk ( · ; dλ) : (πk , π` )dλ (2)
>

0, k = `.
It is well known that they satisfy a three-term recurrence relation
πk+1 (t) = (t − αk )πk (t) − βk πk−1 (t), k = 0, 1, 2, . . . ,

(3)
π−1 (t) = 0, π0 (t) = 1,
where αk = αk (dλ) and βk = βk (dλ) are real resp. positive Rconstants which de-
pend on the measure dλ. For convenience, we define β0 = R dλ(t). Associated
with the recurrence relation (3) is the Jacobi matrix
√
α β 0
 
0 1
 √β α √β 
 1 1 2 
 √ .. 
J(dλ) = 

β2 α2 . , (4)
.. .. 
 
. . 


0
a symmetric tridiagonal matrix of infinite order. Its leading principal minor

matrix of order n will be denoted by
J n (dλ) = J (dλ)[1:n,1:n] . (5)
As already indicated in §1, the basic problem is this: for a given measure
dλ and for given integer n ≥ 1, generate the first n coefficients αk (dλ), k =
2
0, 1, 2, . . . , n − 1, and the first n coefficients βk (dλ), k = 0, 1, 2, . . . , n − 1, that
is, the Jacobi matrix J n (dλ) of order n and β0 .
2.1. Recurrence coefficients. Frequently, the measure dλ is absolutely continu-

ous, i.e., representable in the form
dλ(t) = w(t)dt, (6)
where w is a nonnegative function, called weight function, integrable on the

support of dλ and not identically zero. Among the best-known weight functions
are the classical weight functions, the more important of which are listed in
Table 2.1.
name w(t) supported on
α β
Jacobi (1 − t) (1 + t) , α > −1, β > −1 [−1, 1]
Laguerre tα e−t , α > −1 [0, ∞]
2
Hermite |t|2α e−t , 2α > −1 [−∞, ∞]
Table 2.1. Classical weight functions
For these, the recurrence coefficients are explicitly known. In Matlab, the first
N recurrence coefficients are always stored in an N × 2 array ab as shown in
Fig. 2.1.
α0 β0
α1 β1
.. ..
. .
αN −1 βN −1
Figure 2.1. The array ab of recurrence coefficients
The Matlab command to compute them has the syntax ab=r name(parameters),
where name identifies the weight function, and parameters is a list of parame-
ters including N. Thus, for example, in the case of the Jacobi weight function,
the Matlab command is
ab=r jacobi(N,a,b).
Here, a, b are the Jacobi parameters (denoted by α and β in Table 2.1). If

α = β, it suffices to write ab=r jacobi(N,a), and if α = β = 0, to write
ab=r jacobi(N).
Demo#1. The first ten recurrence coefficients for the Jacobi polynomials with
parameters α = − 21 , β = 23 .
The Matlab command, followed by the output, is shown in the box below.
3
>> ab=r jacobi(10,-.5,1.5)
ab =
6.666666666666666e-01 4.712388980384690e+00
1.333333333333333e-01 1.388888888888889e-01
5.714285714285714e-02 2.100000000000000e-01
3.174603174603174e-02 2.295918367346939e-01
2.020202020202020e-02 2.376543209876543e-01
1.398601398601399e-02 2.417355371900826e-01
1.025641025641026e-02 2.440828402366864e-01
7.843137254901961e-03 2.455555555555556e-01
6.191950464396285e-03 2.465397923875433e-01
5.012531328320802e-03 2.472299168975069e-01
Classical weight functions are not the only ones for which the recurrence co-
efficients are explicitly known. For example, the logistic weight function
e−t
w(t) = , t ∈ R,
(1 + e−t )2
of interest in statistics, has all coefficients αk = 0 (by symmetry) and β0 = 1,

βk = k 4 π 2 /(4k 2 − 1), k ≥ 1 ([3, Eq. (8.7) where λ = 0, x = t/π]). The
corresponding Matlab routine is r logistic.m. Other examples are measures
occurring in the diatomic linear chain model, which are supported on two
disjoint intervals; cf. [10].
Many nonclassical weight functions and measures, however, are such that their
recurrence relations are not explicitly known. In these cases, numerical tech-
niques must be used, some of which are to be described in the next four
subsections.
2.2. Modified Chebyshev algorithm. In principle, the desired recurrence coeffi-

cients can be computed from well-known formulae expressing them in terms
of Hankel-type determinants involving the moments µr of the given measure
dλ. The problems with this are: excessive complexity and, more seriously, ex-
treme numerical instability. To avoid these problems, one can attempt to use
modified moments
Z
mr = pr (t)dλ(t), r = 0, 1, 2, . . . , (7)
R
where pr are monic polynomials of degree r “close” in some sense to the desired
polynomials πr . In particular, they are assumed to also satisfy a three-term
recurrence relation
pk+1 (t) = (t − ak )pk (t) − bk pk−1 (t), k = 0, 1, 2, . . . ,

(8)
p−1 (t) = 0, p0 (t) = 1,
4
but this time with known coefficients ak ∈ R, bk ≥ 0. (We allow for zero
coefficients bk , since ak = bk = 0 yields the ordinary moments.) There is then
a unique map
R2n 7→ R2n : [mk ]k=0
2n−1 n−1
7→ [αk , βk ]k=0 (9)
that takes the first 2n modified moments into the desired n recurrence coef-
ficients αk and βk . An algorithm implementing this map has been developed
by Sack and Donovan [21], and in more definitive form, by Wheeler [26]. In
the case of ordinary moments (ak = bk = 0), it reduces to an algorithm alrady
developed (for discrete measures) by Chebyshev [2]. We called it, therefore,
the Modified Chebyshev Algorithm. It is implemented in the Matlab procedure
ab=chebyshev(N,mom,abm),
where N is the number n in (9), mom the 1 × 2N array of modified moments,

and abm the (2N − 1) × 2 array of the first 2N − 1 recurrence coefficients ak , bk
in (8). If abm is omitted from the list of input parameters, the routine assumes
abm=zeros(2*N-1,2), that is, ordinary moments.
In view of the highly ill-conditioned nature of the map (9) when mr = µr

are ordinary moments, the conditioning of the modified moment map is an
important question that has been studied already in [7], and more definitively
in [9]. There are examples where the map is entirely well conditioned, but also
others, especially when the measure dλ has unbounded support, in which the
map is almost as ill conditioned as for ordinary moments.
Demo#2. The weight function
w(t) = [(1 − ω 2 t2 )(1 − t2 )]−1/2 on [−1, 1], 0 ≤ ω < 1,
of the “elliptic orthogonal polynomials”.
Since the weight function reduces to the Chebyshev weight function when
ω = 0, it seems natural to use as modified moments those relative to the
monic Chebyshev polynomials,
Z 1 1 Z 1
m0 = w(t)dt, mk = Tk (t)w(t)dt, k ≥ 1.
−1 2k−1 −1
Their computation, though not trivial by any means, can be accomplished in

a very stable fashion [9, Example 3.3]. The first 2N of them are generated
in the Matlab routine mm ell.m. The following box shows the Matlab script
required to generate elliptic polynomials.
function ab=r elliptic(N,om2)

abm=r jacobi(2*N-1,-1/2);
mom=mm ell(N,om2);
ab=chebyshev(N,mom,abm);
5
The routine works well even for ω 2 quite close to 1, as is shown by the output
below (displayed only partially) for N=40, om2=.999.
ab =
0 9.682265121100620e+00
0 7.937821421385184e-01
0 1.198676724605757e-01
0 2.270401183698990e-01
0 2.410608787266061e-01
0 2.454285325203698e-01
··· ··················
0 2.499915376529289e-01
0 2.499924312667191e-01
0 2.499932210069769e-01
All coefficients are accurate to machine precision.
2.3. Discrete Stieltjes and Lanczos algorithm. Partly in preparation for the
next subsection, we now consider a discrete N-point measure
N
X
dλN (t) = wk δ(t − xk ), wk > 0, (10)
k=1
where δ is the Dirac delta function. Thus, the measure is supported on N

distinct points xk on the real axis, where it has positive jumps wk . The corre-
sponding inner product is a finite sum,
Z N
X
(p, q)N = p(t)q(t)dλN (t) = wk p(xk )q(xk ). (11)
R k=1
There are now only a finite number, N, of recurrence coefficients αk = αk (dλN ),

βk = βk (dλN ), which can be computed by either of two algorithms, one men-
tioned briefly by Stieltjes [23], and a more recent one based on ideas of Lanczos
[18].
The former combines Darboux’s formulae for the recurrence coefficients,


 (tπk , πk )N
 αk = , k = 0, 1, . . . , n − 1,



(πk , πk )N
(12)
 (πk , πk )N
 βk = , k = 1, 2, . . . , n − 1,



(πk−1 , πk−1 )N
with the recurrence relation (3). In (12), the πk are the (as yet unknown)
discrete orthogonal polynomials πk ( · ; dλN ). Stieltjes’s Procedure consists in
starting with k = 0 and successively increasing k by 1 until k = n − 1. Thus,
when k = 0, we have π0 = 1, so that α0 can be computed by the top relation in
6
(12) with k = 0 and β0 by β0 = N k=1 wk . With α0 , β0 at hand, we can go into
P
(3) with k = 0 and compute π1 (xk ) for all the support points xk . This then
in turn allows us to reapply (12) with k = 1 and compute α1 and β1 . Going
back to (3) with k = 1, we compute π2 (xk ), whereupon (12) with k = 2 yields
α2 , β2 , etc. In this manner we continue until αn−1 , βn−1 have been computed.
Here n ≤ N.
The second algorithm is based on the existence of an orthogonal similarity

transformation
 √ √ √  √  
1 w1 w2 · · · wN √1 β 0 √0 · · · 0
√
 w1 x1
√ 0 ··· 0 

 β0 α0
 √ β1 · · · 0  
T 
Q  w 2 0 x 2 ··· 0  0
Q = 
 β1 α1 · · · 0  ,
 .. .. .. . . ..   .. .. .. . . .. 
 . . . . .   . . . . . 
√
wN 0 0 · · · xN 0 0 0 · · · αN −1
where Q is an orthogonal matrix of order N + 1 having the first coordinate
vector e1 ∈ RN +1 as its first column. Lanczos’s Algorithm [18] carries out
this transformation and thus, since the wk and xk are given, determines the
recurrence coefficients αk , βk . The algorithm, unfortunately, is unstable, but
can be stabilized by using ideas of Rutishauser [20]; see [16].
In Matlab, the two algorithms are implemented in the routines

)
ab=stieltjes(n,xw)
n ≤ N,
ab=lanczos(n,xw)
where xw is the N × 2 array of the support points and weights of the given
discrete measure (10); see Fig. 2.2.
x1 w1
x2 w2
.. ..
. .
xN wN
Figure 2.2. The array xw of support points and weights
The first routine is generally the one to be preferred, although as n approaches

N, it may gradually become unstable. If such is the case, and values of n near
N are indeed required, the second routine is preferable but is considerably
more time-consuming than the first.
2.4. Discretization methods. The basic idea, first advanced in [7] and more fully
developed in [9], is very simple: One first approximates the given measure dλ
by a discrete N-point measure,
dλ(t) ≈ dλN (t), (13)
7
typically by applying some appropriate quadrature scheme. Thereafter, the
desired recurrence coefficients are approximated by those of the discrete mea-
sure, 
 αk (dλ)

≈ αk (dλN ),
(14)
 βk (dλ)

≈ βk (dλN ).
If necessary, the integer N is increased to improve the approximation. For each
N, the approximate recurrence coefficients on the right of (14) are computed
by one of the methods described in §2.3. To come up with a good discretization
(13) that yields fast convergence as N → ∞ may require skill and inventiveness
on the part of the user. But if implemented intelligently, the method is one of
the most effective ones for generating orthogonal polynomials.
The seemingly complicated constructions of multicomponent discretizations

to be described further on will first be motivated by a simple example.
Example 2.1. The weight function
w(t) = (1 − t2 )−1/2 + c on [−1, 1], c > 0.
When c = 0, this is the Chebyshev weight, and as c → ∞, one expects to

recover the Legendre polynomials. Thus, in a sense, the polynomials orthog-
onal with respect to w “interpolate” between the Legendre and Chebyshev
polynomials.
It would be very difficult to find a single quadrature scheme that would ad-
equately approximate an integral with respect to the weight function w by a
finite sum. However, by considering w as a 2-component weight function, the
first component consisting of the Chebyshev weight, and the second of a con-
stant weight function, a natural discretization is obtained by applying Gauss-
Chebyshev quadrature to the first component, and Gauss-Legendre quadrature
to the second. Thus, the inner product with respect to the weight function w
is approximated by
Z 1 Z 1
(p, q)w = p(t)q(t)(1 − t2 )−1/2 dt + c p(t)q(t)dt
−1 −1
M M (15)
wkCh p(xCh Ch
wkLp(xLk )q(xLk ),
X X
≈ k )q(xk ) + c
k=1 k=1
where xCh
k , wk
Ch
are the nodes and weights of the M-point Gauss-Chebyshev
quadrature formula, and xLk , wkL those of the M-point Gauss-Legendre quadra-
ture formula. This in effect approximates the measure dλ(t) = w(t)dt by a
discrete N-point measure dλN , where N = 2M. Since M-point Gauss quadra-
ture integrates polynomials of degree 2M − 1 exactly and all inner products
in the Darboux formulae (12) involve polynomials of degree at most 2n − 1,
the choice M = n will insure that αk (dλ) = αk (dλN ) for all k ≤ n − 1, and
8
similarly for the βk . Thus, Stieltjes’s procedure, and therefore also Lanczos’s
algorithm, produces exact results. There is no need to increase N any further.
In general, the support interval [a, b] of dλ is decomposed into m subintervals

m
[
[a, b] = [aµ , bµ ], m ≥ 1,
µ=1
which may or may not be disjoint. The integral of a polynomial f against the
measure dλ(t) = w(t)dt is then represented somehow in the form
Z b m Z
X bµ
f (t)w(t)dt = fµ (t)wµ (t)dt, (16)
a µ=1 aµ
where in the most general case fµ will differ from f (and in fact may no longer
be a polynomial) and wµ is a positive weight function which, too, may be
different from w. The Multicomponent Discretization Method uses (16) with
f (t) = p(t)q(t) to approximate the inner product (p, q)w by applying an ap-
propriate M-point quadrature rule to each constituent integral on the right of
(16). This yields an approximation dλ ≈ dλN with N = mM. If the given mea-
sure dλ, in addition to the absolutely continuous component, contains also a
discrete p-point component, then the latter is simply added to the (mM)-point
approximation to yield an N-point approximation dλN with N = mM + p.
Using either Stieltjes’s procedure or Lanczos’s algorithm, we then compute the
approximations αk (dλN ), βk (dλN ) of αk (dλ), βk (dλ) for k = 0, 1, . . . , n − 1.
The integer M (and with it N) may be successively increased in an attempt
to obtain sufficient accuracy.
In Matlab, the multicomponent discretization method is implemented in the

routine
[ab,Mcap,kount]=mcdis(n,eps0,quad,Mmax).
Here, n is the number of recurrence coefficients to be computed, and eps0

the desired relative accuracy in the β-coefficients. (The α-coefficients, if they
are small, or even zero, may be obtained only to an absolute accuracy of
eps0.) The input parameter quad is a quadrature routine that generates the
M nodes and weights of the quadrature approximation of the µth component
of dλ for the current discretization parameter M. It may be a user-defined
routine tailored to the specific problem at hand, or a general-purpose routine
provided automatically. The last input parameter Mmax is an upper bound
for the discretization parameter M, which, when exceeded, causes the routine
to issue an error message. The output parameter ab is the n×2 array of the
desired recurrence coefficients, Mcap the value of M that yields the requested
accuracy, and kount the number of iterations required to achieve this accuracy.
9
The details of the discretization must be specified prior to calling the proce-
dure. They are embodied in the following global parameters:
mc the number of component intervals

mp the number of points in the discrete part of the
measure (mp=0 if there is none)
iq to be set equal to 1 if a user-defined quadrature
routine is to be used, and different from 1 otherwise
idelta a parameter whose default value is 1, but which
is preferably set equal to 2 if iq=1 and the user
provides Gauss-type quadrature routines
irout to be set equal to 1 if Stieltjes’s procedure is to
be used, and different from 1 otherwise
DM if mp> 0 an mp×2 array [[x1 y1 ]; [x2 y2 ]; . . . ; [xmp ymp ]]
containing the abscissae and jumps of the discrete
component of the measure
AB an mc×2 array specifying the component intervals
[[a1 b1 ]; [a2 b2 ]; . . . ; [amc bmc ]].
Example 2.2. Normalized Jacobi weight function plus a discrete measure,

p
dλ(t) = [β0J ]−1 (1 − t)α (1 + t)β dt +
X
yj δ(t − tj )dt, α > −1, β > −1, yj > 0,
j=1
R1
where β0J = −1 (1 − t)α (1 + t)β dt.
Similarly as in Example 2.1, we use the M-point Gauss-Jacobi quadrature

rule with M = n and Jacobi parameters α, β to discretize the absolutely
continuous component, but now add on the discrete p-point measure. As in
Example 2.1, this will produce the first n recurrence coefficients exactly. The
Matlab routine implementing this is shown in the box below.
function ab=r jacplus(n,alpha,beta,ty)

global mc mp iq idelta irout DM AB
global a b
a=alpha; b=beta;
mc=1; mp=size(ty,1); iq=1; idelta=2; irout=1;
Mmax=n+1; DM=ty; AB=[-1 1]; eps0=1e3*eps;
[ab,Mcap,kount]=mcdis(n,eps0,@quadjp,Mmax);
The variables a and b are declared global since they are used in the quadra-
ture routine quadjp.m, which is shown in the next box. Note also the choice
Mmax=n+1, which is legitimate since the discretization parameter M = n yields
exact results.
10
function xw=quadjp(N,mu)
global a b
ab=r jacobi(N,a,b); ab(1,2)=1;
xw=gauss(N,ab);
The integer mu in the routine quadjp (in the present case mu=1) specifies
the muth component interval. The call to gauss(N,ab) generates the N-point
Gaussian quadrature rule for the measure identified via the N×2 array ab of
its recurrence coefficients.
Demo#3. The first 40 recurrence coefficients of the normalized Jacobi weight

function with parameters α = − 12 , β = 23 and a mass point of strength 2 added
at the left endpoint of [−1, 1].
The Matlab program, followed by the output (only partially displayed), is

shown in the box below.
>> ty=[-1 2];

>> ab=r jacplus(40,-.5,1.5,ty)
ab =
-4.444444444444e-01 3.000000000000e+00
2.677002583979e-01 6.635802469136e-01
3.224245925965e-01 8.620335316387e-02
1.882535273840e-01 1.426676765162e-01
1.207880431181e-01 1.809505902299e-01
8.380358927439e-02 2.025747903114e-01
·················· ··················
2.077921831426e-03 2.489342817850e-01
1.972710627986e-03 2.489888786295e-01
1.875292842444e-03 2.490393860403e-01
The results can be compared with analytic answers (cf. [11, p. 43]) and are
found to be accurate to all digits shown.
Example 2.3. A weight function involving the modified Bessel function,
w(t) = tα K0 (t) on [0, ∞], α > −1.
This has applications in the asymptotic approximation of oscillatory integral

transforms [27].
The discretization of the measure dλ(t) = w(t)dt should be done with due
regard to the properties of the weight function, especially its behavior for
11
small and large t. This behavior is determined by

 R(t) + I0 (t) ln(1/t)

if 0 < t ≤ 1,
K0 (t) =
 t−1/2 e−t S(t)

if 1 ≤ t < ∞,
where I0 is the “regular” modified Bessel function and R, S are smooth func-
tions for which good rational approximations are known [19]. This suggests
the decomposition [0, ∞] = [0, 1] ∪ [0, 1] ∪ [0, ∞] and the representation
Z ∞ Z 1 Z 1
f (t)w(t)dt = [R(t)f (t)]tα dt + [I0 (t)f (t)]tα ln(1/t)dt
0 Z ∞
0 0 (17)
+ e−1 [(1 + t)α−1/2 S(1 + t)f (1 + t)]e−t dt.
0
Thus, in the notation of (16),
f1 (t) = R(t)f (t), w1 (t) = tα on [0, 1],
f2 (t) = I0 (t)f (t), w2 (t) = tα ln(1/t) on [0, 1],
f3 (t) = e−1 (1 + t)α−1/2 S(1 + t)f (1 + t), w3 (t) = e−t on [0, ∞].
The appropriate discretization of (17), therefore, involves Gauss-Jacobi quadra-

ture (with parameters 0 and α) for the first integral, Gauss quadrature relative
to the weight function w2 on [0, 1] for the second integral, and Gauss-Laguerre
quadrature for the third integral. The Gaussian quadrature rules required are
readily generated, the first and third by classical means, and the second by
using the routine r jaclog.m for generating the recurrence coefficients for the
weight function w2 followed by an application of the routine gauss.m. This
is implemented for arbitrary α > −1 in the routine r modbess.m shown in
the next box. The routine r jacobi01.m called in the sixth line generates the
recurrence coefficients for the shifted Jacobi polynomials (supported on the
interval [0, 1]). The variables abjac, abjaclog, ablag, declared global, are
used in the quadrature routine quadbess.m, which also incorporates one of
the rational approximations of [19] for computing R, S.
function ab=r modbess(N,a,Mmax,eps0)

global mc mp iq idelta irout AB
global abjac abjaclog ablag
mc=3; mp=0; iq=1; idelta=2; irout=1;
AB=[[0 1];[0 1];[0 Inf]];
abjac=r jacobi01(Mmax,0,a);
abjaclog=r jaclog(Mmax,a);
ablag=r laguerre(Nmax);
ab=mcdis(N,eps0,@quadbess,Mmax);
12
Demo#4. Compute
√
∞ π Γ2 (α + 1)
Z
−t α
e t K0 (t)dt = .
0 2α+1 Γ(α + 3/2)
The routine in the box below applies n-point Gauss quadrature of e−t relative
to the weight function w(t) = tα K0 (t) and determines the smallest n for which
the relative error is less than eps0.
>> global a
>> a=-1/2; N=20; Mmax=200; eps0=1e4*eps;
>> exact=sqrt(pi)*(gamma(a+1))^2/(2^(a+1)*gamma(a+3/2));
>> ab=r modbess(N,a,Mmax,eps0); s=0; n=0;
>> while abs(s-exact)>abs(exact)*eps0
n=n+1;
xw=gauss(n,ab);
s=sum(xw(:,2).*exp(-xw(:,1)));
end
>> n, s, abs(s-exact)/abs(exact)
For the choices made of a, N, Mmax, and eps0=2.22×10−12 , the routine yields
n = 12, s = 3.937402486427721, with a relative error of 7.32 × 10−13 .
2.5. Modification algorithms. The problem to be considered here is the follow-

ing: Given the recurrence coefficients of dλ, generate those of the modified
measure
dλmod (t) = r(t)dλ(t), r rational ≥ 0 on supp(dλ).
The problem can be reduced to the one in which r is either a real linear, or
a real quadratic factor or divisor, since any general real r can be written as
a product of such factors and divisors. For these special cases, the problem
has been solved in [8]. (Other approaches have been taken in [17] and [4]; see
also [12, §3].) We briefly discuss the case of a linear factor, already solved by
Galant [6].
Example 2.4. Modification by a liner factor,
r(t) = s(t − c), c ∈ R\supp(dλ),
where s = ±1 is chosen such that r is nonnegative on the support of dλ.
The solution given by Galant is most elegantly described in linear algebra

terms. It consists in applying one step of the (symmetric) shifted LR algorithm
to the Jacobi matrix of the measure dλ. Specifically, the matrix s[J n+1 (dλ) −
cI], which by assumption is positive definite, is first Cholesky decomposed,
s[J n+1 (dλ) − cI] = LLT ,
13
whereupon the factors on the right are interchanged and the shift cI added
back. Discarding the last row and column of the resulting matrix yields the
desired Jacobi matrix of order n,
J n (dλmod ) = (LT L + cI)[1:n,1:n].
The solution can also be described in terms of a nonlinear recurrence algo-

rithm, which in Matlab is implemented by the routine
ab=chri1(N,ab0,c),
where ab0 contains the first N + 1 recurrence coefficients of dλ and c is the

shift parameter.
Our package includes seven additional routines chri2.m, chri3.m, . . ., chri8.m

corresponding to quadratic factors of various types, linear divisors, and quad-
ratic divisors of different kinds. The routine chri7.m, for example, deals with
a quadratic factor of the form r(t) = (t−x)2 with x ∈ R. It would be tempting
to apply the routine chri1.m for the linear factor t − x twice in succession,
but this may be risky if x is inside the support of dλ. There is, however, an
algorithm similar to Galant’s algorithm, which applies one step of the shifted
QR algorithm to the Jacobi matrix J n+2 (dλ) and discards the last two rows
and columns of the result to obtain J n (rdλ) (cf. [12, §3.3]).
Example 2.5. Induced orthogonal polynomials ([14]).
Given an orthogonal polynomial πm ( · ; dλ) of fixed degree m, the induced

orthogonal polynomial of degree k is orthogonal with respect to the weight
2
function w(t) = πm (t)dλ(t).
Here,
m
(t − xµ )2 ,
Y
r(t) =
µ=1
where xµ are the zeros of πm . This calls for m successive applications of the
routine chri7.m with x = xµ , µ = 1, 2, . . . , m. The routine indop.m shown in
the box below implements this.
14
function ab=indop(N,m,ab0)
N0=size(ab0,1);
if N0<N+m, error(’input array ab0 too short’), end
ab=ab0;
if m==0, return, end
zw=gauss(m,ab0);
for imu=1:m
mi=N+m-imu;
for n=1:mi+1
ab1(n,1)=ab(n,1);
ab1(n,2)=ab(n,2);
end
x=zw(imu,1);
ab=chri7(mi,ab1,x);
end
Demo#5. Induced Legendre polynomials.
The routine shown in the next box generates the first 20 recurrence coefficients
of selected induced orthogonal polynomials when dλ is the Legendre measure.
>> N=20; M=11;

>> ab0=r jacobi(N+M);
>> for m=[0 2 6 11]
ab=indop(N,m,ab0)
end
By symmetry, all the α-coefficients are zero. Selected values of the β-coefficients
returned by the routine (rounded to 10 decimal places) are shown in Table
2.2.
k βk,0 βk,2 βk,6 βk,11
0 2.0000000000 0.1777777778 0.0007380787 0.0000007329
1 0.3333333333 0.5238095238 0.5030303030 0.5009523810
6 0.2517482517 0.1650550769 0.2947959861 0.2509913424
12 0.2504347826 0.2467060415 0.2521022519 0.1111727541
19 0.2501732502 0.2214990335 0.2274818789 0.2509466619
Table 2.2. β-coefficients of induced Legendre polynomials
The procedure is remarkably stable, not only for the Legendre measure, but
also for other classical measures, and for n and m as large as 320; see [11,
Tables X and XI].
15
3. Sobolev orthogonal polynomials
These are polynomials orthogonal with respect to an inner product that in-
volves derivatives in addition to function values, each derivative having asso-
ciated with it its own (positive) measure. Thus,
Z Z Z
(p, q)S = p(t)q(t)dλ0 (t) + 0 0
p (t)q (t)dλ1 (t) + · · · + p(s) (t)q (s) (t)dλs (t).
R R R
(18)
The Sobolev polynomials {πk ( · ; S)} are monic polynomials of degree k or-
thogonal with respect to the inner product of (18),
(
= 0, k 6= `,
(πk , π` )S (19)
> 0, k = `.
These polynomials no longer satisfy a three-term recurrence relation, but like

any other system of monic polynomials whose degrees increase by 1 from one
polynomial to the next, they must satisfy a recurrence relation of the extended
form
k
βjk πk−j (t),
X
πk+1 (t) = tπk (t) − k = 0, 1, 2, . . . . (20)
j=0
In place of the Jacobi matrix, we now have an upper Hessenberg matrix of

recurrence coefficients,
n−2 n−1
β00 β11 β22 ··· βn−2 βn−1
 
n−2 n−1 
 1

β01 β12 ··· βn−3 βn−2 
n−2 n−1 
 0 1 β02 ··· βn−4 βn−3 

Hn = 
···
. (21)
 ··· ··· ··· ··· ··· 
 0 0 0 ··· β0n−2 β1n−1 
 
0 0 0 ··· 1 β0n−1
In the case s = 0 corresponding to ordinary orthogonal polynomials, one has

βjk = 0 for j > 0, and the matrix H n is tridiagonal. It can be symmetrized
by a (real) diagonal similarity transformation and then becomes the Jacobi
matrix J n (dλ0 ) (cf. (4)). When s > 0, symmetrization is no longer possible,
since some of the eigenvalues of H n may well be complex.
3.1. Moment-based algorithms. We define modified moments similarly as in

(7), but now a separate set of them for each measure dλσ ,
Z
(σ)
mk = pk (t)dλσ (t), k = 0, 1, 2, . . . ; σ = 0, 1, . . . , s. (22)
R
For simplicity, we use the same set of polynomials {pk } for each measure and
assume, as in (8), that they satisfy a three-term recurrence relation. In analogy
to (9), there is now a unique map that takes the first 2n modified moments of
16
all the measures dλσ into the recurrence coefficients βjk ,
(σ)
2n−1
[mk ]k=0 , σ = 0, 1, . . . , s 7→ [βjk ], k = 0, 1, . . . n − 1; j = 0, 1, . . . , k. (23)
The conditioning of this map has been studied in [28], and an algorithm,
analogous to the modified Chebyshev algorithm, developed (for s = 1) in [15].
The corresponding routine in Matlab is
[B,normsq]=chebyshev sob(N,mom,abm).
Here, N is the n in (23), mom the 2×2N array of the first 2N modified moments
corresponding to dλ0 and dλ1 , and abm the (2N −1)×2 array of the recurrence
coefficients in (8). The output variable B is the N ×N matrix of the recurrence
coefficients βjk , k = 0, 1, . . . , N − 1, 0 ≤ j ≤ k, where βjk occupies the position
B(j + 1, k + 1) of the matrix B; all remaining elements of B are zero. The
routine also returns the optional N-vector normsq of the squared norms kπk k2S
of the Sobolev orthogonal polynomials. If abm is absent in the list of input
parameters, then ordinary moments are assumed (ak = bk = 0).
Example 3.1. The polynomials of Althammer [1].
These are the Sobolev orthogonal polynomials with s = 1 and dλ0 (t) = dt,
dλ1 (t) = γdt on [−1, 1], where γ > 0. There is a fairly obvious choice of
the polynomials {pk } for defining the modified moments, namely the monic
Legendre polynomials. All modified moments in this case, by orthogonality,
are zero except for
(0) (1)
m0 = 2, m0 = 2γ.
In Matlab, the recurrence matrix B for the Althammer polynomials is gener-
ated as shown in the box below (where N=n and g=γ).
>> N=20; g=1;

>> %g=0;
>> mom=zeros(2,2*N);
>> mom(1,1)=2; mom(2,1)=2*g;
>> abm=r jacobi(2*N-1);
>> B=chebyshev sob(N,mom,abm);
Demo#6. Legendre vs Althammer polynomials.
The routine in the box below generates and plots the Sobolev polynomial of
degree N = 20 corresponding to s = 1 and γ = 0 (Legendre polynomial) resp.
γ = 1 (Althammer polynomial). It is assumed that the matrix B has already
been generated by the routine for Althammer polynomials shown above with
N=20 and g=0 resp. g=1.
17
>> N=20;
>> pi=zeros(N+1,1); np=500; y=zeros(np+1,1);
>> for it=0:np
t=-1+2*it/np;
pi(1)=1;
for k=1:N
temp=0;
for l=1:k
temp=temp+B(l,k)*pi(k-l+1);
end
pi(k+1)=t*pi(k)-temp;
end
y(it+1)=pi(N+1);
end
>> x=linspace(-1,1,np+1);
>> hold on
>> plot(x’,y)
>> plot([-1 1],[0 0],’--’)
>> hold off
The plot for the Legendre polynomial is shown in Fig. 3.1 in the left frame,
and the one for the Althammer polynomial in the right frame.
−6 −6
x 10 x 10
8 3
6 2
4 1
2 0
0 −1
−2 −2
−4 −3
−1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1 −1 −0.8 −0.6 −0.4 −0.2 0 0.2 0.4 0.6 0.8 1
Figure 3.1. Legendre vs Althammer polynomial
Interestingly, for the Legendre polynomial the envelope of the extreme points
is convex on top and concave at the bottom, whereas for the Althammer
polynomial it is the other way around. Note also that π20 (±1) = .7607 × 10−5
for the Legendre, and π20 (±1) = 0 for the Althammer polynomial.
18
3.2. Discretization algorithm. The analogue for Sobolev orthogonal polynomi-
als of the Darboux formulae (12) is
(tπk , πk−j )S
βjk = , j = 0, 1, . . . , k, (24)
(πk−j , πk−j )S
with the inner product ( · , · )S defined as in (18). The Discretized Stieltjes Al-
gorithm, similarly as for ordinary orthogonal polynomials, consists in combin-
ing the formulae (24) with the recurrence relation (20), discretizing the inner
products in (24) by suitable quadrature schemes. We chose to approximate
the absolutely continuous component of each measure dλσ by a Gauss-type
quadrature rule,
nσ
X (σ) (σ) (σ)
(p, q)dλσ ≈ wk p(xk )q(xk ), σ = 0, 1, . . . , s, (25)
k=1
and to add on any discrete component of dλσ if present. In Matlab, the quadra-
ture schemes are identified by an md × 2(s + 1) array xw,
(0) (s) (0) (s)

x1 ··· x1 w1 ··· w1
(0) (s) (0) (s)
x2 ··· x2 w2 ··· w2
xw=
.. .. .. ..
. . . .
(0) (s) (0) (s)
xmd ··· xmd wmd ··· wmd
where md=max(nσ ). In each column of xw the entries after x(σ) (σ)

nσ resp. wnσ (if
any) are ignored by the routine. The routine itself has the form
B=stieltjes sob(N,s,nd,xw,a0,same),
where nd=[n0 , n1 , . . . , ns ], a0=α0 (dλ0 ), and same is a logical variable to be set

equal to 1 if all quadrature rules have the same nodes, and equal to 0 otherwise.
If same=1, the routine takes advantage of significant simplifications that are
possible and reduce running time.
Example 3.2. The Althammer polynomials, revisited.
The box below shows the generation of the recurrence matrix B for the Al-
thammer polynomials using the routine stieltjes sob.m.
19
>> N=20; g=1;
>> nd=[N N]; s=1; a0=0; same=1;
>> ab=r jacobi(N);
>> zw=gauss(N,ab);
>> xw=[zw(:,1) zw(:,1) zw(:,2) g*zw(:,2)];
>> B=stieltjes sob(N,s,nd,xw,a0,same);
The results are identical with those produced by the routine chebyshev sob.m.
There is no restriction, however, on the parameter s when using the routine
stieltjes sob.m.
3.3. Zeros. If π(t) is the vector of the first n Sobolev orthogonal polynomials,
π T (t) = [π0 (t), π1 (t), . . . , πn−1 (t)],
then the recurrence relation (20) can be written in matrix form as follows,
tπ T (t) = π T (t)H n + πn (t)eTn ,
where en is the last coordinate vector in Rn . If t = τν is a zero of πn , the last

term vanishes, implying that τν is an eigenvalue of the matrix H n and π T (τν )
a corresponding (left) eigenvector. Thus, the zeros of Sobolev orthogonal poly-
nomials can be computed as eigenvalues of an upper Hessenberg matrix. In
Matlab, this is done by the routine sobzeros.m shown in the box below.
function z=sobzeros(n,N,B)
H=zeros(n);
for i=1:n
for j=1:n
if i==1
H(i,j)=B(j,j);
elseif j==i-1
H(i,j)=1;
elseif j>=i
H(i,j)=B(j-i+1,j);
end
end
end
z=sort(eig(H));
Here B is the recurrence matrix of order N for the Sobolev orthogonal poly-
nomials, and n≤N. The zeros are arranged in increasing order.
Demo#7. The zeros of the Althammer polynomial of degree 20 with γ = 1.
Assuming that the matrix B has already been generated by either the modified
20
Chebyshev algorithm or the Stieltjes procedure as described in §§3.1 and 3.2,
the box below shows the Matlab commands and output (only the positive
zeros are shown, rounded to 12 decimals).
<< N=20; z=sobzeros(N,N,B)

z =
8.05392515636e-02
2.39532838077e-01
3.92325438959e-01
5.34960935873e-01
6.63745343244e-01
7.75342384688e-01
8.66859942239e-01
9.35924777578e-01
9.80740571465e-01
1.00000000000e-01
Judging from how well the symmetry of the roots is satisfied, the results appear
to be accurate to all digits shown except the last, which may be in error by one
or two units. Generating the matrix B by the modified Chebyshev algorithm
or Stieltjes’s procedure produces the same results to this accuracy, but the
Stieltjes procedure is considerably slower (by a factor of about 14) than the
modified Chebyshev algorithm.
References
[1] Althammer, P. 1962. Eine Erweiterung des Orthogonalitätsbegriffes bei

Polynomen und deren Anwendung auf die beste Approximation. J. Reine
Angew. Math. 211, pp. 192–204.
[2] Chebyshev, P. L. 1859. Sur l’interpolation par la méthode des moindres

carrés. Mem. Acad. Impér. Sci. St. Petersbourg (7)1(15), pp. 1–24. Also in
Œuvres I, pp. 473–498.
[3] Chihara, T. S. 1978. An introduction to orthogonal polynomials.

Mathematics and Its Applications 13, Gordon and Breach, New York.
[4] Fischer, Bernd and Golub, Gene H. 1992. How to generate unknown
orthogonal polynomials out of known orthogonal polynomials. J. Comput.
Appl. Math. 43, pp. 99–115.
[5] Freud, Géza 1971. Orthogonal polynomials. Pergamon Press, New York.
(English translation of Orthogonale Polynome, Birkhäuser, Basel, 1969.)
21
[6] Galant, David 1971. An implementation of Christoffel’s theorem in the
theory of orthogonal polynomials. Math. Comp. 25, pp. 111–113.
[7] Gautschi, Walter 1968. Construction of Gauss-Christoffel quadrature

formulas. Math. Comp. 22, pp. 251–270.
[8] Gautschi, Walter 1982. An algorithmic implementation of the generalized

Christoffel theorem. In: Numerical integration (G. Hämmerlin, ed.), Internat.
Ser. Numer. Math. 57, pp. 89–106, Birkhäuser, Basel.
[9] Gautschi, Walter 1982. On generating orthogonal polynomials. SIAM J.

Sci. Statist. Comput. 3, pp. 289–317.
[10] Gautschi, Walter 1984. On some orthogonal polynomials of interest in

theoretical chemistry. BIT 24, pp. 473–483.
[11] Gautschi, Walter 1994. Algorithm 726: ORTHPOL — a package of routines

for generating orthogonal polynomials and Gauss-type quadrature rules. ACM
Trans. Math. Software 20, pp. 21–62.
[12] Gautschi, Walter 2002. The interplay between classical analysis and
(numerical) linear algebra — a tribute to Gene H. Golub. Electron. Trans.
Numer. Anal. 13, pp. 119–147 (electronic).
[13] Gautschi, Walter 2004. Orthogonal polynomials: computation and

approximation. Clarendon Press, Oxford, to appear.
[14] Gautschi, Walter and Li, Shikang 1993. A set of orthogonal polynomials
induced by a given orthogonal polynomial. Aequationes Math. 46, pp. 174–198.
[15] Gautschi, Walter and Zhang, Minda 1995. Computing orthogonal

polynomials in Sobolev spaces. Numer. Math. 71, pp. 159–183.
[16] Gragg, William B. and Harrod, William J. 1984. The numerically

stable reconstruction of Jacobi matrices from spectral data. Numer. Math.
44, pp. 317–335.
[17] Kautsky, J. and Golub, G. H. 1983. On the calculation of Jacobi matrices.

Linear Algebra Appl. 52/53, pp. 439–455.
[18] Lanczos, Cornelius 1950. An iteration method for the solution of the
eigenvalue problem of linear differential and integral operators. J. Research
Nat. Bur. Standards 45, pp. 255–282. Also in Collected Published Papers with
Commentaries, Vol. V, pp. 3-9–3-36.
[19] Russon, A. E. and Blair, J. M. 1969. Rational function minimax

approximations for the Bessel functions K0 (x) and K1 (x). Rep. AECL-3461,
Atomic Energy of Canada Limited, Chalk River, Ontario.
[20] Rutishauser, H. 1963. On Jacobi rotation patterns. In: Experimental

arithmetics, high speed computing and mathematics (N. C. Metropolis, A. H.
Taub, J. Todd, and C. B. Tompkins, eds.), Proc. Sympos. Appl. Math. 15, pp.
219–239, American Mathematical Society, Providence, RI.
22
[21] Sack, R. A. and Donovan, A. F. 1972. An algorithm for Gaussian
quadrature given modified moments. Numer. Math. 18, pp. 465–478.
[22] Stahl, Herbert and Totik, Vilmos 1992. General orthogonal polynomials.
Encyclopedia of Mathematics and Its Applications 43, Cambridge University
Press, Cambridge.
[23] Stieltjes, T. J. 1884. Quelques recherches sur la théorie des quadratures

dites mécaniques. Ann. Sci. École Norm. Paris (3) 1, pp. 409–426. Also in
Œuvres I, pp. 377–396.
[24] Suetin, P. K. 1979. Classical orthogonal polynomials (Second ed.). “Nauka”,

Moscow. (Russian)
[25] Szegö, Gabor 1975. Orthogonal polynomials (Fourth ed.). AMS Colloquium
Publications 23, American Mathematical Society, Providence, RI.
[26] Wheeler, John C. 1974. Modified moments and Gaussian quadratures.

Rocky Mountain J. Math. 4, pp. 287–296.
[27] Wong, R. 1982. Quadrature formulas for oscillatory integral transforms.

Numer. Math. 39, pp. 351–360.
[28] Zhang, Minda 1994. Sensitivity analysis for computing orthogonal

polynomials of Sobolev type. In: Approximation and computation (R.V.M.
Zahar, ed.), Internat. Ser. Numer. Math. 119, pp. 563–576, Birkhäuser Boston,
Boston, MA.
Department of Computer Sciences

Purdue University
West Lafayette, IN 47907-1389
USA
E-mail address: wxg@cs.purdue.edu (W. Gautschi)
23

Orthogonal Polynomials (In Matlab) : Walter Gautschi

Uploaded by

Copyright:

Available Formats

Orthogonal Polynomials (In Matlab) : Walter Gautschi

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Orthogonal Polynomials (In Matlab) : Walter Gautschi

Uploaded by

Copyright:

Available Formats

Orthogonal Polynomials (in Matlab)

The analytic theory of orthogonal polynomials is well documented in a number

While the theory of orthogonal polynomials is well developed, the practice of

It is well known that they satisfy a three-term recurrence relation

πk+1 (t) = (t − αk )πk (t) − βk πk−1 (t), k = 0, 1, 2, . . . ,

a symmetric tridiagonal matrix of infinite order. Its leading principal minor

J n (dλ) = J (dλ)[1:n,1:n] . (5)

2.1. Recurrence coefficients. Frequently, the measure dλ is absolutely continu-

dλ(t) = w(t)dt, (6)

where w is a nonnegative function, called weight function, integrable on the

Table 2.1. Classical weight functions

Figure 2.1. The array ab of recurrence coefficients

Here, a, b are the Jacobi parameters (denoted by α and β in Table 2.1). If

of interest in statistics, has all coefficients αk = 0 (by symmetry) and β0 = 1,

2.2. Modified Chebyshev algorithm. In principle, the desired recurrence coeffi-

pk+1 (t) = (t − ak )pk (t) − bk pk−1 (t), k = 0, 1, 2, . . . ,

where N is the number n in (9), mom the 1 × 2N array of modified moments,

In view of the highly ill-conditioned nature of the map (9) when mr = µr

Demo#2. The weight function

w(t) = [(1 − ω 2 t2 )(1 − t2 )]−1/2 on [−1, 1], 0 ≤ ω < 1,

of the “elliptic orthogonal polynomials”.

Their computation, though not trivial by any means, can be accomplished in

function ab=r elliptic(N,om2)

All coefficients are accurate to machine precision.

where δ is the Dirac delta function. Thus, the measure is supported on N

There are now only a finite number, N, of recurrence coefficients αk = αk (dλN ),

The former combines Darboux’s formulae for the recurrence coefficients,

The second algorithm is based on the existence of an orthogonal similarity

In Matlab, the two algorithms are implemented in the routines

Figure 2.2. The array xw of support points and weights

The first routine is generally the one to be preferred, although as n approaches

dλ(t) ≈ dλN (t), (13)

The seemingly complicated constructions of multicomponent discretizations

Example 2.1. The weight function

w(t) = (1 − t2 )−1/2 + c on [−1, 1], c > 0.

When c = 0, this is the Chebyshev weight, and as c → ∞, one expects to

In general, the support interval [a, b] of dλ is decomposed into m subintervals

In Matlab, the multicomponent discretization method is implemented in the

Here, n is the number of recurrence coefficients to be computed, and eps0

mc the number of component intervals

Example 2.2. Normalized Jacobi weight function plus a discrete measure,

Similarly as in Example 2.1, we use the M-point Gauss-Jacobi quadrature

function ab=r jacplus(n,alpha,beta,ty)

Demo#3. The first 40 recurrence coefficients of the normalized Jacobi weight

The Matlab program, followed by the output (only partially displayed), is

>> ty=[-1 2];

Example 2.3. A weight function involving the modified Bessel function,

w(t) = tα K0 (t) on [0, ∞], α > −1.

This has applications in the asymptotic approximation of oscillatory integral

Thus, in the notation of (16),

f1 (t) = R(t)f (t), w1 (t) = tα on [0, 1],

f2 (t) = I0 (t)f (t), w2 (t) = tα ln(1/t) on [0, 1],

The appropriate discretization of (17), therefore, involves Gauss-Jacobi quadra-

function ab=r modbess(N,a,Mmax,eps0)

2.5. Modification algorithms. The problem to be considered here is the follow-

Example 2.4. Modification by a liner factor,

r(t) = s(t − c), c ∈ R\supp(dλ),

where s = ±1 is chosen such that r is nonnegative on the support of dλ.