Chebyshev Interpolation: an interactive tour
Scott A. Sarra
Marshall University
* NOTE: For the proper typesetting of the mathematical symbols in this document, it must be viewed with Internet Explorer. In order for the applets to function, the Java 1.5 (5.0) runtime environment must be installed on your computer.
1 Introduction
Most areas of numerical analysis, as well as many other areas of
Mathematics as a whole, make use of the Chebyshev polynomials. In
several areas, e.g. polynomial approximation, numerical
integration, and pseudospectral methods for partial differential
equations, the Chebyshev polynomials take take a significant role.
In fact, the following quote has been attributed to a number of
distinguished mathematicians:
"The Chebyshev polynomials are everywhere dense in numerical
analysis."
In this manuscript we make use of Java applets to interactively
explore some of the classical results on approximation using
Chebyshev polynomials. The three applets used are the Chebyshev
approximation (CA) applet, the Chebyshev polynomial (CP) applet,
and the exponential filter (EF) applet. We also discuss an active
research areas that uses the Chebyshev polynomials. The following
books [7,8] devoted to the
Chebyshev polynomials may be consulted for more detailed
information than we provide in this brief presentation.
2 Chebyshev Polynomials
The Chebyshev Polynomials (of the first kind) are defined
as
Tn(x) = cos[ n arccos(x) ].
(1)
They are orthogonal with respect to the weight w(x) = (1-x2)-1/2 on the interval [-1,1]. Intervals [a,b] other
than [-1,1] are easily handled by the change of variable xョ [1/2][ (b-a)x + a + b ]. Although
not immediately evident from definition (1), each
Tn(x) is a polynomial of degree n. From definition
(1) we have that T0( x ) = cos(0 ) = 1
and T1( x ) = cos(arccosx) = x. For n ウ 1 we have
the triple recursion relation
Tn+1(x) = 2 x Tn(x) - Tn-1(x)
(2)
which can be shown to be true using basic trig identities. Using
equation (2) we see
T2( x )
=
2x (x) - 1 = 2x2 - 1
T3( x )
=
2x (2x2 - 1) - x = 4x3 - 3x
T4( x )
=
2x (4x3 - 3x) - (2x2 - 1) = 8x4 - 8x2 + 1
and that the Chebyshev polynomials are indeed polynomials of
degree n. What do the Chebyshev polynomials look like? The
Chebyshev polynomials of degree k=0,1,シ,12 can be plotted
in the CP applet below. Notice that |Tn(x)| 」 1.
3 Continuous Chebyshev Expansion
The infinite continuous Chebyshev series expansion is
f(x) サ
・
?
n=0
「an Tn(x)
(3)
where
a
n =
2
p
?
?
1
-1
(1-x
2)
-1/2 f(x) T
n(x) dx.
(4)
Truncating the series after N+1 terms, we get the
truncated continuous Chebyshev expansion
SN (x)=
N
?
n=0
「an Tn(x).
(5)
The single prime notation in the summation indicates that the
first term is halved. There are several functions in which the
integral for the coefficients an can be evaluated
explicitly, but this is not possible in general. Examples included
in the CA applet for which a continuous truncated expansion can be
derived are the sign function (26),
f(x)=ヨ{1-x2}, and f(x) = |x|.
The conditions which must be placed on f to ensure the
convergence of the series (3) depend on
the type of convergence to be established: pointwise, uniform,
L2, or Cesaro mean. At the lowest level, the series
(3) converges pointwise to f at points
where f is continuous in [-1,1] and to the left and right
limiting values of f at any of a finite number of jump
discontinuities in the interior of the interval. For example, the
sign function in the CA applet has a jump discontinuity at
x0=0 and has the limiting values on each side of the
discontinuity of f(x0+) = 1 and f(x0-) = -1. Thus the series
converges to zero at this point, i.e.
SN ( x
0 ) サ
1
2
[ f(x
0+) +f(x
0-) ].
If f(x) is an even function then ak = 0 for k = 1,3,5,.... If f(x) is an odd function then ak = 0 for k = 0,2,4, .... This can be observed in the truncated continuous
expansion of f(x) = ヨ{1-x2} and f(x)=|x| (even) and f(x) = sign(x) (odd) in the CA applet.
4 Discrete Chebyshev Expansions
When the integral in (4) can not be evaluated
exactly, we can introduce a discrete grid and use a numerical
quadrature (integration) formula. Several possible grids, and
related quadrature formulas exist. The Chebyshev-Gauss-Lobatto
(CGL) points
x
k = -cos
?
?
k p
N
?
?
k=0,1,シ,N
(6)
are a popular choice. The CGL points are the n-1 extrema of
Tn(x) plus the endpoints of the interval [-1,1]. Using the CP
applet, observe how the extrema of the Chebyshev polynomials are
not evenly distributed and how they cluster around the boundary.
In the CA applet, the CGL points may be plotted by checking plot
CGL points on the options menu. Try this with the sign function
starting with N=9 and then increasing N.
The corresponding CGL quadrature formula is
?
?
1
-1
f(x) dx
サ
p
N
N
?
j=0
「「 f(x
k).
(7)
Using the CGL quadrature formula, the discrete Chebyshev
coefficients an are defined to be
a
n サ a
n =
2
N
N
?
k=0
「「 f(x
k) T
n(x
k)
(8)
and the discrete truncated partial sum is
PN (x)=
N
?
n=0
「an Tn(x).
(9)
Using definition
(8) takes O(N2) operations to
evaluate the discrete Chebyshev coefficients. For large N, a
better choice is the fast cosine transform which takes
O(N logN) operations. If f(x) is a polynomial of
at most degree 2N-1, the CGL quadrature formula is exact, and
PN (x) and SN (x) coincide.
4.1 Interpolating Partial Sum
Requiring that the approximation be interpolating, i.e., requiring
that it satisfy
PN (xi) = f(xi) i = 0,1,シ,N
(10)
we get the interpolating partial sum
IN (x)=
N
?
n=0
「「 an Tn(x).
(11)
The double prime notation in the summation indicates that the
first and last terms are halved. The interpolating partial sum would be equal to the truncated
series with the coefficients approximated via
CGL quadrature except the last coefficient is halved. This is due
to the choice of quadrature points. If Gaussian quadrature which
uses the Chebyshev-Gauss (CG) points had been used instead of CGL
quadrature, the interpolating and discrete truncated partial sum
would be identical. The CG points are the zeros of Tn(x) and
do not include x ア1. Chebyshev pseudospectral methods for solving PDEs usually incorporate the
CGL points and not the CG points. The reason for this is that the
discrete grid must include the boundary points so that the
boundary conditions of the PDE can be incorporated into the
numerical approximation. Using the applet we can observe the difference between SN,
PN, and IN. For example if we use the
sign function with N=11 and plot the CGL points we see that
IN goes through the interpolation sites while
SN and PN do not.
By using the CGL points (6), which cluster densely
around the endpoints of [-1,1], as interpolation sites, the
nonuniform convergence (the Runge Phenomena) associated
with equally spaced polynomial interpolation is avoided.
4.2 Aliasing
Introducing a discrete grid leads to aliasing. The
discrete coefficients can be expressed in terms of the continuous
coefficients as
an = an +
・
?
j=1
( an+2jN + a-n+2jN ).
(12)
As an example consider the sign function with N=9. The
difference between the discrete coefficient a5 and the
continuous coefficient a5 can be quantified by the
aliasing relation (12) as
a5 - a5
=
a23 + a41 + a59 + シ
This relation is a result of the fact that on the discrete grid,
T5 is identical to T23,T41,T59,シ and also to
T13,T31,T49,シ as is illustrated in figure
1. In the CA applet, observe the difference
between the odd numbered coefficients of S9,
P9 and I9. There is no difference in the
even numbered coefficients as the sign function is odd. Thus the
continuous even coefficients that are involved in the aliasing
relation are all zero.
convergence plot
Figure 1: On the CGL grid (open black circles) for N=9 T5 is identical to T13 (green) and T23 (cyan). Points of intersection on the CGL grid are marked with red *'s.
5 Rates of Convergence
Repeatedly integrating equation (4) by parts we
get
a
n =
1
n
m
2
p
?
?
1
-1
(1-x
2)
-1/2 f
(m)(x) T
n(x) dx.
(13)
Thus, if f(x) is m-times (m ウ 1) continuously
differentiable in [-1,1] the above integral will exist and we
can conclude that
an = O(n-m), n = 1,2,シ.
(14)
If we make a careful choice of which definition of the integral to
use, the same result can be shown to be true if f(x) is
(m-1)-times differentiable a.e. (almost everywhere) in [-1,1] with its
(m-1)th derivative of bounded variation in [-1,1].
Since the absolute value of each Tk is bounded above by one
on [-1,1], it follows that the truncation error for the
continuous expansion is bounded by the sum of the absolute value
of the neglected coefficients:
|f(x) - SN (x) | 」
・
?
n=N+1
|an|.
(15)
A similar bound, with an additional factor of two, holds for the
interpolating partial sum:
|f(x) - IN (x) | 」 2
・
?
n=N+1
|an|
(16)
for x ホ [-1,1]. From (14),
(15), and (16) we conclude
that
|f(x) - SN (x) | = O(N-m)
(17)
and
|f(x) - IN (x) | = O(N-m).
(18)
If f is infinitely differentiable the convergence is faster than
O(N-m) no matter how large we take m. This is
commonly termed spectral accuracy or exponential
accuracy. If f can be extended to an analytic function in a
suitable region of the complex plane the pointwise error on
[-1,1] can be shown to be
for some r > 1. In figure 2 the rather slow
decay rate of the error with increasing N is illustrated for the
absolute value function for which m=1. This can be contrasted
with the rapid spectral convergence of the infinitely smooth
function which is also illustrated in figure
2.
No matter what rate of decay the coefficients have, the
convergence rate is only observed for n > n0. Using an approximation with fewer than
n0 terms may result in a very bad approximation. For example,
the decay rate of the coefficients of the infinitely smooth function
in the applet is not yet evident for N=17 and the approximation
is very poor.
convergence plot
Figure 2: Convergence of an infinitely differentiable function f2(x) = ecos(8x3+1) vs. convergence of a continuous function f5(x)=|x|.
Equation (13) allows us to conclude that if
f is a polynomial of degree N then an = 0 for all
n > N. For example, observe that the seventh degree polynomial in
the CP approximation applet has ak = 0 (to within machine
precision of 2.2204e-16) for n > 7.
If m=0, i.e, f is discontinuous, the accuracy of the Chebyshev
approximation methods reduces to O(1) near the
discontinuity. Sufficiently far away from the discontinuity, the
convergence will be slowed to O(N-1). Oscillations
will be present near the discontinuity and will not diminish as Nョ ・. Additionally, the oscillations will not even
be localized near a discontinuity. This situation is referred to
as the Gibbs Phenomenon. The Gibbs' oscillations are very
visible in the Chebyshev approximations of the sign function in
the CP applet.
6 Filters
Spectral filters can be used to enhance the decay rate of the
Chebyshev coefficients [12]. The filtered Chebyshev
approximation is defined as
FN ( x) =
N
?
n=0
s
?
?
n
N
?
?
a
n T
n(x)
(20)
where s is a spectral filter. A pth (p > 1) order
spectral filter is defined as a sufficiently smooth function
satisfying
s(m)(0)
=
0 m = 1,2,シ,p-1
(22)
s(m)(1)
=
0 m = 0,1,シ,p-1.
(23)
The convergence rate of the filtered approximation is determined
solely by the order of the filter and the regularity of the
function away from the point of discontinuity.
If p is chosen increasing with N, the filtered expansion
recovers exponential accuracy away from a discontinuity. Assuming
that f(x) has a discontinuity at x0 and setting d(x) = x -x0, the estimate
|f(x) -
FN(x)| 」
K
d(x)
p-1N
p-1
(24)
holds where K is a constant. If p is sufficiently large, and
d(x) not too small, the error goes to zero faster than any
finite power of N, i.e. spectral accuracy is recovered. When x
is close to a discontinuity the error increases. If d(x) = O(1/N) then the error estimate is O(1).
Many different filter functions are available, but perhaps the
most versatile and widely used filter is the exponential filter
s(w)=exp(-aw2p) p = 1,2,シ
(25)
of order 2p. In order for condition (23)
to be satisfied, the parameter a is taken
a = -lnem where em is defined as
machine zero. On a 32-bit machine using double precision
floating point operations, em=2-52 サ
2.2204e-16 and ln(e) サ -36.0437. The
exponential filter is implemented in the CA applet. To apply the
filter in the applet select the filter option on the approximation
menu. The filter will be applied to the approximation that is
marked in blue on the approximation menu. The order (the default
is a fourth order filter) of the filter may be changed in the
parameter dialog box which is available from the options menu.
Select the sign functions and apply the filter. Observe the
improved accuracy away from the discontinuity as well as the effect
it has on the magnitude of the coefficients. Experiment with
filters of various orders. Notice how a lower order filter smears
or rounds off the discontinuity while a higher order filter does
not completely remove the oscillations around the discontinuity.
The EP applet illustrates the strength of the damping applied in
equation (20) to the coefficients ak from
k=0,1,シ,N for filters of order 2 to 32.
7 Applet
Use the CA applet to further experiment with the concepts we have
discussed. The example functions included in the applet:
- The sign function (m=0)
- The "smooth" function is f2(x) = ecos(8x3+1), (m=・)
- Square root function: f4(x) = ヨ{1-x2}, (m=1)
- Absolute value function: f5(x) = |x|, (m=1).
- Seventh degree polynomial: f6(x) = x7 - 2x6 + x + 3.
8 Current Research Area
Chebyshev approximation is an old and rich subject. However, many
areas that employ Chebyshev polynomials have open questions that
have attracted the attention of current researchers. One example
is pseudospectral methods for the numerical solution of partial
differential equations (PDEs). Chebyshev pseudospectral methods,
which are based on the interpolating Chebyshev approximation
(11), are well established as powerful
methods for the numerical solution of PDEs with sufficiently
smooth solutions.
Interpolation means that f(x), the function that is
approximated, is already a known function. The terms
collocation and pseudospectral are applied to global
polynomial interpolatory methods for solving differential
equations for an unknown function f(x). Detailed information on
pseudospectral methods may be found in the standard references
[1,2,3,4,5,11].
Many important PDEs have discontinuous (or nearly discontinuous)
solutions. See the article [9] for a discussion of one
such class of PDEs, nonlinear hyperbolic conservation laws. In these
cases, the Chebyshev pseudospectral method produces approximations
that are contaminated with Gibbs oscillations and suffer from the
corresponding loss of spectral accuracy, just like the Chebyshev
interpolation methods that the pseudospectral methods are based on.
An active research area is the development of postprocessing methods
to remove the Gibbs oscillations from PDE solutions and to restore
spectral accuracy. Spectral filters may be used but they perform
poorly in the neighborhood of discontinuities. More sophisticated
methods that do better in the area of discontinuities, but may need
to know the exact location of the discontinuities, include Spectral
Mollification, Gegenbauer Reconstruction [6],
Padé Filtering, and Digital Total Variation Filtering. Several
postprocessing methods with applications are discussed in
[10] with supporting web material at
http://www.scottsarra.org/signal/signal.html. The ultimate
goal is a "black box" postprocessing algorithm which can be given
an oscillatory PDE solution and return a postprocessed solution with
spectral accuracy restored.
References
- [1]
-
John P. Boyd.
Chebyshev and Fourier Spectral Methods.
Dover Publications, Inc, New York, second edition, 2000.
- [2]
-
Claudio Canuto, M. Y. Hussaini, Alfio Quarteroni, and Thomas A. Zang.
Spectral Methods for Fluid Dynamics.
Springer-Verlag, New York, 1988.
- [3]
-
D. Funaro.
Polynomial Approximation of Differential Equations.
Springer-Verlag, New York, 1992.
- [4]
-
David Gottlieb, M. Y. Hussaini, and Steven A. Orszag.
Theory and application of spectral methods.
In R. G. Voigt, D. Gottlieb, and M. Y. Hussaini, editors,
Spectral Methods for Partial Differential Equations, pages 1-54. SIAM,
Philadelphia, 1984.
- [5]
-
David Gottlieb and Steven A. Orszag.
Numerical Analysis of Spectral Methods.
SIAM, Philadelphia, PA, 1977.
- [6]
-
David Gottlieb and Chi-Wang Shu.
On the Gibbs phenomenon and its resolution.
SIAM Review, 39(4):644-668, 1997.
- [7]
-
J. Mason and D. Handscomb.
Chebyshev Polynomials.
CRC, 2003.
- [8]
-
T. Rivlin.
The Chebyshev Polynomials.
Wiley, 1974.
- [9]
-
S. A. Sarra.
The method of characteristics with applications to conservation laws.
Journal of Online Mathematics and its Applications, 3, 2003.
http://www.joma.org/mathDL/4/?pa=content&sa=viewDocument&nodeId=389
(accessed March 1, 2005).
- [10]
-
S. A. Sarra.
The spectral signal processing suite.
ACM Transactions on Mathematical Software, 29(2):1-23, 2003.
- [11]
-
L. N. Trefethen.
Spectral Methods in Matlab.
SIAM, Philadelphia, 2000.
- [12]
-
H. Vandeven.
Family of spectral filters for discontinuous problems.
SIAM Journal of Scientific Computing, 6:159-192, 1991.
File translated from
TEX
by
TTH,
version 3.66.
On 01 Mar 2005, 09:38.