Numerical Procedures for Analyzing Dynamical Processes

Edward Ott

Numerical Procedures for Analyzing Dynamical Processes

17AD-A248 175i iI-fli I i~Ih NUMERICAL PROCEDURES FOR ANALYZING DYNAMICAL PROCESSES fTIC FINAL REPORT Project Period: October 1, 1990 - February 29, 1992 Celso Grebogi Edward Ott James A. Yorke University of Maryland College Park, MD 20742-3511 92-07966 .q2 q 20 04 7 111111111111I11I11 Prepared for the Office of Naval Research (DARPA) under grant number N00014-88-K-0657 DISC LAIMR NOTICE \s \ THIS DOCUMENT IS BEST QUALITY AVAILABLE. THE COPY FURNISHED TO DTIC CONTAINED A SIGNIFICANT NUMBER OF PAGES WHICH DO REPRODUCE LEGIBLY. NOT OFFICE OF NAVAL RESEARCH PUBLICATIONS / PATENTS / PRESENTATIONS / HONORS FOR 1 OCTOBER 1990 THROUGH 29 FEBRUARY 1992 CONTRACT: N00014-88-K-0657 R&T NO.: b41u001- -- 04 TITLE OF CONTRACT: Numerical Procedures for Analyzing Dynamical Processes NAME(S) OF PRINCIPAL INVESTIGATOR(S): Celso Grebugi Edward Ott James A. Yorke NAME OF ORGANIZATION: University of Maryland ADDRESS OF ORGANIZATION: Laboratory for Plasma Research College Park, Maryland 20742-3511 Reproduction in whole, or in part, is permitted for any purpose of the United States Government. This document has been approved for public release and sale; its distribution is unlimited. 2 INTRODUCT-ON The following report summarizes our activities under the Office of Naval Research (DARPA) Contract No. N0014-88-K-0657. We have organized this report under the following five categories: I. Deliverables: computer tape and disk with instructions, and summary of accomplishments related to the proposed projects. II. List of publications for the period of this report. III. Appended respective reprints and preprints. N I ," .. . .. A 3 qUo I. Deliverables: Computer Tape and Disk with Instructions arid 3ummary of Accomplishments Related to the Proposed Projects PROJECT I Deliverable: \XVQ are ddivering a tape with a software package for UNIX workstations with documentat'on for analyzing low dimensional dynamical behavior from time series. In particur.z the Lyapunov exponent code will, together with the di- niension code. permit the user to distinguish between periodic, chaotic, and random processes, "Random processes" here means behavior whose dimension is too high to compute. The code computes the information dimension of the time series. We are also including in the same tape a noise-redaction code with documentation. Summary of Project 1: Nonlinear Noise Filteringof Experimental Datafrom Chaotic Processes Many attempts have been made to apply ideas from dynamical sytems to the analysis of experimental data including estimates of attractor dimension and measurement of Lyapunov exponents. Ai, essential problem is that noise often complicates the analysis. For example, noise obscures the fractal structure of the attractor, so that estimates of the attractor dimension cmn be difficult to obtain. Various methods have been proposed to estimate the noise levels in the data, and these are useful for determining the smallest scales at which dimension measurements are feasible. However, up until now no systematic method has been developed for noise reduction. We have developed a method which we believe is a potential breakthrough in the analysis of experimental data. Typically, attractors are reconstructed from a scalar time series of experimental data using time delays. Conventional signal filtering techniques are not useful in this case, because they exanine only portions of the signal which are close in time. We examine points on an attractor which are close in 4 phase space; the corresponding parts of the original signal in general are far apart intime. Our method is a linearization technique which uses the dynamics of the reconstructed attractor to estinate and correct errors in the trajectories. The method relies on the assumption that in a small neighborhood about a point on the trajectory, the dynamics on the attractor is nearly linear. in other words, given a point xi on the attractor, its image is xi+1 = f(xi) for some nonlinear, unlivown function f. We assur- that it is possible to find a matrix A and a vector b such that x,+, - Axi + b. The mehod has two steps: first, to compute the matriecs A and vectors b for each point on the trajectory, and second, to find a new trajectory near the original one which best satisfies the linear approximation. We believe that a reliable procedure like the one outlined above will be invaluable for the analysis of experimental data. 5 PROJECT 2 Deliverable: We are delivering a disk containing a Dynamics code (for IBM com- I patible PCs and for UNIX/X window workstations) with a Manual for computing and evaluating dynamical processes. The source code contains 20,000 lines of code. I In particular, the program will compute stable and unstable manifolds as described below. Summary of Project 2: A fast Reliable Method for the Numerical Computation of Stable Manifolds of Chaotic P? ocesses 3 Saddle points often play a crucial role in the dynamics of a particular map f. A schematic illustration of a saddle p,:int in two dimensions is given in the following 3 figure: __- --_ S I |I Because p is an unstable fixed point, any point p' eventually moves away from I p as f is iterated, even though f(p) = p. For example, in Fig. 1, initial conditions slightly to the right of the curve labeled S move toward p for a few iterates, then are repelled to the right thereafter, eventually approaching the curve U. Initial conditions slightly to the left of S wili move close to p, then off to the left. The cirve S is the stable manifold of p: it is the set of initial conditions which are attracted to p. The cure U is the unstable manifold. If f is invertible then U is the stable manifold of p for the inverse map f-1. More generally, U is the set of points whose preimages tend to p. * 6 I I In many cases, the stable and unstable manifolds wind around in complicated ways. Because the manifolds arc intertwined so closely, initial conditions can approach and be repelled from the saddle point repeatedly, leading to complex behavior. Stable manifolds of fixed points often form part of the boundary between two basins of attraction. In this case, the structure of the stable manifold determines how sensitive the system is to small errors in measuring an initial condition. In addition, it is often important to know whether the stable and unstable manifolds cross at a point other than the saddle point p. Such homoclinic intersections are often of interest, especially in cases where the map depends on a parameter. Hence, I 5 a knowledge of the structure of the stable and unstable manifolds is essential to understanding the dynamics. We have developed efficient, reliable numerical methods to calculbie them. ! I I I I I I II I7 II II. LIST OF PUBLICATIONS FOR * 1 THE PERIOD OF THIS REPORT 1. "Noise Reduction: Finding the Simplest Dynamical System Consistent with the Data," E. J. Kostelich and J. A. Yorke, Physica 41D, 183 (1990). 1 2. "Shadowing of Physical Trajectories in Chaotic Dynamics: Containment and Refinement," C. Grebogi, S. M. Hammel, J. A. Yorke, and T. Sauer, Phys. I Rev. Lett. 65, 1527 (1990). 3. Antimonotonicity: Concurrent Creation and Annihilation of Periodic Orbits," Bulletin AMS 23, 469 (1990). 4. "Chaotic Scattering in Several Dimensions," Q. Chen, M. Ding, and E. Ott, Phys. Lett. 145A, 93 (1990). 5. "Cross-sections of Chaotic Attractor-,," Q. Chen and E. Ott, Phys. Lett. 147A, 450 (1990). 6. "Rigorous Verification of Trajectories for the Computer Simulation of Dynam- I ical Systems," T. Sauer and J. A. Yorke, Nonlinearity 4, 961 (1991). 7. "Analysis of a Procedure for Finding Numerical Trajectories Close to Chaotic Saddle Hyperbolic Sets," H. E. Nusse and J. A. Yorke, Ergod. Th. & Dynam. Sys. 11, 189 (1991). 3 8. "Embedology," T. Sauer, J. A. Yorke, and M. Casdagli, J. Stat. Phys. 65, 579 (1991). 9. "A Numerical Procedure for Finding Accessible Trajectories of Basin Boundaries," H. E. Nusse and J. A. Yorke, Nonlinearity 4, 1183 (1991). 3 10. "Calculating Topological Entropies of Chaotic Dynamical Systems," Q. Chen, E. Ott, and L,Hurd, Phys. Lett. 156A, 48 (1991). 3 11. "On the Tendency Toward Ergodicity with Increasing Number of Degrees of Freedom in Hamiltonian," L. Hutd, C. Grebogi, and E. Ott, submitted for * publication. 8 12. "Metamorphoses: Sudden Jumps in Basin Boundaries," K. T. Alligood, L. Tedeschini-Lalli, and J. A. Yorke, submitted for publication. 13. "Accessible Saddles on Fractal Basin Boundaric ,," K. '. Alligood and J. A. Yorke, submitted for publication. 14. "The Analysis of Experimental Data Using Time-Delay Embedding Methods," E. J. Kostelich and J. A. Yorke, submitted for publication. 15. "When Cantor Sets Intersect Thickly," B. R. Hunt, I. Kan, and J. A. Yorke, submitted for publication. 16. "Border-Collision Bifurcations Including 'Period Two to Period Three' for Piecewise Smooth Systems," H. E. Nusse and J. A. Yorke, submitted for publication. I9 I I I I I I I 3 III. APPENDED RESt'ECTIVE REPRINTS AND PREPRINTS U U I I I I I I I I 10 Phvsica D A1(1990) 183-196 North-Holiand NOISE REDUCTION: FINDING THE SIMPLEST DYNAMICAL SYSTivl CONSISTENT WITH THE DATA Eric J. KOSTELICHab and James A. YORKE Institute for Phvsical Science and Technology, Uniersitv of Manland. College Park, MD 20742. LSA hCenter /or Nonlinear Dvnanics. Department of Phi-sics. Untverstiv ol Texas. A usttn. TX 78712. USA Department of Mathematics. Universitt of Maryland. Colleqe Park. MD 20742. USA Received 3 March 1989 Rev:scd manuscnpt received I October 1989 Accepted 18 October 1989 Communicated by R Westervelt I A novel method is desLnbed for noist. rcduttion II Laotii, experimental data %.hoednami.s arc low dimensional In ad lition. we show how the approaLh allow,. cpenmentalists to use n n of tle same tethniqucs that hae been essential for the analysis of nonlinear systems of ordinary differential equations and difference equations. I. Introduction Numerical computation and computer graphics have been essential tools for investigating the behavior of nonlinear maps and differential equations. The pioneering work of Lorenz 1251 was made possible by numencal integration on a computer, allowing him to take nearby pairs of initial conditions ankd compare the trajectories. H16non 1191 discovered ,he complex dynamics of his celebrated quadratic map with the aid of a programmable calculator. A variety of classical and modern techniques has been exploited to find pertodic orbits, their stable and unstable manifolds [141, basins of attraction [26], fractal dimension [27], and Lyapunov exponents [10, 31. 37]. In some cases, numerical methods can establish rigorou ly the existence of initial conditions whose trajectories have essentially the same intricate structure that one sees on a computer screen [18]. 'Curient address Department of Mathematics, Anzona State University. Tmpe, AZ 85287, USA 0167-2789/90/$03 50 (North-Holland) Elsevier Science Publishers B V Until recently, experimentalists have not been able to apply most of these methods to the analysis of experimental data, since they do not in genetal have explicit equations to model the behavior of thei: apparatus. In ,ases where it is possible to lind accurate models of the ph)sical system, quantitative predictions about the behavtor of actual experiments are possible 117]. However, all that is available in a typicl experiment is the time-dependent output (e.g. voltage) from one or more probes, which is a function of the dynamics. One fundamental problem in the analysis of experimental data concerns the correspondence between the dynamics that goerns the behavior of the apparatus and the discretely sampled time series that Limprises the data. Another question is how to unimize the effect of noise. In this paper, we show how the time dely embedding method, row commonly used to reconstruct an attractor from experimental data, yields a novel procedure for reducing noise in data whose dynamcs can be characterized as low dimensional. Moreover, we I IS4 EL. Kworkh OcdJA. Yar.ug / "'damm i show how the approach can be extended to allow experimentalists access to many of the analytical measurementsL In the embedding method. one generates a set of mdimensional points ow tools mentioned above, Section 2 reviews the time delay embedding method and some of its applications. Section 3 introduces some of the problems associated with traditional filters and outlines our noise reduction method. coordinates are vralucs in the time series separated by a conswant delay [I . For example. wlen m =3.I the reconstructed attractor is the set of points I x,= ;.. s,.,s..j) where ir is the time delay. Takens [341 has shown that under suitable hvpoheses. hsprocedure yields a set whose properties are equivalent to those of the original attractor provided that the embedding dimension 2. The time delayv es merln As stated in section 1. one problem in analyzing experimental data is how to relate the measurements with the dynamics. Before the early 1980s. power spectra were the principal method for analyz:ng such data. For instance. Fenstermacher et al. 1131 relied heavily on power spectra to detect transitions from periodic to weakly turbulent flow between concentric rotating cylinders. However. Fourier analysis alone is inadequate for describing the dynamics. Other methods also have been used to analvze time series output from dynamical systems. Lorenz 1251 used next amplitude maps to describe some features of the dynamics; that is. he plotted -,-,. against :,, where z, is the nth relative maximum of the third coordinate of the numerically calculated solution. Such maps are often useful. not only for investigating features of the Lorenz at- called crnamic in that information about the dynamics is stored in the compu;er for analysis. With each data vector x,, one stores the "next'" method has come into common use as a way of cal system underlying that data this approach, one supposes thaerientaldata.In they i bhaio goned b as linear approximation provides an estimate of the Jacobian of the map at x, 1111. Eckmann et al. [101 use linear maps computed in this way to integrate solution traveling along an attractor" t (which is not observable directly). However, one assumes there is a smooth function that maps points on the attractor to real numbers (the experimental ='Existing numerical methods requirc the attractor to be low dimensional I I In principle, the embedding method allows one to study the dvnamics i detail The earliest applications may be called staic in that the analsis focuses on the geome:ric properties of the set of points on the reconstructed attractor, For exampie. phase portraits and Poincari sections are used in ref. 151 to help determine the transition between quasiperiodic and chaotic flow in a CouetteTaylor experiment. Another important application is the estimation of attractor dimension from experimental data. for which tte:e i%a largx literature [271. In addition, various .nfornmtion theoretic notions can be used to find good choices of embedding dimension and time delay 1151. More recent applications of the embedding method are quite different in nature and can be vector, for example. x, s for some 8 > 0. This makes it possible to compute a linear approxima- that the dynam ica l behavio r is govern ed by a I i is large enough. tractor 1321, but also for instance in experiments on intermittency in oscillating chemical reactions 1301. In the past decade, the time delay embedding reconstructingdata. an attractor from a time series of experimental In l tion of the dynamics in a neighborhood of x,. assuming that there is a low-dimensional dynami- a s t o a i t o a q ai n '2 . In particular. a n i d t ep s t v a set of variational equations and find the positive w macrial wasirst presentd by D Rucik at a Nobel smosiin 19D Wolf ct al. [371 have proposed a diffcrcnt me'hod in which nearb' pairs of points arc followed to cstimatc the largest Lyapunov cxponent I I I Ei. Kti*h :3 diJA. Yare/.Vese mAwim In fact. the time delay embedding method pro- new time series with some of the high-frequency %idesa powerful set of tools for anahag the components removed. This isthe basic idea be- dynamics the breadth of which may not have hind Wee and other bandpass filters 1291 been realized by Eckmann and Rudle. In the remainder of this paper. we dicus two novel applications that are possiW specifically: However. as noted previously, power spectral analyss insufficient to characte.re the dynamics when the data are chaotic. Since the power at spectrum of a low-dimensional chaotic signal resembles that of a noisy one, the suppression of certain frequencies can alter the dynamics of the (1)Noise edtm.Sim one can ap the dynamics at each point it becomes ossible identify and correct inaccuracies in trajectories arising from random errors in the oinal series. Numerical evidence rigna th ie reduction procedure described below improves the yof other analyses such as Lyapnov accuracpofone an lyse s. ucats , exponents and dimension calculaaons. (2) Simplicial approximaions. Linear approximations can be computed at each point on a grid in a neighborb~od of the attractor 'o form a in aneihbor~odof te atracor o fom a simplicial approximation of the dynamical system. This can be used to locate unstable periodic orbits We consider noise reduction in section 3. 3. Noise reduction The ability to extract information from timevarying signals is limited by the presence of noise, Recent experiments to study the transition to turbulence in systems far from equilibrium, like those by Fenstermacher et al. [131, Behringer and Ahlers [21, and Libchaber et al. [241, succeeded largely because of instrumentation that enabled them to quantify and reduce the noise. However, it is often expensive and time consuming to redesign experimental apparatus to improve the signal to noise ratio. An important question, therefore, is how the experimental data can be filtered or otherwise preprocessed before it is analyzed further. One common approach is to use Fourier analysis: one might model the noise as a collection of highfrequency components and subtract them from a power spectrum (or Fourier transform) of the input data. The transform can be inverted to yield a I It5 e have shown that a simple low-pass filter effectively introduces an extra Lyapunov exponent that depends on the cutoff frequency. If the cutoff frequency is sufliciently low. then the filter can increase the fractal dimension of the reconstructed attractor. This resut also has been confirmed by Mitschke et al. suit also an coni c cyrit . 1281 with data from an electronic circuit. We now consider a different approach and show Wow cosidera differ et can so exploited to reduce the noise, at least in cases where the time series can be viewed as a dynamical system with a low-dimensional attractor. Our objective is to use the dynamics to detect and correct errors in trajectories that result from noise. This is done in two steps once an embedding dimension m and a time delay T have been fixed. In the first step. we consider the motion of an ensemble of points in a small neighborhood of each point on the attractor in order to compute a linear approximation of the dynamics there. In the second step, we use these approximations to consider how well an individual trajectory obeys them. That is, we ask how the observed trajectory can be perturbed slightly to yield a new trajectory that satisfies the linear maps better. The trajectory adjustment is done in such a way that a new time series is output whose dynamics are more consistent with those on the phase space attractor, This approach is fundamentally different from traditional noise reduction methods. Because we consider the motion of points on a phase space attractor, we are using information in the original signal that is not localized in a time or frequency domain. Points that are close in phase space correspond to data that in general are widely and F. 136 EJ. Kadc a .,d.A. 'YVoii w irr I oomu irregularly spaced in tme .due to de se ti ve-dependeuc on initial conditions on chaotic at- :: tractors. In contrast. Kalman 14 and similar filters examine data that are closely spaced in time: bandpass filters operate in the frequency domain.I Fig. 1. Schematic diagram for the first stage of the noise reduction method. A collection of points in an c-ball about the reference point xf is used to find a linear approximation of 4. Edmann-Ruele linearkiabo the dynamcs there. The discrete sampling of the original signal means that the points on the reconstructed attractor can be treated as iterates of a nonlinear map f whose exact form is unknown. We assume that f is nearly linear in a small neighborhood of each attractor point x and write We mention three difficulties in computing the local linear approximations in the subsections below. f(x) = Ax + b =- L(.) for some m x m matrix A and m-vector b. (The matrix A is the Jacobian of f at x.) This approximation, which we call the Eckmann-Ruelle linearization at x. can be computed with least-squares methods similar to those described in refs. [11, 101. Given a reference point Xrf, let (x, ),"- be a collection of the n points which are closest to xre1. With each point x, we store =4 the next point (i.e., the image of x,), denoted y, , The kth row a k of A and the kth component bk of b are given by the least-squares solution of the equation yA = bk + a k x, (1) where Yk is the k th component of y and the dot denotes the dot product. Fig. I illustrates the idea'. "The points x, are points on the attractor which are not consecutive in time. The subscnpt i merely enumerates all the points on the attractor contained witun a small distance (of Xrer. In this notation. x, and y, are consecutive in time. *SFarmer and Sidorowich [12] observe that the Eckmann- Ruelle bineanzation can be used for prediction Given a reference point x,, find the Eckmann-Ruelle hneanzation A,x + b,, compute x,, I = A,x, + b,, and repeat the process to get the predicted trajectory I I 4.1. 111 conditioned least squares There is a particular problem when one tries to compute solutions to eq. (1) with a finite data set of limited accuracy that has not been addressed in previous papers [10, 311. Suppose for example that all the points in a neighborhood of x,,f lie nearly along a single line. i.e., the attractor appears one dimensional within the available resolution. Although it is possible to measure the expansion along the unstable manifold at xrf there are not enough points in other directions to measure the contraction. Hence it is not possible to compute a 2 x 2 Jacobian matrix accurately. Any attempt to do so will result in an estimate of the Jacobian whose elements have large relative errors. This kind of least-squares problem is il conditioned. The ill conditioning can be avoided by changing coordinates so that the first vector in the new basis points in the unstable direction 6 . A one-dimensional approximation of the dynamics is computed using the new coordinates; that is, we approximate the dynamics only along the unstable manifold. We recover the matrix A by changing coordinates back to the original basis. For example, if we are working in the plane and the unstable direction is the line y = x, then we rotate the coordinate axes by 45', The dynamics are approximated by a one-dimensional linear map 6 This is done by computing the right singular vectors (9)of the n x m matrix whose jth row is x, The procedure is called principal component analysis in the statistical hterature. I i I E.. Kostehch and J.A. Yorke/ Noise reducion computed along the line Y = x. Then we rotate back to the original coordinates. (The resulting matrix A has rank 1 in this example.) This approach substantially enhances the robustness of the numerical procedure. 4.2. Finding nearest neighbors A second problem is finding an efficient way to locate all of the points closest to a given reference point. The dynamical embedding method imposes stringent requirements on any nearest-neighbor strigen onanyalgorithm reuireent algorithm. The storage overhead for the corre- I ~ 187 in the same box number. The search is extended if necessary to adjacent boxes. Only a crude partition is needed for this algorithm to work efficiently (typically we choose B = 40), and the grid4 is extended only to the first three coordinate axes. *"hen the embedding dimension is larger than three, a preliminary list of nearest neighbors is obtained using only the first three coordinates of each attractor point. The final list is extracte by computi each point in the preliunary list. Although there are circumstances where this can perform poorly (e.g.. when most of th can p o r e oo r ted ina ostndf sponding datather structures must be ttrctorpoits, small, because there are te ns ofthouand of ar th oten usands of of attracto r points. the boxes), attractorthepoints are concentrated of distribution of points ona ihandful typical The algorithm must be fast, since there is one nearest-neighbor problem for each linear map to be computed, We solve this problem by partitioning the phase space into a grid of boxes that is parallel to the coordinate axes. Each coordinate axis is divided into B intervals. (Fig. 2 illustrates the grid in two dimensions.) Each point on the attractor is asdimesios.) achpoit ontheattacto isassigned a box number according to its cotrdinates. For example, a point on the plane whose first coordinate falls in the jth interval (counting from 0) along the x axis and whose second coordinate falls in the k th interval along the y axis is assigned to box signd number kB +j. The list of box t boxnumer B along +j Th lis otbox numbers is sorted, carrying a pointer to the attractors is sufficiently uniform that the running time is very fast. Memory use is also efficient: a set of N attracto. points requires 3N storage locations. In contrast, the tree-search algorithm advocated in ref. [12] requires several times more storage (although the lookup time is probably slightly less). Because N = 10 in typical applications, we believe that the box-grid approach (or some variant) is the most practical. A survey of ome varaniheos al . A svey of other nearest-neighbor algorithms is given in ref. [3]. original data point. Given a reference point xrf, its box number is found using the above formula. A binary search in the list of box numbers then locates the address of xmf and all the other points ordinary least squares to compute the linear maps. In the usual statitical problem of fitting a straight line, one has observations (x,, I where x, is is measured. One assumes known exactly and y', that y, = ao + a~x, + c,, w'ere the (, are independent errors drawn from the same normal distribution. (Analogous assumptions hold in the BI- B _B +21BB B-_B+2 _B+ _ 1- _1 J+ SBIever, 2B+-1 0 1 __. B. -1 B- Fig. 2. Box numbenng scheme in two dimensions. The attractoris normalized to fit in the unit square. The bottom row of boxes rests against the x axis and the leftmost row of boxes against the Y axis. t r c o s i suf i e ly nf r m ha t e r n i g There potential difficulty Tria is a etial dicuty inthethe use se of multivariate case.) In the present situation, howboth x, and v,are measured with error. It can be shown that the ordinary least-squares method produces biased estimates of the parameters a0 and a, in this case [16, 23]. In practice this does not seem to be a serious problem, but statistical procedures to handle this situation (the socalled "errors in variables" methods) may provide 188 EJ. Kostehcl and J.A Yorke/ Noise reduction -The trajectory adjustment can be iterated. That is, once a new trajectory .1, has been found, one can replace each x, in eq. (2) by .,and compute a Fig. 3. Schematic diagram of the trajectory adjustment procedure. The trajectory defined by the Fquence (x, ) is perturbed to a new trajectory given by { i, ) which ismore consistent with the dynamics. In this example we show what the perturbed trajectory might look like J the dynanucs were approximately honzontal translation to the nght. an alternative approach to noise reduction. We consider this question in the appendix. 5. Trajectory adjustment by rnnimizing new sequence { .}. We place an upper limit on the distance a point can move. Points which seem to require especially large adjustments can be flagged and output unchanged. (This may be necessary if the input time series contains large "glitches" or if nonhnearities are significant over small distances in certain regions of the attractor.) When the input is a time series, we modify the above procedure slightly since we require a time series as output. The trajectory adjustment is done The Eckmann-Ruelle linearization procediire described above is computed and the resulting maps are stored for a sequence of reference points along a given trajectory (for the results quoted here, the sequence usually contains 24 points). We now consider how to perturb this trajectory so that it is more consistent with the dynamics. The objc:tive is to choose a new sequence of points 1, to minimize the sum of squares so that changes to the coordinates of x, (corresponding to particular time series values) are made consistently for all subsequent points whose coordinates are the same time series values. For example, suppose the time delay is I and the embedding dimension is 2. Then trajectories are perturbed so that the second coordinate of the ith point is the same as the first coordinate of the (i + l)st point. That is, when x, = (s, s,+1) is moved to the point 1, = ( ,, ,+ 1), we require that the first coordinate of 1 be , EW1l-,- X'11" 6. Results using experimental data self-inconsistency (2) We note that the attractor need not be chaotic where L(x,) =A~x, + b, w is a weighting factor, and the sum runs over all the points along the trajectory 17 . Eq. (2) can be slved using least squares, Heuristically, eq. (2) measures the selfinconsistency of the data, assuming that the linear approximations of the dynamics are accurate. See .3.We say the new sequence (x,) is more for this noise reduction procedure to be effective. Fig. 4a shows a phase portrait of noisy measurements of wavy vortex flow in a Couette-Taylor experiment (20]. This flow is periodic, so the attractor is a limit cycle (widened into a band because of the noise) and the power spectrum consists of one fundamental frequency and its 11 L, 1 +Ii, +I L, (I)11, self-consistent *7 #7In the results descnbed in this paper, the Eckmann-Ruelle points l;:.earization procedure is done using a collection of within a radius of 1-6% of each :,erence point, depending on the embedding dimension, the dimension of the attractor, and the number of attractor points. This results in collections of 50-200 pcnts per ball, which gives reasonably accurate map approxima,tions without making the computer program too slow. The weighting factor w is set to 1. harmonics above a noise floor. See fig. 4b. Figs. 4c, 4d show the same data after noise reduction. makes the limit The noise reduction procedure cycle much narrower, and the noise floor in the power spectrum is reduced by almost two orders of magnitude. However, no power is subtracted from any of the fundamental frequencies, and in fact some harmonics are revealed which previously were obscured by the noise. 189 E.J. Kostelich and J.A. Yorke/ Noise reduction ()3 () 2 I u 0 -2 -3 II 1e 3() 2 II -3. ! 0 1 .4 0)3 0.4 1 -2 * -3 0 I ! 0. 0. 0)5 Fig 4 Phase portraits and power spectra for measurements of wavy vortex flow in a Couette-Taylor experiment. (a). (b Phase portrait and power spectrum before noise reduction is apphed (c). (d) after noise reduction: (e), (f) after a low.pass hilter is applied to the onginal data The vertical axis in (b), (d) and (f) is the base-lO logarithm of the power spectral density, the honzontal axis is in multiples of the Nyqist frequency. These results are significantly different from those obtained by low-pass filtering. Figs. 4e, 4f show the phase portrait and power spectrum when the original data are passed through a 12th-order Butterworth filter with a cutoff frequency of 0.35. The dynamical noise reduction procedure is more effective than low-pass filtering since the noise appears to have a broad spectrum. However, the dynamical noise reduction method appears to subtract power from a mode whose fundamental frequency is approximately 0.3 times the Nyquist frequency. We do not know exactly 190 EJ. Kostelich and J.A. Yorke/ Noise reducton (bl I (a) 0 -2 .3 -4 (c)() -'4 0 8 16 24 12 404 Fig 5 Phase portraits and power spectra for measurements of weakly chaotic flow in a Couette-Taylor experiment (a). (b)Phase (d) after noise reduction The units for the power ,pectrum plot, portrait and power ,pectrum before noise reduction is applied. (c), are the same as those in ref. (51 why this occurs. However, this peak corresponds to the iotation frequency of the inner cylinder and may result from a defect in the Couette-Taylor apparatus [331. We do not consider this to be a serious problem. because the power associated with this mode is several orders of magnitude smaller than that of the wavy vortex flow. We emphasize that our objective is to find a simple dynamical system that is consistent with the data. It is possible for this method to elimnnate certain dynamical behavior from an attractor if those dynamics have very small amplitude, as fig. 4f shows. This situation is most likely to arise when there are not enough data to distinguish such dynamics from random noise. In the present example, the noise reduction procedure reveals the limit cycle behavior quite well". The results obtained by applying the method to chaotic data from the Couette-Taylor fluid flow experiment described in ref. [5] are shown in fig. 5. Fig. 5a shows a two-dimensional phase portrait of the raw time series at a Reynolds number R/Rc = 12.9, which corresponds to weakly chaotic flow [5]. The corresponding phase portrait from the filtered time series is shown in fig. 5b. Figs. 5c, 5d show 5 sWe have not attempted to find the smallest amplitude at which the noise reduction procedure can disunguish quasipenodic from penodic flow In general this will depend on the amount of data. the sampling rate. the embedding dimension, and other factors. I E.J. Kostelich andJ.A. Yorke/Noise reduction the power spectra for the coresponding time series" 9 . It is difficult to estimate how much aoise is removed from the data in this example on the basis of power spectra. One problem is that the transition from quasiperiodic to weakly chaotic fluid flow is marked by a sudden rise in the noise floor in the power spectrum (cf. fig. 3 in ref. [5]). Hence one carnot determine how much of the noise floor is c.ue to deterministic chaos and how much results from broad-band noise. The noise reduction procedure described here has the effect of reducing the power in the high-frequency components of the signal. One question therefore is whether reducing the high-frequency noise corre- sponds to discovering the true dynamics which have been masked by noise. We believe that the answer is yes, based on those cises where there is an underlying low-dimensional dynamical system. However, in chaotic processes some high-frequency components remain, because they are appropriate 191 stored, and a time series is generated by adding a uniformly distributed random number to each iterate. This simulates a time series with measurement noise, i.e., a time series where noise results from errors in measuring the signal, not from perturbations of the dynamics. We measure the improvement in the signal after processing by considering the pointwise error e,= x,,+i -f(x,, x,)II, i.e., the distance between the observed image and the predicted one. Let the mean error be 1/2 E t' =-k , the rmis value of the pointwise error over all A' points on the attractor. We define the noise reduction as R = I - Ef.,ied/Enomv, to the dynamics. We use eq. (3) to generate a time series as follows (with the standard parameter values a = 1.4, /f = 0.3). We choose an initial condition and discard where the mean errors are computed for the adjusted and original noisy time series, respectively. The quantity R is a measure of the self-consistency of the time series. (In other words. R measures how much better on the average the output attractor obeys eq. (3) as one hops from point to point.) When 1% noise is added to the input as described above, the noise reduction (measured with the actual map) is 79%Io . Nearly identical res,,ts are obtained when the input contains only 0.1% noise. In addition, noise levels can be reduced almost as much in cases where the noise is added to the dynamics. i.e., where the input is of the form {x,.: x,. =f(x, + 71,,X,_ + 7). , q,-I random). When the program is run on noiseless input, the mean error in the output is 0.025% the first 100 iterates. The next 32768 iterates are of the attractor extent, which suggests that errors *9 The time senes consists of 32 768 values, from which an attractor is reconstructed in four dimensions Linear maps are computed using 50-100 points in each ball Trajectones are Ut°The pointwise error is measured using eq (3) However. the attractor can be embedded in more than two dimensions fitted using sequences of 24 points, when performing the noise reduction 7. Numerical experiments on noise reduction One important question is how much noise this method removes from the data. The power spectra above suggest that the method eliminates most of the noise, but it is impossible to give a precise estimate for typical chaotic experimental data. However, the H~non map [191 provides a convenient way to quantify the noise reduction, because it can be written as a tinie delay map of the form x,+ I =(x,,x, 1) = 1 - ax" +1x,_ . (3) 1EJ. Kosteich and J.A 192 arising from small nonlinearities are negligible when the input contains enough points. 8. Simplicial approximations of dyaamical systems Recent work has shown that simplicial approximations of dynamical systems can reproduce the behavior of the original system to high accuracy 1361. (See also ref. [351 for a bilineat approach.) In particular. the fractal structure of the original attractors and basin boundaries is preserved over many scales. Such approximations can yield significant computational savings, especially when the original system consists of ordinary differential equations. This approach can be extended in a natural way to generate simplicial app .)ximations of the dynamics on attractors reconstructed from experimental data. Our objective here is to find an approximate dynamical system in a neighborhood of the attractor as follows, A simplex in an m-dimensional space is a triangle with #n + I vertices. Suppose the map is known at each point on a grid. Then there is a unique way to extend the map linearly to the interior of the simplex S whose vertices are grid points. Given a point P ia the interior of S. let { b, )'. be its corresponding barvcentric coordinates (see ref. [36] for an algorithm to compute them). Let 1(vo) be the mar at the ith vertex. The dynamical system at P i. rated by compi,in.g ,_. b,f(v. (4) -0 We apply this method to experimenal data by fin ling a linear approximation of the dynarrac , at each vertex v, with the least-squares method desc" -d above, .sing a collection of points in a si At ball around v,. The maps are stored and retrieved using a hashing algorithm similar to that described in ref. 136]. This yields a piecewise linear Yorke / Noise reductwn methods that previously were available only to theorists:-" . We illustrate the approach using a time series of 32768 values from the Hnon map with a = 1.2, /f = 0.3 uing eq. (3) and adding 0.1% noise as described above. The original attractor is shown in fig. 6a. We take a grid of points wbhch aie spaced at 1%intervals (,his and subsequent distances are expressed as a fraction of the original attractor extent). The trne series is embedded in two dimensions. and a linear approximation of the dymimics is computed at each grid point for which 50 or more attractor points can be collected with a ball of radius 0.03, the set of such grid points is shown in fig. 6b. We take an initial condition near the original attractor and show the first 3000 iterates using eq., (4) in fig. 6c. Although some defects are visible, the attractor produced by the approximate dynamical system looks almost identical to the original one. One application of simplicial approximations is the location of periodic saddles and the estimation of the largest eigenvalue of the corresponding Jacobian. That is. if x is a periodic point of period p, then we find the eigenvalue of DJ P(x) of largest modulus, where Df'( v) refers to the mairix of partial derivatives of the pth iterate of the map f evaluated at x. Given an initial guess for x. one can apply Newton's method using the maps computed at the grid points and eq. (4) to locate the saddle using the simplicial approximations. Likewise. eq. (3) can be used to locate the corresponding "exact" saddle. Saddle orbits up to period 8 have been computed in this way. In all cases. the saddle point for the simplicial approximation is within 2% of the corresponding saddle point for the H~non map. Table I shows the largest eigenvalues of the saddle orbits. (The columns labeled mi= 2 and m = 3 refer to the embedding dimension used to reconstruct the attractor.) In most cases, the =lus approach is less ambitious than that of Crutchfield approximation of the dynamics from a set of 181, experimental data which can be analyzed with the equations that creates the observed attractor who attempts to find a single set of nonlinear difference 193 EJ. Kostehch and J.A. Yorke/ Noise eduction (a) / ., (C) , f - 0 3 (b I I% . , / _____ ___ ___ _ • . (b) Fig. 6. i. Ur_ H~non attractor computed from eq. t3) with a ,, I 2. ft -0 3 (b) 1% grid on which hinear approximations of the -- .... .i) .. dynamics are computed from the available attractor points. (c) Attractor produced by the bimphcial approximations relative error is only a few percent, and in no case exceeds 25%. (The largest relative error is for the period 8 saddles, where one finds the eigenvalue of the product of 8 Jacobians computed from the lea5 squares.) "his meth,>d can be exten&d to experimental d it, sets. However, there are relatively stringent described in section 4.) The current computer implementation uses a large amount of disk space to store the linear map approximations at the grid points. We have constructed a simplicial approximation for an attractor obtained from a BelousovZhabotinskii chemical reaction [7, 301. The attrac- ,m.. irernmts on the data that can be handled: the time se, es must be long erl,)ugh to trace out many trajeclories near the principal unstable saddle orbits, and the noise Ivel must be low. (Presumably, ,.isy data can be preprocessed using the approach tor is reconstructed in three dimensions from a set of 32 768 measurements of bromide ion concentration. The phase portrait is shown in fig. 7a. Linear approximations of the dynamics are computed at each point of a grid consisting of 50 El. Kostehch and JA. Yorke/ Nom., reduction 194 Table I The largest eigenvalues of the Jacobian of the periodic orbits located using the simphcial approximalicri of the Hinon attractor, the attractor. Using initial guesses from some of the trajectories, we apply Newton's method to locate the saddle orbit shown in fig. 7b. Moreover, Period m-2 Exact m-3 we obtain estimates of the Jacobian Df of the map evaluated at a point on the saddle orbit. The 1 1.793 2.178 4.226 1.695 2.199 1.757 2.183 4051 eigenvalues of Df are estimated as X,- 1.14. X 2 - 0.102, and X 3 = - 1.53. These quantitative > 9.626 0 2 4 4.329 6 10.38 10.70 6 10.38 25.80 20.02 17.70 11.32 24.88 20.60 24,32 8 9 8 1212 30.25 20.38 21.70 intervals along each coordinate axis for which 50 within an points or more attractor a Ths poducs oin.be located ridofcan rdiu he 8% 8% radius of the grid point. This produces a database of 59 550 maps. We observe from graphi, cal evidence that many traje. tories approach what appears to be a period-3 saddle in the middle of results confirm that the orbit is a saddle since X, > X3 (Note that one expects X 0 for a flow generated from a set of diffetential equations.) 9. Conclusion Methods for approximating the dynamics of attractors reconstructed from experimental data provide powerful tools. Most of the same proce. dures that have been so important for theoretical insight. such as Poincari maps. unstable fixed points and their manifolds, basin boundarics, and the like, are now available to experimenters. at least in cases where the dynamics are low dimensional. There is little doubt that these tools will lead to breakthroughs in the understanding of a wide variety of physical systems. However. considerable effort is needed before we learn which kinds of systems will benefit most from these types of analyses. Significant improvements in technique will certainly extend the applicability of dynamical embedding methods, for example to higherdimensional attractors. i7- jI z In this appendix we outline a possible alterna- . ////' ,',regression I / i Fig 7. (a) The attractor reconstructed from a time series of bromide ion concentrauoas in a Belousov-Zhabounskil chemi- cal reacuon. (b) The penod-3 saddle orbit tive noise reduction method based on the theory when all the quantities in the of least squares are measured with error. In ordinary least squares, the variables in the problem fall into two classes: the independent variables, which are known exactly, and the dependent variables, which are observations assumed to be functions of the independent variables. The errors dependent variables are subject to random s th t areasuedinependent and enil that are assumed independent and identically dis- tributed (i.i.d.). I EJ.'AKoteIch and J.A. Yorkel Noe, reduction On an attractor reconstructed from experimental data. we assaime that the mapping which takes points in a sufficiently small '- 'I to their images is approximately linear. Howevt,. the locations of all the points are subject to small random errors because of the noise. Hence one cannot describe the points as independent variables and their images as dependent variables. The usual leastsquares method produces a biased estimate of the linear map, and this bias does not decrease if more observations are added [16. 231. The so-called "errors in variables" least-squares methods can be used to handle the latter problem, This approach can be used to obtain both an estimate of the linear map as well as estimates of the "true" values of each of the observations. At first this appears to be an underdetermined problem: from n pairs of observmptions one wants to compute the parameters of the functional relation between them as well as estimates of the n actual pairs " -. However, it is possible to solve this problem by making some assumptions about the errors [16, 231, In our case, we assume that the errors in the location of each point and its image are i.i.d. In particular, we let the covanance matrix of the errors in the variables be the identity matrix. This assumption is valid whenever the noise is indepen- dent of the dynamics, 13. We illustrate the procedure for the case where we are given a collection of n points (in R") and their images. Following Jefferys (211. we form a set of n equations of condition given by /,(x,) = x,,., - Ax, - b,mx,+, - L(x,), (5) where x, is the ith point, x,,, is its observed image, A4is an m X m matrix, and b is an m-vector. The goal is to find estimates of L (i.e.. A and " In the statistical hterature. the problem is said to be uidenufied 1Dyna.-ucal noise (iLe.. each poit is perturbed slightly before iteratun&, yields a covanance matnx which depends on the point However. as long as the dynamical noise small, is our assumpuons about the covanance mamx of the errors should not compromise the accuracy of the method. 195 b). together with perturbations 6. such that f,(x, + ,) . (x,,, +0Od,) - L(x, + 0) - 0 and such that the quadratic form so- tv-10 (6) is minimized. The superscript t denotes transpose and a is the covariance matrix of the observations (which we assume is the identity matrix here). This minimization problem can be solved using Lagrange multipliers (see refs. (21, 221 for a numerical algorithm). The solution gives A and b together with estimates x, -t- 0, of the "true" observations. It can be shown 1161 under fairly mild hypotheses that the estimates of L and the obser. vations are the best in the class of linear estimators. One way to approach noise reduction is to extend eq. (5) to include several iterations of the observed points. Given a collection of points in a ball, together with the next p iterates of each point, the method above is used to find a collection of linear maps Lt . L ...... L. approximating the dynamics. The method also finds estimates of the actual observations. In this approach. therefore, the calculation of the maps and the adjustment of the trajectones is done in one step. Moreover, each point and its image exactly satisfy a linear relationship. Of course, p cannot be too large, because nonlinear effects eventually will become significant when the dynamics are chaotic. On the other hand. eq. (5) provides a natural way to include quadratic or other nonlinear terms. We have written a computer program to imple-, ment this alternative noise reduction algorithm. So" far, the results of this approach have not been as good as those from the method described in the main part of the paper, but further refinement should improve them. Acknowledgements Dan Lathrop provided invaluable assistance in finding periodic orbits in the Hinon and BZ attractors. We thank Bill Jefferys for useful discus- I 196 I EJ Kosehch and J.A. Yorke/ Noise reduction sions and computer software for the errors in variables least-squares problem. Andy Fraser. RandiTab eand w npro all idred, Randy Tagg and Hquan HarryS Swinney all provided (131 P.R. Fenstermacher, H.L. Swinney and J.P, Gollub. J.Fluid Mech. 94 (1979) 103. (141 W. 737.Ftanceschini and L. Russo. J.Stat. Phys. 25 (1981) helpful suggestions, This research is supported by 11S) A. Fraser and H.L. Swinney. Phys. Rev. A 34 (1986) 1134. the Applied and Computational Mathematics Program of the Defense Advanced Research Projects Agency (DARPA-ACMP) ad by the Department of Energy Office of Basic Energy Sciences. W.A. Fuller, 161 York. 1987). Measurement Error Models (Wiley. New 1171 E.G. Gwinn and R.M. Westervelt. Phys, Rev. A 33 (1986) ,143. 181 S.M. Hammel. J.A. Yorke and C.Grebog, J.Complexity 3(1987) 136: Bull. Am. Math. Soc. 19 (1988) 465 1191 M. Hinon. Comm. Math. Phys. 50 (1976) 69. 1201 D Hirst, Ph.D. Dissertation. University of Texas iDecem. Reee111m III R. Badu. G.Bro.V, D.Denghett. M.Ravatu, S.Ciliberto 979 12 (I Lett. bO (1988) Rev. Phys..P.Behngerand and A Politi.121 .AknJ. FuidMc~h her 1987), (211 WH. Jll'erys, Astron. J. 85 (1980) 177 1221 W H. Joffervs, Astron. J 86 11981) 149 1231 M G Kendall and A Stuart, The Advanced Theory ot Statistics, Vol. 2 (ntlhn, London, 1961 p 375 I I 1.21241 A. Libchaber. S. Fauve and C. Latroche, Physica D "7 219: G. Ahlers and R.P. Behringer. Phys. Rev. Lett. 40 (19781) 712. 131JiL. Bentley and J.H. Friedman. ACM Lomput. Sugv. II (1979) 397. [4)SM. Boac. Lid.. Digtal and Kaman Filtering (Edward Arnold Publishcrs LondKn, 1979). A. PuIBrandster and I.L. Swny, Phys. Rev A 35(1987) 2207. 6 M. Caadagh, Nonlnar predicton of chaotic time senes, (1983) 73. (25 EN Lorenz. J Atmos. Sci. 20(1963)130 1261 S.W. MacDonald. C. Grebogi. E Ott and J.A. Yorke. Physica D 17 (1985) 125. (271 G.Mayer-Ktess. ed., Dimensions and Entropies in Chaotic Systems (Spnnger. B".lin, 1986), and references therein. 1281 F. Mitschke, M. Mdller and W. Lange. Phys. Rev. A 37 (1938) 4518. 1291 L.R. Rabiner and B. Gold. Theory and Application of Digital Signal Processing (Prentice.Hall. Englewood Cliffs, prepnnt (December 1937). 171 K.C. Coffman. Ph.D. Thesis. University of Texas at Austin (1987) J.P. Crutchield and B. McNamara. Complex Systems I 181 (19871 417H D. Abarbanel. R Brown and J B. Kdtke, Prediction and system idenufication in chaoUc nonlinear systems: time senes with broadband spectra. prepnnt (January 1989). Dongarra. C.B. Moler, J.R. Bunch and G.W Stewart. (9) J.J. LIN-PACK user's Guide (Sciety for lndustnal and Applied Mathematics. Philadelphia, 1979). (101 J.-P. Eckmann. S.O. Kamphorsth D. Ruelle and S. Ciliberto. Phys. Rev. A 34 (1986) 4971 D. Ruelle. Rev. Mod. Phys. 57 (1985) [11I J -P. Eckmann and 61. 617. (12) J.D. Farmer and JJ.Sidorowich, Phys, Rev. Lett. 59 (1987) 845 NJ. 197.7 (301 J.-C. Roux. Phvssca 0 7 (1983)57: J.C. Roux. R.H. Simoyi and H.L. Swinney, Physica I) 8 (1983) 257. (311 M Sno and Y Sawada. Phys. Rev. Lett 55 (1995)1082. 1321 C Sparrow. The Lorenz Equations: Bifurcations, Chaos. and Strange Attractors iSpnnger. New York. 19842). 1331 R Tagg, pnvate communication. 134] F Takens. in: Dynamical Systems and Turbulence. Spnnger Lecture Notes in Mathematics, Vol. 898, eds. D.A. Rand and L.-S Young tSpnnger, Berlin. 1980) p 36 (351 B.H. Tongue. Physica D 28 (1987) 401 1361 F Varosi. C. Grebogi and J A. Yorke, Phys Lett A 124 (1987) 9 371 A Wolf, J B Swift. H.L. Swinney and J A PyiaD1 18)25 Physica D 16 (1985) 285. Vastano, I I BULLETIN iNew Senesi oF THE %MERICANMATHEMATICAL SOCIETY % oume .23. Number 21. October 1'990) ANTIMIONOTONICITY: CONCURRENT CREATION AND ANNIHILATION OF PERIODIC ORBITS 1. KLAN AND 3. A. YORKE -kBSTRACT. One-parameter families , of diffeoimorphisms of the Euclidean plane are known to have a complicated bifurcation pattern as ;. %'ariesnear certain values. namely where homochnic tangencies are created. We argue that the bifurcation pattern is much more irregular than previously reported. I I Our results contrast with the monotonicity result for the well- understood one-dimensional family g (.r = ;x( I - tu where it is known that periodic orbits are created and never annihilated as A. increases. We show that this monotonicitv in the creation ol periodic orbits never occurs tor any one-parameier tamilv of' C; area coniracting ditfeiorphisms of the Euclidean plane. excluding certain technical degenerate cases where I I our analysis breaks down. It has been shown that in each neighborhood of a parameter value at which a homoclinic tangency occurs. there are either infinitely many parameter values at which periodic orbits are created or infinitely many at which periodic orbits are annihilated. We %how that there are both infnitelv many values at which periodic orbits are created and tnfiniteiv many -it which periodic orbits are anniilated. We call this'phenomenon antimonotoniwuv. U 1.INTRODUCTION The orbit of point v under a ditfeomorphism of the plane fis the sequence {f(x)}. where for k > 0. J de.,iotes the k-fold 1 composition of f'. J- k denotes the k-fold composition of Cf and I' is the identity map. Let p be a periodic point with period n.- The stable manifold IW(p) of the point p is the set {xr 0c) = p} . Similarly, the unstable manifold 11""p) of' PIs J.( lim~ -0 , p} We assume that p Is a hyperbolic saddle. that is. the eigenvalues e, . e, otf Df"(p) are such that lim~ -0 le1 Ie< I < le, I. Since f is a diffeomrorphism of the plane, both W'(p) and W"(p) are curves. There exists a homoclinic tangency I Received by the editors November 2. 1988 and. in revised form. May 1,. 1989. 1980,tMathemancs S~ubject C'lassificationt 1985 Revision;i. Primary 54035,581F713. Partial support provided by DARPA/ACMP program and by AFOSR-81-02 17. *1(D 469 1990 American Mathematical Society 0273-0979/90 $1.00 + $525 per page ; KAN AD J. A. YORKE .a70) of p at q if iV'pj and IW'p, intersect tangentially at q. The homoclinic tangency of p at q for a one-parameter family j at ,.= o is called nondegenerate if IV(p and IVO(pj have quadratic contact at q and IV (p) has nonzero velocity transverse to W"(p) at q as ,. varies [R]. Any value 4, at which this occurs is called a nondegenerate tangency value. A one-parameter family of maps g, is called monotone increasing (decreasing)on an interval J of parameter values if there are no bifurcations for . E J in which periodic orbits are annihilated as ,. increases (decreases. respectively). We say f is antimonotone at ;0 if periodic orbits are both created and annihilated as ,. increases in each neighborhood of the parameter value ,-, The only smooth family for which monotonicity has been proved is the quadratic family , x,- = ,vi I - i tDouadv. Hubbard. Milnor. Thurston. Sullivan. see jMT]. By contrast we have the following theorem. .. Antimonotonicity Thf rem. Each dissipatve ( 3 planardiffeomorphism .izmdv is antinonotone at each nondegenerate honocliw tangency value. Note that this result says nothing about what happens near degenerate homoclinic tangency values, but we believe this situation is essentially the same as for the nondegenerate case. We sketch the proof for a model case. A paper detailing the proof of the general result is in preparation. If two curves are tangent at . = 4 and move apart. so that they do not intersect as A increases (decreases) beyond .,. then we say contact is broken at 4,) %contactis made at 4-. respectively). and we say ., is a contact-breakingvalue lcontact-making value, respectively). Bubble Lemma. If 4. is a nondegenerate tangencv value at which contact is made, then there are nondegeneratetangenci' values arbitrarilv close to ,, at which contact is broken (and vice versa). The theorem follows immediately from the Bubble Lemma because in each neighborhood of a contact-making nondegenerate tangency value, infinitely many periodic orbits are created (and near contact-breaking ones, infinitely many are annihilated) [N. GS]. Thus, in each neighborhood of a nondegenerate tangency, orbits are both created and annihilated, as is illustrated in Figure 1 for the example of the Henon family. II I : NTMOOTONICITY I -0089 . x i -Q084 -14702 w I III 1147062:7"' FIGURE I. SMALL BUBBLE IN HENON FAMILY ,- -).3t'. x. .I. \'. V - 5.000 PREITERATES. .X-COORDINATE OF 30.000 ITERATES PLOTTED PER /. V-\LUE. I II.PRELIMINARIES For each Cantor set C,: R Newhouse INJ defines a number in [0. c, called the thickness -',C, associated with C. \ "rmiddle- 0" Cantor set C, = I \ G., is constructed inductively as follows: I = [0. I1 and I,.0 and I, , are the left and right component of 1, G, . respectively. where '', is an open interval of length .111 in the middle of I . The thickness of C,, is iI - 01)120. Newhouse proves the following lemma. Thickness Lemma. Let F and H be (antor sets in R. with If hull(F) and hull(H) r F toth nonemptv, and r H). r F) I Then H n F is nonempty. I A Newhouse horseshoe famdy N;. is defined as follows. (See Figure 2 on page 472 for symbols. coordinates, and the role of the constants. and see Figure 3 on page 472 for the first iterate I. K-AN AND J. A. YORKE B 1/2+1 1/2- C 1/0 A C x FIGURE 2, COORDINATES FOR . II N(A) N(B) -- N(C) 0 ----- ( X FIGURE 3. FIRST ITERATE OF N,.. V). Define .V,(.. r = d,,. /Jr) for (X., V) C .1' (...) = I /(I i-)) for .v V)E B *N'. ", ) = (". -. + ' -(j I 1/2) for (X.Y) E C: and continue N, smoothly to the rest of R2 We choose ,t# 2. Let A denote the maximal invariant subset of .4 u B; A is a Cantor set and is the product A, A of two Cantor sets. A, is the projection of A ' -NTIMONOTONICITY -'73 onto the x-axis and A, onto the v-axis. We assume that a and ,8are selected so that r(A) ,r(A) = .11 - )-Q(l - 2a) - i> I. A primary stable (unstable) segment is a line segment of the form [0, I]x {y} where v E A., ({x} x [0. 1] where x E A,. respectively). A primary unstable parabolais a parabolic arc of the form N.(x. [/2- e.I/2 + el) where X E A,. Newhouse and Robinson show in [N. R], that in effect, there exist parameter values ,. near homoclinic tangencies where for a proper choice of coordinates the map is similar to Figure 3. We are assuming that the map changes in a regular way as /".varies. thereby avoiding technical complications. III. PROOF OF BUBBLE LEMMA ASSUMING NEWHOUSE HORSESHOE FAMILIES OCCUR Let 4"0 be a nondegenerate tangency value, which we assume to be a contact-making tangency., We assume that on a small interval. arbitrarily near ;.,, there is a Newhouse horseshoe family. We rescale that small interval to be [0. 1]. The primary tangencies (the tangencies of primary parabolas with primary stable segments) are all conta making. We will show that arbitrarily near /. = 0. there is a nondegenerate tangency which is contact-breaking and is not primary. Theparabolicarcofthe (t.= l/2+t,/ "- -- t) -I -iform ,_ / 1- I for 0<,<(1-2//3) <fi-,. t. lies in a gap in the Cantor set of primary stable leaves as shown in .It" Figure 4. I I 4 FIGURE A 4. THE ARC ,OIt.5). 474 I. KAN AND J.k. YORKE Let "(cf) denote the set of parameters such that v't. :) lies or a primary parabola. For each , the vertices of the primary parabolas have y-coordinates at (-;. ,A,, so we see that F(¢) = -ToAu - - f-'" and the thickness of "(%)is equal to rtAU). The nth image of L'tm,) t' (t,,; under N;. is -)= -- -. ;' - .-1/ +I a-,l/2+ti2-" There is a c = ,, t = " at which the Y-coordinate has a stationary inflection point as shown in Figure 5b. and 1 and "satisfy 46ffn-V 11-I l1/2) = -31;',o'?-/3 -3I : ' and = Z,3 '461"-1 . Notice = I/2)( I - Zl / i- 3 - so for large n we have 0 < 7 < I - 2/p .'/I '' Clatm. For fixed , < . with sufficiently small, there exists a .E F( ) such that the nth iterate of the primary parabola containing (t. ,) has a tangency with a primary stable segment. This tangency is contact-breaking and is nondegenerate for .. . The first part of this claim follows from the fact that the local maximum v( , ;.) (see Figure 5a) of the Y-coordinate of ",, U.t .,. depends linearly on ;.. That is. E o) - A; t and so {y(4, ,.)iA E r( )} has thickness rA). By the Thickness Lemma, there exists some A.E (n) such that vIs. A Note that A is00") . Since . isin Frl) , there is a primary unstable parabola which contains v(t . so v,,It. 4,. , is contained in the unstable manifold of A and is tangent to a primary stable segment of A. As ;.varies near 0. the position of this primary unstable parabola is vtt, + A). Nondegeneracy and contact-breaking can be verified by considering the .i-coordinate of d(v,(t, +A ))/dA and noting that for sufficiently small - > 0 and large n this derivative is negative for t sufficiently close to 7.1 We have shown that there is a primary stable leaf S and a primary unstable parabola U so that the nth iterate of U has a I -,NTIMONOTONICITY - S infletioon/ I I I FIGURE I 5. THE INFLECTION VALUE Z. _ I FIGURE 6. CONTACT-BREAKING TANGENCY (I AT a = A. contact-breaking tangency with S (see Figure 0i. Since the stable and unstable manifold of the fixed point p at (0. 0) contain curves arbitrarily close to S and U. respectively, we see that p will have contact-breaking tangencies at parameter values arbitrarily near .. Finally, for n large, this , is near 0. i REFERENCES [GSI N. K. Gavrilov and L. P.Silnikov. On three.dinensionaldvnamical wms close to systems with structurally unstable homoclinic curve. i. 11. Math. USSR-Sb. 88(4) (1972), 467-485. ibid. 9l t 1973), 139-156. [MT] J. Milnor and W. P. Thurston. On iteratedmaps orthe ,nierval. D.V namical Systems: Proc. Univ. Maryland 1986-87. Lecture Notes in Math.. vol. 1342. Spnnger-Verlag, Berlin and New York. 1989. pp. 465-563. .76 [NJ (R] I KAN AND J. A. 'ORKE S. Newhouse. The abundance ot wild hiperoolic sets and nonsmooth stable vets fbr ditffeonorphisms. Inst. Hautes Etudes Sci. Pubi. Math. 50 (1978). 101-151. C. Robinson. Bifurcation to innitelv many sinks. Comm. Math. Phys. 90 11983). 433-459. DEPARTMENT OF MATHEMATICS. GEORGE MASON !'NIVERSITY. FAIRFAX. VIRGINIA 22030 AND NAVAL SURFACE WARFARE CENTER., WHITE OAK. %IARYL-%%D INSTITUTE FOR PHYSICAL SCIENCE AND rECIINOLOY. I'%IVFRSITY OF MARN- [.AND. C"OLLEGE PARK. MARYLAND 20742 Volume 145. number 2.3 3 PHYSICS LETTERS A 2 April 1990 CHAOTIC SCATTERING IN SEVERAL DIMENSIONS 3) Qi CHEN. Mingzhou DING I and Edward OTT : 3 Received 20 December 1989; revised manuscnpt received 30 January 1990: accepted for publication 30 January 1990 Communicated by AP. Fordy Laboratory for Plasma Research. University of Maryland. College Park, MD 20742. USA For chaotic scattering in two-degree-of.freedom (N, 2). time-independent. Hamiltonian systems. scattenng functions (i.e.. plots of the dependence of aphase space variable alter scattering versus aphase space variable before scattering) typically display singularities on a fractal set. For N> 2. however. sc3ttenng functions typically do not have fractal properties (even when the chaotic invanant set isfractal), unless the fractal dimension of the chaotic set is large enough. Anumerical investigation of this I phenomenon is presented for a scatterer consisting of four reflecting spheres at the vertices of aregular tetrahedron. Recentty, there nas been much interest in the phenomenon of chaotic scattering (see reviews I I I) due to its appearance in a variety of applications, including fluid mechanics, celestial mechanics, and, systems that these functions are typically singular on a Cantor set of values of the variable characterizing the state before scattering, Here we consider whether this situation persists in systems with more than two especially, molecular dynamics. In addition, the implications of classical chaotic scattering for the corresponding quantum scattering problem is a subject of active research (2]. Another line of study concerns the question of how chaotic scattering comes about and evolves as a system parameter is varied [ 31.In all of these past works, when specific systems or examples are investigated, they have almost always been effectively Hamiltonians with two degrees of freedom. Since many situations that will arise in practice can be expc,.ted to involve Hamiltonians with more than two degrees of freedoms, it is important to see whether new phenomena, not present in two-degree-of-fr'±edom systems, can be anticipated in these situations, In particular, let is consider plotting variables characterizing the st ite of the system after scattering as a function of a single variable characterizing the state of the system before scattering (with the other "before-scattering variables" held fixed). We call such plots "scattering functions". It is a striking hallmark of chaotic scattering in two-degree-of-freedom degrees of freedom. We find that the scattering function does not typically display fractal properties in N.degree-of-freedom chaotic scattering systems with N> 2, unless the Hausdorff dimension D, of the fractal chaotic invariant set exceeds a critical value. In particular, if the Hamiltonian is time reversible, then fractal behavior of scattering functions can typically be expected only if II Also at Department of Physics. 2 Also at Department of Electrical Engineenng and Physics. Dc > 2N- 3. (I) Since Dc is greater than or equal to one, eq. ( I ) is satisfied for two-degree-of-freedom chaotic scattering systems (N= 2). For N> 2, fractal behavior of the scattering function istypically always absent even though the chaotic invariant set itself is fractal, provided that I < <D2N- 3. (Because the chaotic set lies in the DE-dimensional energy surface (DE= 2N- I ), we always have D, < 2N- I.) Since Dc depends on system parameters, one expects that aqualitative change in the scattering function can be observed as a parameter of the system is varied through the critical value at which Dc= 2N- 3. Eq. (I) is derived below. We consider N-degree-of-freedom, time-indepen- dent, open Hamiltonian systems, such that the dy- 0375-9601/90/S 03.50 © Elsevier Science Publishers B.V. (North-Holland) 93 Volume 145. number 2.3 PHYSICS LETTERS A 2 April 1990 namics is time reversible. That is, ifx=X(t),p=P(t) are solutions of Hamilton's equations (where x and p are the N-dimensional configuration and momentum vectors), then x=X(-t), p=- P(-t) are also solutions. The dynamics will be reversible if the Hamiltonian is an even function of p. For example, H--jp:+ V(x) (2) • )We ative (i.e., D(S) I ), then there is a positive probability that S,n S2 is not empty; and, furthermore, if S,r S2 is not empty, then D(St S2 ) is given by (4) with probability one. now apply (4) to the chaotic scattering situ- Let D,and D, denote the dimensions of the stable and unstable manifolds of the chaotic invariant set. Due to the assumed time reversibility of the dynamics. these dimensions must be equal, D,= D .. 3) ation. Since the intersection of the stable and unstable manifolds is the chaotic set, we see that (3) and (4) with D. = D,= 2N- I yield D,= N+ d, with (5) (Non-time.reversible dynamics occurs, for example, when magnetic fields are present and leads to Hamiltonians which are not even functions of p. In these cases, (3) need not hold.) We shall be interested in "iedimension of inter. sections of sets lying in the energy surface. As'background. we note the following. Let S,and S, denote two subsets of a D.-dimensional manifold, and let their dimensions be denoted D(S,) and D(S 2). If S, and S2 are smooth surfaces, then generically d, = (D, - 1). (6) D(SnS)= D(S,) + D(S)- D., (4) if the right hand side is nonnegative and SjnS2 is not empty. If it is negative, then S, and S, do not have a generic intersection. For example, two onedimensional lines in a three-dimensional space may intersect at a point, but .. iight perturbation of the position of the lines typically removes the intersection. Thus the original intersection is not "generic". We wish to apply (4) also to the case where S, is fractal and D(S,) is its Hausdorff dimension with noninteger value Si, For this purpose, we refer to the theorems in ref. [4 . As an example of these results, consider the case of a fractal set S,lying in a rectangular region of a plane (D. = 2 ). Now randomly choose a straight line S2 in the plane by first choosing a point with uniform probability distribution in the rectangle and then placing the line through this point at an angle chosen randomly with uniform probability in [0, 2n]. If the left hand side of (4) is neg,IFormula (4) applies ifS1 isa Souslin set and S2 is a smooth surface. A Souslin set isthe union of countable intersection of closed sets. See ref. 41. 94 We now observe that the fractal set of singular values for the scattering function corresponds' ints on the stable manifold of the chaotic s" .-he orbits originating from such points asymptote to the chaotic set. Orbits originating near these points will spend a long time "bouncing arouod" in the scat. terer before leaving the scattering region. that is, they stay close to the chaotic set for a long time and hence are sensitive to small perturbations of their initial conditions. Let d, denote the fractal dimension of the set of singular values of the variable in the scat. tering function which characterizes the orbit before scattering. Sweeping this single, before-scattering variable corresponds to moving along a curve in the Drimensional energy surface. Thus d, is the dimension of the intersection of the stable manifold of the chaotic set with a one-dimensional set, and (4) yields, d, = D,+ I -DE, or d,=d,+2-N, (7a) (7b) di = ID,+ 2- N, where in (7b) we have used (6). (Note that (7a) applies whether or not the Haniltonian is time reversible, while (6) and hence (7b) require time reversible dynamics.) If the right hand side of (7) is negative, then there is zero "probability" of intersection, and we will typically never observe fractal properties of the scattering function. Requiring d, > 0 in (7b) yields the previously stated condition for fractal behavior in the scattering function, eq. ( I). We emphasize that the critical value, D,= 2N- 3, Volume 145, number 2,3 3 PHYSICS LETTERS A for observation of fractal behavior in the scattering function results under the assumption that the scattering function is obtained by varying a single before-scattering variable holding all the others fixed. If instead, we choose to consider scattering functions which depend on n independent before-scattering variables with the others held fixed, then similar considerations can be applied. In this case, fractal behavior in the n-independent-variable scattering function is typically observable if D,> 2(N-n)- I (for time reversible systems), and the fractal dimension of the set on which the scattering function is singular is d. =d,+n+ I -N. In such cases we say that the chaotic scattering is an "n-dimensional ob. servable". Since, as a practical matter, it is much easier to examine a function of a single independent variable, we expect the one-dimensional observable case to be of most interest. We check the above qualitative features in a simpie system exhibiting chaotic scattering. It consists of a point particle of unit speed bouncing between four identical hard spheres. The centers of the spheres are located at the vertices of a regular tetrahedron (fig. I ) of unit edge length. The spheres are labeled by (0, I, 2. 3). The coordinates of their centers I (.v,, v, 1=O. 1, 2. 3) are: o), (.ro,. -U)--(0, 0. 01,). (x,,, : 1 )=( - O) 1, l//3, Z 0. r~ Y / -pact 1 i X Fig. I. The geometry of the scatterer, four reflecting hard spheres sitting at the vertices of a regular tetrahedron, (x 2, y 2, :2) = ( - 2April 1990 I/2,/3, 0) 1- (x 3 ,y 3, :s)= (0, l//0). Thus the bottom of the tetrahedron sits on the plane :=0. The radius of the spheres R is the only adjustable parameter in the system. and the spheres do not intersect as long as R < 1. There are an infinite number of trapped orbits. periodic or aperiodic, in our system. These orbits are all unstable since small displacem.nts from a trapped orbit are magnified exponentially by the defocusing effect of the spheres. All trapped orbits can be uniquely coded by a bi-infinite sequence (a,) of four symbols J0. I. 2. 31 in the following way. We introduce a discrete time as the time of collision of the particle with one of the four spheres. The symbol a, is set to k if the panicle collides with sphere k at time i. Obviously, the particle cannot hit the same sphere it collided with at the immediately previjus time. Therefore, when R is small enough, the sole constraints on the symbol sequence of trapptd orbits is a, ta,. 1. If the symbol sequence is periodic. the corresponding orbit is also periodic. For instance, the orbit bouncing between sphere one and sphere two is of period two, and its symbol sequence is 1.... 1, 2. 1. 2. ... = [1, 21, where the square bracket denotes the periodicity. There are a total of six period-two orbits: (0, , 0.21, 10, 31, [1. 21, (1, 31, [2.31. There is no penod-one orbit due to the constraint a,*a,_1 . The number of trapped periodic orbits grows exponentially with the period. The exponent is the topological entropy of the set of trapped orbits. For our system, when R is small enough, the topological entropy is log(3). To proceed, imagine the following situation. We choose a plane below the scattering tetrahedron of spheres,. = - K, K> R. We then consider trajectories originating from initial conditions (.re, Yo) on this plane and with initial velocity straight upward (i.e.. parallel to the z-axis). We refer to (x, yo) as the imparameters. For all initial conditions (x.r,yo' we define a nonnegative integer valued function T(xo, Yo) which we call the time delay function. Its value is given by the total number of collisions with the hard spheres experienced by the particle with impact parameters (xo, Yo). For almost all impact parameters, this function is finite, corresponding to a finite trapping time of the particle in the system. 95 Volume 145, number 2,3 PHYSICS LETTERS A 2 April 1990 1 a0.4 - 1 .0.40.4 Xo 0 0.4 XO Fig. 3.The intersection of the stable manifold with the hyper- 0 plane plane:= -K. p.puO. R=0.48. -0.3 -0.4 -0.1 Rc 0 0.1 0.2 0.3 112 R 0.4 X0 Fig. 2. (a) Hierarchical construction of the Cantor set structure of the stable manifold. R =0.4; (b) blowup of (a). Fil. 4. Schematic illustration of the dimension d. as a function of R. However, there are certain trajectories which remain in the system for an arbitrarily long time. Initial conditions (xo. Yo) for these trajectories are distributed on a Cantor set. This Cantor set is the intersection in the five-dimensional energy surface of the stable manifold of the trapped unstable set with the two-dimensional plane z= -K, p=p=O. The time delay function is singular on this Cantor set. To see the Cantor set structure of the stable man- detail. For some impact parameters, the particle will not hit any of the four hard spheres and will go straight off to infinity. Those initial conditions from which the particle hits one of the four spheres at least once are the vertical projection of the four spheres onto the plane of initial conditions. They are the four big circular disks in fig. 2. We denote this set from which orbits experience at least one bounce by C1. Inside each big disk, there are three small deformed disks, from which the particle hits the four spheres ifold, we consider the particle trajectories in more at least twice. These are images of the other three 96 I I Volume 145., number 2.3 PHYSICS LETTERS A spheres in the mirror of the first sphere. Thus we have a set C2 of nine small disks from which orbits bounce at least twice. Within each small disk, there are three smaller disks C3, from which the particle hits the hard spheres three or more times. The resulting set of this hierarchical disk organization, given by n-,, C,, is the Cantor set illustrated in fig. 3. Starting from any point in this set, the particle bounces between the four hard spheres forever, never escaping to infinity, The fractal dimension of this Cantor set is d, and 2April 1990 is related to the dimension of the stable manifold D, by D,= 3+d . It is reasonable to presume that d, is a monotonically increasing function of the radius R. When R is zero, there is no strange set on the plane of initial conditions, and hence d, is zero. For small R,the dimension d, increases sharply with R, d, -/ln(R-'' as can be shown by an argument similar to one given in ref. 13 ). On the other hand, if R>,1,5, the region 16 25 J 20 II 10 T T 10 ---n__ 0 , -0.2 0 0.2 0. I 0.4 0.6 e 0.8 1 1.2 0.64 0.56 0.58 e 0.6 0.62 0.64 25 ,, 20 h 15k 10F~!~ 5 0.59 0.595 0.6 0.605 0.61 Fig. 5. (a) The time delay as a function of the distance I alont the one-dimensional line cut in a case exhibiting chaotic scatter ing, R=0.48; (b) blowup of (a); (c) blowup of ko). 97 Volume 145, number 2,3 PHYSICS LETTERS A 2 April 1990 between the four spheres is closed to the outside (in this case, the spheres intersect since R > j ), and all the points in this closed region are trapped. Hence, all the points in the closed region are on the stable manifold (i.e., the chaotic set and its stable manifold are the same set). The dimension of the stable manifold in this case is equal to the dimension of the energy surface, D,=5, and thus d,=2. Therefore, if we vary R between 0 and I/,/3 the dimension d, in. at which d,= I, and the scattering will change qualitatively as R increases through R. Below R,, we will not see chaotic scattering from a one-dimensional cut in the plane of initial conditions. A question of prime interest in this context is whether R, < J. If it is, then we will be able to see chaotic scattering for typical one-dimensional cuts for R in a range of values (t< R< j ) iuch that the spheres do not intersect. We used a box counting algorithm to determine creases from 0 to 2. Thus there will be a value R- R the fractal dimension d,. We cover the Cantor set Ilk - 0 .0.2 0.2 0.6 0.4 0.8 1 1.2 1 0.54 0.56 0.58 0.6 VI 0.62 I 0.641 I C 0 - 0, I I Fig. 6. (a) The cosine of the angle to the :-axis made by the ex0.59 0.595 0.6 e 98 0.605 0.61 iting direction of the particle as a function of the distance I for the same one-dimensional cut as in fig. 5; (b) blowup of (a); (c) blowup of (b). Volume 145. number 2.3 2 April 1990 PHYSICS LETTERS A generated above by squares of edge length e. then in the limit e-.0. the number of squares N(E) needed I The exponent d, can be determined by a least-squares fit of N(W). When R=0.48. we found d,is approximately 1.4. J hus we verify the important result that R,< 1.See the schematic illustration in fig. 4. We also computed d,at a smaller R value. R=0.4. at which we obtain d, 1.07. Using a linear extrapolation from these two computed values of d,. we estimate & 03.0 R 0.38. 10 a 1 for the covering scales as We now describe some of our numerical results at K 01 I 001 oO, R=0.48. Since d,z 1.4> 1. we expect, with positive probability, to see chaotic scattering from a randomly chosen one-dimensional cut in the plane of 1000 es~aping set on this one-dimensional line should ',e equal to d, =d, - I = 0.4. We check this by generating one-dimensional random cuts in the plane. We pick a random point in the square centered at the point lo x=v=O, of edge length 2R. Then we draw a line at a random angle through this point. Restricting initial '=11 i 12 =13 L\ 0 initial conditions. The fractal dimension of the non- =10 _/s=o =14 , 02 I 1)6 ,8 10 b T K conditions to this line, we then plotted the "time de. lay" (i .. , the total number of bounces from spheres experienced by a particle) as a function of distance I along this line. Out of thirty such lines, we found nineteen cases exhibiting a fractal set of singularities of the time delay function. A typical form of the time delay function restricted to the one-dimensional line in cases where we observe chaotic scattenng is shown in fig. 5a. From the blowups plotted in figs. 5b and 5c, we conclude that the singularities in the time delay function are apparently distributed in a fracial set. Another way to confirm this is to examine the dependence of the scattering function giving the exiting particlh direction. Fig, 6a shows plots of the cosine of the aogle 0 to the :-axis made by the velocity of an exiting particle as a function of distanc. I along the same randomly chosen line as was used for fig. 5. In regions near singularities, this function oscillates wildly. Successive blowups of this function (figs. 6b and 6c) show qualitative similarity, again indicating fractal singularities. To determine the fractal dimension of the set of singularities on a one-dimensional line, we use the following algorithm (the usual box counting method i =10 01 t/ 0001 06 04 02 =12 =14 = 15____ _--__= 0 /-/ =1 =12 08 1.0 S Fig. 7 (a)The Hausdorff sum K'(s) as a function ofs for different level t; (b)the same plot for a diffe ni one-dimensonal cut yields an error comparable to the fractal dimension). The time delay function assigns naturally a level structure to the one-dimensional line. At level t, we measure the length of all the intervals where the time delay function is greater than or equal to i and denote them by I,. Then we form the Hausdorff sum K'(s)= X (I ) , (8) where the sum is taken over all intervals at level i. When i tends to infinity, this sum should give the Hausdorff s-dimensional measure [ 5]. Therefore, it 99 VomI 3I45. num to 2. PHYSK"S LETTERS A 2Apd 3 99 is ,-afinitewhen s is less than the Hausdorffdimensiondofthefrac- v- and is zerowhen sis greater than d,. Hence. , expect that for sufficiently- large fractal behavior in the scattering function. level i. the sums K'(s) for different levels will all intersect with each other at approximately the same point s=d,given by the Hausdorff dimension of the one-dimensional fractal set. supported by the Office of Naval Research (Phwics). by the Department of Energy (Basic Enerpy Sciences) and by the Advanced Research Projects Agemcy We thank Itmi Kan for discussion. This 'ork was For R=0.48. numerical calculation indeed shows that the sums K'(s) for large levels all intersect at approximately the same value, thus yielding an approximation to d,. Figs. 7a and 7b plot K'(s) as a function of s for different i for two one-dimensional line cuts of the plane of initial conditions. (Small i data are not shown here. since they do not reflect the fractal property of the singular set.) Within numerical errors. the intersection points are all centered at d, =0.4 ± 0.05. This value is also consistent with resuits obtained for other cuts exhibiting chaotic scattering and is also consistent with our box counting result d,- 1.4. When R=0.25, the fractal dimension d,is less than one. Consistent with this. from 100 random line cuts of the plane of initial conditions, we did not see any 100 iefeences I U. iB. Eckhadt. D 33 (9) t 39: Smilanskv. TPhlsaa The classical and quantum theeey oof chaotic sauag.Lectures at es Houchtm Sesmson Ut. Chaos and quantum pis. eds. M.-J. Giamnom. A. VofosandJ. ZionJmm tEke,,e..Amst 1990. to be Puished. Im. .Phy& Rey. Lett.60 (19U) -3777: [2 It- Blmd and U. Su P. Gasprd and S. Rice.J.Chem. Phy.L 90 11999) 2242.2255: p. C-itanovi and B. Eckhardt. Phvs Rev.Lett. 63 (1959) 823. (31 S.Weher. E On and C. Grebr.;L Phy. . cv. Lett 63 (1999) 919. Mattda. Acta %JatIh 152 (194) 77. Ann. Acad.Sci. [41pFewwcA 1 (19751 2-7. 51KJ.Falconer. The Igomcty of factal sets (Cambridge Univ. Prss. Cambridae. 1995). ! I PHYSICSILETERSA Voline 147. Number &9 I Cross-sections of chaotic attractors I Qi Chen and Edward Ott ' LabovamforrPlasnr a Rmaiwr 30JY 1990 Unn1wm u ofManianL Colge ParAL MD 20742. USA Rccei ed 26 March 1990: accepted for publication '8 May 1990 Communicated by A.P. Fordy We present an e'fictent algorithm for constructing cross-sections of chaotic attractors. The technique is particularly useful for studying the structure and fractal dimension of higher dimencional attractors. 3 One of the central topics in nonlinear dynamical systems theory is the study of the structure and organization of invariant sets under the -lynamics. In chaotic attractor is followed until it comes near the desired cross-section plane. Through a subsidiary calculation, a local approximation to the 'astable particular, the geometry of strange attractors [I I is of particular interest. For such studies, the visualization of the strange attractor is important for revealing structure as well as characterizing the attractor, This presents problems when higher dimensional attractors are encountered. For exampie, the projection of an attractor whose fractal dimension is greater than two to a plane yields a fuzzy blob. Questions such as whether the local structure of a typical higher dimensional strange attractor is the product of a continuum with a Cantor set 121 or is more complex than this cannot be answered by simply taking a projection of the attractor. In addition. numerical determination of the dimension of higher dimensional fractal sets by box-counting algorithms can require enormous memory storage and CPU time. If feasible. taking cross-sections of the attractor (i.e.. intersections of the attractor with a surface) might offer a way of both elucidating the geometry of the attractor and of estimating its dimension. In this regard, two procedures for taking a crosssection of a chaotic attractor were proposed by Lorenz 12 1. and the first of them was extended and further developed by Kostelich and Yorke [ 3 ]. This latter procedure is basically as follows. An orbit on the manifold through that point is found. Then the intersection of the approximate unstable manifold and the desired cross-section plane is determined, thus projecting the orbit point onto the cross-section plane. Assuming the attractor is smooth in the unstable direction (or directions), this intersection approximates to a point in the cross-section of the attractor. Repeating this procedure many times as an orbit is followed, a cross-section picture of the attractor is built up. In this note. we consider Lorenz's second procedure for taking numerical cross-section. Compared to the first procedure. this procedure can be easier to implement and yield faster computer computation. On the other hand. the method has certain limitations which will be discussed. Consider an N-dimensional invertible map, x.+ I= F(x,). Choose a compact volume V which contains the chaotic attractor. We shall find the cross-section of an m-dimensional hyperplane with the unstable manifolds of the invariant sets contained in V. This will typically include the attractor. By inverting the map, the attractor becomes a repellor. Consider a point x in V and examine its preimages F - I(x), F -2 (x) .... F - "(x). Let T(x) denote the smallest value of n such that F - "(x) is not in V. We call T(x) the inverse escape time from V.Under the inverse map, all points in the region V will finally escape except for those on the unstable manifolds of the invariant sets con- Also at Department of Electrical Engineenng and Department of Physics 450 0375-9601 /90/S 03.50 © 1990 - Elsevier Science Publishers B.V. (North-Holland) Volume 147. number 8.9 PHYSICS LETEr A 30 July 1990 tained in V. This set, of course, includes the repellor of the inverse map originating from the chaotic attractor of the forward map. Points on the unstable manifolds of the invariant sets in V correspond to singular points of the inverse escape time function (T(x) =oo). (We assume that the inverse map has no attractors in V. For example, the inverse of a strictly contractive map (e.g., the HWnon map) can have no attractors.) Thus, if we start initial conditions from a hyperplane and collect all the singular points of the inverse escape time function of the map, we find the intersection of the hyperplane with the unstable manifolds of the invariant sets in V, and this typically includes the cross-section of the attractor with the hyperplane. In practice, we not not determine the singular points but rather we determine a succession of nested sets containing the singular points. We do this by computing x values for which T(.r) > N for successively la; ,er values of N. To obtain the intersection with the attractor, one should reject points which satisfy T(x) > N but do not lie approximately on the attractor. In principle, thiscan be done by calculating theofLyapunov exponents (or other ergodic quantities) F -'tfor each .r satisfyotr qthat ing T(x) >N along the orbit x, F -counting F-(-i)(x). For large N, these exponents will aproximate the negatives of the Lyapunov exponents of the forward map on the attractor, provided thatreon the attractor. If x does not x lies approximately he ttrator thn th Lypulie pprximtelyon lie approximately on the attractor, then the Lyapunov exponents for the inverse map starting from x sions agree with previous calculations which resolved only the attractor set. The dimension of the intersection set in the crosssection plane is related to the dimension of the unstable manifold set by a result of Mattila [6 ]. If the Hausdorff dimension D of a bounded fractal set lying in an N-dimensional space is greater than N- m, then a random cut by an m-dimensional hyperplane intersects the set with positive probability; if it does intersect the fractal set, the dimension d of the intersection set is related to D by D=d+ (N-rn) (m) will approximate those for another invariant set in V and will differ substantially from the exponents of the attractor. In this case the point x is rejected. It will not always be possible to apply this Lyapunov exponent test, because N must be sufficiently large to obtain reliable estimates of the Lyapunov exponents of the inverse map. Alternatively, one can omit the Lyapunov exponent test altogether. In this case, the set obtained may be larger than that for the attractor. Thus a calculation of the fractal dimension of this set yields an upper bound for the fractal dimension of the attractor. In our numerical examples, we have not applied the Lyapunov exponent test. Nonetheless, as shown below, for these examples, the method appears to yield very good approximations to the actual attractor, and the calculated dimen- the dimension takes values between 1.22 and 1.30. Fig. I shows the Hdnon attractor. It can be shown that the attractor is included in the square [ -2.0, 2.01 x I -2.0, 2.0]. This is the region V which we use for calculating the inverse escape time function. We take a horizontal one-dimensional cross-section through the point x=O, y=0 and calculate T(x) at regularly spaced intervals along this line. This is shown in fig. 2a. We see there is a natural Cantor set level structure in the inverse escape time function. At level 0, there is one interval from which it requires at least one backward iterate to escape the square; at level 1, there are two intervals from which it requires at least two backward iterates to escape the square; etc. The intersection of all these intervals is the cross-section of the Hnon attractor. Fig. 2b with probability one. Hence, by generating the crosssection of the attractor and measuring the dimension of the cross-section set, we determine the dimension of the strange attractor. To illustrate out algorithm, we first calculate onedimensional cross-sections of the Hdnon attractor. The Henon attractor is generated by the following map, x,+1 =a-x2+by., y,+ =x,. (2) At parameter values a= 1.4, b=0.3, H~non observed there exists a chaotic attractor. Numerical box techniques for the calculation of the dimension of a strange attractor were first applied by Russell et al. [4], who obtained a result for the diy rabre who fou at te tain sI wa sult was obtained by Grassberger who found that the capacity dimension is approximately 1.28 ± 0.01 [51. However, from different least squares fits of the slope, 451 I I I 30 July 1990 PHYSICS LETTERS A Volume 147, number 8,9 -4. I 7 y0 I T5 - -2 [ 3 1 -1 0 1 2 -2 0 2 0 I 2 X8 I Fig. 1.The Hnon attractor. shows the same function for the vertical cross-section through the same point x=O, y=O. To get the fractal dimension of these cross-section sets, we use the following procedure. We denote the lengths of the intervals at level i by I,. Then we form the Hausdorff sum K'(s)= t),(3) J where the sum is taken over all intervals at level i. When i tends to infinity, this sum is the Hausdorff s-dimensional measure [71. Therefore, it is infinite when s is less than the Hausdorff dimension d of the fractal set, and is zero when s is greater than d. Hence, we expect that for large i, the sums K'(s) versus s for different levels will intersect with each other at approximately the same point s=d given by the Hausdorf' dimension of the one-dimensional fractal set ". In fig. 3, we show results for the Hausdorff sums for different levels for a typical one-dimensional cut. The lines for this case have intersec'ions in the range Ithe to find sum (3) Hausdorffused of thepreviousy application numerical The fractal to study chahas been dimension 3 I and Yorke 191 guarantee otic scattering [8 ]. Results of Nusse that for hyperbolic horseshoes, an interval with succcis:ve nested increasing T(x) contains a point where T(x) = o. 452 11 T 5 1 -2 b Fig. 2.Inverse escape time function for the Henon map. (a) Hor- izontal cut through x=.0, y=0. (b) Vertical cut through x=O, y=o. d 0.24 to 0.30. Examining many different one-diand vertical cuts, we estimate mensional horizontal d to lie in the range 0.20 to 0 34. From formula (I1), the dimention of the Hdnon attractor is approxi- mately D; 1.20-1.34. The whole calculation for a Volume 147, number 8,9 PHYSICS LETTERS A 30 July 1990 cross-section plane, we collect those points from which after some chosen maximum number of it- 10.0 trates of the inverse map nmax, the point remains in -To Ki 1.0 -i= the hypercube region. Fig. 4 shows two two-dimensional cross-sections of the attractor using our algorithm (ym,,=0.5 and nm,,= 15). The pictures in fig. 4 appear to be identical to those in ref. (31. find the fractal dimension of the chaotic at1.0 2 0.8[- 0.11 0.2 0.0 0.6 0.4 0.8 0.61.1;- 0 1.0 X2 Fig. 3. The Hausdorff sum K' (s) as a function of, for different 0.4 levels ifor the one-dimensional vertical cut through x=z0.8. Y=o,. cut involved very little computer memory and took less than 5 seconds on the Cray XMP computer.:' . Our second example is the double rotor attractor 0.0 ' generated by the following four-dimensional, vol. ume-contracting map (101, 0.0 X1+l ",:=M, / (.I 2+lMt i0.2v+ M Y'n ) + (( c, /2 7 ) sin(2nx 'i+ +' lI 2~ (c2/2n)sin(2xx"2 5.8 \-6.602 M =(0.7496 M- 0.1203 a 0.2 0.4 x, 0.6 0.8 1.0 mod I 4) 4 [ Here x, x 2 take values from the unit interval [0, 1), and y, and v2 'ike values from the real line. At parameter valucs iven by M,( .",. . -6.602') - 12.40,' 0.1203) 00 Y 4 , -' . 02,-- 0.8699J' c, =0.3536, c,=0.5, Kostelich and Yorke [ 31 find that there is a chaotic attractor. Since the two x-directions of the double rotor map are compact, we choose for V the hypercube box given by max( lyil, 1Y21 ) <Y., Starting from a uniform distribution of initial points in the -0.4 0.0 __ 0.2 0.4 _ 0.6 0.8 1.0 b X2 Fig. 4. Cross-sections of the double rotor attractor. (a) Crosssectionaty;=0., y2=0. (b) Cross-sectionaty,=O,x,2 2/2 453 Volume 147, number 8,9 PHYSICS LETTERS A 30 July 1990 squares N(e) needed for the covering scales as 2 1s N(E) -E-d,. 21 The exponent d is determined by a least squares fit 22 of a straight line to a log-log plot of N(e). In fig. 5, we calculate the capacity dimension d for the crosssection sets of figs. 4a and 4b. The two values of D=d+2 determined from least squares fitting are 3.67 and 3.63. According to the estimates of ref. [3 ], the information dimension lies in the range 3.61 to N (2"r) 2' 3.68. Thus we find that the values of the capacity and 2' information dimensions (the latter must be smaller) are apparently quite close to each other. In conclusion, we have presented an efficient algorithm for calculating cross-sections of strange at- 201 4 a4 5 5 6 6 7 8 8 9 10 2* tractors. This method may be useful for the esti- mation of the fractal dimension of higher dimensional chaotic attractors. 2 tl ; - 0 We acknowledge helpful conversations with Ming- 212 N(2") 28s I zhou Ding and James Yorke. This work was supported by the Office of Naval Research (Physics), by the Department of Energy (Basic Energy Sciences) and by the Advanced Research Projects Agency. The computation was done at the National Energy Research Supercomputer Center. References 2 I (5) [II J.D. Farmer, E. Ot and J.A. Yorke, Physica D 7 (1983) 153. [21 E.N. Lorenz, Physica D 13 (1984) 90;,17 (1985) 279. [31 E. Kostelich and J.A. Yorke, Physica D 24 (1987) 263. 20 LO [41 D.A. Russell, J.D. Hanson and E. Ott, Phys. Rev. Lett. 45 3 b4 5 6 7 8 9 10 (1980) 1175. b[51 P.Grassberger, Phys. Lett. A 97 (1983) 224. (61 P. Mattila, Acta Math. 152 (1984) 77, Ann. Acad. Sc. Fenn. Fig. 5. (a) N(E)as afunctonoe for thecross-section set mnfig. A 1 (1975) 227. 4a. The least squares fit dimenson dff 1.67 _0.05. ivesacapacaty [71 K.J. Falconer, The geometry of fractal sets (Cambridge (b) The same plot for fig. 4b. The least squares fit gives Univ. Press, Cambndge, 1985). dffi1"63+0"05 [8Q. Chen, M. Ding and E. Ott, Phys. Lett. A"145 (1990) tractor, we used a box counting algorithm. We cover the resulting cross-section set with squares from a grid of edge length e. In the limit e-.0, the number of 454 154. 193 H.E. Nusse and J.A. Yorke, Physica D 36 (1989) 137. (103 C.Grebogi, E. Kostelich. E. Ott and J A. Yorke, Physica D 25 ( 1987) 347 I I I I Nonlinearity 4 (1991) 961-979. Printed in the UK Rigorous verification of trajectories for the computer simulation of dynamical systemst Tim Sauert and James A Yorke§ USDepartment of Mathematical Sciences, George Mason University, Fairfax, VA 22030. USA § Institute of Physical Science and Technology, University of Maryland, College Park. MD 20742. USA Received 30 July 1990, in final form 29 January 1991 Accepted by J D Farmer Abstract. We present a new techniq e for constructing a computer.assisted proof of the reliability of a long computer.generated trajectory of a dynamical system. Auxiliary calculations made along the noise-corrupted computer trajectory determine whether there exists a true trajectory which follows the computed trajectory closely for long times. A major application is to verify trajectories of chaotic differential equations and discrete systems. We apply the main results to computer simulations of the Hinon map and the forced damped pendulum. AMS classification scheme numbers: 58F13, 58FI5, 65G05, 65L70 I. Introduction Are numerical studies of chaotic systems reliable? More specifically, do computer trajectories 'correspond' to actual trajectories of the system under study? The answer is sometimes no. In other words, there is no guarantee that there exists a true trajectory that stays near a given computer-generated numerical trajectory. The question is especially pivotal for chaotic systems. Chaotic trajectories exhibit sensitive dependence on initial conditions: two trajectories with initial conditions that are extremely close tend to diverge exponentially from one another. At the same time, a great deal of phenomenological research on chaotic systems relies heavily on computer simulation. Therefore, the use of an ODE solver on a finite-precision computer to approximate a trajectory of a chaotic dynamical system leads to a fundamental paradox. Because of sensitive dependence on initial conditions, a small truncation or rounding error made at any step during the computation will tend to be greatly magnified by future evolution of the system. Under what conditions will the computed trajectory be close to a true trajectory of the model? Consideration of simple examples of nonlinear maps illustrate that there are critical points of trajectories where round-off error or other noise can introduce new behaviour. We discuss typical examples in section 2. At such 'glitches' the true trajectories all i 't Research supported by the Applied and Computational Mathematics Program of DARPA. 0951-7715/91/030961+19S03 50 © 1991 lOP Publishing Ltd and LMS Publishing Ltd 961 962 T Sauer and J A Yorke diverge from the numerical trajectory. In this case, there will be no true trajectory that stays near the numerical trajectory. In other cases, the numerical trajectory can be shadowed: some true trajectory remains close to the numerical trajectory. In the present work we state a result (theorem 3.3) which says that if certain quantities evaluated at points of the computer-generated trajectory, called a pseudotrajectory. are not too large, then there exists a true trajectory near the .computergenerated one. Rigorous upper bounds for these quantities can be generated by the computer as it produces the pseudo-trajectory. If these quantities satisfy the hypotheses of the theorem, which again can be rigorously checked by the computer, the result is a computer-assisted proof of'the existence of a true trajectory near the computergenerated pseudo-trajectory. For example, if the one-step errors in the pseudo-trajectory occur in the tenth decimal place, then the true trajectory that results from the theorem differs from the computer-generated trajectory in approximately the fifth decimal place. In particular. the initial point of the true trajectory can differ from the initial condition of the pseudo-trajectory at most in the fifth decimal place. A typical application of the theorem is to the forced damped pendulum y+a' +sin y- bcost. Setting the parameters a = 0.2 and b = 2.4, we prove the existence of an apparently chaotic trajectory with initial cor itions y(0) = 5(0) = 0 for time t ranging from t = 0 it0,lies within 10-' of an explicit to t = 101n. This trajectory, for all 0 4 t computer-generated (noisy) trajectory produced with a one-step error of 10-11. There are similar results for other initial conditions and other choices of a and b. To describe the theorem, we make a distinction between discrete and continuous models. Computational methods for approximating trajectories of systems of ordinary differential equations work by a series of small, discrete steps. We can therefore consider computer simulation of discrete systems and autonomous differential equations at the bame time if we define a dynamical system to be an invertible map f on R'. (We actually define dynamical system a little more generally, as a sequence of maps (f.) on R", tD also cover the non-autonomous differential equations case.) We will try to keep this distinction clear by using the word trajectory for continuous systems and orbit for discrete systems. Consider then a 6-pseudo-orbit of a discrete system f, which we can imagine having resulted from applying a one-step quadrature method with truncation error 6 to a system of differential equations on R1', m > 2. Assume that we have subspaces S. and U. at each point x. of the pseudo-orbit, which are self-consistent with tolerance 6. By this we mean that S. and U. are complementary subspaces of the tangent space R" at xn (see figure 1), that unit vectors in U. are mapped by f to within 6 of U.+ ,, and similarly for S. Define the positive number r,, to be an upper bound for the expansion rate of the linearization Df along Sn, and t,, to be an upper bound for the expansion rate of Df - I along U.. See section 3 for precise definitions. The quantities which need to be measured to assure the existence of a nearby true orbit are most easily expressed as recurrence relations. Set up a recurrence relation C, by beginning with Cc = 0, and recursively defining C, = csc 0 +r,_IC ,, where 6. is the angle between S. and U. Define Dn similarly: DN = 0, where N is the length of the pseudo-orbit, and D,, = csc 0 + tD+ , for n < N. Then as long as the quantities C. and D, are not too large for all n, there is a true orbit of f near the pseudo-orbit. More precisely: I I Rigorous verification of trajectories 963 U, 1l Sn Fiure 1. The splitting of the tangent space at the nth point of the pseudo-orbit. Theorem 3.3. Assume 6 < 1/20m2 and let B be a bound on the first and second partial derivatives of f and f -. If max{C.,D ,J mS/2 B2 ,/- for all n = O.....N. then there exists an orbit {w.} of f such that Ix. - wjI < v57 for n- 0,.... N. Note that we do not need to assume uniform contraction and expansion along the directions S. and U1. In other words, r. and t. do not need to be less than one for all n. The proof of the theorem is constructive, in the sense that it uses a procedure for refining noisy orbits originally given in [6]. The essential point of the proof is to show that under the conditions of the theorem, the iterated application of the refinement procedure, beginning with the pseudo-orbit, results in a sequence of refined pseudoorbits with decreasing noise level, and whose limit is a true orbit. In addition, the true orbit isnot too far from the original pseudo-orbit. The proof can also be considered a justification for using the refinement process computationally on the actual noisy orbit to reduce noise to near machine-precision, but that is a separate issue from the main question we are answering here. This direction is taken up in [7]. A true orbit that stays near the pseudo-orbit is said to shadow the pseudo-orbit. Several years ago, Anosov and Bowen proved shadowing results for hyperbolic maps on a differential manifold. The conclusion of Anosov [1] for a hyperbolic map says that, given any prescribed shadowing distance c (between the pseudo-orbit and true orbit) there exists a 6 > 0 so that any 6-pseudo-orbit can be £-shadowed by a true orbit. Bowen [2] showed that the same result holds if the map is required only to be hyperbolic on a basic set containing the orbit. Other proofs have been given, and one more is aconsequence of the present work. There are two factors that make the approach of Anosov and Bowen impractical for use in computer experiments. First, the 6 that isproduced can be orders of magnitude smaller than the machine epsilon of existing digital computers. Second, most interesting dynamical systems currently being studied are not hyperbolic. Theorem 3.3 does not assume that the dynamical system is hyperbolic. Our approach is to prove that as long as the system is sufficiently hyperbolic along the (finite length) numerical trajectory, then that piece of the numerical trajectory can be 964 T Sauer and J A Yorke shadowed by a true trajectory. On the other hand, when f is hyperbolic, C, and D', stay uniformly bounded for all iterates n, in whichi case arbitrarily long shadowing trajectories are constructed by the theorem for sufficiently small 6. Thus the shadowing theorem of Anosov and Bowen is a consequence of theorem 3.3, as is noted in [9]. In (5,6] a method is developed which creates computer-assisted proofs of the existence of finite length shadowing orbits on a case-by-case basis. In two dimensions, a small parallelogram is constructed near each point of the numerical orbit in such a way that there is a guarantee of a true orbit whose nth point lies in the nth parallelogram. They apply the method to one-dimensional maps and the two-dimensional Hinon and Ikeda maps, none of which are hyperbolic. These papers use auxiliary calculations in 96-bit precision to verify that there are true orbits near the pseudo-orbit, which was produced in 48-bit precision. The advantage of the present method over [6) is that the auxiliary calculations can now be done in the same precision in which the orbit was calculated. For the maps mentioned above, only 48-bit precision is needed to verify the existence of a pseudo-orbit produced in 48-bit precision. This fact is especially important when attempting to shadow differential equations. We found that the methods of [6] were not practical, at least for the differential equations we tried. For example, in orwer to produce long shadowable pseudo. trajectories for the forced damped pendulum. we needed to use a one-step error of no more than 10-19, which already requires 96-bit precision. In this case, there is no extra precision available for the auxiliary calculations of [6). Thus the new method, superior even for maps, is evidently essential for shadowing differential equations. The improvement is largely gained by sublimating the refinement process, done explicitly in a computer-aided proof in (6), into the proof of theorem 3.3. It is proved here that under the hypotheses of the theorem, the refinement process, when iterated, theoretically converges to a true trajectory. The main result of this paper was announced in (9), in a slightly less streamlined form. Other work along these lines for the one-dimensional case is reported in [3). In the next section, it is shown by example that shadowing can fail for some pseudo-trajectories. The details of the main theorem (theorem 3.3) are presented in section 3. Section 4 consists of a number of remarks relevant to the implementation of the computer algorithm based on theorem 3.3. Examples are given in section 5, and section 6 contains the proof of the main theorem. 2 Why shadowing works What makes it possible to find a true orbit near a pseudo-orbit in the presence of sensitive dependence on initial conditions? The short answer is hyperbolicity along the pseudo-orbit. Even for a non-hyperbolic dynamical system, as long as the pseudo-orbit avoids areas of phase space that lack hyperbolicity, it may be possible to find a nearby true orbit. Of course, on typical ergodic chaotic attractors, this avoidance is only done as a matter of degree. Roughly speaking, the pseudo-orbit must stay far away from non-hyperbolic areas compared with the size of the errors being made. Our method essentially relies on measuring how successful the trajectory is in staying hyperbolic. As a simple example, imagine a map which contracts distances. Assume that the distance between any two points x and y is decreased by a factor of K by the map f, where 0 < K < 1. Thus If'(x) - f"(y)l < K"Jx - yl. It follows that any pseudo-orbit I 965 Rigorous verification of trajectories can be shadowed by the true orbit beginning at its own initial condition. All distances are contracted, including errors that are made along the pseudo-orbit. To be more precise, let a 6-pseudo-orbit be denoted by {X,X If(xo) -. II< 6,and further If2 (xo) - x.1 4 1 ,.... ,xN.. Then If2(Xo) -f( 1 )I+If(x1 ) -x 21 KIf(x0 ) - x1j +If(xl) - X21 < (K + 1)6. Continuing in this way, If"(xo) - x.i < (K - I+ K1-2 +... + 1)4, and we can see that the true orbit {x0,f(x 0 ),..... N(x0 )) shadows the pseudo-orbit within 6/(l - K). Although this hyperbolic map is not sensitive to initial conditions, it is an instructive example. C-nsider next a diffieomorphism which expands distances, so that If (x) - f"(y)l > K"Ix - yI for K > I. This map is sensitive to initial conditions, yet any pseudo-orbit {xo,x1..,...xs) can easily be shadowed. The inverse of the map contracts distances, so the true orbit (f-N(xN),f-N+i(xN).... xN) will shadow the pseudo-orbit within 41(1 - I/K). A general hyperbolic dynamical system is a combination of the above tivo examples. At each point, some directions are expanding and the rest are contracting. To construct a true orbit, one needs to use information from the beginning of :he pseudo-orbit in the contracting directions and from the end of the pseudo-orbit in the expanding directions. This idea is the basis of theorem 3.3. On the other hand, not every pseudo-orbit can be shadowed. This is not a failure of any particular shadowing procedure. The simplest examples of nonlinear maps provide cases of pseudo-orbits for which there is no corresponding true orbit nearby. Consider the one-dimensional logistic map f(x) = I - 2x2, shown in figures 2 and 3. The interval I = [-I, 11 maps onto itself under f and so is an invariant set. True orbits which begin in I remain in I for all time. Figue 2. A pseudo-orbit of f(x) = 1-2x2 which cannot be shadowed. The initial condition is the dot at the origin. An error of size 6 is made in computing f(0), which causes the orbit to eventually approach -o. Now consider the 6-pseudo-orbit which begins with x0 = 0, x, = l + 4, and which from then on is computed without error. Then x, = f(xi) < -1,and the pseudo-orbit 966 T Sauer and J A Yorke I I Fitwre 3. For the map 1x) 1- .2x. an initial condition in the open interval Li"lcngth V86 around zero can be attracted to -x if an error of size 6 is made. diverges to -x. See figure 2. Clearly, there .io true orbit of the system f which shadows the pseudo-orbit by a distance of less than I. Any true orbit within I unit of .- 0 must stay witain I for all time. In this simple case, points escape from true behaviour near the critical point, or fold, of the map. Informally, we call such a divergence from legal behaviour a glitch. In general, the logistic map f(x) = a(l - x") - I will have pseudo-orbits that cannot be shadowed not only for a = 2 as above, but when the parameter a is less than and within 6 of 2. where 6 is the noise level of the process. (This corresponds to the critical value of the fold in figure 2 being between I - 6 and 1.) Thus the occurrence of a glitch is a robust phenomenon. The same phenomenon occurs in higher-dimensional chaotic dynamical systems, because of the folds caused by homoclinic tangencies and near-tangencies of stable and unstable manifolds. How often should we expect glitches? The answer should depend on the noise level 6. In the logistic map example fix) = I -2x-, there isan interval of length v,8 around for which it is possible for an error of size 6 to cause a glitch. This is illustrated ,ure 3. Any initial condition in the designated interval around 0 is susceptible to .1g mapped to a value greater than I, and therefore mapped out of 1, towards -)0. , computer-generated orbit of that type does not correspond to any true orbit. If we assume that this interval of length 03- is sampled by the dynamical systew. approximately in proportion to its length, we expect a glitch to occur on the order of every l/v'3 steps. Numerical studies in [6] on two-dimensional maps and thc studies of differential equations undertaken for this work roughly support this scaling. I I I I I 3. Shadowing theorem The theorem can be used to shadow diffeomorphisms or differential equations. To include both cases, we will consider a dynamical system to be a sequence f 0 ... fN of C2-diffeomorphisms on R" for some positive integer N. When attempting to shadow a discrete map f, we will use f, = f for all n. For a non-autonomous differential equation t = F(t, x), we would let f, be the map on phase space which takes an initial point x at time t to the point on the trajectory time at time t + hn , where h, is the current step size of the ODE solver. If we assume, for simplicity, I I I I I I 967 Rigorous vei fication of trajectories that the differential equation is being solved with a constant step size h, then h. = h for all n. In this case, the ODE solver induces a map called the time-h map of the system. In the case of an autonomous differential equation, the induced time-h map will be the same for all t.On the other hand, if the differential equation is non-autonomous, the time-h map towillinclude dependboth on the t. The following and definition of an orbit cases. of a dynamical system is made autonomous non-autonomous Definition 3.1. Let N be a positive integer, and let f, : R" -. R" be a C2diffeomorphism for each 0 4 n < N. The finite sequence {y}, n = 0,..., N of points i'. R" is called an orbit of the dynamical system {f.}, n 0,..., N - I if f,,(y.) - y.,+, !or n = 0..... N - 1.An orbit is sometimes referred to as a true orbit to contrast with the notion of pseudo.orbit. The finite sequence {x.J is called a 6-pseudo-orbit of {ff. if If,(x,) - x,,Ill <6 for n = 0..... N - I. The 6-pseudo-orbit fx,g is r.-shadowed by the orbit y of the dynamical system {fJ if Ix. -y, < c for n = 0,..... Here, as below, we use the Euclidean norm: I - 1/2 1 for a vector v = (v,... v). We also need to define the concept of moving frame from the point of view of computer simulation. Im The moving frames we will require will be numerical approximations S.and U. to the stable tangent space and the unstable tangent space at x., if they exist, and the next best thing, if they do not. Let N and k be positive integers. For each n = 0...., N -- I, let J,be a non-singular x mmatrix. For each n = 0., N let ,, v () be a set of k vectors in R", and define 4,, to be the m Y k matrix with columns Definition 3.2. The set 1,, "..... g,}N .ois 7,,. called a 6-pseudo-frame for the dynamical system {J.I if for all 0 < n < N, I., The entries of the k x k matrix ATA. - lk are no larger than 6 in absolute value; 2. J,,t....Jnv, are each within 6 of range(A,, ,). Informally, we call property I of the definition almost-orthogonality, and property 2 consistency. The usefulness of this definition for computer-assisted proofs lies in the fact that a 6-pseudo-frame consisting of machine-representable numbers can be constructed using standard computational procedures. Assume that we begin with a set of k vectors {to 1 ,. vk} in R"' which form an orthonormal set. (That is, the vectors in the set are mutually orthogonal unit vectors.) Assume further that the components of the vectors vol,.... vok are machine-representable floating point numbers. Then we use the Gram-Schmidt orthogonalization procedure on the set (J0 (vo) .. J0 (vok)}, and define (v, ..... v1k} to be the machine-stored vectors that result from this finite-precision computation. (In some cases we found thai a more stable form of orthogonalization [4] improved this step.) Continuing in this way for 0 < <n N we define a 6-pseudo-frame for a small number 6, such that each vector v,, in the frame is machine representable. T Sauer and J A Yorke 968 We can now describe the main theorem. For each 0 < n < N, let f,,' R' - R. be a C 2-diffeomorphism. Let {x,,}.o be a 6-pseudo-orbit of the dynamical system {f.). Define J,, = Df(x) to be the matrix of first partial derivatives of f. Let B, (respectively B,) be an upper bound for the absolute values of the first (respectively, second) partial derivatives of the component functions of f, and f.-; on the union of balls of radius 61/2 centred at x. for n = 0,...,.N Set B = max{2,B1,B 2}. For positive integers k+l = m,let {sj,. ,Skob (resectively,{u,,,.. . , } be a0 -pseudo-frame for {J; '} (respectively, {Jj}) such that {S. .. ,,. .. u~ } spans R" for each n. 1 Define the subspaces S, = span{s., ... SM), U, = span{u,,,.. .,u.}, and define 0 to be the angle between S,, and U. Let r, and t, be numbers satisfying IJ.yl < rjlyI IJ,':l < t,,I:l for y 1ES for z e Un+i. Define Co = 0.C = csc0,, + rnC.,- for n > 0. Similarly, define Dj = 0,Dn = escOn,+ tDnI for n < N. Theorem 3.3. Let {x,}. o0 be a 6-pseudo-orbit for the dynamical system {f,} on R m'n > 2, and assume that 6 < 1/20m . If max{C.,D.} < for all n = 0. n= 0..... N. N, then there exists an orbit {wv} of {f,,} such that Ix,, - w,, < V6 for Theorem 3.3 gives an alternative approach to Bowen's shadowing lemma (2]. Let f R'" - R'" be a C 2-dff'eomorphism. A compact invariant set A is called hyperbolic if there is a continuous splitting of the tangent space TR = El Eu for x E A, and positive constants A.< 1,C > 0 such that 1. Df (x)(E ,) = E' 2. Df(x)(Eu) = 3. IDf"(x)(v) < CA-IvI tbr v E E1 4. IDf-"(x)(v) < CA-vi for v E Eu for allx EAandforall > 0. Theorem 3.4. [2]. Assume A is a hyperbolic set for f. For every e > 0 there is a 6 > 0 so that every 6-pseudo-orbit in A can be c-shadowed. Theorem 3.4 is a direct consequence of theorem 3.3 (see [9]). 4. Computer-assisted shadowing In this section we describe a computer algorithm which uses the above theorem 3.3 to verify the existence of true orbits of a dynamical system near the pseudo-orbit determined by a numerical computation. Along with the pseudo-orbit being computed, there are some auxiliary calculations to be made to check that the hypotheses of the theorem are satisfied. Namely, it is necessary to find upper bounds for the constants B, cscO, r,,,t, and finally C,, and D,. We next describe these auxiliary calculations, which if successful provide a computer-assisted proof of the existence of a true orbit. P:oous rerio. i 5 of ir-jecniaes 969 4.1. Conurucuon of sutable and ummable fraes I... The algorithm wods best when J-pseudo-frame the 's . and . are chosen to approximately encompass the tb and unstable directions, respectively, for the dynamical system If. at the particular map f,. One way uO_of vectors to accomplish this is as follows. Begin with an orthonormal set ,. in R" chosen arbitraily. Inductively define the orthonormal set Iu, +,.. uj} to be the computed results of applying the technique of Gram-Schmidt orthogonalization. followed by normalization. to the set Df,(x,)u,..... Df .(,,ud!. Because of computer will be only approximate, which is not important these6computations round-off, v a 6-pseudo-frae It is straightforward to fin a is"=,"..",,,f" For which ,6 for which both parts of definition 3.2 are satisfied. Part I is easily checked with the computed !.--- - and depends on the residual error of the Gram-Schmidt orthogonalization. In most cases of following a trajectory of a system of ordinary differential equations, the , will be determied by part 2 of definition 3.2. which depends on the error bound of the ODE solver being used to follow the tangent vectors along the pseudo-orbit. Begin with an arbitrary € is defined analogously. The frame ,. Svk, fl.m R". Give.. . for n N. apply orthonormal set . (x3 _.)s,,k. The stored values of Gram-Schmidt to the set {Dxf;t(x _t,. the resulting computation are I .s....._, by definition. The calculation of csc O., where 0, is the angle between S. and U,. is simple if the dimension m is small, but for higher dimensions the following scheme may be helpful. Define A. to be the m x m matrix whose columns are {s,_ ... s,u,, ..... u}, and let B. = A-'. Let B. be the m x m matrix whose top k rows are the same as those of B. and whose bottom 1 rows are filled with zeros. Let B, be the m x m matrix whose top k rows are the filled with zeros and whose bottom I rows are the same as those of B,. Note that B,, = B, + B. . Now define S,, = A,,B, and U,, = .4,Bu. It is clear that S, and L,, are projections onto S,, and U,. respectively, and that S + U,, = A,,(Bs +Bu) = 1 Further. S,, and U,, are the unique m x m matrices with these properties. It is a standard fact that csc 0 = IS,[ = IU., where as usual we use the Euclidean matrix norm. This scheme provides a computationally stable method for computing a strict upper bound on csc O,,. which is necessary for bounding the C,, and D,. 4.2. Calculation of r,, and t, We have dcfined r,, to be a positive number that bounds the growth of f,, in the direction S,, at x,. That is. r,, satisfies IJ,,yJ < r,,Jyl for vectors Y in S,. Such a number is impossible to find by measuring JJ,y on a general basis of S,. This is the reason that almost-orthogonal frames are needed in the calculation. Lemma 4.1. using A = J, and W = S, shows how to find an upper bound on r,, solely using information about the action of J, on the almost-orthogonal basis of S,. Analogously, t, can be found by using lemma 4.1 with A = J;-, on the subspace U,,+,. Lemma 4.1. Let A be an m x m matrix and W a subspace of R' with basis Let W be the m x k matrix with columns .... Wk}. Then max ,Ew.1,,=i AvI << AW VI - WJVTJW - 1 ivw. 970 TSauer and J AYorke when the right-hand side exists. Lemma 4.1 is proved in section 5. 43. Calcudaion of C.and D. Computing C. and D, appears simple once cscQ0, r. and t. are known. There are two more details, however, that greatly reduce the data requirements of this task. In applications of this algorithm, it is typical for N. the number of points in the pseudoorbit- to be of the order of several million. On the other hand, we have previously suggested that the computation of the stable frame s,,.sJ' (and therefore 0.) be done by beginning with a random frame at n = N. and applying J.-' to create frames N - I.....0. To avoid the problem of storing all frames simultaneously. we iuggest buildine.... . in pieces of length N, < N. For example- we found N, = 5000 to be reasonable. The idea is to find each block of 5000 nearly-orthogonal bases by stopping after each b -k of 5000 points in the pseudo-orbit, finding the next 1000 points, and then applying J, ' 6000 times to a random starting orthogonal basis to produce stable directions, and then go on to the next block of 5000. In all cases we ha.e tried, the stable frame produced this way satisfied definition 3.2 within the prescribed 6. The second problem is deciding whether the recurrence relation D, stays within the bound of the theorem, given that it is defined beginning at the end of the trajectory. The following simple lemma shows how to verify the bound on Dn in forward time. In short, a new recurrence relation E, is defined which is computed in forward time. The lemma shows how to tell by computing E, whether D. violates a given bound. Lemma 4.2. Let D., = 0: Dn = a, + bD., be a recurrence relation for n = 0,.... N and let .4 be a real number. Define another recurrence relation E0 = A; E.+1 = min{(E. - a,)/b,A for n = 0,...,N. If E, > 0 for n = O....,N, then D.<< .4forn=. .... N. 4.4. Calculation of B The calculation of B, the upper bound on the magnitudes of the first and second partial derivatives of the f., is normally trivial if we are given the explicit map. In more interesting cases, we are following the (possibly time-dependent) flow of a system of differential equations, and need bounds on the derivatives of the time-h map for step size h.It is this map which is being approximated by the numerical ODE solver. To this end. consider the first-order system = F(t,y) where v is a vector in R"' and t denotes the independent variable. Define g(t,s,Z) to be the value of the solution with initial condition y(s) = : at time t.Then the time-h map of the differential equation which maps the value at time to to the value at time to + h is given by y). fin.h(Y) = g(to + h, to, The following lemma establishes upper bounds on the partial derivatives of the in component functions of f = f,,.h = (f . f 971 Rigorous verification of trajectories Lemma 4.3. 1. Define E, = m Then m 2. Define E. = max <e h O E I0FL. ,I2 f , -, IhE,eI 'CmI E' I. Then max 1Y- IYY '-' eIy ey C.YJkI The proof is an exercise in using the Gronwall inequality (see for example lemma 4.1 of [8]) on the first and second variational equations of the system. 4.5. Quadrature method To apply theorem 3.3 to a differential equation such as the forced damped pendulum. we need a quadrature method which has high accuracy. and which has an explicit error formula. The former is necessary to allow application of the theorem with a reasonably small 6 (and therefore a long shadowing time). The latter is necessary to assure that we have a rigorous bound on 6. The simplest method that satisfies these two criteria is the Taylor method. The formula for the one-step error is explicit, being essentially the Taylor remainder. However, the major diffict.lty with implementation of the Taylor methoes in general is that they require explicit differentiation of the right-hand side of the differential equation. Thus, applying the seventh-order Taylor method to the differential equation (I) j + a,+siny = bcost • evidently requires differentiating the differential equation five times. The formulae fill a few pages. Fortunately, there is a trick which allows application of the Taylor method as an ODE solver without doing the symbolic calculation of higher derivatives of the differential equation. We illustrate the trick in terms of equation (l).Set z sin y and z, = cos y. Then = (cosvyp= :, z,=(-sinyAy = -z ,. calculate the higher Now given of a point (y, :) inFirst phase space time t, we show derivatives ; at time t. of all, we atcan calculate z,z, how fromtothe definitions and V from equation (1). Then, for i > 1, we recursively calculate ) z,0 = 1 - I-- (11=0 yj+2) =_-ay(+1 - I ) (If . ) I ,I+ b(cos )"1 using the differential equation and the product rule of Leibniz. The higher derivatives of y at time t are therefore known, so we can apply the Taylor method of arbitrary order with no symbolic calculation beforehand. A similar trick applies to the variational equation of (1). We applied the seventh-order Taylor method to follow solutions of both the differential equation and the variational equation. The latter is necessary for calculating a rigorous 6-pseudo-frame for the computer-generated trajectory. 972 T Sauer and J A Yorke 5. Examples As a first example, consider the Hinon map f(x,y) = (a - x2 + by, x) of the plane. For parameter values a = 1.4, b - 0.3, this map has an apparently chaotic orbit. Using the method described above, a computer-generated 6-pseudo-orbit with initial condition (0,0) and 6 = 10-' 4 was found to have a true orbit within 10- 7 for over one million iterates. Similar statements apply for other initial conditions, and for other parameter values. The pseudo-orbits generated by our computer satisfied lxi < 2, IYl < 2 in every case. In this range, the magnitudes of the first partial derivatives of f = (f1,f2) and the easily-computed inverse f-I = (g, g 2 ) are bounded above by 4. The magnitudes of the second partial derivatives are bounded by 2. Therefore we used in = 2, B = 4 in the hypotheses of theorem 3.3. This map was originally shadowed in [6], and similar results were reported. In that paper, a different approach was taken, which uses 96-b:t arithmetic (machine-epsilon = 10- 28) to verify shadowing of a 6-pseudo-orbit c Icuh' in 48-bit arithmetic, i.e. with 6 = 10- 14. The method of the present paper does not require such higher precision for this map. This point becomes especially relevant when systems are studied that are inherently more difficult to shadow. Consider the forced damped pendulum, which satisfies the . .,Jerentiai equation ; +av +siny = bcost. To achieve good shadowing results for this differential equation we needed to generate a 6-pseudo-trajectory with 6 = 10-18. We accomplish this by using a seventh-order one-step quadrature method with an explicit truncation error formula, using a step size of h = ir/1000. The implementation details of the quadrature method are given in section 4.5. The fact that the quadrature error formula is explicit is critical. Without it we could not get a rigorous bound on 6. For the forced damped pendulum with parameters a = 0.2 and b = 2.4, there is an apparfntly chaotic trajectory with initial conditions y(O) = y(0) = 0. Using theorem 3.3, we proved the existence of a true trajectory within 10- 9 of the computer-generated trajectory for time t ranging from 0 to 104ir. This trajectory corresponds to 107 time steps of the ODE solver. Again, there are similar results for other initial conditions, and other values of a and b The mapN f,, used ii rem 3.3 were the time-h maps of the non-autonomous differential equation, where = t/1000. The derivation of B for the forced damped 'duluri uses lemma 4.3. Write the pendulum equation as a first-order system. Then in lemma 4 3 is F(t,y 1,y2) = (' 2, -- sin v, - av, + b cos t). It is easy to check that the first and second partial derivatives of F with respect to Yi and Y2 are bounded in absolute value by 1, so that E, = E2 = 1. Lemma 4.3 says that B = max{2,e 2 h,4he6h}. Since h = 10- 8, we use B = 2,m = 2 in the hypotheses of theorem 3.3. Note that the inverse of a time-h map is a time-'mirnus h' map, so that the same B works for f '. 973 Rigorous verification of trajectories 6. Proof of theorem The convention in this section, as in the entire paper, is that all vector and matrix norms are 12 (Euclidean) norms. The norm of an m x m matrix A is defined interms of the vector norm, as follows: AI = VERI.lrl=l max IAvJ. It follows from the definitions that JAI = V1/ai(TA, where o(B) denotes the maximum absolute value of the eigenvalues of the symmetric matrix B. Lemma 61. If A is an m x m matrix whose entries are at most 6 in absolute value, 3 3 then JAI < m6. Proof. 141 < IAIF, where l.412 =-Lemma 6.2. IfW isan m x k matrix and x = WY,then < .,/ I - w IYI V1l- IWTi 3 I 3 3I ,=, a-. See [4]. -I I 'when the right-hand side exists. Proof. 2 Iyl= V,T WT Wy + IX1 - = yT(! - wTW)y + Ix12 IyI(I - WTW)yI + Ixl2 ~< 1 - wT1WVyI 2 + 1X12. < Proof o1 lemma 4.1. Let x E W. Then x = Wy, and Ax = A WY. By lemma 6.2, max and '%I=IXEW AxI = I II V max IAWYI 2 = max yERk lyI= max 1I1 ER'.!1 =1 r 117 .4Wv2 v.(AW)TAWy =a((AW)TAW). yERIIhl=1 Let (v,, Vk}__o be a 6-pseudo-frame for the matrix J, where < 3/(4k). Let An be. the matrix with columns {v,, . vk}. For each v E range Ao there is a w E range A, such that dt, - wI < 2vA6lvl. Lemma 6.3. 6 I Proof. Let v = 'J iv- = c, vo,; that is. v = Aoc. Define w = A c ==l cU,. Then wI = Zk ci(J I °- Ch) C Ik 6<vk< V1 I 26 v 'klvl 6vAOIv - where the last line follows from lemma 6.2. 0 U I 974 T Sauer and J A Yorke The next two lemmas refer to a C 2-map f which maps a convex subset S of R to R"'.. Define B, (respectively, B,) to be an upper bound on the magnitude of all first (respectively, second) partial derivatives of all component functions of f on S. Assume that x and x + It lie in S. i IU Lemma 6.4. 1. If(x + h) - f(x)l < m /BlhI. 2. 1Df (x +t)- Df (x) I < im/"mmB,Ih 1. Prool. For a scalar function g, Ig(x + It)- g(x)l < max 211 , I/hf vimax = I OX 1111OX) Applying this to each entry of the vector f, and the matrix Df, respectively, one gets the stated estimates. i 1 Lemma 6.5. m\/'mB2IhlI2 If(x + i)- J(x) - Df(x)hl <m 2 Proof. Each component g of f satisfies Ig(x +/t) -g(v) - Df (v)hl < from which the result follows easily. mh'B 2 0 Now assume that ... Snk}n() and u,, U}n=O are 6-pseudo- frames for the dynamical system on Rm,. where k + I = m. Let B be an upper bound for the magnitude of all entries of the J,,. Let S, and U, be the subspaces spanned by the moving frames and let S,, and U,, be the projections onto the subspaces such that S,, + U, = 1. U I Lemma 6 6. I. For u ( U,, (a) IS,,J,,ul < 21ml"26 iSn+1i uI, and (b) IS,,_lJnl1uI < 2m'i" B6IS,_JIluI. 2. For s E S,, (a) IUn1Jn_1si 2mn'26JU,+IIsi, and (b) iJL +,1J sl < 2m3 '2B61U,-, 1 sIi. I 975 Rigorous verification of trajectories N isa-pseudo-frame for {J.}N-' we use the fact that {u,,,..., Proof. To prove 1, Ifwe are given u E U., there isw E U,+, such that lJu - wl < 2vJ61ul, by lemma 6.3. IS,.+ 1Jul = IS.N+Jnu - S.+ w + Sn+w 1 = ISn+iJnu - Sn+IwI <2V 6S,,+,llu!. Secondly, we use the fact that {un,, . _ ut1}N is a mB-pseudo-frame for j-}N< 2v'imBIlul. Given u E Un, there isw E U,,-.. such that JJ 1u IS._-J',ul = ISn._IJ -,u - Sn._ Iv +S,,-_Il = ISnldJn_!u - S._tw[ < Sn_1llJA-_,u - w1 I4 . 2ma/2 B61Sn_,lluJ. . Part 2 issimilar. + R" is a C2-diffeomorphism. Let Proof oj theorem 3.3. For each 0 < n < N, f.:R" -(x.)= 0 be a 6-pseudo-orbit of the dynamical system {f.}. Define J,= Df(x.). Let BI (respectively, B,) be an upper bound for the absolute value of the first (respectively, second) partial derivatives of the component functions of f.and fi on the union of balls of radius 61/2 centred at x,for n = 0,..., N. Set B = max{2,BIB,). For positive unI,. ...unt} 0 ) be a 6-pseudo-frame (respectively, {= integers k+1 = m, let . SfkunI.u} spans RI for each n. for JJ;"' (respectively, {Jn}) such that {Sn| . 0 Unt}, and define n to be the angle Snk}, Un = span{un1 . Define Sn = span s. between S,, and Un. Let S. (respectively, Un) be the (unique) projection onto S. (respectively, U.) such that Sn + U, = . Recall that ISI = IUni = csc On. Let r. and in be numbers satisfying IJhy for y E Sn rnlyl for: E Un+ IJn': < tn1': Define x0 = x, y = :v= 0 for . = 0,1,2,..., and define Y', = Sn(f(x,_-d - x,' +J.-l,,-) = U,,f (x,+1 ) - x,+ J,T-,+n) . i1+1 += V1.+ z' for n = 1... N. (2) for n =0....N - 1. (3) for n = 0,..., N. (4) The sequence {x'} =0 isthe result of applying the refinement technique i times to the Define p by 6 = m/ 2 B2. Let CO = 0, Cn = ISnI+rnICn-I original pseudo-orbit ( for n > 0. Similarly, let D. =0,Dn = IUI +tDn+ I for n < N. 976 T Sauer and J A Yorke Assume that C,, < 6P- 112 and D,, < V-1/ 2 for n = 0,..., N. Then for Lemma 6.7. n=0,..., Nandi> O: + 1/2 < 2-'6C,, 4 2-'pp+ l/2 (a) IY'I < 2-'6D, < 2-i6 (b)lz'l 2 + 1 (c) lx'+1 - X"I < 21-,6p / 2 (d) Ix', ' - x°l < 46P+11 < 40B < Proof. Statement (d) follows from (c). Statements (a), (b) and (c) are proved by double induction on i and n. If i = 0: (a) 1Yl =0. and for n > 0, ly~~l1Sl6+r._i bmll < IS,,16 +r,,_16C,,_1 < C.b. (b) I-O< D.6, by reasoning similar to (a). < 5(C, + DN) < 626P-I/z = 26 + tz.l (c) Ix"- Vol '<ly.°I + 1/2 . Now we assume that (a) holds for i - 1, and prove it for case i. We induct on n. The n = 0 case is trivial, since lyll = 0. Assume that (a) holds for the case i, n - I and prove it for i,n: xn YA -_l f,.(x' -n +J-I1) +f Y. =" = S.- = S.f.- 1(x .- 1) -f..-I(x - 1- - (x+.-I)-i -) 1 - Df-(x'-,)(X' 1 ') - x Sn-l,,-In-I where we have used the facts that S,(:z'') = 0 andI n +A - , 1(4 1 = by the definition of y' '-- We will bound the Euclidean norm of each of the four terms of the last sum separately. xj- l1 _)- A. ln-1 t S"Y ( _ -,, " - I( Df n,,- 1,x 'n-I ' - xI-11))I< -1 n- -2-B I n'3/2" m3/ 2 n-1 - 32 2 -2 4 ISI2- ' 32B,2'm / 6 p -2 -, 4< ISJI2 since m > 2, B >S2 implies that 6-2P > m5B4 = 32Bm31 2 (m/ 12B3)/32 > 32Bm3 /2. I 977 Rigorous verification of trajectories * 2. IS.(Dfn. 1(x'-1) - i+Z!:,)I -J)(Y nz-1~ J.-t)- IS"Im3/ 2B2zx-1t X Ixi-jI i-:I -< - n-1-+Zn-,1 2 < ISnIm3 B246P+/ 326B62pm IS.162' 3<ISI62 2-'6" 2 3/ 2 - '- 2 since 6-2P > 32B2m/ 2. * 3. n-1Slv 1 IS.J._i-._t[, 3 <26P-1/2 v, 62-'+16p+1/2 3 = 16624 v <2-'-6 2 26 since 6- 2P > 16v/4i. 1 4. 1J-11-- 1 IUAJ-14-11 ISA-14 < r._tlyy,_- I + 2m3/ 2B6UI2-U'4 32 > 8m B. since 6-2P < r, -tIYn-II +86 ,ly' 2 i/2 3 Pm /2B 2-o-26 26 + 2 _I Adding up the four bounds we have "1In'1 Iy.I 4< 41S,162 - ,- 2 +r I < S.1b2-' +r,_12-'6C.,_1 =C,62-'. This proves (a) for the case i. The proof of (b) is similar, except that we use descending induction on n. The n = N initial case of (b) is trivial, since I.I I = 0. Finally, (c) is a 0 simple consequence of (a) and (b). Lemma 6.7 shows that for each n, y, -- Ozn - 0, and that {x'},' is a Cauchy sequence. Therefore xn converges to some wn in R'. The sequence Nw~}' 0 is the limit of the refinement process applied to (x}_.0. Moreover, lemma 6.7 (d) implies that Iw - xI < 61/2 We will complete this section by showing that f.(w.) = wn+1 for n = 0,...,N - 1, so that the {w.} represents a true shadowing orbit of {x.}. According to equations (2) and (3), S+ 1(fn(wn) - wn d = 0 (5) T Sauer and J A Yorke 978 and U"U. "'(w ,+ ) - w) = 0 (6) for n = 0... N - 1. Furthermore, we have If"(wO) - w,+1< lfn(w n) - fn(x,)l + If (xn) - x,+Il + Ix,.+ - W,+11 < mBlw - x,1 l +6 +46 p "2 < (mB, + )46P+'/ 2 +6 /: 6 6i2 +m5/2B2 (6'/ + 4(mB+ )) 61/<< 14(4+1,) 78=o=+ 25/z222--T '' < 61/2, A similar calculation shows that if; - w I < 61/2. I Secondly, corollary to this calculation are the facts that IfI,(w ,) - ',,+,I < 6"i / and R'-IOw,.+i0 - X,, I < 6 1i 2 Thereibre J.,,(w,) and f,,'(wn+i) are within the balls around x,,,x,,, respectively, for which the lemmas 6.4-6.5 concerning growth bounds on f. apply. Lemma 6.8. The sequence {wv}' o is an orbit of the dynamical system {fj}. That is, = w,+, for n = 0,..., N - !. f,,w$) Proof Equation (6) says that U,(w. f;'(w +i)) =0. Since S.+U. = 1, w.-f;I(w,+ 1 ) belongs to the subspace S. We evaluate IS.+,(J,(w - f-'(w,+1 )))l in two ways. First, using the fact that S,, + U,,+l = I, i)1 ->S, .fC-i(wn., Iw,, > I,(w,, ';7i(w~l+i))I > - IU~+ 1Jg 1 (wn - f'(w~l+))lI w, - f,-i(w.i)I - 2m 3i 2B6tU,,+Illw,, -f f'I(w+ 1 ) 1 -- 2Bm321vn+u16) lw(mB where the last inequality uses lemma 6.6. '(w,+I)I I I 979 Rigorous verification of trajectories I On the other hand, = f=(w.) - w.+, - (f.(wn) - w.+, - Df.(w.)(w -f J.(w. -y-f'(w.t '(w.+l)) + (Jn - Df,(wn))(wn - f !(w,+l)). 3 Since S,+t(f.nw) wn.+') = 0 by - equation (5), we have IS"+ I(V . - f,-' (w,+ 0)))1 < 4m/ 2BlIS"+IIIw" - f'I(w.+1)I2 + m31 B2 IS,.+jIx - w"Iw, =m'/2B,1S,,+,IlIlw,.- f- j (w,,+1) + 46P+1/2)w,-f-Iw+) Putting the two inequalities together. we have that either w case we are done. or else -f = f.-I(w,.,. ), (w,+1) in which 1 t -~2Bm3/21U 1+15 < m3/2BiS.+1l(1lw,, _f,-(w,,+1)I +46P+1/2 M1 I8 I **< Bm3 /26 (26"/ ' ) + 6P where we use the bound 6P-1/2 on IS,+, and IU,,+,I. This inequality implies that 3 6- ' < MS/2B2 -+1 +7,OM+ 4 , ) < m 51 2 B2 I 2B2 . Therefore since m > 2,B > 2. This contradicts the assumption 6-P = mS5, 0 , N - I. +) for n =0. w, =f'(wn References (I] Anosov D V 1967 Geodesic flows and closed Riemannian manifolds with negative curvature Proc. Steklov Inst. Math. 90 121 Bowen R 1975 w-limit sets for Axiom A ditffeomorphisms J. Diff. Eq. 18 333-9 131 Chow S-N and Palmer K 1990 On the numerical computation of orbits of dynamical systems: the one-dimensional case Preprint (41 Golub G and Van Loan C 1989 Matrix Computations 2nd edn (Baltimore. MD The Johns Hopkins (51 (6 [71 University Press) Grebogi C, Hammel S and Yorke J 1987 Do numerical orbits of chaotic dynamical processes represent true orbits? J. Complexity 3 (1987) 136-45 Grebogi C, Hammel S and Yorke J 1988 Numerical orbits of chaotic processes represent true orbits Bull. Am. Math. Soc. 19 465-70 Hammel S 1990 A noise reduction method for chaotic systems Phys Lett. 148A 421-8 Hartman P 1964 Ordinary Differential Equations (New York. Wiley) Sauer T and Yorke J 1990 Shadowing trajectories in dynamical systems Computer Aided Proofs in Analysis ed K Meyer and D Schmidt (Berlin: Spnnger) pp 229-34 [8] [91 I Ergod. Th. & Dynam. Sys. (1991), 11, 189-208 Printed in Great Britain 1 * 3 Analysis of a procedure for finding numerical trajectories close to chaotic saddle hyperbolic setst HELENA E. NUSSEt* AND JAMES A. YORKE*§ University of Mavland, College Park Maryiand 20742, USA I (Received I February 1989 and revised October 1989) Abstract. In dynamical systems examples are common in waich there are regions containing chaotic sets that are not attractors, e.g. systems with horseshoes have such regions. In such dynamical systems one will observe chaotic transients. An important problem isthe 'Dynamical Restraint Problem': given a region that contains a chaotic set but contains no attractor, find a chaotic trajectory numerically that remains in the region for an arbitrarily long period of time. We present two procedures ('PIM triple procedures') for finding trajectories which stay extremely close to such chaotic sets for arbitrarily long periods of time. 1. Introduction Studying dynamical systems, one often observes transient chaotic behaviour, apparently due to the presence of horseshoes. For example, for suitably chosen parameter values, the H6non map has an attracting period orbit with period 5 and also a non-attracting chaotic set, and one observes that the duration of the transient chaotic behaviour of many trajectories is rather short before they settle down on the period 5 attractor. Other famous examples with chaotic transients are: the Hinon map for large parameter values where almost all trajectories go to infinity and there is a bounded non-attracting invariant set; the forced damped pendulum; and the Lorenz equations for values of the Rayleigh number below the standard values that have a chaotic attractor. Transient chaos is also present whenever there is a fractal 3 boundary separating the basins of two or more attractors. Let M be a smooth n-dimensional manifold without boundary, and let F be a C3.diffeomorphism from M to itself. We denote by p the distance function on M. t Research in pan supported by AFOSR, by DARPA under the Applied & Computational Mathematics Program, and the Netherlands Organization for the Advancement of Pure Research (N.W.O.). t Institute for Physical Science and Technology, University of Maryland. * Rijksuniversiteat Groningen, Fac. Economische Wetenschappen, WSN.gebouw, Postbus 800, NL-9700 AV Groningen, The Netherlands. § Department of Mathematics, University of Maryland. 190 H. E. Nusse and J. A. Yorke A region R isan open and bounded set in M. We say a region R is a transientregion if it contains no attractor. We will be studying these regions in cases where the trajectory through almost every initial point eventually leaves the region. We investigate special trajectories that remain in such a transient region for all positive time. For example, the horseshoe is usually pictured mapping a rectangle to a horseshoe shape; the rectangle is a transient region. The great majority of the trajectories of the horseshoe map will leave the region after a few iterates. We are looking for numerical procedures for finding chaotic trajectories that stay in the transient region as long as we wish to compute them for t - 0. The main problem that we would like to address is: The dynamic restraint problem. Find a (nonperiodic) orbit numerically that remains in a specified transient region for an arbitrarily long period of time. The above problem explicitly concerns numerical (i.e. computer) procedures of finite precision. It leads to the following problem where it is assumed all computation. can be made exactly. The static restraint problem. Find an initial point whose orbit stays in a specified transient region for an arbitrarily long period of time. We will establish a procedure (the PIM triple procedure) for finding points whose orbits will stay in specified regions in M for dynamical systems in ideal cases that are uniformly saddle-hyperbolic systems. The unstable manifold of each nonwandering point in the transient region is assumed to be one dimensional. Let R be a transient region for F. The stable set S(R) of F is (xE R: F'(x)E R for n =0, 1,2.... }; the unstable set U(R) of F is tx rR: F-"(x)E R for n = 0, 1,2.... ). The set of points x for which F"(.) is in R for all integers n is called the invariant set lnv (R) of F in R, that is, lnv (R)= S(R) n U(R). A component of S(R) (resp., U(R)), which contains a point of lnv (R) is called a stable (resp., unstable) segment. We call Inv (R) a chaotic saddle when it includes a Cantor set. We assume that for the transient region ' the set Inv (R) is nonempty. We will refer to R\S(R), the complement of the stable set S(R) in the transient region R, as the transient set. We will say that a point p in S(R) is accessible from the transient set R\S(R) if there is a continuous curve K ending at p so that K\(p) is the transient set R\S(R). For uses in dynamics, see [GOY] and (AY]. We wouiu like to address the following problem: Accessible static restraint problem. Given a segment J that intersects the stable set S(R) transversally, describe a procedure for finding a point (in J n S(R)) which is ible (from R\S(R)). ac %c will establish a procedure (the Accessible PIM triple procedure) for finding such accessible points in M for the same class of dynamical systems as above. Both our procedures are based on our presumed ability to specify an initial point p and compute the time TR(p) its trajectory takes to escape from R. In the PIM (Proper Interior Maximum) triple procedure, we seek out triples of points a, c, and b on a curve segment with c the 'interior' point, that is, c is between a and b.The I Chaotic saddle hyperbolic sets 191 triples are selected with an 'interior maximum' of the escape time, which means TR(c) > TR(a) and TR(c) > TR(b). We then look for new triples that lie in the a, b segment but are closer together and so are 'proper'. The most challenging cases are those in which the average escape tin. is short so that the transient trajectories of typical points in R do not come close to the unstable chaotic set. The organisation of the paper is as follows. In § 2 we present the PIM triple procedure and the Accessible PIM triple procedure; the main results for the validity of these procedures for hyperbolic systems are stated precisely in § 3. § 4 is devoted to the proofs of the results in § 3. In § 5, we will discuss the associated numerical procedures (including the shadowing of the numerical orbits by real orbits of the dynamical system). Finally in § 6, we will explain why the PIM triple methods also can be used for basin boundaries, we will describe how the results carry over to higher dimensional systems; and we also will argue that it is sufficient to assume that F isof class C2. 1 3 2. The procedures Let the manifold M, the diffeomorphism F, and the transient region R be as in the introduction. The escape rime TR(x) of a point x in M for R is defined by Ts, W f=min (n =0: F"(x)z R} T lo ifF"(x)ER for alln=0. For the example of the horseshoe map, the escape time function T, has the following properties: (I) TR(x) = cofor x on a Cantor set of stable segments; (2) if a, c, and b are three points on a segment L of an unstable segment J so that: (i)c is between a and b and (ii) TR(c)> max {TR(a), TR(b)}, then the segment [a, b] c J from a to b intersects the stable set S(R. These properties pla) a crucial role in the PIM triple procedures, and lead to the following definitions. Let J be an unstable segment in R. Then J is homeomorphic to an interval, and we may assume it has the ordering of an interval. The notation (a, c, b) for a triple means that a, c, and b lie on J and c is between a and b. Let L c J bt a 'segment', that is, a connected subset of J. Assume L intersects the stable set S(R) transversally, and let (a, c, b) be a triple on L. Since L is homeomorphic to an interval it has an ordering. We assume that the ordering on J (and hence on L) is such that a < c < b; and for points x and *y in J we write [x,y]s for the segment on J joining x and y. The triple (a, c, b) is called an Interior Maximum triple if TR(c)> max {TR(a), TR(b)}; and (a, c,b) is called a Proper Interior Maximum (PIM) triple on L, if (a, c,b) is an Interior Maximum triple and at least one of the points a and b is in t,,e interior of L. For each e > 0, an e-refinement of {a, b} (w.r.t. J) is a finite set of points a = go< gl< ...< gN= b in [a, b]s such that (E12). p(Ca, b]s) <-p([gk, gk,]S E p([a,b]s) H. E Nusse and . A Yorke 192 for all k 0 s< k <- N- 1. and an E-refinemem of (a. c. b) is an e-refinement of fa, b} as above so that c=g, for some k. I -k!-N-I The outline of the PIM triple procedure is the following. Let R be an appropriately chosen transient region for F and let L be a segment on an unstable segmeni J (intersectinf the stable set transversally). Let E>0 be sufficiently small. Given a PIM triple (a., c., b,) in L. starting with n =0. choose some -refinement P. of the triple (a.. c., b,) in [a.. bIi, select any three not necessarily consecutive points from P. which constitute a new PIM triple (a,.,, c,.,, b..,) on [a.. b.],. The new triple must be "proper; proper here means [a.,. b,., ], is a proper subset of [a,., b. ],. The condition guaranteeing the existence of such a PIM triple when E is sufficiently smail. will be described in § 3. Note that. according the definition of PIM triple. p([a..,. b...,b) is (I -0.5e)p([a.. b.], ) Thus the nested sequence of the intervals {[a,, b.].)... converges to a point which we will call a PIM limit poain. The E above can be chosen small enough that it is independent of n. We will show that under reasonable conditions the orbit of the PIM limit point stays in the transient region X. The choice of the PIM triple i3 typically not unique and different choices will result in different PIM limit poin's. This 'static' problem's solution is not directly implementable on a computer because computations are made with finite precision. but it lead to a practical solution of the dyna.;,ic restraint problem as discussed in § 5. The idea of the Accessible PIM triple procedure is like the PIM triple procedure except that the PIM triples (a,, c,. b,) are selected more precisely so that [a., a,.,]s does not intersect the stable set S(R) for all n =-N for some NE N. The difficulty here is that we o-ily compute the escape times of the grid points and yet we must be sure that [a,, a,,,], contains no points of S( R). We must guarantee the procedure will succeed if E> 0 is small enough, where e is fixed, depending only on the diffeomorphism and region. Our objective is to describe the Accessible PIM triple procedure that selects in a unique way a nested sequence of PIM triple intervals on J which leads to an accessible point in S(R) on J. The accessible point p in J S(R) that we will find, will be accessible using the curve [r, p]s for some r in J,so we say p will be 'accessed from the left', that is from the side containing r. We could alternatively have chosen to approach from the right and we would expect to find a different point. Given an e/3-refinement P. = {x,: 0- i s N(e)} on J of a PIM triple (a,, c., b.) in J with a. = xo< x, <-. <xN,) = b,. Assuming that P. includes a PIM triple. c,,, b, ) in P, in the following way: then we choose the next PIM triple (a,, () Select h,+, to be the leftmost point in P. such that it is the right point of a PIM triple in P.: (2) Select c,, to be the adjacent point to the left of b,, in P.; (3) The systematic choice of a,+, in P. is the following: Let M. be the minimum value of {TR(x,): x, E P,,x, < c,, ).We write: a +, is the rightmost point of {x, E P.: x, < c,,, TR(x,) = Mn}; a,+, is the adjacent point to the right of ao,. in P,; I I 3 Chaotic saddle hvperbolic sets 193 3Case a' ., is the rightmost point of Ix,E P.: x-c.-,, TR(Y)= T(a;.,)}. ii). If either M. < T,(a.) or M. > min { TR(x,): x, e P.), then choose a.., = oa.., ; otherwise, Case (ii). If M. = TR(a.) and P. is not an c-refinement of (a., c..,, b..,), then choose a.., = a.; otherwise, Case (iii). If M.= TR,.a.) and P. is an E-refinement of (a., c..,. b..,), and if a., > a. or a.., = c,.,, then choose a,,.. = a,; otherwise, 3 Case liv). If M., TR(a.) and P. is an c-refinement of (a., c..,, be.,), and if a..,=a. and a', < c..,, then choose a,,.. = a,., Repeatedly applying the Accessible PIM triple procedure leads to an accessible point on S(R). To understand rule 3. notice that rules I and 2 imply that the graph of TR is rather simple on P. n [a., c.., Jnamely., TR is monotonic increasing on P. between ao., and c.,,, and TR is non-increasing on P. between a. and a.., .These properties follow from the fact that b.,, was chosen as far left as possible. We will show that after the first few iterates TR(a.) =min { T(x,): x,E P}. 3. Results In § 2 we presented the idea of the procedure for finding a point whose orbit stays in the transient region. In this description, we assumed that there exists an E> 0 for which every E-refinement of a PIM triple includes a new PIM triple. Furthermore, the associated curve segment from a.., to b,,, has a length at most (I - L/2) times the length of the previous one (from a. to b). We will justify these concepts. Let the manifold M and diffeomorphism F be as in the introduction. A subset A of M is hyperbolic if it is closed and F-invariant and the tangent bundle TM splits into dF-invariant subbundles E' and E" on which dF is uniformly contracting and uniformly expanding respectively. A hyperbolic set A,is called a saddle-hyperbolic set if dim F' - 1 and dim E" - I. We will call a region R a saddle-hyperbolic transient region if R satisfies all the following conditions: (AI) R is a transient region; (A2) Hyperbolicity property: nv (R) is a nonempty saddle-hyperbolic set; (A3) Boundar, property: U(R)cn8R is mapped outside the closure of R; (A4) Intersection property': each nontrivial component y of U(R) is an unstable segment, that is, y intersects Inv (R); note that such a segment y must intersect S(R) transversally. We assume throughout that dim E" = I. For the sake of simplicity, we Issume that n = 2; the more difficult case n a 3 will be discussed in § 6. For a saddle-hyperbolic transient region R and E>0, the properties (A1) and (A2) imply that the escape time of almos' every point on an unstable segment is finite. (A result due to Bowen and Ruelle [BR] shows that S(R) has Lebesgue measure zero.) Hence, one may assume that such a refinement does not intersect the stable set S(R). I 194 H. E Nusse and J. A. Yorke If R is a saddle-hyperbolic transient region, then the escape time map T restricted to an unstable segment J c U(R) has the following two properties, which follow from Proposition 1 and the T-Jump Lemma below. (i) All the points in a chosen segment [a, b]., on J will escape from R if and only if no e-refinement of {a. b} includes a PIM triple; (ii) TR is locally constant on an open subset of full measure of J. and if TR(x) < 00 and x is a point of discontinuity of Tl,, then liminfTR(y)=TR(x) and limsup TR(y)= TR(x)+l. Application. The objective of the paper is to present procedures which enable us to obtain numerical trajectories lying on chaotic saddles, and to justify that these procedures work in ideal cases. The examples of interest will rarely satisfy all our hypotheses, and yet we observe that frequently we can successfully use the procedures to obtain pictures of Inv (R) by plotting the numerical trajectory. Consider the following example. Let t..- difteomorphism F acting on the plane be given by F(x, y) = (A-x 2 + M. Jx). It is well known that the map F is equivalent under a linear change of variables with the Hinon map. We choose the parameter values M = 0.3, and A = 3 in figure I(a), A =4.2 in figure I(b) (and figure 2) and A =2.0 in figure 1(c). Then a result due to Devaney and Nitecki [DN] implies that B = {(x, y): -3 < x < 3, -3 < y < 3} includes the nonwandering set of F, so we select B for the transient region. When A = 4.2, the nonwandering set is a uniformly hyperbolic chaotic saddle. We start the numerical procedure with the horizontal line segment with y = I extending from the left side of B to the right side. By using the PIM triple procedure' we obtain a numerical trajectory consisting of tiny intervals. The result is presented in figure 1. When A = 4.2, the region B is a saddle-hyperbolic transient region: the results due to Devaney and Nitecki [DN] imply that B satisfies the conditions AI)-(A4). When A = 3 we do not know if condition (A2) will hold, and for A = 2.0 we have a non-fully developed horseshoe. In figure 2 we present the sets U(B) and S( B) for A = 4.2 (the chaotic saddle is the intersection S(B) n U(B)), and the accessible fixed point on the chaotic saddle is indicated by an arrow. The reader is referred to [NY] for other applications such as the Lorenz equations, the pulsed rotor map, and the forced pendulum equation. Rather than state one or two theorems the results seem best stated a progression of ideas: (1) PIM triples exist, (2) refinement of PIM triples incluac PIM triples, and (3) the resulting sequence of PIM triples conveige to a desirable point. The special case of accessible PIM triple sequences must be discussed separately. From now on, we will assume that R is a saddle-hyperbolic transient region for F with dim E" = 1, and that J c U(R) denotes an unstable segment. That implies that both ends of J are in the boundary of the transient region R. We know by the Intersection assumption that J intersects the stable set S(R). Clearly, this property Mw 195 Chaotic saddle hyperbolic sets .1 5 , • -- (a) z (a) 1 1 *.'. It , F [ ... (b) FIGURE 1. (a) Numencal trajectory obtained by the PIM tnple procedure for the Hinon map in the transient region -3 < x, y < 3, and parameter values A = 3, M = 0.3 (b) Numencal trajectory obtained procedure for the Henon map in the transient region -3 < x, y < 3, and parameter by the PIM triple values A =4 2, M =0.3. (c) Numencal trajectory obtained by the PIM triple procedure for the HWnon map in the transient region -3 < x, y < 3, and paiameter values A = 2 0, M = 0 3. H. E. Nusse and J. A. Yorke 196 [" I FIGURE i-continued. will not hold for each subsegment L of J. since J n~S(R) is nowhere dense in J andI one can choose the segment L lying entirely in the complement of J n S(R). Our first restilt 'PIM Existence Proposition* characterizes the segments intersecting the stable set S(R).I PROPOSITION 1, (PIM existence.) Let L statements are eq~uivalent: = [a, b]j be a segment in 1. The following f" (i) there exists E > 0 such that everv r-refinement of (a, b} includes a PIMI triple;I (fi) L contains a point of Inv (R) in its interior. In the PIM Existence Proposition the segment L can be chosen so that it intersects " R) only at points extremely close to one of the end points of L and so E must1 extremely small, so e depends on the choice of L. However, the PIM Refinement Proposition, stated below, shows that a single e (depending on F and R) can be used, once we have found our first P1 M triple. One might expect that our assumptionsI of uniform hyperbolicity would imply that the uniformity of E would be an easy corollary. In fact, the existence of an E for each PIM triple is much easier than E can be chosen uniformly, and this uniformity is essential for the PIM triple procedures. In principle it can be difficult to find the first PIM triple if the initial interval L is chosen badly. PROPOSITION 2. (PIM refinement.) There exists E> 0 (depending on F and R) such1 I Chaotic saddle hyperbolic sets 197 FIGURE 2. The stable and the unstable manifold for the fixed point at approx. (I 729, 1 729) for the Hinon map in the transient region -3 < x, ' < 3, and parameter values A = 4.2, M = 0.3 The accessible fixed point on the chaotic saddle is indicated by an arrow. that there is a PIM triple in each e-refinement in J of each Interior Maximum triple in J.for every unstable segment J c U( R). The next result deals with the convergence of the sequence of nested PIM triple segments [a,+,, b,+,]j c [an, b.]j on J, in other words, the PIM triple procedure is valid. A sequence of PIM triples {(a., cn, b.)}.n 0 on J is called a PIM triple sequence if (a, , c. , b.,,) is in an e-refinement of the Interior Maximum triple (a., C,,, bn) for all n. We say {(an, cn, b.)}..-, is the accessible PIM triple sequence if (an, C., b.) is selected using the Accessible PIM triple procedure for all n. 3. (PIM convergence.) Let E > 0 be as in Proposition 2. Every sequence of nested segments {[an,, .]j},,, that is associated with the PIM triple sequence {(a., c,, bn)}.n,, on J, converges to a point on S(R). PROPOSITION The next result is the key in proving that the 'Accessible PIM triple procedure' is valid. PROPOSITION 4 (Accessible PIM Refinement.) Let E > 0 be as in Proposition 2. Let {(an, cn, bn)) .0 be an Accessible PIM triple sequence on J. Then there exists integer N >-0 such that [an, an,,+]j does not intersect S(R) for every n : N. Recall that a nested sequence of PIM triple intervals obtained from e-refinements will converge to a PIM limit point on S(R). Note that the PIM limit point of the PIM triple intervals associated with PIM triples in Proposition 4 is an accessible 198 H. E. Nusse and J. A. Yorke point on S(R). The next result implies that the Accessible PIM triple procedure is valid. 5. (Accessible PIM convergence.) Let e > 0 be as in Proposition 2. If the PIM triple sequence {(a., c., b,)} . 0 in Proposition 3 is an accessible PIM triple sequence, then the sequence of nested segments {[a,. b.]j},o on J, converges to an accessible point on S(R). PROPOSITION I 4. Proofs 4.1. Preliminaries We assume that R is a saddle-hyperbolic region for the diffeomorphism F By Smale's 'Spectral Decomposition Theorem' [S] we know that we can decompose the nonwandering set fi into a finite collection of disjoint closed invariant subsets and on each of these subsets F has a dense orbit; these maximal invariant subsets of 0i appearing in the decomposition are called the basic sets (see e.g. [Ni] and [GH] for the definitions and several properties of uniformly hyperbolic systems). From now on, let r denote basic set of F From the definition of lnv (R) it follows immediately that either ra Inv (R) or r Inv (R) = 0. This implies that we can decompose lnv (R) into finitely many basic sets. Note that r n lnv (R) = 0' does not imply 'n R = 0', and r n R #0 ' does not imply 'rnlnv (R)* 0'. Recall that for Z El the stable manifold WS(=) (resp. unstable manifold W"(z)) of z is the set of points x forwhich p(F"(z), F"(x)) - 0 (resp. p(F"(z), F-"(x)) - 0) as n-oX; the local stable manifold Wlo,(z) (resp. the local unstable manifold Wl'o,(=)) of z of size)3 is the set of points x in W(z) (resp. W(z)) so that p(F"(z), F"(x)) -3 (resp. p(F".z), F-"(x)) - 1) for all integers n a 0, where 0 > 0. When the stable or unstable manifold is a curve, we writes orWl(z) and W;oc(z) for the u. where a is either two components of W1o0(z)\{z), We will call r a trivial basic set if r consists of one periodic orbit, and we call r a nontrivial basic set if r includes more than one periodic orbit. Assume that r is nontrivial, we call U periodic if there exists m EN such that F'"has no dense orbit on r, and we call r nonperiodic if it is not periodic. The following results 4.1, 4.2, and 4.4 are reformulated from [NP] and [PT]. 4.1. There existsfinite sets P, P', and P" of periodicpoints, P = P' U P", such that for all x lnv (R): (1) If x is not a limit point of both W" 3 (x) n!Q and W"c(x) n fl, then x is in Ws(p) p E P". for some (2) If x is not a limit point of both W o(x) n f1 and W j-(x) rn fl, then x E W"(p) for some p E P'. PROPOSITION Proof For a proof, see Newhouse and Pali. [NP]. 4.2. Let P' and P" be as in Proposition 4.1. Let U be a nontrivial nonperiodic basic set in Inv (R). Then there exist finitely many disjoint regions R, being diffeomorphic images of the square B = [-l, lx [-1, 1], say R,= g,(B), 1 - i:5 PROPOSITION Nfor some NEN, such that: (1) rUnR, 0 0 for all i; (2) rc U , R.; (3) F(05R,)c = g,({(x, y): Ix= l,-l-y1})resp. U, 1 a,RandF-'(aR,)cU.R,.1 aRwhere aR, I I Chaotic saddle hyperbolic sets = 8=R,=g,({(x,y): -1- x-5 1, yI the unstable set W"(P'). 199 1}) are segments in the stable set W 5(P") resp. Proof For a proof, see Palis and Takens [PT]. Remark. The intersection of r with the union of the regions in Proposition 4.2 is a Markov partition for F, see Bowen [B] for the notion of Markov partition. PROPOSrTON 4.3. Let P" be as in Proposition 4.1. Then we have x E S(R) is accessible ifand only if x e W'(p) for some p E P'. Proof Apply the Propositions 4.1 and 4.2. From now on, let z e F anv (R) be fixed, and let I"c W"(z) be a segment such that I" crosses each region Rk at least once, where Rk, 1 - k -sN, is as in Proposition 4.2. Palis and Takens [PT] have shown that there exist finitely many disjoint regions denoted RM(IU) in UN., R, that have the same properties as the R,'s such that I' crosses each R(I") exactly once, 1 t<-jsN(Iu), for some N(Iu) EN. Therefore, we will assume that I' crosses each region R, from Proposition 4.2 precisely once. There exist a CI' a stable foliation l' on a neighborhood U, of F for some a > 0, and it is no restriction to assume that every region R, is contained in U1., 1: i:5 N; see [PT]. Let 7:R - Wu(z) be a C' parametrization, and define a projection 7r:F-* U.NI R,r I' by taking in each region R,, 1 si-5 N, the projection along the local stable manifolds into the intersection I" with that region. This projection can be extended from r to the union of the regions R,, by projecting along the leaves of the foliation ?'.This extension will also be denoted by 7r. We obtain (see [PT]) the following result that says that for some iterate Al, the map F can be viewed as expansive along unstable segments. PROPOSITION 4.4. There exist a positive integer M and a C' ' map ,P:U ' . T _(JI R,) -R defined by (x) = T-- o oFo(x) such that Iq'(x)l > 1,for some a >0. From now on, let I,,., N be N disjoint compact intervals on the real line, and we write Y=.)JN- I, Let f: Y- R be a C ~'map, for some a>0, with the following properties: (1) f is C'*+ on an open neighborhood U of Y; (2) Y" Interior(f(Y)); (3) there exists numbers A,> 1 such that If'(x)I -A, for every x E1, 1-j- N; (4) the transition matrix A ,. is primitive, that is, there is an integer Q>0 so that all the entries of (A y.j)Q are positive; where A¢y. is defined by A,, I(j, m)---I if f(lj) nI 0, and Ay.j(j, m) = 0 if f(Il) n , = 0, 1 75j, m - N. Note that condition (2) implies that either 1,rf(Im) = 0 or 1,n Int (f(Im)) = 1, for all l -j,m - N. The escape time Ty(x) of x E Y underf is the minimum value n with the property f (x) Isnot in Y We define for every integer k -1: Ak={XE Y: Ty(x)>-k} Dk = xE Y: T,(x)=k}. H. E. Nusse and J. A. Yorke 200 In particular, A, = Y. Hence, for each integer k a 1 we have Ak+, is the set of points in Ak whose escape time from Y is at least k; hence Ak.1 is the set of points in Y that stay in Y under f k. The points in Y which will stay in Y under all iterates will be denoted by A ={x E Y: Ty(x) = o} For every integer k a I we have: (a) Ak = Ak+, u Dk ; (b) Y=Ak+,Ul k., D, that is, Y is the union of the set of points Ak+, whose escape time from Y is at least k + 1, and the set of points D, whose escape time from Y is j, where Il--j - k. Denote the length of an interval L by LI. 8 GEOMETRIC LEMMA 1.There exists f > 0 such thatj r every integer k : 1,the following holds: (i) Every component of Ak contains components of Dk and of Ak+,; (i For each component D of Dk n A, one has IDI/IAI - 3f, and each component U of Ak+, - A, satisfies IUI/IAI 8, with A an arbitrarily chosen component of Ak. Proof of the Geometric Lemma L For each integer i 1, we write R, for the sum of the entries on the ith row of Ayj, 1:- i:5 N. Assumption (4) implies R, is at least I for all i, and the sum of the R,'s is greater than N. Proof of (i). Let k a I be a given integer. First, we assume k = 1. Let L be a given component of At = Y, say L = I, for some j, I sj s N., The assumptions (1)-(4) imply f(L) contains R,+1 components of D1. Since L = {x E L: Tv(x) a 2}u {xE L: Ty(x) = 1), we have that L contains R, components of A,. Now we assume k > 1. Let A be a given component of Ak. By the definition of Ak and the assumptions on J, we havej'-'(A) is a component of A,, sayJ'-'(A) = 1, for some j, I !-j - N. Therefore, A contains R, + I components of Dk and R, components of Ak+. Proof of (ii). We are looking for 8 > 0 such that for each integer k - 1 and for each component A in Ak, we have A(D)/A(A) - 35,and A(U)/IA (A):>- 6f, for each component D of Dk o A, and each component U of Ak,+ n A. From (i) and the assumptions on f we obtain that for each k 2 1, the number of components of Ak and that of Dk is finite. Let, for each integer k !1, N(Ak) be the number of components of Ak, and let N(Dk) be the number of components of Dk. We write, tor each k - 1, the sets Ak and Dk as the union of their components in the following way: N (A,) Ak =U N(V~ A,,, Dk = Dk ,, For each k - I and each component A in Ak, we define IVI/IAI, Sk(A) = min V where the minimum is taken over all components V of the sets = m i k(A), we define A Dk and Ak+,; and Chaotic saddle hyperbolic sets 201 where the minimum is taken over all components A of the set Ak. We are done if there exists 8 f > 0 so that Sk a 8f for all k Let k > I be a given integer. Let A be a given component of Ak, and let D be an arbitrary component of either Dk or A, such that A includes D. From the foregoing, we can fix an integer n(k), I :s n(k)<- N(Ak), such that A = Ak.wk,, and an integer re(k), 1 <-mik) 5 N(Dk) if D is a component of Elk, and 1!-5 r(k)!E- N(Ak ,) if D is a component of Ak~,, such that D= Set for each integer i, 2si:- k: Dk.,,,k. Applying the mean value theorem, we tan find for every integer i, 2 ! i s k. real numbers a, in A,.,, and d, in D,,,,, such that Ij'(a,)l .A, ,,, ' iA,-1,11-1 J, If (d,)i IO....... = ID_1,,,11I. This leads to: ID,. ,A,I/IAA,.i,,,I= (iI If a,)/J'(ld,)I)' (ID,.m,,,/IA,.e.,,. (I) From now on, we can mimick the proof of Lemma 5.5 in [Nul, and we obtain: lim H If(a,)iJf(d,)I>0. (2) The results (1) and (2) imply that there exists y .0 such that ID,.,,I/fAnIAklk V. Therefore, , :- y for each k 2. We conclude: IDI/IAI >-8, for every component A in Ak, for every component D with Dc A, where D is either a component of Dk or D is a component of Ak,, for 6, = min {5, y}. This completes the proof of the Geometric Lemma I. 4.2. Prools of the PIM propositions Let J c U(R) denote an unstable segment. Recall that both end points are on the boundary of the transient region R, and that J intersects the stable set S(R). We define for every integer k : 1: Ak(J) ={xEJ: TR(x) -k} Dk(J) ={xEJ: TR(x) = k}. In particular, .4,(J) = J. Hence, for each integer k 2t I we have Ak.,(J) is the set of points in A(J) whose escape time from R is at least k+ 1; hence, Ak.,(J) is the set of points in J that stay in R under F'. The points in J which will stay in R under all iterates will be denoted by A,-(J). For every integer k a I we have: A (a) A(J)=AkA+(J)UDk(J) (b) J=AkA,(J)u U D,(J), that is, J is the union of the set of points Ak,,(J) the escape time of which from R is at least k + 1, and the set of points D,(J) the escape time of which from R is j, where lj:5. We write D,(J)=U t, Dk(J). Note that A (J)=n'qk.oAk(J), and J = A-,(J) u D(J). In the lemma below we will state that, if the value of the escape time map TR changes then it changes in steps of 1. I 202 H. E. Nusse and J. A. Yorke For every x = DA(J) there exists an e >0 such that for each y E J with p([x, y]j)<e one has ITR(x)- Tr(y)1:5 1. T-JUMP LEMMA. Proof of the T-Jump Lemma. Let xE D,(J) be given. We will write D '(J)= U'., Int(Dk(J)), where Int(Dk(J)) means the interior of Dk(J) for each k-2 1. First, consider the case where x e D'(J). Then, by the definitions, TR is constant on the component of D~'(J) including x. Consequently, there exists an E > 0 so that TR(y)= TR(x) for all Y in J with p([Xy]j)< F. Now we consider the case where x E D,(J)\D,'(J).Let M a 0 be the integer for which FM(x)E Bndy (Rl, where Bndy (R) means the boundary of R. From the fact that each point in Bndy ( R) is mapped outside R it follows that TR(x) = N + I. We obtain that there exists E > 0 so that for each y E J with p([x, y].j) < - either TR(y) = M or TI(y) = A + 1. We conclude: there exists e > 0 so that for every y E J with p([x, yvs) < e either TR(x) = TR(y) or ITR(x) - TR(y)j = I. This completes the proof of the T-Jump Lemma. Denote the length of a segment L c J by p(L). GEOMETRIC LEMMA II. There exists 8 > 0 such that Jor ever, J in U(R), and .for each integer k - I, the following holds: (1) Each component of Ak(J) contains components of Dd(J) and A , (J); (2) Let A be an arbitrarily chosen component of Ak(J). For each component D of Dk(J) nA, one has p(D)/p(A) -8, and each component U of Ak,,J)nA, satisfies p( U)/p(A) - S. Prool oJ Geometric Lemma I!. Let J E U(R). Proof of 1). For k = i, the assumptions (AI)-(A4) imply that AI(J) = J contains at least two components of DJ), and it contains at least one component of A_(J). Now assume k > 1, and let A be a component of Ak(J)., By the definition of Ak(J) and the assumptions on F,we have F&-'(A) is a component of U(R). Hence, A contains at least two components of Dk(J) and at least one component of Ak ,(J). Proof of (2). Let U be a neighborhood of Inv (R) on which a C' ' stable foliation R exists, for some a > 0. The case that a basic set is nontrivial periodic is similar to that of a nonperiodic basic set but the notation is more complicated. Therefore, we assume that every basic set in Inv (R) is either nontrivial nonperiodic or trivial. For each nontrivial nonperiodic basic set F let I" and the regions R,(F), 1:5 i S N(F), be as in Proposition 4.2, and let U, be an open neighborhood of F such that (1) U., R,(F)c U, c U, (2) the set r-'(I" r Ur) and its closure consist both of N(F) components, and (3) the map p in Proposition 4.4 may be extended to T 1 (J "n U, ). For each trivial basic set F, let U, be an open neighborhood of F in U such that Urn U% is empty, fo, each basic set A in Inv (R)\I'. Select an integer K ->I such that the union of the Ur's include AK(J); the existence of K is guaranteed by the fact that Ak(J)- W'(lnv (R))nJ as k-coo. From the assumptions on F we obtain that the number of components of both Ak(J) and Dk(J) is finite for al k For I I I Chaotic saddle hyperbolic sets 203 every k > I and each basic set r in Inv (R), we define 6k(J) = minA minv p(V)/p(A) and sk(J; F) = minA {minv p( V)/p(A):" A r is nonempty}, where the minimum is taken over all components V of the sets Dk(J) and Ak.,(J), and all components A of the set Ak(J) such that VC A. Obviously, 8 k(J) s (J; F), for all k Let F be a basic set in Inv (R). Write Ur(R) = {J e U(R): J0F is nonempty. We first show: there exists 31.>0 such that for each JE Ur(R), for all k- > 1, and for every component A of Ak(J) that intersects F, one has every component D of Dk(J) r) A satisfies p(D)/p(A) a 8,., and each component U of Ak ,(J) r A satisfies p( U)/p(A) a: 8.. The case that I' is a periodic orbit is left to the reader. We assume that I' is a nontrivial nonperiodic basic set. Applying Proposition 4.4 and Geometric Lemma I we obtain that there exists 5,,(J;l')>0 such that 8,(J')-&,(J;F) for all k>K. We write 8,(J;F)= minIk.K8k(J;F), then 6k(J; ')->8p(J;l ") for all I --k sK. Now we define s(J; F) = min {q(J, F), s,(J; ')J >0 and get p(V)/p(A)a u5(J; F) for every component A of Ak(J) and every component V with Vc A, where V is either in Dk(J) or in Ak.,(J). Now, we define 61. = inf {6J; F): J e U.(R)}. Since U, (R) is compact we obtain 61. = min {(J; r): J E Uj,(R)} > 0. Finally, since F was arbitrarily given, we define 6 = min {8,: F basic set in Inv (R)), and conclude 6 k(J) ->6 for all k a I. This completes the proof of Geometric Lemma II. Proof of Proposition 1. Let L be as in the proposition. (i) (ii)': We assume that there exists e >0 such that every F-refinement of (a, b} includes a PIM triple. Ifthe interior of L does not include a point of Inv (R), then A (J) 0 L is empty, and thus no e-refinement of {a, b} includes a PIM triple. Hence, the interior of L contains a boundary point of Dk (J) for some integer k a 0. Therefore, the interior of L intersects A,(J). '(ii) = i)': Now we assume that the interior of L contains a point q of A,(J)o 1', for some basic set I' in Inv (R). Select integer M- I, such that L contains a component A of Am (J) that includes q. Let S > 0 be as in the Geometric Lemma II. Now we select E = 82 . p(A). From the Geometric Lemma 11 we know that A contains at least two components of D (J) whose length of each of them is at least 8. p(A), and A contains one or more components of A 4 1,, whose length of each of them is at least 8. p(A). We obtain that each E-refinement of a and b includes a PIM triple in A. This completes the proof of Proposition 1. From now on, we fix 6 as in Geometric Lemma 11 and e = 82. Proof of Proposition 2. Let (a, c, b) be an Interior Maximum triple in J. First, we assume that TR(a):- TR(b)< TR(c). Case 1. Assume k=min .... b TR(x)< T(a). Let D be the component of D&(J) containing at least one point of [a, b]j, for which TR(y) = k for all v in D. Then Dc int ([a, b]j)c A, where A is the component of Ak(J) for which Dc A. Since p([a, b] ) < p(A), applying the Geometric Lemma II gives p(D)/p([a, b]j) a 8. Then, for every (3-refinement P of (a,c, b), with 0< P.s3 we have PqrD#0. We obtain: for each p E P, r- D either (p, c, b) or (a, c, p) is a PIM triple in Pq. 204 H. E. Nusse and . A. YorkeI Case 2. Assume mi.. TRW?_ TR (a) and TR (c) Z-TR(a)+ 2=m +1.Then, by the T-Jump Lemma, there exists a component D of Dm.(J) in the interval [a, c],. Since (a, b]j lies in a component A of A,,-i(J), the Geometric Lemma 11 impliesI p(D)/p((a, b]i) Z-8. Hence, every (3-refinement of (a, c, tb) includes a point p of D, so (p, c,b) is a PIM triple, where 0 < P3 . Case 3. Assume TR(c) = TRt(a) + I = m and that Case I does not apply. This impliesI TR (b) =TR (a). Set p3= 82; let Pp be a 13-refinement of (a,c,b), say PO = fx,: 0sis N(P)) cJ with a = x0 <x < ..*< xhN~ =b and xk =c for some ~ N(P3)- 1.I From the Geometric Lemma 11 we get that (a, b]i contains a component D of D.,.iV), and p(D)/p(fa, b~j) a-6*. We obtain that every 1-refinement of (a, c,b) includes a PI M triple for each 0<1 P S. The case TR (b) t. TR(a) < TR(c) is similar. The conclusion is: For E = S we have: every e-refinement of a PIM triple in J includes a PIM triple. This completes the _~ proof of Propositio'n 2.I Proof of Proposition 3. Left to the reader. Before we will prove Proposition 4, we will present a monotonicity property for the escape time map as well as an auxiliary observability result for Accessible PIMI triple sequences. MONOTONICITY LEMMA. Let a and cbe two points on J, and let Pc: a,c]j be aI (3-refinement of a and c, saY P = (x,: 0!5 i~s N(P3)) and a = x0 < x, <, < xNp C. where (3> 0. Assume that TR is monotonic on P (that is. TR(x&, I) a TR(xk), 0!5 V NO) - 1), and TR(c) > Tt(a). Write mn= min fTR(x): XE(a, Cj} .I < 8, Dm(J)n (~a,c~j consists oj one component, and it Then, JOr ever), (, 0 < 13 includes a Proof of the Monotonicit)' Lemma. Let (3,a, c, P, TN, and m be as in the Lemma. By the definition of mn, we know that [ a, cI., is contained in a component A of A,,MJ. Assume that 0 <1P < 8. Suppose that TR(a) > mn. Then there is a component D of D,,AJ) such that Dc [a, clu. (Note that neither a nor c is contained in D.) From Geometric Lemma 11 we know that p(D)/p([a, ci) 2:p(D)/p(A) a 8 >1P; this implies Prn D 0. But this contradicts the assumption TR is monotonic on P. Hence, m = TR(a).I Suppose Dm(J)cr-, (a, c]j consists of two components, say D and D'. We will assume D' includes a. The Geometric Lemma 11 implies there exists a component U of Am,,(J) between D and D' such that p(U)/p([a,ciO) p(U)/p(A) 2 3> . We obtain that P includes a PI M triple (a, c', b') with C'E P r) U and b'E P n DI (since both p( D)/p([a, C].,) >1P and p( U)/p([a, C].,) >1P), which coniradicts the monotonicity of TR on P. This completes the proof of the Monotonicity Lemma.I Let Pcai be an e/3 -refinement of an Interior Maximum triple (ao, co, b) in , and assume TR(x) 2:TR(aO) for evenv, X EP. Let (ao,cj, bj) be the PIM triple in P, in which b, and cl are selected as in the Accessible PIM triple OBSERVABILITY LEMMA. procedure, and let a,) and a: be defined as in the Accessible PIM triple procedure. Chaotic saddle hyperbolic sets 205 I fP is an e-refinement of (an, cl, b,), then (il If a'> ao then [ao, a']. does not intersect S(R); otherwise, (ii) if a I= a0 then TR(b,)> TR(ao), a, < cl and [ao, afl] does not intersect S(R). Proof of Observability Lemma. Let P, (an, cl, b,), a' and a be as in the Lemma, and assume P n [ao, b], is an e-refinement of (an, cl, b,). Note that from this latter assumption it follows that Pr)ao, c,].j is a -refinement of {ao, c'} for some 0 03<8. Let m =min{TR(x): xe [ao, b,]s}. The assumptions 'TR(x,)a TR(ao) for all x,E P', 'Pn[ ao, bl]j is an E-refinement of (an, c,, b,)', and the Geometric Lemma 11 imply that m = TR(ao). Proof of (i). Assume that a'l'> an. By the Monotonicity Lemma we obtain that TR(x) = TR (ao) for all x E [ an, a' ]s ; hence, [ an, a'l] does not intersect S(R). Proof of (ii). Assume that a'= a0 . Suppose TR(b,) = TR(aO) = m. From the Geometric Lemma II and the assumptions we get that the interval [aO, b]s contains one component A of A,.,i(J), and p(A)/p([a,,, b,]j)> 8. Applying the Geometric Lemma II again, we get that there are at least 2 components U, and U. of D..,(J) and at least one component U3 of A..2(J) in A, and for each k, l<--k-3, p(U)/p((a, b,]s) =(p(Uk)/p(A))(p(A)/p([a., b,]s)>8 2=r. Hence, each U", I < k - 3, contains at least one point of P This implies b, is not the leftmost point in P that is the right point in a PIM triple, which contradicts the assumption. Conclusion: TR(bj)> TR(ao). The facts "(ao, cl, bl) is a PIM triple' and 'TR (ao)< TR(bl)' imply TR(cl) TR (a,,) + 2. We obtain from the Geometric Lemma II that there is a component D of D ,,(J) in [a,,, cj]j such that p(D)/p([a), b,],)- :S. Using the T-Jump Lemma, we obtain that there is a point qE Dn P with TR(q) TR(a,,)+ I and for all x in P between a,, and q one has TR(ao)- TR(x)- TR(aj)+ 1. It follows that the point a exists. Applying the Monotonicity Lemma we obtain m = TR(ao) <- 7R (x) TR(a ) = m + I for a!l xE [aO, a. ],; hence, [a,,, a] does not intersect S(R). This completes the proof of the Observability Lemma. Proof of Proposition 4. Let e be as in Proposition 2, and let {(a,, cn, bn)}n.;u be an Accessible PIM triple sequence in J, that is, (an, co, b) is an Interior Maximum triple and for n a 1, (a., c,, b,) is obtained by the Accessible PIM triple procedure. For n a0, let P. be an e/3-refinement of the Interior Maximum triple (a., c., b.), and recall that M, = min tTR(x,): x, E P., x, < c.,}. Further, we write m.=min{TR(x,): x, P.,). Note that the Geometric Lemma 11 implies m= min { TR(x): XE [a., b.]s}. We will show that there exists an integer N 0 such that for every integer n -N: TR(a)=AM.;IT(a.,l)-TR(a.)l1; and [an,a,,b] does not intersect S(R). From the T-Jump Lemma, the Geometric Lemma II, and the assumption that {(a., c., b.)}.,, is obtained by the Accessible PIM triple procedure we obtain for each n a0, the following properties: (1) if TR (a.,)>M, then TR(a,,.:)=M,,; (2) if TR(b,,)=m, then TR(b, , )- TR (b )+ 1; H. E Nusse and J. A. Yorke 206 t3) if T(b.)=m. and M.> m, then m,a:_n.f+l (4) if T(b.)> m. and M.> n, then Tt(b..,)>-m.. These properties imply that there exists a minimal integer N -:0 such that TR(x,) z M, = m., - T(aN ) for each x. E P. Case 1. P,. is 11o E- -finement of (a,. c..., fb.). Since a.., = a., we have t i) T(v.) > It.., = m.v+, = T(a,,) for each x,E Pv,, and (2) (aN . .j], does not intersect S(R). Obviously. TR(x) = TR(a.v) for all x in [aN.a..-,1. rst. assume that % Case 2. P.. is an E-refinement of (a By the Monotonicity Lemma. and the Observability Lemma we obtain for a., = TR(a,) for all x ta.,.a.,_,j, a'v.,: 11) TR(X)= (2) TR(,) 2:%, ., = m.ni, = TR(a_... for each x,E P..,. and (3) [a... aN.,,], does not intersect St R). Now assurne that a%,., = a. Applying the Monotonicity Lemma. and the Observability Lemma yield. for a ., = a-.,; (I) TR(x) = TR(aN) for every x E [a,. a.,J, (2) -,tx,) M. m. , = T. 'a..,)= T(a.,,)+I for each x,E P.-,, and (3) [aN, aN ,], does not intersect S(R). By induction, one obtains the desired result. This completes the proof of Proposition 4. Proof of Proposition 5. Left to the reader. 5. Discussion of the numeri al procedures Now we will return to the "dynamic' question addressed ir. the beginning, namely, how can you numerically follow a trajectory on an invarian: set for an arbitrarily long period of time? A line segment [a, b] straddles the stable manifold of a point P i^ [a. b] intersects this manifold transversally. In tl.e cases we zwudy, P will be replaced by chaotic saddles (nontrivial basic sets) and [a, b] will straddle a subset of S(R). Furthermore, in practice [a, b] will be very short and will be extremely cl-hse to the invariant set lnv (R). The numerical procedure goes as follows: (1) Choose 1-,ith some experimenting) efine and choose PIM a straight line segment 1; (2) Apply PIM triple proced triple int.rval) repeatedly until the length of the PIM triple interval is less than some distance o (e.g. a = 10-8); call this interval 1= PIM,,(I); (3) For a straight li'ne se.nent L with end points a and b, we write PIM,7 (L) to denote either [a, b] if 1[a, bil < o- or the resulting interval when some PIM triple procedure is applied . ,itil an interval of length less ihan a is reached. Note that this operator depends oniy on the end points of L. The basic process then is itera-:1g PIM,(F(L)). While F(L) is an interval, only Fla) and F(b) are relevant. Thus we obtain I.., = PM,(F(I,)), a sequence of straight line segments. We thus obtain a traJectory of tiny straight line segments I, and to the precision of the computer (..bout 10") we typically have I,, c: F(/.), and selecting any point x. from I,, perhaps the midpoint, we have that Ix 1 - F(x,)j is small, typically of the order of oa. Since computers can never be expected to produce true trajectories I I I I Chaotic saddle htverbolic sets 207 (except in trivial cases such as fixed points), we may say {x,.}.o is a numerical trajectory. We call the sequence of intervals {I.}o a saddle straddle trajector because the interval straddles a piece of the stable set S(R) of a chaotic saddle set. It typically approximates (after a few iterates) a basic set in the invariant set (which is a chaotic saddle) in the interesting cases. Furthermore, a saddle trajectory approximates the trajectory of a point in the Static Restraint Problem. Despite the complexity of the construction, we will refer to x., as the iterate' of x,,. Remark In -ractice we find that every L-refinement of two points {a, b), with r= 1/30, includes several PIM triples. In computing the sequence of PIM triples (a,, c., b.) defined by the Accessible PIM triple procedure, once either case 3iii or 3(iv) holds, and if c is more than r - lb - al from a and b. then it can be shown that every E-refinement of the end points {a. b} of a PIM triple (a. c, b) includes a PIM triple; in the computer program we do not use c at all. For the examples in this paper and in [NY] we find that the Accessible PIM triple procedure leads to accessible fixed points or periodic points, which is in agreement with the fact that all the accessible points for two dimensional hyperbolic systems are on the stable manifolds of finitely many periodic points. In this paper we have shown that our procedures are valid in ideal situations. We find it works well in practice even in less than ideal cases. From the examples in fNY], we have seen that the PIM triple procedure works quite well for a variety of dynamical systems. It is important to ask if such straddle trajectories represent true trajectories of the system. In other words, does there exist a true trajectory of the system that shadows (i.e., stays close to) the numerical trajectory obtained by the PIM triple procedure? When a map is sufficiently hyperbolic on the invariant set in question, Bowen [B] obtained a result saying that each noisy trajectory in the nonwandering set can be shadowed by a true trajectory if the perturbation is small; see [B] for the precise statement. We will say that Inv (R) satisfies the 'no cycle condition' if for every family of basic sets F , k...F,,in Inv (R) such that the stable set of has a nonempty intersection with the unstable set of F,. ,, for all I <-i< M, the stable manifold of rk,., does not intersect the unstable manifold of r,,,. Assuming lnv(R) satisfies the 'no cycle condition* and 6 is sufficiently small, we can show that every saddle straddle trajectory of a two dimensional uniformly hyperbolic system with a chaotic sadole obtained by the PIM triple procedure, can be shadowed by a true trajectory for as long as the saddle straddle trajectory can be computed. Ir(, I 6. Concluding remarks 6.1. Higher dimensional systems. One of the ingredients in the analysis of the validity foliation of the PIM triple procedures in dimension two, is the existence of a C 9;' on a neighborhood of a basic set. The existence of such a foliation for the two dimensional case, is guaranteed by a result due to De Melo [M]. Unfortunately. the existence of a foliation 7-'on a neighborhood of a basic set in higher dimensions is not known, see e.g. [PT]. 208 H. E NVusse and I. A. Yorke Let from now on, the dimension n =-3. Let F be an Axiom A diffeomorphism, let R be a saddle-hyperbolic transient region for which dim E" = 1, and assume that for each basic set F in Inv (R) there exists a C ~ stable foliation AV' on a neighborhood of 17, for some a > 0. Then the Propositions 1, 2. 3. 4. and 5 are still valid. The proof is almost the same, except instead of Propositions 4.1 and 4.2 one should use the properties of Markov partitions of basic sets. see Bowen [B]. 6.2. Order of Tiferentiabilit of the Daffeomorphism. We assumed that the difteomorphism F is C". This assumption implied the existence of a C' - expanding map. for some a > 0, in Proposition 4.4. If F is of class C2. then it is known that such an expanding map is C'. WVe would like to point out, that the Haider exponent a is only used to obtain 12) in the proof of the Geometric Lemma 1. Fortunately, we can prove the Geometric Lemma I tin particulk: property IN) for the C'-map ,p of Proposition 4.4 by combining the techniques of the proof of Proposition 6 in [Ne] and Lemma 5.5 in [No). Thus in fact, it is sufficient to assume F is C2 to guarantee the main results of the paper. REFERENCES [AV] 1B] [BR] [DNI [coy] [GNOVI [GH] [m) (NP] [Ne] [Nil [Nul K. T. Alligood & J1.A. Yorke. Accessible saddles on fractal basin boundanies. Preprnt 1989. R. Bowen. Equilibrium States and the Ergodic Theory _f Anosov Diffeomorphisms. Lecture Notes in Mathematcs 470, Springer Verlag: Berlin. 1975 R. Bowen & D. Ruelle. The ergodic theory of Axiom A flows. Intent. Math. 29 (1975). 181-202. R. Devaney & Z. Nitecki. Shift automorphisms in the Hinon mapping. Commun. Math Ph vs. 67 (1979). 137-146. C. Grcbogi. E. Ott & J A. Yorke. Basin boundary metamorphoses: changes in accessihle boundary orbits. Phrsica 24D (1987), 243-262.I C. Grebogi. H. E. Nusse. E. Ott & J. A. Yorke. Bassic sets: sets determine the dimension of basin boundaries In: Dvnamical Sistems., ed. J. C. A~lexander. Proceedings oj the Universityof Man/and 1986-87. Lecture Notes anMath. 1342. pp. 220-250 Spnnger.Verlag. Berlin. Heidelberg, New York, Lonlon. Pans. Tokyo, 1988. J. Guckenheimer & P. Holmes. Nonlinear Oscillations. Dynamical Svttems, and Btfurcationf of Vector Fields. Applied Mathematical Sciences 42, Springer Verlag: New York, 1983 W. de Melo. Structural stability of diffeoimorphisms on two-manifolds. Inrent. Math 21 (1973). 233-246.I S. Newhouse & J. Palis Hyperbolic nonwandenng sets on two-dimensional manifolds. In Dynamical Systems. pp.293-301, ed. M. NI. Peixoto. Academic Press New York and London. 1973. S. E. Newhouse The abundance of wild hyperbolic sets and non-smooth stable sets forI diffeomorphisms. Pub. Math. I H.ES. 50 (1979), 101-15 1. Z. Nitecki. Differentiable Dynamics. %IIT Press. Cambridge. 1971 H. E Nusse. Asymptotically periodic behaviour in the dynamics of chaotic mappings SlAM J AppI. Math 47 (1987), 498- 513. [NV] (Pr] IS] IV) H. E. Nusse & J. A. Yorke. A procedure for finding numerical trajectonies on chaotic saddles. Physica D36 (1989). 137-156. J. Palis & F. Takens. Homoclinic bifurcations and hyperbolic dynamics. 16' Coldquio Brasileiro Matematdica, IMPA, 1987.I S. Smale. Differentiable dynamical systems. Bull. Amer. Math. Soc. 73 (1967), 747-817. J. A. Yorke. DYNAMICS. A Program for IBM PC Clones. 1987, 1988. Reenms Eram JUL*.as. to :iTATWXi4L Pinw 164 *'5. %moi. 4. NoveUSU iV91 Embeology run, Sauer.' James A. Yorke.-- and Martin Casdagli" Receired Afarit). 1991 I Mlathematicai formuviations oi the cmoeddint methods commonlh umeo for the .econstruction . t attractors tram data Neries aire *J~sculscd. F.-becioina -neorems. casea tin previous ~%r(r hte iuFT'. -.'C csao6ihea for compact suosets .i oi Euctidean bvace R' If it i% an integer iurger ian twice the box-counting dimension of 4, then almost every map irom R. to R". in the %cnseof prevalence. n, one-to-one on. A. and moreover is an embeddine on smooth manifolds contained within 4. If .4 is a chaotic attractor of a typical dynamical system. Lhen the same is true for almost every delav-c vrdiate it? 1 from R, to R*. Thewe results are extended its two other directions. Similar results areproedin the more general case ot reconstructions wtich ub ov 4vea~c ofdelaycoriae.Scn.ifrainvgieonh sf-trclinstthat exists %hen it is less than or cqual to twice the hox-counting dimension of .4. KEY WORDS: rmbeddine. ..haotic tittractor: tttractor %cconstruction. probability one: prevalene. box-counting dimension: delay coordinates 1. INTRODUCTION In this work we give theoretical Justification ofl data embedding techniques used by experimentalists to reconstruct dynamical information from time series. 'We focus on cases in which trajectories of the s~ystem under study are asymptotic to a compact attractor. We state conditions that ensure that the map from the attractor into reconstruction space is an embedding. meaning that it is one-to-one and preserves differential inforrnati r-O~F approach integrates and expands on previous results on c~aibedi by 3Whitnev,9 and Takens.' 1 27 Department of Mathematical Sciences. George Mason University. Fairfax, Virginia 22030. Institute of Physical Science and Technology, University of Maryland. College Park. Maryland 20742. Santa Fe Institute. Santa Fe. New Mexico 87501. Current address. Tech Partners. 4 Stamford Forum. 8th Floor. Stamford. Connecticut 06901. 579 W22-4715 91 1100.0579106 50.0 t' 1991Plenum ilublisiin; Corporation Sauer et ai. 5a Whitney showed that a enenic smooth map F from a d-dimensionai smooth compact manifold .11 to R- " 'is actually a diffeomorphism on . That is. .11 and F(MI are diffeomorphic. We generaiize this in two ways: first. :y replacing "generic' %th "probability-one" tin a prescribed sense I. and second. by replacing the manifold If by a compact invariant set . contained in R" that may have nonmnteger box-counting dimension iboxdimi. In that case. we show that almost every smooth map from a neighborhood of A to R" is one-to-one as lone as I 1 n > 2 boxdimu.4 ) We also show that almost every smooth map is an embedding on compact 'iding subsets of smooth manifolds within .1. This ,u-gests that 'but ..s techniques can be used to compute positive Lyapunov ex; .ot necessarily negative L'apunov exponents u. The ?o)itixc .j.ouno% -xponents are usually carried by smooth unstable manifoids on arctors. We give precise definitions of one-to-one. embedding. -nd aimost e,erv in the next section. Takens di..it with a restricted class of maps called dela -coordinate maps. A delay-coordinatt map is constructed from a time series of a single observed quantity from an experiment. Because of this. a typical delaycoordinate map is much more likely to be accessible to an experimentalist than a typ:ca! map T.kens:i" showed that if the dynamical system and the observed quantity are generic. then the dela. -coordinate map from a * d-dimensional smooth compact manifold .11 to R " ' is a diffeomorphism on Ml. Our results generalize those of Takens"'" in the ,aime two Was as for Whitnev, theorem. Namei'. -.%e replace ,!Qneric %ith probabdity-,ne .and the manifold 11 by a possibly iractal ct. Thus. tor a compact u,,ariant subset .1 of R", under mild conditions on the dynamical s~stem. almost every delay-coordinate map F from RA to R" is one-to-one on . provided will be that n> 2 boxdimi.-I). Also. any manifold structure within preserved in F(A). These results hold for lower box-counting dimension (see Section 4) if boxdim does not exist. The ambient space R can be replaced by a k-dimensional smooth manifold in the general :ase. In addition. ,.ve have made explicit the hypotheses on the dynamical ,%stem (discrete or continuousi that are needed to ensure that the delay-coordinate map gives an embedding. In particular. oniv C' smoothness is needed. For flows, the delay must be chosen so that there are no periodic orbits whose period is exactly equal to the time delay used or twice the delay. (A finite number of periodic orbits of a flow whose periods are p times the delay are allowed for p > 3.) Further. we explain what happens I I I I IU I I I logy boxdiml A). In that case we put bounds on the case that n < -• on of the self-intersection set. which is the set on which the oneproperty fails. Finally. we give more general versions of the delayate theorem which deals with filtered delay coordinates, which tore versatile and useful applications of embedding methods. ere are no analogues of these results where the box-counting on is replaced by Hausdorff dimension (see Theorem 4.7 and the on that follows 1. In an Appendix to this work written by I. Kan. %sare descnbed of compact subsets of R*. for any positive integer Iihave Hausdorff dimension d = 0. and which are difficult to project e-to-one way. The requirement n > 2d discussed above translates in e to > 0.However.ecery projection of such a set to R'. i < k. fails u ne-to-one. Section 2 we explain the new %ersion of the Whitney and Takens ig theorems. In Section .3 we discuss filtered delav coordinates. 4 contains proofs of the results. W TO EMBED MANIFOLDS AND FRACTAL SETS U -actal Whitney Embedding Prevalence Theorem 3ltici |ume ,Pis a flow on Euclidean space R*. generated. for example. by ,nomous system of k differential equations. Assume further that all ries are asymptotic to an attractor .4. The study of long-time ,rof the system will involve the study of the set .1. a typical scientific experiment, the phase ,pace R' cannot be lv seen. The experimenter tries to infer properties of the system by measurements. Since each state of the dynamical s stem is uniquely I by a point in phase space. a measured quantity is a I.unction Irom pace to the real number line. If it independent quantities Q . Q. measured simultaneously, then with each point in phase space is ted a point in Euclidean space R". We can then talk about the F(state = (Q . Q,,) naps Rk to R". r example. suppose all trajectories in phase space R' are attracted riodic cycle. Thus. .1 is topologically a circle lying in R". Imagine o available measurement quantities Q, and Q2 are plotted in the Then there is a measurement map F from A to R' given by )= (Q1, Q2). To what extent are the properties of the hidden )r A preserved in the observable "reconstruction space" R2? I 582 Sauer et al. The answer depends on how the circle is mapped to R2 under F Consider the case where R* = R3 and Q and ( ire simply the two coordinate functions x, and x,. In Fie. ' a. the relative position of the points is preserved upon projection. and we may view F(A) as a faithful reconstruction of the attractor A. If distinct points on the attractor .4 map under F to distinct points on FA). we say that F is one-to-one on A. In the case of Fig. lb. on the other hand. two different states of the dynamical system have been identified together in RA ). In the reconstruction space. which is all the experimenter actually sees, the two distinct states cannot be distinguished. and information has been lost. The one-to-one property is useful because the state of a deterministic dynamical system. and thus its future evolution, is completely specified by a point in phase space. Suppose that at a given state Y one observes the %alue F(.' in the reconstruction space. and that this is followed I sec later )y a particular event. If F is one-to-one. each appearance ' the measurements represented by F(vi ,ill be followed I ec later by the .,ame event. This is because there is a one-to-one correspondence betmeen the attractor points in phase ,pace and their images in reconstruction .,pace There is predictive power ,n finding a one-to-one map. Perhaps the measurements F(.W would not be repeated precisely. Yet F is reasonable. similar measurements %ill predict similar events. if th, This approach to prediction and noise reduction of data is bcing pursued by a number of research groups. Although most of the interest lies in the case that .1 is an attractor of a dynamical system. the main question can be posed in more generalit. Let A be a compact subset of Euclidean space R'. and let F map R' to another Euclidean space R". Under \%hat conditions cara \e be assured that i'embedded" n R" by typical maps F' 4 is The precise definition of embeddiniz involves differential structure \ one-to-one map is a map that does not collapse points, that is. no ' points are mapped to the same image point. .\n embedding is a map that does not collapse points or tangent directions. Thus. to define embeddin,-g. we need to be working on a compact set .1 that has well-defined tangent spaces. Let I he a compact smooth differentiable manifold. (Here. as in the remainder of the paper. the word .vnooth %%ill be used to mean continuouly differentiable, or C'.) A smooth map F on .1 is an inrinersuon if the derivative map DF(.x) (represented by the Jacobian matrix of F at k) is one-to-one at every point x of .-I. Since DF(Rx) is a linear map. this i! equivalent to DF(x) having full rank on the tangent space. This can happen whether or not F is one-to-one. Under an immersion, no differential structure is lost in going from A to F(A). I I I I I I I I I 1 I U U 583 Embedoilogy An ernheddine of A is a smooth diffeomorphism from A onto its image F(A . that is. a smooth one-to-one map which has a smooth inverse. For a compact manifold .4. the map F is an embedding if and only if F is a oneto-one immersion. Figure la shows an example of an embedding of a circle into the plane. Figure lb shows an immersion that is not one-to-one. and Fig. Ic shows a one-to-one map that fails to be an immersion. Whether or not a typical map from .4 to R" is an embedding of A depends on the set .-. and on what we mean by "typical.- \ celebrated result o t this type is the embedding genericity theorem of Whitney.", which says that if .4 is a smooth manifold of dimension c. then the set of maps into R-'-' that are embeddings of .4 is an open and dense set in the C'-topologv ,of maps. The iact that the set of embeddings is open in the bet of smooth maps means that uien each embeddina. aroitrari.%:mail perturbations wll still he emoeuaings. Fle lact that the ,c[ ot emueddings h iOlt in the ,ct oi maps means that every imooth map. whether it is an emoedding or not. is arbitrarilv near an embedding. One would like to conclude from Whitney s I ,FF 1 A a F // 'C" Fig. I I FA) 2 ia) An embedding F of the smooth manifold .4 into R (b) An immersion that fails to be one-to-one. (c) A one.to-one map that fails to be an immersion. 584 Sauer et al. theorem that n = 2d + I simultaneous measurements are typically sufficient to reconstruct a d-dimensional state manifold .4 in the measurement space R". However. open dense subsets, even of Euclidean space. can be thin in terms of probability. There are standard examples. many from recent studies in dynamics. of open dense sets that have arbitrarily small Lebesgue measure, and therefore arbitrarily small probability of being realized. A well-known example is the phenomenon of Arnold tongues. Consider the family of circle dilfeomorphisms g,,,.(x)=x+w+ksinx mod 2-. where 0 < wj - 2.: and 0 < k < I are parameters. For each k we can define the setI Stab k i= )<,. < " : ., has a stable periodic orbit' For )<k < 1. the ,et Stab~k) is a countable union of disjoint open intervals of positive length. and is an open dense subset of [0. ]. However. the total lengt' i Lebcsguc measur-) of the open dense et Stablk) approaches zero as k -0. For small k. the pr,-baulitv of landing in this open dense set is %ery small. See ref. 3 for more details. With such examples in mind. an experimentalist would like to make a stronger statement than that the Net of embeddings is an open and dense set of smooth maps. Instead. one would like to know that the particular map that results from analyzing the experimental data is an embedding with prohabilit v one. The problem with such a statement is that the space of all smooth maps is infinite-dimensional. The notion of probability one on infinitedimensional spaces do,- -ot have an obvious generalization Irom finitedimensional ,paces. I is no measure on a Banach ,pace that corresponds to Lebesgue measure on finite-dimensional subspaces. Nonetheless. we would lilke to make sense of "almost every" map having some property, such as being an embedding. Following ref. 24. we propose the following definition of prevalence. Definition 2.1. A Borel subset S of a normed linear ,pace I' is prevalent if there is a finite-dimensional subspace E of 1",,uch that for each r in V. v+ e belongs to S for iLebesgue) almost every L'in E. We give the distinguished subspace E the nickname of prohe Npatc. The fact that S is prevalent means that if we start at any point in the ambient space V and explore along the finite-dimensional space of directions specified by E, then almost every point encountered will lie in S. I I I Embedology 585 Notice that any space containing a probe space for S is itself a probe space for S. In other words, if E' is any finite-dimensional space containing E. then perturbations of any element of V by elements of E' will be in S with probability one. This is a simple consequence of the Fubini theorem.' From this fact it is easy to see that a prevalent subset of a finitedimensional vector space is simply a set whose complement has zero measure. Also. the union or intersection of a finite number of prevalent sets is again prevalent. We will often use the notion of prevalence to describe subsets of functions. It follows from the definition that prevalent implies dense in the Ck-topology for any k. More generally, prevalent implies dense in any normed linear space. When a condition holds for a prevalent set of functions, it is usually illuminating to determine the smallest, or most efficient. probe subspace E. This corresponds to the minimal amount of perturbation that must he available to the ,ystem in order for the condition to hold ,.ith uirtual certainty. As stated above, for subsets of finite-dimensional the term prevalent is synonomous with "almost every," in the sense ,paces )f outside a set of measure zeru. When there is no possibility of confusion. we %.il say that "almost every"' map satisfies a property when the set of such maps is prevalent, even in the infinite-dimensional case. For example. consider convergent Fourier series in one variable, which form an infinite-dimcnsional vector space with basis ',e ' . In the ,ense of prevalence. almost ever' Fourier series has nonzero integral on EO. 2.]. The probe space E in this case is the one-dimensional space of constant functions. If E' is a vector space of Fourier series which contains the constant functions. then for every Fourier ,cries i. the integral of !' will le nonzcro tor almost every e in E' With this definition, we introduce a prevalence version of the Whitney embedding theorem. Theorem 2.2 (Whitney Embedding Prevalence Theorem). Let .Abe a compact smooth manifold of dimension d contained in R' \lmost every smooth map R' - R2 " is an embedding of .1. I In particular, given any smooth map F. not only are there maps arbitrarily near F that are embeddings. but in the .ense of prevalence. almost all of the maps near F are embeddings. The probe space E of Definition 2.1 is the k( 2d + I )-dimensional space of linear maps from R' to R2 , '. This theorem, which is proved in Section 4. gives a stengthening of the traditional statement of the Whitnev embedding theorem. It is quite interesting that Whitney later proved the different result that under the same circumstances. there exists an embedding into R2d. (This Sauer et al. 586 could be called the Whitney embedding existence theorem) However, an existence theorem is of little help to an experimentalist. who needs information about maps near the particular one that happens to be available.I Knowledgze that an embeddir: exists sheds little information on the particular F under study. The example of Fig. l b shows that the dimension -'.+ I of Theorem -1.2 is the best possible. The map F is not one-to-one on the twisted circle .4. thus does not embed .4 into R-. Further. no nearby map even in the ("-topology) embeds Ai.On the other liand. if a given map of the circle .1 into R 3 was not one-to-one. there would necessarily be a prevalent set of nearby maps that arc embcddings. The first main a021 of this section was .o express Whitney's embedding theorem (and Takens' theorem: see belowi in this probabilistic sense. Thc .second is to extend WVhitney's theorem to sets .1I that are not manifoldsI Here \%e usc tile fractal dimcnsion known as hox-countinu dimension. he hox-,.ounting i or capacit%i dimension oi a compact 'sct I :nl A" i be the set of all point,, defined as follo%%s. For a positi~e number .let within . of .1. i.e.. -1, = GeR"' denote the 'i-dimensional outer dinL'/sWio for some it A:. Let %oil , olume of .1, . Then 'ie ho.tounhim,. ~-i al. : ' of A is boxdim.A ) =it - 1im log voll .l -41 logz; if the limit cxists. if' not. the upper (rcspccti%-ely. loweril ox-Countiig dimension ciin be defined by replacing the limit by the lim inif iresp., lim sup). WVhen the box-counting dimension exists, the appro\imate \caling law ohl~ holds. where d = hoxdjm(. I). There arc ,everal equivalent definitions of box-counting dimension. For example. R" cuin be divided into .-cubcs by a grid based, say. at pointwhose coordinates are L.-multiples o1 integers. Let .V(o be the number oI boxes that intersect I Then boxdim(.1 =lim -o V l0! og -102gr with similar provisions for upper and lower box-counting dimension. ThleI scaling in this case is NWI Embedology 587 Even if we know the box-counting dimension of an attractor A. Theorem 2.2 gives no estimate on the lowest dimension f6r which almost every map is an embedding. Suppose we know that .4 is the invariact set of a flow on R"° . and that the box-counting dimension of .4 is 1.4. In the absence of any knowledge about the containment of .4 in a smooth manifold of dimension less than 100, the use of Theorem 2.2 to get a oneto-one reconstruction requires the use of maps into R oi In fact. the smallest smooth manifold that contains the 1.4-dimensional attractor may indeed have dimension 100. But as the next result shows. one can do much better: almost every reconstruction map into RI will be one-to-one on A. I Theorem 2.3 Fractal Whitney Embedding Prevalence Theoremi. Let .I be a compact subset of R " ot box-counting dimension d. and let z be an integer greater than Zd. For almost every smooth map /- R" - R". I. 1is one-to-one on .1 2. F is an immersion on each compact subset C of a smooth manifold ccntained in ..I., The proof of the one-to-one half o the fra, '.al Whitney embedding prevalence theorem may be sketched as follows. Choose any bounded finite-dimensional space E of smooth maps F so that varying F by elements of E results in perturbing F(.') - F(v) throughout R" for each pair v; v in A. For example. the probe space E can be taken to be the space of linear maps from R" to R". Then the probability (measured in E) that the perturbed F(v) and F( v lie within ;: is on the order of i:". Similarly. if B and B, are r-boxes on .1. the probability that F(B ) and F(B, intersect is on the order of ;". Here we assume that there i, a hound on the magnification oi F. ,is when F i,, a ,mooth map near the comtact ,ct I The ,,ct I can be covered by essentially i. " boxes ot size :.. ano the number ol pairs of boxes is proportional to o: -Y1 The probability that no distinct pair of boxes collide in the image F(A is proportional to k : -'l c=;.* " If Sthe n > 2d. this probability of choosing a perturbation of F that fails to be oneto-one is negligible for small i.. More precise details of the proof. as well as immersion part. are !n Section 4 2.2. Fractal Delav Embedding Prevalence Theorem Despite the beauty of Whitnevs embedding theorem. it is rare for a scientist to be able to measure a large number of independent quantities simultaneously. In fact. it is a rather subtle problem to decide whether two different simultaneous measurements are indeed independent. These problems can be sidestepped to some degree by introducing the use of 588 Sauer et al. delav coordinates. In this approach. only one measurable quantity i needed. In a typical experiment, the single measurable quantity is sam pied at intervals 7' time units apart. The resulting list of samples :Q is called a time series. Think of the measurable quantity as an observation function it on the state space R" on which the dynamical system 0 is acting. EachI reading Q, = iix)is the result of evaluatine the observation function it at the current state ., Definition 2.4. If (P is a flow onl a manifold Al. T is a positi'beI number (called the delavi and hi: At - R is a smooth function. define the dlaY-coordinatc' iiap FRh. '1P. T): Al - R" byI Rh/. 0P. T)(xY) = ti Mx). Itu0 1jn. /it0 if MO vi To start with a Nimple e,\ample. Ict I be a periodic orbit oi the fio\I P We iound above that in the aosence ot dynamics. three mfuereenUnft coordinates are rcquircd to embed I in reconstruction space. ,r more precisely, that Almost cverv mooth map F= i ,)romf neighborhood of .4 to RI is an embeddin - on .1. Now the situation is dif.ecnt. InstezA' of three functions 1. I, /, that must be independent. t .here is a single function It. and the corresponding map Fl/i. A. T) pictured in Fig. 2. We want to know that for almo-,t c~erV function It from . Ito the real numbers R. the delay-coordinate map1 RhI. AP T) from .1 into R" is an embeddine. It should be stressed that this does not follow from Theorems 2.2 and 2.3. The maps F/i. 'P. F) form a restricted subset of all maps: whether they contain enough \ariation it) perturb away self-crossinmzs of .4 needs to be determined. In fact. the Leneral Fig. 2. The attractor on the left is mapped using delay coordinates into the reconstruction space on the right.I I Embedology 5 589 answer is that they do not contain enough variation. Extra hypotheses on the dynamical system ( are required to ensure that almost every h ieads to an embedding of .-. To see the need for extra hypotheses. consider the case the, . is a periodic orbit of a continuous dynamical system whose period is equal to the sampling interval T. Topologically. .4 is a circle. In this case. F(h. . T) cannot be one-to-one for an', observation function h. Let x be a point on the topological circle I.Since the period of .1 is T. hWx)= IitP ':= . . ,- r(X)). so that F= F(h/.(P. T) maps x to the diagonal line Ix, ..... : .'v,= . =.,, in R". A circle cannot be mapped continuously to a line (in this case, the diagonal line in R") in a one-to-one fashion. See Fig. 3. The one-to-one property also fails when .4 is a periodic orbit of period Define the function = .oxxi .)l on .1. The lunction ,a i either identicailv zero or it is nonzero tor some x on .. in \%hich c'ahe it has the opposite ,ign at the image point 'P. txi. and changes )ign on .1. In any case. dx) has a root x,, on .A.Since the period of .I Is 2T. we have h(Xu) = 11(0 1 ()) = Il0 ;rtCi)) -. Then F(h, '. T) maps \,, and IP ,ix,) to the same point in A If x0 and 1P - I x,.) are distinct, this says that F is not one-to-one. If \,, = -.1.xn, then the orbit actually has period T. ard F fails to be one-to-one as above. In the presence of periodic orbits of period :T. Fh. 1P. T) cannot be one-to-one for an ohservation function h. On the other hand. when .I is a periodic orbit of period 3T. or an' period not .'qual to T or 2T. there is no such problem. In this case the dela.-coordinate map of a periodic orbit 1 into R" i, an embedding for .mot every observation [unction )! a Ione as the reconstruction dimen,ion i,, at least three. Uhe ,iatcment :or more ,-'-nerai attra.tor,, I i,, follows. 3 ", 3-T. I , F' I? R Fig. 3. A two-to-one map from a topological circle to the real line. 59 SaUW of &I. Theorem 2.5 4Fractal Delay Embedding Prevalence Theoremii..:t * be a flow on an open subset U of R". and let A be a compact subset of U of box-countine dimension . Let n > 2d be an integer, and let T> 0. Assume that A contains at most a finite number of equilibria, no periodic orbits of 0 of period T or 2T. at most -;.teiy many periodic orbits oi period 3T. 4T.- nT.and that the lineanzauons of those periodic orbits have distinct eienvalues. Then for almost everv smooth function h on U. the delay coordinate map F(h.4. TI: U-*R" is: 1. One-to-one on .4. 2. An immerrion on each compact subset C of a smooth manifold contained in A. Where Takens - :l ;howed that the delay-coordinate maps generically ,m the C-topology ,gveembeddines of smooth manifolds ot dimension ,. -e -substitute comoact -ets of box-countine dimersion ... .nd ::inacc genenc with prevalent. The assumption of Theorem 2.5 that there are no periodic orbits ,i period T or 2T can be satisfied by choosing the nme delay r to h suffiLiently small. In fact. ia we assume that the recto, field on . -atisfics a Lipschitz condition. that is. .= 'xl. where 1'(xI - i1y; < L.x- ,. then it is known' 1 ' that each periodic orbit must have period at least - L. Hence. if T< ,n:L. there will be no periodic orbits of period T or "7. Theorem .5 assumes u>_,l to avoid clf-intersecton .f the reconstructed imnage of A. To see that this requirement cannot be reiaxcd in general. consider the case d= 1. it 2d= 2 shown 1i Fig. 4a. Let the observation function h be the coordinate function v,. and consider the delay coordinate map R' -- R2 defined by F(x,.',P T" i v,ix. *,1 P viii In the situation illus, ..... in Fie. 4a. 1,4' ,jh))<.,((P ,ia))< v,(a)= vIh). and \,0 ,c)) < v,tP ;it)) < ic )= \,(. Setting F= F(x,. 't.T). this means that in the reconstruction ,pace R2. F(a) lies , directly above Fh). and Ffd) lies directly , e F(I See Fig. 4b. The map F is continuous on the tralectory. ,o there i, a continuous path. parametrized by \-,. connecting Fa and Fl( i. There i, also ,uch a path connecting F(h) and F(d). According to Fig. 4b. there must be a %alue of v, in between where the curves meet. and two different points on the circle map together under F Otherwise said. somewhere in between thete .,, an ,c coordinate such that the upper and lower parts of the trajectory advance the same amount in the x, direction during the time imerval T. and thus have identical delay coordinates. The map Ph. P.T) is not an embedding. I If thee observation function or flow is verturbed'a smnall amount. the saMe NTheorem 2.5 Is a special case of a sae ntabout diffeomorphisms. Blefore statinga that version. we redefine delay coordinate maps for diffeomorphisms. 3and U Definition 2.6. if g is adiffeomorphism of an opensubset U of R' h: L' - R is a function, define the dedaY twordinaze i"zap Ffh. -1to:U' R by (DtaJ 37 t a _ _ _ _ _ _ _ d): _ _ __T_ F(d) F(a) F(b) I 3 b Fig. 4 (a) A trajectory of a flow that cannot be mapped using two delay coordinates ina one-to-one way. (b) The point at which the paths cross corresponds to a et of delay coor- dinates shared by two points on the trajectoi' Sauer et al. We get the previous theorem by substituting g = statemenlt. r in the folLwing Theorem 2.7. Let g be a diffeomorphism on an open subset L of R' . and let .4 be a compact subset of U. boxdimi A i = d. and let i > _'d be an integer. Assume that for every positive integer p < In. the set .1. of periodic points of period p satisfies boxdim.tAi< P-2. and that the linearization Dg- for each of these orbits has distinct cigenvalues. Then for almost every smooth function h on U. the delay coordinate map F(h. g): U- R is: 1. One-to-one on .4. 2. An -immersion on each compact ,ubset C of a smooth manifold contained in .4. Remark 2.8. Fhe probe space ir this prevaient Ect can be taker it, 'e any set I... ot poiynomials in h .araoles %%nich includes .ill roi.nomials of total degrec up to 2n. Given any mooth function ,:. on I for almost all choices of x = (c, ....2f from R'. the function .',= k),-,-ZA" x,Ih, satisfies properties I and 2. Remark 2.9. The proof of Theorem 2.7 is easily extended to the more general case where the reconstruction map F consists o a mixture of lagged observations. The more general result ,ays that F = hI(.v '(.\'l)'- I ,0 ' satisfies the conclusions of Theorem 2.7 as long as ,, + -,i., 2,1 and the corre;ponding conditions on the periodic points are ,atisfied. Those conditions arc that boxdimi . p 2 for ;, - max n. The reconstruction of ch. :tractors using independent coordinates from a time series was advocateu ill 980 by Packard ct ,d.'' The delaycoordinate map is attributed in that work to a communication MIth D. Ruelle. The metho .ctuailv illustrated in ref.. 21 is somewhat different: namely, it is to use the value u, of the time series and its time deriva'es i,. ,.as independent coordinates. In 1981. Takens ' published the first mathematical result, on the delay-coordinate map. \round the .,Ame time. Roux and Swinne\ " exhibited plots of delay-coordinate reconstructions of experimental data from the Belousov-Zhabotinski reaction. In 1985. Eckmann and Ruelle'" took the idea one ,tep further and suggested examining not only the delay coordinates of a point. but also the relation between the delay coordinates of a point and the next point which occurs T time units later. In principle, one can then approximate not only 593 Embedogy 3 I the attractor. but the attractor together with its dynamics. Since ref. 9 it has become common practice to gath.;r points that are close in reconstruction space. and use their next images to construct a low-order parametric model which approximates the dynamics in a small region. This idea has begun to be used for prediction and noise reduction applications. See. for example. refs. 1. 6. 12. 13. 15. 16. 18. and 28. 1 2.3. Self-Intersection In the case that the reconstruction dimension it is not greater than twice the box-counting dimension d of the set A. the map F in the fractal Whitney embedding prevalence theorem (Theorem 2.3) will often not be an embedding. However. if d < n.most of .4 will still be embedded. In the case that .I is a smooth manifold of dimension d. almost every F will be an embedding outside a ,ubset of A of dimension at most "'- ,1. If d < n. then 2.1 - it d. and so this exceptional subset will ha~e positive codimension in A. If A is simply a compact set of box-counting dimension ,.then the situation is slightly different. We will call the pai: x. v of points 6-distant if the distance between them is at least 6. Then we define the 6-distant selfintersection set of F to be the subset of .4consisting of all .asuch that there is a 6-distant point Y with F(x) = Fly); that is. (F. j)= 'xe.4, F(x) = F(.) for some ive A.k -.II> 6 Then the result is that for everv ,J > 0. the lower dimension of the i-distant self-intersection ,et f (F.6) is at box-countine most 2 - itfor almost every F. \ precise statement i,, given bv the next theorem. 3 3 Theorem 2.10 (Self-Intersection Theoremi. Let A be a compact subset of R' of box-counting dimension d. let it < 2d be an integer, and let 6> 0. For almost every smooth map F: R - R": I. The 6-distant self-intersection set r(F.6) of F has lower boxcounting dimension at most 21 -it. 2. F is an immersion on each compact subset C of an in-manifold contained in A except on a subset of C of dimension at most 2m -it- I. For example, consider mapping a circle to the real line. In this case d= in = n = 1, and Theorem 2.10 says that a prevalent set of F are immersions outside a zero-dimensional set. This is clear from Fig. 3. where the zero-dimensional set consists of a pair of points. The map is at least 2 to 1outside this set. and hence nowhere an embedding. 594 Sauer efak. On the other hand. settig d n: I and it2 in the theorem we see that a prevalent set of maps F from the circle to the piane are immiersions. and are embeddines outside a zero-dimensionai subset. Thus. the maps shown in Figs. l a and lb are of the prevalent type. immersions w..hich are one-to-one except for at most a discrete izero-dimensionail set of poitits. Figure Ic. on the other hand. is nonprevalent. Almost any map near F will perturb away the cusp. There is also a ,ef-intersection version of the fractal delay embeddinea prevaence theorem I Theorem 2.5) which one ets by making the obvious changes. Thus, if it < :.d. then for each 6 > o there exists a subset 1 (F. )I. whose box-counting dimension is at most 2'- it. on which the delaycoordinate map fails to be one-to-one. Note that the result is independent of 6 >0. If .11 Is a closed subsed of an in-manifold contained in .4I. then there is a subset E, of .11 of dimension at most 1n - it - I on %khich thle map fails to be an immersion. 2.4. How Many Delay Coordinates Do You Need? When usinie a delay coordinate map (or filtered delav coordinate mnap. described in the next sectic' ) to examine the image R(AI in R" of I ,et .1 in R*. the choice of it depends on the objective of the investigation. Different choices of 11suffice for the different goals of prediction. Calculation of dimension and Lyapunov exponents. and the dctermination A thle stability of periodic orbits. To compute the dimension of .1. all that is required is that 12.1 dimP.4) = dim .I wvhether the dimension beiniz used is box-counting. [Iauhdortf. iniorrnatiofl. or correlation dimension. The latter two depend on a probability Jcnsit% on .4 and F(A). It is hhown in ref. 24 that for the case of Hausdortf diimension. the equality (2.1l holds for alrnozt every' measurable map F,. in the sense of prevalence. as lone as it >, dim. I . The probe space of perturba1 tions for this result ib the space of all linear transformations from R' to R" \v4attila' ' proved that equality (:.I) holds for almost every orthogonal projection F It is rewhat surprising that there are exampi-s for which 12.1 Idoes not hold lor anyv map F wvhen box-counting dimension is used. Q~en under the hypothesis ni > boxdim A I. An example of this type is given in ref. _'. However. in most cases of compact sets which arise in dynamical ,vstems. we expect Hausdorff dimension to equal box-counting dimension. In pr .ical situations. if attempts to measure boxdimt.) result in ariation would answers dependent on it. where nt > boxdim A4). then the % U 'IEmbedoiogy 595 seem to be a numerical artifact, since there is no theoretioa justification for which of the values of n greater than boxdimIA) gives the more accurate result. The usual technique is to increase n until the observed dimension of boxaim F(.A) reaches a plateau. and to use this -esult. The resulting number might be called the piateau dimension. While the plateau dimension may indeed give the best numerical estimate of the dmension of A. there does not seem to be theoretical or numerical justification of this bias. and the question needs further investiation. Notice that n > boxdimi.4 does o'it guarantee that almost every F is one-to-one. but that is not required for dimension calculation. If the objective is to use F(.) to predict the future behavior of trajec- tories, then it is sufficient to have the map F be one-to-one. In which case n > 2 boxdimt.4) is needed. Knowing the current state in Ft -11 is sufficient to predict the iuture of the traiectorv tat least in the ,hort ,uni. In the I I I 3The I ,ituation 0I Fig. lb. on the other hana. prediction on the periodic ornit would ,tlt be possible. except when the trajectory was at the rmrdpoint ot the "figure eight.' If the objective ,s to compute the Lyapunov exponents of the system. it is necessary to ask which exponents are to be cor.. ;uted. For a simple example. .uppose the attractor .4 is a periodic orbit. Then the best p.issible result of the examination o F(.A) is o observe that t is a Lvapunov exponent. The other exponents. presumably all negative. cannot be observed without introducing perturbations. IMore generally. if an attractcr .4 lies on a manifold of dimension ti (as a 2.2-dimensional attractor might lie on a three-dimensional manifold), it will certainly he impossible to measure more than in true exponents from an embedding, cven tf thL reconstructed image F.) lies in R" ith n > m. There are no criteria for determining, the smallest manifold containina 1 Theorems 2.3 and 2.5 say that if t > 2. boxdimt.l ). then almost every F is an embedding of all smooth manifolds that lie in .1. The ,mooth manifolds we have in mnd are the surface corresponding to the unstable directions on the attractor .1. that is. the unstable manifolds. Under an embedding, the differential information is preserved along ,ilooth directions. ,uch as unstable manifolds. indicating that positive Lyapunov exponents should be computable from the image F(.I). The stable manifolds, on the other hand. will be likely to intersect in a Cantor set. The image of a Cantor set in F(A) ma,, he quite com-.1 pressed. For example. a set which is the product of five Cantor sets whose dimensions sum to 0.5 might be mapped to a one-dimensional line in F(A). It seems difficult to recover any exponents in these directions from knowledge of the reconstructed dynamics in FA). self-intersection results in Section 2.3 are aimed at another kind of 59 Sauer et al. question. A relevant experiment involving a vibrating ribbon is described in refs. 8 and 26. In this case. the Poincare map has an attractor whose dimension was experimentally calculated to be 1.2. The investigators were interested in determining the eigenvalues of the linearization of a period-3 point on the attractor. Using a delay-coordinate map of the attractor into R 2 did not result in a one-to-one map, which is consistent with our results in Section 2.2. Theorem 2.10 of Section 2.3. which deals with self-intersection. suggests that the subset . of A on which the map into R 2 fails to be one-to-one should have dimension at most 2,= 2 x 1.2- 2 =0.4. They found that the self-intersection set looked like a finite set. If .1 indeed has dimension 0.4 or less. as we w..i expect. then the set . would be unlikely to include the periodic point in question. and the delay-coordinate map would be expected to be one-to-on- in a neighborhood of that orbit., Numerical investigations of the dynamics near the periodic orbit revealed that the dynamics did uppear to be two-dimensional, and the researchers \%ere ,1le to estimate numerically the eizgenvalues of the orbit at these points. 3. THE DELAY COORDINATE MAP AN.) FILTERS 3.1. Main Results So far. we have defined the delay coordinate map .- F(Ih. ,,.v from the hidden phase space RA to the reconstruction space R". Under Ntiitable conditions on the diffcomorphism ,, the delay coordinate map Fit. ,,i is an embedding for almost all observation functions 11. In this formulation. information from the previous a time steps is used to identify a .tatc of the original dynamical system in RA. For purposes of measuring quantitative invariants of the dxnamical systems, noise reduction, or prediction. it may be advantageous to create an embedding that identifies a state with information from a larger number of previous time steps. However, working with embeddings in R" I, difficult for large i. A way around this problem is to incorporate large numbers of previous data readings by "averaging" their contributions in ,ome ense. This problem has also been treated in ref. 7 To this end. generalize the delay-coordinate map F(h. ,r R' -- R". F(h, g)x = th(x). h(g(xn..... hig" where the superscript T denotes transpose. by defining the fi!tered deluvcoordinate map F(B, h, g): R' - R" to be F(B. h, g)x = BF(h. g)x (3.1) Embgdotogy , 597 where B is an i x w constant matrix. Thus. each coordinate of F(B. h, g)x is a linear combination of the w coordinates of F(h. gjx. Here we are considerng the case where . is a diffeomorphism. for notational convenience. Everything we say applies to a flow P by setting g equal to the time - T map of the flow. We will call it- the iwindow length of the reconstruction, since there are w evenly-spaced observations used. We call tnthe reconstruction dimension. since R" is the range space of the map. We may as well assume that n <, w and that B has rank n: otherwise we could throw away some rows of B without losing information. Assuming that B is a fixed matrix restricts the filter to be a linear multidimensional moving average MA filter. Autoregressive tAR) filters in general can change the dimension of the attractor.' ,o, If B is the identity matrix (denoted 1). the map is the original Takens delay coordinate map. \s stated in the previous ,ection. in that :ase. t(1. ' t h,. 11)is almost always an embedding ab iong .:b n i, greater than twice the box-counting dimension of the attractor Ind the periodic points of period p less than n have distinct cigenvalues and make up a set of boxdim < p,'2. Under Fl: ring, some complications are caused by the existence of periodic cycles. On the other hand. the next theorem states that in the absence of cycles of length smaller than the window length it. every movine filter B gives a faithful representation of the attractor. 3 I1 3average Theornm 3.1 (Filtered Delay Embedding Prevalence Theorem . Let U be an open subset of R' , g be a smooth diffeomorphism on C. and lt .1 be a compact subset of U. boxdimA.)=d. For a positive integer n > 11. let B be an it x it matrix of rank I1.Assume ,, has no periodic points Of period less than or equal to t Then tor almost c%erv \mooth function hi.the delay coordinate map Ft B. i. L - R"i. ' I. One-to-one on .1. 2. An immersion on each closed ,ubset C of a .mooth manifold contained in .4. The probe space for perturbing I can be taken to be any ,pace of polynomials in k variables which includes all polynomials of total degree up to 2w. Furthermore. in case it,< 2d. the results of Theorem 3.1 hold outside exceptional subsets of .4 precisely as in Theorem 2.10. For example. consider the3 x 9 matrix I ii i 0 00 0 B=I0 0 0 1 1 1 0 0 0 0 0 00 (0 0 (3.2) 598 Sauer et al. Then F(B. h.g x.,= ( lx) + hi gt.x n + hgit.) x). ( viI" Y" 1)+ hI g4( .x" )I + hh4i .( , O(h( .l '1)+ h( t.v)) + hi g'(x)))) Although the map F(B. It. ,., uses information from 9 different lags. the moving averaae*" reconstruction space is only 3-dimensional. According to the theorem, if the dynamical system ,has no periodic points of period less than c = 9. then FiB. h. ,,is an embedding for almost all observation functions I. Remark 3.2. When the diffeomorphism , has periodic !,,oints. ,:ertain special choices of filters 3 will cause seif-intersection to occur i,ti periodic points. However. under the genericity hiypotheses on the oA'narn!. cal system of Theorem 2.5. for example. almost all choices of an Ii , matrix B imply the conclusions of Theorem 3.1, This follows from Remarks 3.4 and 3.,. A more detailed 'oew of the effect of periodic points otf the dynamical system is given in Sections 3.3 and 3.4. 3.2. Examples of Filters In this section we will list some examples of filters that may he useful in given situations. The easiest example is a simple averaging filter. For any integers in. n. let B be a n x in matrix of form (111a. ) In I'm ... I in where there are in nonzero entries in each row. In the presence of' noise. this filter :,hould perform well compared to the more standard delay-coordinate embedding which uses every rith reading and discards the rest. A more sophisticated noise filter was suggested in ref. 5 for a slightlk different purpose. and elaborated on in the %ery readable ref. 2. where it i*S used for dimension measurements. It is based on the singular value decomposition from matrix algebra. also known as principal component analysis. Let Yt, .... Y.L be the reconstructed vectors in R". where L is the length of the Embedology 599 data series. Following Broomhead and King. 5 1 define the L x tv trajectory matrix where the i are treated as row %ectors, The ot'artance m"ctrux of this multivariate distribution is .41.4. The off-diagonal entries of .4'.4 measure the statistical dependence of the variables. The singular 'alue decomposition''"' of the L x it- matrix .1. where 3 I = V'SLUr .4 where 1 .. an L L orthouonal matrix. C I,, a it x it orthogonal matrix Ithis means that 1' 1=/, U '= i). and S is an L t diagonal matrix t meaning that the entries a., of S are zero if i - ). By rearranging the rows and columns of V and I. ,e can arrange for the . mnqdur ralu,. of. I to s rt, a :,. .. 0. The bottom L - rows of S are zero. The ,ingular value decomposition suggests the uhe of the lilter 8= (. That I.N.instead of plotting the %ectors V ..... i, in reconstruction space R". plot the ectors U, ..... .'.. One immediate positive consequence of this change of variables is the ,tatistical linear independence of the new variables. The covariance matrix of the new trajectory matrix () is .A,) r. IU= SrS. a diagonal matrix. In practice. one can do better than 8 = U . This 14 because some of the nonzero ,ingular Nalues are dominated by noise. .\ rule of thumb ih to ignore (by ,citing to zero I all singular \alues below the noise floor of the experimental data. Ignoring all hut the largest A ,ingular %alues Is equivalent to letting the filter B in Eq. IS.1I be the top A rows of Lr. The rows of L r are orthogonal. so B is still full rank. Theorem 3,1 implies that F(B. h. g)j will typically be one-to-one and immersive. This program was followed in ref. 2. in the context of measuring the correlation dimension of chaotic attractors in a stable way. They used a filter B that consisted of the rows of Lr that corresponded to singular values above lo - '. 600 Sauer et al. 3.3. Conditions on Periodic Orbits Which Imply One-to-One For special filters B,conclusions I and 2 of Theorem 3.1 can fail. but only for periodic points. That is. some periodic points of period less than ,'may be mapped together under the map F(B. It. .). For example. assume B=( (0 i , ) 1 4 4 4 I- -0 4 1 and assume that g has a period.4 orbit, that is. g.(x) = v. Then for any h. F(B.h. g)maps all four points of the period-4 orbit to the same point in R' so F(B. h. g) fails to be one-to-one. There is no way for any observation function to distinguish the four points. ,ince their outputs are being ,iveraaed omer the cntire cyclc. Thus. the filtered delay coordinate map luk. tor ail observation functions i. to be one-to.one. A similar problem occurs with the filter 0,000 0 3.61 t Now F(B, h, . "= t!h(.\') + h(g-(.\'))), I((¢.") + ht( g:\I1v M. \ssume that ihe period.four orbit o, , constts of \ =,t\,, . v.. V1 = .t Vol. and x,= &.\,). Now \,, and ,are mapped to the hame point in thc reconstruction space R' by F(B. h. v i. and the same goes for .\ and v'.Again. the map cannot be one-to-one tor any h. A second obvious problem can be illustrated when the dynamical system has more than one fixed point. No matter how Itis chosen. the filter -- I- B= 0 t-0 3 37 0 t) t 3.7) maps all fixed points to the origin in R', violating the one-to-one condition. In each of these situations, the underlying dynamical system g may dictate that some periodic points will become iu,...:fled under a particular 1'Enibedology ft '601' 'filter B. no, matter how- aeneric-' thd obser~aionri function I:.,On, the -other hand.,these identificationis, occur. only at periodic ,points. Further., even, in, ihe, case of, periodic points., it turns out, that the restrictions onl B edxem-; plified- by 'the, three cases aboveare the only restrictions. That-.is. ifthds&,are, avoided. then' F(B. Iz.g), is one-to-one for a-prevalent set of observation functions h~. To be more precise about tihese restrictions'. we need to -make, some, definitions. For each positive integer p. denote. by ..4. the set of period-p' pointsof g lYind on 4. That is. A,P' .\ .4,: g (x) =,. . 'Let 1,, denote the a x-i identity matrix and (.jdenote greatest common, divisor. WVe wvill use !he convention that (p. 0)= 0. For integers p > q> . define the P x Ip -(p. q)) matrix I IA2. I Iminn. 3 50 3 Define C " to be the r. x (p- (p, q1))) matrix formed by repeating the block (',., vertically, and for a positive integer it. define' C";: to be the matrix formed by ?he.top, ' rcws of C"~ Theorem 3.3, Let C be an open subset of. R'. let q be' a-- smooth diffeomorphism on C. and let .jI be a compact subset of' U of box-countinig dimension d., Let it and it be integers satisfying ivw n> 1. Assume (hat B is an it x it- matrix of. rank /1'wvhich saUtisFc-s:, Al.. rank' BC;:: > 2, ho xdim(. I, for all, 1I p < v rank bC~ oxdimAjib'r a litI p~w Then for almost, everv' smooth function/h. ,F(B. It. -' is one-io-one (in Remark 3.4. Note that 'rank C,,, = p -T(p. qj). and so rank. C"IN mill'wi. p- (-p, q) I It follows that rank BiC" min~n.-p: and rank BC". 3 2 p1 1 for B= /,,-and also -for almostuevery it x it matrix B. To illustrate the restrictions that Theorem, 3.3 puts on moving average Filters, assume that B) is the S,x 6 matrix 0.5). In particular. the filter B must satisfy condition \2 for'p =4. q = 1. which means I 0 o) 1 0 rank,.B - 0-I0 010 > boxdim .4 602 Sauer et al. The rank on the ieft-hand side is Zero. however. and-if there exists any period4 orbit. the filter t3.3) fails this condition. This is consistent with .hat we have already noticed: in the-presence- of-a- period4 orbit, the-map F(B. h..gJ is ndt one-to-one for any h. The filter 13.6) satisfies the above conditio-dhas lone as there:ate finiteiv many period4 orbits. However. it fails condition A2 for P = 4. q 2. which .equires I) tankB -1 0 1 boxdim.4 1 0 This is aiain consistent with our earlier observation. Finaily. if there exist fixed points, the tilter i. l'aiis the condmon XI for ) I-if there exist fixed points. That is because condition AI requires rank B I > 2-boxdim A1, Since the rank on the left side is zero. the cohdiigdn fals unlegs the .et of fixed points is empty. 34. Conditions on Periodic Orbits Which .lmo!V. an Immersion Therc are also rather obvious situations When. cttaui -filters cause F(Bh Iig) 16 fail' anan immersion. Asstiethav- -is a circle that has a fixed point x. Assume-that the derivative of if-qt v-is -2. Consider the filter B . 3.9) In this case. the map F(B. h.g) cannot be an immersion at x forany observation function h. For a tangent vector c in T,:.= RI. the.derivative map is 603 EmbaologY Vh(xI V h( " " xj D F( . h ., " \ V h lg - =6 I. "r , i _gtx x" I D" X I iDx OlVh(xjic. =t) I ) 7 Vh1( x1"1. - .,r I .. so-the tangent map of F(B. h. -, at x is the zero map. In the case of an ie-dimensional- manifold A! with a iixed potnt X. .t can be checked that fo- a filter B of this. type. F(B. I. -,I wlii fail to he an Iimmersion f r all.h as ionri- as -the iinearization " . . as- on_: .env'aiue ot .. .is n the one-to-one case. the immer ion-u%%il fa.i oniv ,,r ncriouic points. To be precise. given numbers c,.c,. define the . <rp matrix :1D'~... cjf.- "' (' 3.10) (310 where I,, denotes the p x p identity matrix. For a- positive integer %i.let ..... cj:If the .... matrix formed by the top w rows of D c,) be the c, are distinct. then rank D,(c, ..... c% = min, w. rp, .. I.- . . . . . Let C he an open subset of R'. !ct , n'e .I ,mouth d iffe9morphism onan C. and let .1 be a compact ,ubset_7-n-*,i a.. , ,mooth • L. , .ssume-0t,'. Let w and n b iefiid-NftKsfvingivw-. '.. -manif~fold in the linearizations Dg" of periodic orbits of period p less than or equal to w have distinct eigenvalues. Assume that B is an n, x w matrix of rank ,, which satisfies: Theorem 3.5. A3. 5 . rank BD;*(l.,. . > oxdim In<. and for all subsets linearization at a point in A., P 4.' r ... - for all I< p < iv of. eigenvalues of the Then for almost every smooth function h. F(B. h. g is an immersion on .4. Remark 3.6. See Theorem 4.14 for a- proof. Note that since rank ( = min w. rpl for distinct eigenvalues ,;,. it follows that rank BD= min{n, rp,} for the original delay coordinate case of B = 1,, and also for almost every n x ivmatrix B. 6~. Sauer er ak. TO illustrtethe- condition X) is asafixed point with Un eiOgenfaiue of -rank ~D(-2 >Q. but :~. o i~ 39 hn~z~ 2. .at -c6nditi~ii reciuiret' that. OD 2r1j 4. PROOFS This section contains the proofs of the esults stated above. After some fundamental- -lemmas, w6 !ivye -thc -proofs of the WVhitney forms of the qmbedding theoremts. These follow 'Umma-4-I1. The proofs.-of the -delay-coo rdinate- forms involvinia 11lters. Theorems 3_3 and 5. :oilow immediateiy frm. Theorems -',L3 and 14-- _rSpeCtIVeiy. FhIS ,:CEuon concludes with the proof of -Theorems 2.- and .-. which ard sed:ai, casds of Theorems 3.3 and 3'.5. Lemmra 4.1. points in R'. and.it, Let i ar's /. be positivc i.atesvers. .... it,, in R. c,.. j%...v, distinct L-in- R'. 1. there exists a Oolynomial Ii in A variables of degree -at most 1 - I such that l'or = I... it. hIv, ,= it.. 2.- There exists a polynomnial-/i in-k variables of dearce-at mnost n such that- for i .t.V/~,~v) Proof. L. We may assumc. by linear chanize of coordinates, that the first- coordinates of ..... ..are distinct'. Then. ordinary onfe-\ .nable inter- polation- guarantees such a- polynomial. 2. First assume A = 1. There exists I poi,nomial of dciiree at most n - I in one variable that inter-polates, the data. The antidcrivative- Is the In the general case. by a linear changc of coordinates. may assume that -for. each j L..k. the ifh coordinates of I,, are distinct. The above paragraph shows that for j =Ik -there is a polynomnial of degre at most n in the jth coordinate . whose derivative h\%. interpolates the 1ath coordinate of it, for i ~i.n The sum of all k of the-se polynomials is a polynomial of degiree at most n which satisfies-the conclusion. Lemma 4.2. Let Ft'x) = x + h be a map from R' to R". where Nt is an~tn x t matrix and R". For a positive integer r. let - - 0.be the r th largest singular value, 01 M. De'note by- Bp, the ball centen.- a..the origin 'Embedalogy -605, .of -radius,p in RK. and' by. B the bal c~rtrtd ar theiorigin of radius,6 in, R.Then Vol ;B% ) , Proof. Note that- -decreasing aiV 'v singular, value of A- does-not dtcrease the Ieft-hiand,,side. Thus we ntay asgumne that the singular values 'O jlf..atis*ya. LetJM=_ VSUrbe, the sinIgular--value tuicOmnposition of M. Here S is a diagonal: matrix with 'entries a and all, other. entries zero. V is an inx n orthogonal matrix. and" U is~a ux torthogonial matrix. Sihke the columns of 'U and J each- form an orihonormal, set. we recop-hizc AID., =jVSL'r ,_as an r-dirnehsional, ball of radius a olvinL!T in R1. In fact. the first r columjns of V-mdhtniied- by th-e factbr r,,) d di which, span _A1B,,. The set, F n-~ -Bp,consists of the, vectors in B0 whose image~by A! lan 'ds in a ball, of radius 6iin R". This is a-cyliridfrial subset of B',. with 'base dimcnfsion r and -base radius,a. The subset thus has i-dimrensionial volume less than W6aj'C~ ',, where , r~~,) eoe the volumeo ihe r-dimc-nsional: unit ~ball. The volume-OTAB, is p'C,. so . Vol 11,F(B,s) (Iap q Lemmiia 4.1. Let S be a.bounded subset of R'. boxdimi) =-d. and let-~ ,~G be 'Lipschitz maps from- S to R". A\ssume that for each v InS. the--rithiarg"s-sin ulaf-valtie of-he ,rx trnatrixM~ :G H GCJ.v is at least T>O0, For each :ce R' define G,, Gj-i-+ x.G.. then for almost every in R', ihe-set. G--'(0) has lower box-countina dimension at most, d- 1%If tr>d.L then, G;(01 is empty for almost every X.. Proof. For a positiv'e number p. define the set B, to be. the ball of radius p centered at- the origin in R'. For the purposes of proving the theorem, we may replace R' by, B.. For the remainder of the proof. we will, say that G, has some property with probability p to mean that the Lebesgue- measure of the set 'of a e B0 for which G%has the property is p times the measure of B.. For example. if xc-S, then Lemma 4.2 shows 606 Surta ,that '+ 'J e.iG~ for cC-3B., wh -Probabiltatios Ldt_ D > d'. and'let,.., - 0 be- such that-for 01 < c~ , the foll'owing,two, facts hold. First, S can: be covered -bV i: "~k-diriihsionAi balls B(x. c)o radius-e. cenitered, at xqS. Second.,by the zLipschitz- codition there -exists a constanit .C such that-the irbagt under any G'.~'..o~.~~bl nR intetse~cting S is contained' in a, -C-baliL i R*7. For ,thd-remaindif Of the prdof., we assume c <-j) The probabilit -y that the set Gj((.,: ij ii o is ,at most -the Probability 'that ;G,(x )I < Cc. which is a constaint, times L'. s ince 1)and' r, are fixed. For' any,positiv& number It. the probdbilitv that at least V! of the c'n' images- -G (B(x. cy)- contain 0, is at most, -C V'"f' Therefore. G,'0.can be covered by fewe& than Mt'='i: " of, the i:-balls. except with probability at -most re 1.As lone as h > D-- r 'this probability canbe-made as small as desired' bv deceasing i:. Let p >.0. There-is a sequence ;;a pprouching o such-that (J'0 can-,be-covered.-bv fewerthan zi: "halls~except--for~ probability at most, 1 Thus, -the lower'boX-Co untinpg' dimension of G. "(0), is at 'mo'st h.except for a probability p, subset, of :(. Sne/>0wa' arbitrar., lower ,boxdi'r1 fG;'01v "d lower bdxdimt G 7(0)) I-r 3 -r. Remnark4. In case boxdimi Sydoes not, exist. the hypotheses Oif the lemma can b'e slightly weakened by allowinii d to, be the lower hox counting, dirridnsion of' S. A slight- adaptation of -the' -proof sho s that boxdim can, be replaced 'throughout Lemhma 4.3 by Ha4usdorff dimension. In,particular.i ir > HD-S'). -thenG;7'M, isempty for altnost every c in R'. If in 'Lemmba 4,3 we assume that rank(M,) ;4dfor each xye.S-instead: of the assumption on the siniiular values, then 6,10V-Is empty for almost every x. That is because one can apply Lemma 4J3 to the' set S, c- S E rth largest, singular value, of' Af,,>_aT to gev -I~l)~~.then SU>S, implies G;"(0) =,. We state- this fact in the next lemma. ' Lemmna 4.5. Ldt S be a bounded subset of R'. boxdimiS) = d1. and let Go, G .. . G, be Lipschitz maps from S to R". Assume that- for each x in S. the rank of -the n x t matrix GVb 1 G, (x-. is at least r. For each i c-R' define G, =G + 1 xG,. Then for almost every 2 in R'. the set G,-'(0) is the nested countable union of sets of lower box-couritihg- dimension at most d -r. If r >d.1 then G-'(0) is empty for almost every 7. 60 tmb eddg-V I Lemma,,4'.6. Let A "be a compact subset ,oflRk."beLet.F, ,,., F, Lipschitz maps ftom A.to R-". For each, integer r ,., let S7. be the -set of" pafrs "-Y in.A for .which the trx-r, matrix 'a j-(X' Flo").... . F( -F 'has ,rank r. and, let d, =lower. boxdim( S, Define F.- F)- ._,, .4,7: R?. Then for =. .. outside a measure zero subset of R'. the ,flowing hold': I. fd .<r for Aintegers r >,O., then-the map F, is one-to-one. 2. If d , r for some integerr >. 0. then for every (i > 0. the lower box- II Proo. V:or t= 0.... deline G, I ( i. On the Net, S.. -i the rank of the n x t matrix ;s r. Ii If r> d,, Lemma 4.5 show that for almost every x e R'. the origin is not in the image of S, under the map G, = ,.",, or equivalently. ; F,(y)1Tor v ,i' in S,. If r > I, for all r. then F, i., one-to-one. since pair x les- in' some S,. H If ,.. d,, let (A x A ,, =, (x, y Iq.4 x A: ,x - Yl ti, bebthe subset of(-distant pairs ,of points in . x A. Since (A x A),, is compact for any J > 0. the minimum of the nth singular value of V,, in A< .I , ,. greater than 0. Lemma 4.3 shows that for almost every i. the orin i i ..i I 1)J) 1< for a subset of (..I x 4 ),, vith lower box-counting dimension at most a, - r. Therefore the (-distant self-intersection subset (F,,)of .A.which is the image of this subset under the projection of (A x .A), to A. has dimension at most d,.-.r. 3 o,(x) 3 1each '1. , Theorem 4.7. Let I be a compact subset of R' , lower boxdim(.-l) = d. If n > 2,d. then almost every linear transformation of R' to R" is one-to-one on A. Proof. This follows immediately from Lemma 4 6 and the remark following it. Let 'F,l be a basis for the nk-dimensional space of linear transformations. For each pair x - y, the vector x - v can be moved to any direction in R" by a linear -transformation. In the terminology of Lemma4.6. S,, =Ax-Aand S, is empty for r-#n. Since lower boxdim,) - 2d< it. almost every F, = P,F, is one-to-one on A. I Io -608,Sure~i Remhark 4;.. It :is initerdsting. tha t, -no statement similar toTheoremA47 can be made if 'box-coUnting, dirnesiii : epacedx' by, Hausdorfft dimension. In. an- Appendix -to -this, work 0provided- by. Kanh. examples are tofistructdd of' comfpact subses .4)6f. a&y .Efclid an, spacO Rk that'haveHausdorff, dimension, d,= 0. and, such that -no pojecion to -A"f~r it < k, 'i .one-to-one onA. This strikinfg, difference between, tboX-c6untintg, dimension, and' Hausdorff dimension, is -related to the fact that Hausdorff dimensionv does not work well With products.. Extra 'h vpo theses are needed on, C. in par-7 ticular on 'the. Hdusdorff -dimension- of the product, A x A. to prove- an' analogue, to Theorem 4.7. For example., M46i has shown (see ref 17 and its correction in. ref. 9. p.,611), that if ii,> HD(A 5<A) +J-. thenr the conclusion (if theorem 4.7 ag-ain 'holds, Of' course. using. Lemma 4;3 n Remark 4-.. ,it turns out that only it.> I-DA x<.4) is required: Theorem 4.9. Let A be, a compact, subset oft i.nd ]Lt,it > H-D( .{. -.1 ). Then almost eyery linear transformation- of P% o R is one-to-one on A. It- was showvn in ref. 10 that under the hypothese of Theorem 4.7 almost every orthwgonul'projection is one-to-one -(and -in-flact ha~sa, Hbider continuous inverse). Definition 4.10. For a compact, differentiable mnanifold 11. 'let =(X.Tu): Xv e Al, v T%At I be the tatkgew bundle (if Al. and 'let =(x, ',A) .v) e T(MA); Irl = 1 'denote the unidt tangent bundlile o' Nt Lemma 4.11. Let A1be a, compact subset of'a smooth mnanifold' embedded in R'. Let F, 1 . P,: R R" be a set of smooth. maps fromn an open neighborhood 1U of A to R'11 rt- each positive' tnteer r-. let S, b-,, the subset of the unit tangent bundle St.]) such that the it, t matrix has rank and let i, =l1ower boxdimiS'). Define: F U-R". Then the' following 'hold: = F,,- +, . ,: I.- If df- < r for all integers r > 0. then for almost, ever;' : e R?'. (Ihe map F, is an imtmersion on.. 2. If dl, >, r for some r >, 0. then for almost every 7 e R'. F-, is an immersion outside a subset of .4 of lower boxdim, <, (1 - r. Proof. For i =0,..., 1. define G,: S(A) - R" by G,(x. vti= DF,(x) v. If r> di, for all r > 0, then Lemma 4.5 applies to show that for almost every G (0) r)S, is the empty set. Since S(A) is the union of all S, G'() G.- Em-dolgy ,, . , A' ,3 009 ' i I.", SF, I. is'empty. Thus. no ufit.tangent vector issmapped to, the origin -anirhimersion. ,and F 1 is Incase,.;ld -for some ,r. there -is a.,ositiVe !lower bound -on the singular vaes, o'the Q., on S(A); Lemma,4,.3 imp!ies thatl therel-ss .s.bset of unittangent vectors-.. of lower- boxdim 4,-r, that, can map to zero., The projection 0f this subset into .4 has lower boxdim,;!,- . 3 Proof of'TheoremS 2.2. 2.3, and2.; 10. Theorem 2.2 is a special case -of Theorem ...To. prove the 'latter. we need to show that a prevalent set ofmaps areoie-to-one and immersive. Let F,...,.., be a basis for the set of linear transformations from the notation, of Lemma 4.6. the set S,, = .4x.4 .1 and S,= ,0 Rk - R", In for r z . Sincc'boxdim,4 x A) F7 is one-to-one on A for almost n,t2d< F, are. added. the rank of .1,, every ze R'. If any other maps F. .... cannot drop I'or any pair .\v. so almost every linear combination of F.is one-to7one on .. .... The -proof of the immersion half uses Lemma4.1 1 instead of C is a subset of a smooth manifold of Lemma 4.6. Since boxdiml{) = d, dimension at most d.'and therefore boxdim S(C) < 2d- I. In the -notat .on of Lemma4.11. S,,=S(C) and S,= 0 for r On. Since i>2d>2d- I boxdim S,, the proof follows from Lemma 4.11. The proof of Theorem 2.10 is similar, except that the second part of the conclusions of Lemmas 4.6 and 4.11 are used. For example. in the use x .4- J and S, = 0 for r ti as before, but now of Lemma 4.6. S,, = A4 boxdimt.4A x A)= 2d> it. Thus for each 5> 0. for almost every F, the j) has lower box-countin dimension at 6(F, 6-distant self-intersection set most 2d - i. The immersion half is again analogous. I Let ' be an open subset of R'. let g: U'- L be a map. and let h: U-, Rbe a function. Let w <i - be integers and set w= Iv"-iv- +1. For I <i<i, set g,=g,,--. so that .,=g" and Definition 4.12. g,, =g ". Let B be an i x w matrix. Define the filtered delay.-coordinatemap F,,:(B h. .0: U-.R"1 by r F,,I(B. h. g)(x) = B(h(gl x) ). hM g2(x) .... h(g,,x))) = B(h(g" (x)),.... h(g" Ix))) r Theorems 2.7, 3.1, 3.3, and 3.5 are corollaries of the next two results. for which we will use the following notation. Let g denote a smooth diffeomorphism on an open neighborhood U in R '.Let h ,.... h, be a:-basis for 4th6, polytnomials in, k.arables, of .degr6 at,. most l.' f. a srmooth funicition iZ) on i~adrR. deffine- Ii. -r7 - ',hFor, each p9l4eitgrp enote, by':. tias t, period-p -p6intsof lynonA That x.1~ txA: g~x . ~the~rniatries C b-e as in iheorem, _31.3 . Theorem 4.1,.Lx be a, smnoth diffeohqrphism-on an, open. ,neiahborhood- U 6f,' R ' a~dltAb b~atsbe fU odm K d. Let it and- w it r bhe-integers. iv< it w-- Assumne'that, the It' it X w maffix-ff'satisfie's: A2. tlank BC"' > 2 b6xd.imni.- )for a1lf I boxdimfi A~ for all I <, q~ <:P < Let ., It be a -basis for the polynomials in k variables of degree at most, 2W. Then for any smooth functionvi, on R', and for almost every x R'. 'the I'dilowinm: told: If it > U. -then FitB, It,. g j: U' R" isone-to-one_ of I-[ 2.If it -, 2. then for -every Ji > 0, the i-distant self-intersectiotv et 1;Fw h, ), 6) has lower box-counting dimension at most 2d- it 'Proof. For i I.. defin-. I. /hI, (g 1 By definition. F(B,., 2 )~ F, To use Lemma 4.6. we need to check for each x vthe rankoof the matrix Al"= I F,(.) - F,(y I.. .(xVI - .(yII which can be written as q<_ 2w. the :, are distinct, and J = iV is a ivx q matrix each of whose rows consists of.zeros except for one I and one 1. By part I of Lemma 4.1. the 3 EMbedblog'y61 rank of H is. q. We divide-the study oif"the rank of. Af ,, i-8JH into-three I cases. Case I: .vand.Yare notboth periodic with-period <-i, Invthis case. J,, is upper or lower triangular_and, ank,,,,) w.Since B.. I. and H are onto linear transformations, the product BJH is onto and has rank nt. The set of pairs x yv of case I 'has box-counting dimension at most 2d. and rank(M.,) = n.If g has no periodic points of period < w,we are done. and conclusion I (respectively.,'2) of Lemma 4;6 implies conclusion V: (resp., 2) of the theorem.. The remaining twocases are necessaryto deal with periodic points of period < ii, We show that conclusion I of Lemma 4.6 appliesin both cases. Case 2: v and y lie in distinct periodic orbits of period , Assume p and -q are minimal suchthat ,i;(.1 =x.. -,(yv= ; and that I < q 2. boxdim .. ,,which is the box-counting dimension of the set of pairs treated in case2. By Lemma4.6. for almost ,very xeR', '.(xJ-#F,(.-) for every such pair x # y. -Case3: Both v and v lie in the same periodic orbit of period < w. Assume P andq areminimal such that g(-x) = x.g(x)= y. and that q boxdim .4,p, which is the dimension of the pairs x # iyof case 3. Now Lemma 4.6 applies-,to give the conclusion. | aI Theorem 4.14. Let g,be a smooth diffeomorphism tn an open neighborhood L' in R'. and let .. I be. a compact ,ubget of a smooth "i-manifold in C.Assume that the linearizations of periodic orbits of period less than w have -distinct eigenvalues. Let a < itbe positive integers as -in Theorem 4.13. and assume that the it x it- matrix B satisfies: A3. rank D (;,, ,>boxdimtA +r- 1) for all I p< i. I<r<, and for all subsets ,..,, of eigenvalues of the linearization DgP at a point in .-1 . Let h ...... h,be a basis for the polynomials in' k variables of degree at most 2w. Then fr any smooth function h,) on R .and for almost every ;c e R'. the following hold: . If n > 2m. then F(B, h., g): U-- R" is an immersion on .4, 2. If n < 2n, then F(B, h, g) is an immersion outside an exceptional subset of A of dimension at most 21n -t- 1. Proof, To, app, 'matrix Lemma,4.l ,D F,.( \) -we need,to check the rank of the,,ii=x v .. DF.I x(.v )), (4 .1) for each Lin vx. the unit tangent bundle S(A )LFor a given observatlon ThAction h. the derivative of7F(B. h g)his hFBl. l'=&I'a, U,. Vh~g'-()) g (v)/ If x is not a periodic point of period less than w, then g""- v)..... (.x are "l.vI distinct poinis. The facts that ,, is a diffeomorphism and r ?0 imn'l that D9"(.V)L'?=( for all i, Thercfore by Lemma 4.1. part 2. the set of %cctors 'DF(B. hi, :O.x,: ;c-=_ R'' ipans R". In the notation of, Lemma 4 II. the subset S, contains allpoints of, S(.ij:that are not periodic with period, less than ,w, and d,, = lower boxdimiT,,) < 2m - I. If g has no periodic points of peribd less than w, the proof is finishcd. by Lemma 4.11. If x is a periodic point of period, p <u. then HP It'l v B DF(B. It. g .)t= IfD Hl D where •,=g 9 i '{IX)= vp, bi, = VIZ~x, ) wt'-"Dgt. , )..- Dg{."x1}Dg" (.xkt Dj= Dg(x, I)"".DgxtI) Dglx,,)... Dg(x,) Each matrix D, has the same set of eigenvalues ,. ... and by hypothesis, they are distinct. If u t.... I is a spanning set of eigenvectors for u,,, DI, then it checks that uji= Dg(x,_-1 )...Dg(x)u, for I <i<p, 1 <j.n defines a spanning set (uj, ....u;,, of eigenvectors for D,. Thus. if !1! EribedologV 613 lot =7 , auis the eigenvector expansion of wi then the eigenvector expansion of w is Z,". 'a~uq, which has the same coefficients. i Ii Thus DF(B. h. q)(x)Iv can, be written as Bftimes the is-vector V. " • l0 "! .. . 0 0 0 ... 0 /.,o, I'Io 0 0 ,. 0 4M 0 ... o 0 ... 0 HI .+ + 1011)0 ,,; ..t.;,0 0H0 l r (42) (0 0 .. 0 ;" ', ;.i ...0 To find the rank of the matrix (4.1),for (x, v) where x is periodic, we need to find the span of B times the vectors (4.2) for I = h, = T x,,. i e R'. Assume that the eigenvector expansion of v has exactly r nonzero ....a,. By Lemma4.1, part 2, the set of vectors 'Vht,(.,): coefficients ot e R', spans RI. Then because the it,, I < i n.are linearly independent. the vectors of form (4.2) span a space of dimension min[w. rp as x spans R. I,,, Therefore, for this iv. ri. the span of the vectors 14.1) has dimension By hypothesis. the hoxdim of such equal to the rank of 1_D.,. . pairs (x. v) in S(4 ) is boxdimt..I) + r- 1. By hypothesis. the rank of the it x t matrix (4.1) is strictly larger. so that Lemma 4,11 applies to give the conclusion. Proof of Theorem 2.7. Apply Theorems 3.3 and 3.5 with B= 1,5. According to Remarks 3.4 and 3.6, the conditions AI-A3 translate to p>2.boxdim(A.), p/2>boxdimA,), and min{n. rp' >boxdim.4,)+r- I. respectively, for I < p < n and I < r m. Thus, the hypothesis boxdim(A,.) < pi 2 guarantees that AI-A3 hold. Proof of Theorem 3. 1. Since Ap is empty for I < p < w. the conditions AI-A3 of Theorems 3.3 and 3.5 are satisfied vacuously. I 614 Sauer-er al. APPENDIX. HAUS,ORFF DIMENSON-'ZERO SETS WITH NO ONE-TO-ONE PROJECTIONS Intai Kan The purpose of this Appendix is to construct a Cantor set C-R" whose Hausdorff dimension is zero and which has the property that everyprojection of rank less than ti is not one-to-one when restricted to C. Definition A.1. The iautisdvrt s-drnetnstota outer iieasurec'uI a set K is "'(K)=limn inf where the L.!' U |.ntimum is taken over ail covers C ' .I' K vith the diameters o the C. uniormity less than o. The -.auurlt ad.wension ol a nonempty ,et K is the unique valu o1's such that 1K)= f. if t<s and ty"(K)=0 if '. Example A.2. We construct tl-.e subset C of R' as the union of two sets .4 = iJ',., , and B =,', 8, each of Hausdorff dimension zero. with the property that for any projection P of rank less than ti theimages under P of A and B intersect. and thus P is not injective when restricted to C. The set .1, lies on a face of the unit m-cube and a = (a1 .,a,..,,) is in .Af, if it satisfies the following restrictions on the binary expansion a, = a!aa(I ... of its coordinates: I. If i=,#. then a = 0. 2. If i:n and k i.o. then either Ia)a.=1) for allI =_0.k, .,t]: or (1:) at'= I for all Ie(.Vt, f. .. I]. Here the sequence 0 =Mo0 < .t ... increases sufficently rapidly ,o that limtfM,. 11 ,) = -..If i =n. then the orthogonal projection of .. , on the ith coordinate axis is a Cantor set which can be covered by 2" intervals of length - . where r, = k + 7". 1 (V,,- _ . Thus. 1., can be covered by 21'- "' cubes with edges of length 2 , Since r, Af,. we see that lim. - (i- I ),,. - I =0 and both the lower box-counting and Hausdorff dimensions of A, are zero. Since .4is the union of rn copies of A,,, we see that both the lower box-counting and Hausdorff dimensions of A are zero. 'Department of Mathematicai Sciences. George Mason University. Fairfax. Virginia 22030. EmnbeddobgY :615 'The-'set ' lies on ,a face of 'the .unit ,n-cube opposite A ,4, andb is -in B, if it, satisfies the -following, restrictions on tuhe' biniary e.xpansion of 'its, -coordinates:' V.Ifi~.then b,'=: 1. 2. If i t n and ,k 0, then either ta) h'1-'0,foral1-' IG( or (b) h'=,,l 'f6r~aI I,''k IM J1.k +]. k MA,21; -V'k Here H,,L is as above. The lower box-countine and *'Hausdorffdimensions of B are zero. The Hausdorff dimension of C = AukyB, is zero. Let P denote a projection ofi ranik less than in, Let~vc; i U1 , v. ...,)i the null,".spaice of P be chosen so that for all i and r, 1,for some particular n. We now show -that P-r estrictdd to C is not injective by 'finding some 1)= B, and a e.., such that r = h --. Using t~he 'binarv* expansiofr notation. %v,def-ine it and 1)as f'ollows: IfIi =u. then d (Yand h'= 1,. 2~ Ifj'11 and k >0, then '1a). a~ 0 and b" r' for all ,C VU., MA.I ; and. b) a',> (v'i. I-) mod'2 arid h>, I for all IcoordinateI Clearlv we have, v = b a.and ,by the definition,,of A,, and' B., we also have aeC 4 and 1,e B,.3 *additionially * ACKNOWLEDGMENTS The rdsearch of T.S. and J.:\.. was supportc' d by the Applied and Computational, Mathematics Program of DARPA. that %)f J.A.Y. by, AFOSR and the U. S. Department o1f Energy Bgasic Energyv Sciences ), "and'that if M.C. by grants to the Santa Fe Institute. including core funding from the John D. and Catherine T. MacArthur Foundation. the National Science Foundation. and -the U. S. Department of Energy. REFERENCES L.H. Abarbanel. R. Brown. aznd J. Kadtke. Prediction in chaotic nonlincar systems: Methods for time series with broadband Fourier spectra. preprint. 2. A. N1. Albano. J. Muench. C. Schwartz. A. Mees. and P.Rapp. Singular value decomposition and thc Grassbereer-Procaccia uaorithm. Piws. Rev'. .1 38:30-17-3026 (1988). 3. V. 1.Arnold. Geometrical Mtetlhods tin the Theori. ol Ordinart' Differential Equationis (Springer-Verlag. New York. 1983). 4. R. Badji. G. Bropgt. B. Derighetti. M. Ravani. S. Ciliberto. A. Politi. and M. A. Rubio. Dimension increase in filtered chaotic signals. Phiy.. Rev. Lett. 60:979-982 (1988). 5. D. S. Broomhead and 0. P. King, Extracting qualitative dynamics from experimental data. Plwsicat 2OD:.217-236 (1986). 616 Sauereuet al. 6., M. Casda Ii. Nonlitiear prediction of chaoic time series Phymrca 3-46:335-3S6 11989,1. -CasdaalL -S. -Eubank& -D. -Farmcr, ad J. -Gib~on. Stae-sbace rccbnsrc=.:on-_in..the presence oi noie. preprint. 3. W. Ditto. S. Rauseo.; an 'd M. Spano. Experimental control- of chaos. P!:. Re. left. 65:32 11-z--14 11990). .M. 9. i.-P. Eckmann and D. Ruelile. E-codic thecorv of chaos and stranize 2ttractors-, Ra-. Meutd Phfs. 7:6174~56_(1985). 10. A. Eden. C. Foias. B. Nicolaenko. and Rt. Temnam. HIder-continuity for ie.in'. erse of -'Maiiis-projection.-Compies-Rendus.-to aippear; 11. K.' Fdlconer.-Fractul-GeoirneirrM ile'.. New York-. 1990). 12. 1.- D. -Farmer. aind, J. Sidorowich. Predicing chaotic time efies. -Phui. Rer. :Leir. 59:845-848 (1987). I3 1 . Farmer-and J. Sidorowich. Exploiting chaos to piedict-the., Wuture and reduce noise. Technical Report LA-UR-88-901 Los Alaros National Laboratory u1988). 14. G. Gjolub -and t. Van--Loan. Mfairixr Ciiinputatiums..,nd cd. (Johns -Hopkins Uhi'.crsity Press. Baltimore. Miaryland. l91. 15. E. Kostelach and- J. Yorke. Notse reduction: Findine the- mimolet d.1namicat s%,icn consistent ".nh the data. 1-Pua 41 Di_" ,- 146 11990). 16. E. Kostelich and 1. Yorke. Nise reduction in dynamical -t.ic .':. - 38:14%-I 652-I l9881.17. R. Mafia. On thc dimen'sion (if-the compact invariant,- -ets off certain nonlineair nips in Lecture Notes in Mu:thilinuuc. No.-IS99 (SprinL'er-VerlaL'. I98!). Ig; P. Martcau and H; Abarbanel. Noise reluctivin in ch- ic. time Nec.N u-,inel 'caled probabilistic methods. preprint. 19. P. Mattila. HausdoriT dimension.' orthogional projections -and -intrsections- u'ith planes. Ann. A-cad Sc. Fcnn. MthI. 1. 2 "4 (1975 1. 20. F-. Mitsckc. M. \18ller. .ind W Lance. Meawuuifi filtered chaotic -Ienais. Piwi. Rer. 1 37:4 S-452 1- (I (AX ). 21. N. Packard. J. Crutchfield. D: F-armcr. and R. Sha..-Gcometry from ti line -crics. Phi. Rev. Lcit. 45:7 12 (1980). 2.W_ Rudin. Real- and (Ctptw Attahis. 2nd ed. iMcGraw-Hill. New York. 110741, 23. J.-C. Roux and H. Switncv. Topolocv Of chaos in a chemical reaction. in Vwtnar Phenomei~na tit Chemical Dv-ratturs. C. Vidal and A~. Pacault-. eds. i Spriket. Bterlin. 111.1(1 '4. B. Hunt. r. Sauer and J. Yorke. Prevalence: N\translation-in'.ariant -.lIm(N~ :%crv'Il in. iite-dimcnsional spaces. prcprintt. 25. T. Sauer and 1. Yorke. Statistically N.elf-simiiar %ets. preprint. 26. J. Sommerer. W'; Ditto. C. Grebom. E. Ott. and M. Spano. Experimental confirmaion (if the theory for critical exponents %;fcrises. Plhs. -Leit. .1 153:105 -109 11991 ), 27. F. Takens. Detectine ttraniie attractors in; turbulence. in Le-wure Notes~ tit 1lii',iitiic. X98 (SprinL'er.Verlae. 1991). 28. 1. 1ownshend.* Nonlinear prediction of speech signals., preprint. 29. H. Whitney. Differentiable manifolds. . Ign. M~ath,. 37:645-680 (1936). 30. 1. Yorke. Periods of periodic %,olutionsand the Lipschitz constant. Prnt (Im.Wtful. So. 22:509-5 12 (19691. Printed by Catherine Press. Ltd.. Tempelhof 41. B-8000 Brugge. Belgium -Nonlinearity 4 (1991) 1183-4212. Printed in the UK A numerical. procedure for finding accessible trajectories on basin -boundaries* U: 'Helena E 'Nusset§, and James A Ybrket + tlnstitute for Physical Science and Technology, University of Maryland. College Park, MD 20742;-USA tDepaftment of Mathematics, University of Maryland. College Park, MD 20742,. -I USA Received 291ianuary 1991 I, Accepted by J-Sinai Abstract. In dynamical systems examples are common in which two or more attractors coexist, and in such cases the basin boundary is non.empty. Thc asiri borndary is either smooth or fractal (that is. it has a Cantor-like structure). When ihere are horseshoes in the basin boundary, the basin boundary is fractal. A relatively small subset of a fractal basin boundary is said to be 'accessible' from a basin.However, these accessible points play an important role in the dynamics and, especially, in showing how th~e dynamics change as parameters are varied. The purpose of this paper is to present a numerical procedure that enables us to produce irajectories lying in this accessible set on the basin boundary, and we prove that this procedure is valid in certain hyperbolic systems. AMS classification scheme numbers: 58F12, 58F13, 65005 1. Introduction Dynamical systems often have:quite different behaviour in different open sets, each open sethavingits own attractor. These open sets maybe the basins of attractors. We are interested in the boundary on the common boundary between such- open sets. The common behaviour may be either smooth or fractal. A point p on the boundary of an open set U is accessiblefrom U if there is a cui-ve'lying in U U {p) which ends on-p. The'basin boundaryis the set of all points on the boundary of a basin of attraction such that each open neighbourhood of p intersects at least two different basins of attraction [GOY1]. If the basin boundary is smooth, then each point on the basin boundary is accessible'from two basins. In particular, if the basin boundary is a curve, then all of its points are accessible. When the basin boundary is * Research in part supported by AFOSR, and by DARPA under the Applied & Computational Mathematics Program. Permanent address: Rijksuniversiteit Groningen, Fac. Economische Wetenschappen, WSN-gebouw. Postbus 800, NL-9700 AV Groningen, The Netherlands. i 0951-7715/91/041183 + 30$02.50 © 1991 IOP Publishing Ltd and LMS Publishing Ltd 1183 184' IHfENusie and , A Yorke fractal, ontlyia rel tively msmall subset0f the: basin §6undary consists- of adcessible points. -and generhlly no points that are accessible' from a ,basin will be accessible frf Ather basin; A collection. of papers have assumed"- that investigators can :pro ... accessibleitrajectories onbasin boundaries [AS ], [AY], [GOYl], [HJ], but ,no rigorous procedures have been presented; For more details, see the discussion in section 6. Studying dynamical: systems; one often observes transient chaotic behaviour , apparently due to the presence :of horseshoes. It is well kn6Wn [MGOY]' that transient chaos is present Whenev -there isa fractal basin boundary separating the basinsof.two or more attractors. Foi example, for-suitably chosen parameter values, the-Hn6n map 'has attractiig periodic orbits with. period 3 and 5, and also a non-attracting chaotic invariant set in the basin boundary, and one observes that the duration of the transient chaotic-behaviour-of many trajectories is rather short before they. settle dowr 'o. one of these two periodic attractors, Other famous examples with chaotic traasicnts, due-to a-bounded non-aitracting invariant chaotic set in-the basin boundary, are the-forced damped pendulum and-the forced Duffing equation. Transient chaos is also present if. there is a chaotic invariant set-in the interior of the ,closure of' the basin. In this case, the basin boundary -can be either fractal , or smooth 1[KG], [NY1], [N t21. Let M be a smooth d-dimensional manifold without boundary with d ; 2, and let F-be a C3-diffeomorphism from.M to itself. "For xi y in Mwe denote by p(x, y) the distance betWeen x and y. A set S.c M is positively invaiantif .F(S) =S, 'aindis invariant if-F(s)=s. ,For xEM- and a closed set ScM, we write P(x,S)= min{p(x. ,,: y e S). An attractor A is an invariait, compact set in M such that0(1) there exists an open -neighbpurhood"U of A such that, for each x E U-the distance p(F-(x), A)-- when n- 4 c; and (2) there is a point x eA such that the closure of the- trajectory (F"(x)},,. 0 equals A. A generalized. attractoris the union of finitely many attractors. We say a~region'isan openand boundedset in M; a transientregion is a region thatcontains no attractor. F6r an attractor (or. a generalized attractor) A we-.say, -the domain of attraction of A is-the set of all points -x in M.-for which p(F(x), A)- 0'as n--*'. The basin boundary is the: set of all. points, ie M :for which each open neighbourhood has, a non-emptyintersection with at least two different domains of attraction, see [GOY1]. In the-literature, for-an attractor A the notions 'domain of attraction, of A' and- 'basin of A' -are often equivalent. On the other hand, in other studies of dynamical systems, the notion 'basin of A' is defined as-the region in M that is the interior ofthe closure of the domain of attraction-of A. Thereforefor an attractor (or generalized attractor) A we define basin {A} to be the interior of the closure of the-domain-of attraction of A. We would like to emphasize thatba~in(A). is associated with attractor A and may include Cantor sets of curves ,that are not in-the domain ofattaction of A; that is, the trajectories of all the points on these curves will not converge to the attractor A. In the forced pendulum example in section 3- we show numerically that basin{A} does include such an invariant Cantor set of-curves. We -will. be studying transient, regions- in cases where the trajectory through almost every initial point eventually leaves the region. We investigate special trajectories that remain in such a transient region for all positive time. In [BGOYY]i [GNOY] a numerical method (involving the bisection procedure) for finding trajectories on the basin-boundary was presented. The papers [NY1], [NY2] introduced the PIM triple (refinement) procedure and the accessible PIM triple Accessibletrajectorieonbasin boundaries 1185 (refinement) procedure.and'Both these numfeficAl-trajectbries refinement procedureg respectively, enabi6 us tothat obtain numerical ,trajectories, accessible stay (for positive, time) in a soecified transient region in M. 'In [NY2]. these two iefinemenr procedures were shown to be valid for uniformly saddle-hyperbolic dynamical: systems, for which the dimension of the, unstable ,manifold of any nonwandering point in the transient region was assumedto be one dimensional. Let R be a transient region 'for F. The stable setS(R)'of Fis (xe R :F"(x)'eRfor n,= 0,1,2,. .. ; the unstable set U(R) of F is x eR :F-"(x).e R for n = 0, 1, 2,... }. The set of points x for which F'(x) is in R for all integers n is calledthe, iniariantset Inv(R) of F in R, that is, Inv(R) - S(R) n U(R). A component of S(R) (resp. U(R)), which contains a point of Inv(R) is called a stable (resp. unstable) segment. We call Inv(R) a chaotic Saddle when it includes a Cantor set. These notions are illustrated in the following example. Example 1. An S-shaped horseshoe map is an invertible map that squeezes, stretches and folds a rectangle into an S-shape area as illustrated in the figurebelow. We consider the S-shaped horseshoe map g, which is defined on a neighbourhood of connected set W, where W is the union of a rectangle E and the two half disks DA and Da as indicated in the figure. Assume (1) g maps W into its intcrior, (2) the intersection g(E) n E consists of three horizontal strips, say H1 , H-2 and H3 , and (3) the half disks DA and Da include fixed point attractors A and B respectively. Let V1, V2 and V3 be the vertical strips in E (stretching the full width of E) such that I'j) =,Hi, I ---i -< 3; see figure 1. "'It is well known, see e.g. Guckenheimer andHolmes [GH], that under ', reasonable assumptions, almost every point will be attracted to either A or B, the stable set S(E) of g with respect to E is a Cantor set of vertical curves, and the unstable set U(E) of g with respect to E is a Cantor set of horizontal curves. All components of S(E) are stable segments, and all comiponents of U(E) are unstable segments. The intersection C of the stable set S(E) with the unstable set U(E) in E is a chaotic saddle. Note that all the points on the chaotic saddle C stay in the box E for-all time under all forward and all backward iterates of the map g. The set of points in E that are on the basin boundary is the stable set S(E), and the basin boundary of g is fractal. One might choose the transient region R to be the interior of W minus two small closed balls that are centred at the attractors A and B. Ia-c6mpact, 1 U 12 H13 '9,,: Figure 1. S-shape horseshoe map: vertical strips in the rectangle E are mapped into horizontal strips in E, namely -:V) = H1, F(V2) = H2 , and F(V3) = H3 . The half disks DA and DB each contains a fixed point attractor, and each is mapped into its interior. 1186 H E Nusse and TA Yorke We assume throughout that (1) foir the transient region R the set Inv(R)_is non-empty,, and(2), there exist two generalized attractors A and B; afid'each point ,in R'that escapesroffm R, under iteration-of the map Fis either in basin{A). or in :basin{B}, and the basin boundary. is the common boundary of basin{A} and, basin{B}. We will refer to R\S(R), the complement ofthe stable set S(R) in'the transient region-R, as the transieit-set.Recall that a point p,in S(R) is accessible from an open set V if there is a continuous curve K ending at p such that K\(p} is-in V. We investigate the cases where V iseither basin{A} (or basin(B}) or isthe transient set R\S(R). In-this paper we emphasize points accessible from basin(A} rather than from basin{B}, just to simplify notation. Obviously, if a point p in S(R) is accessible from the transient set R\S(R) and p is on the bas boundary, then p is-accessible from either basin(A} or basin{B}. On the other hand, S(R) can contain points which are notin the basin boundary, and such points might beso numerous that they block the access to the basin, boundary, that is, every curve in basin{A} that goes to an accessible point wouldnecessary pass through points of S(R). Thus no points of the basin boundary would be accessible from R\S(R). Naturally S(R) would have its own accessible points, but these would lie in basin{A} (or basin{B}). This situation occurs in- the previously-pmentioned pendulum example. Hence, S(R) might contain points on the basin boundary that are accessible from basin{A} (or basin(B}) but not.accessible from the transient set R\S(R). In example 2 below, S(R) contains such points in the basin boundary, Therefore, the accessible PIM triple procedure [NY2J for finding accessible points on S(R) is, generally speaking, not a-procedurefor finding accessible points on the basin boundary. We would like to point out that there are cases where S(R) equals the set (basin boundary l R), (though this condition may be hard to verify). In such cases the ASST method (involving the accessible PIM triple procedure) might be used for finding accessible trajectories on the basin boundary. Example 2. In this example, we illustrate the fact that S(R) can contain points that are not in the basin boundary, and for simplicity we present one-dimensional maps. Consider-two one-dimensional maps with attractor A (which is -,o) and attractor B (which is +-). Let f and g be the piecewise linear maps of which the graph is given in figure 2(a) and -2(b) respectively, such that g(y) =f(y) for all y < 1. 3 Figr(.On-iesonlmp) y adg(h grah f" .1 n-yregvnin2atn I 21 13 pq p y - Fgue 2. One-dimensional maps f and g (the graphs of f and -g are given in 2(a) and 2(b) respectively). When we choose the transient region R to be the interval (-2,3), the stable set S(R;f) equals the basin boundary, and the stable set S(R;g) is strictly larger than the-basin boundary. AcCessible trajectorieson b'sin boundaries -1187 Let p'and q denote the two fixed points of g in (1, 0), and write m = -min(g(y):y,>>p}. Assume, 1 <m <2 <p <q <3< g(m), see-figure 2. The maps are constructed-in such a way that basin(A},and basin{B} of g andf-coincide. Hence, both-maps have the sam6 basin boundary andit is contained in the interval [-1, 1]. .Note that the basin boundary is the set of all pointsih [-, 1].that stay inside [-1, 1] under all positive iterates of the map f (or g), and the basin'boundary is fractal. Onthe other hand we have, all points in (1, oo) go to attractor B under forward iteration of the-mapf, whereas basin{B} for g includes a chaotic saddle in the open interval' (2,3). When wechoose the transient region R to be ,the open interval (-2,3), the stable sets S(R;f) and S(R;g) are the sets of points that stay in R under all' forward iterates of f and g ,respectively. We have the basin boundary equals the stable set S(R;f), but,the stable set S(R; g) -is, strictly la'rger than the basin boundary. It can be shown thatpoints of S(R;g)-S(R;f) can be found arbitrarily close to each point of the basin boundary. We-would like-to address the following problem. Accessible basin boundary static restraint problem. Given a segment J that has one end point in basin(A} and one end p6int in basititB), describe a procedure for finding a point on the basin boundary (in I n S(R)) which is accessible from basin{A). We will state a procedure (the accessible basin boundary refinement procedure) for finding accessible -points in M on the basin boundary. We will show it is valid (guaranteed to work) for the same class of hyperbolic dynamical systems as in [NY2I, namely hyperbolic systems in which the unstable manifolds are one dimensional. All the procedures are based on our presum ed ability to specify an initial point p and compute the time TR(p) its trajectory takes to escape from R. For applications, we need a 'dynamic' version of the 'static' problem above, since we want to produce numerical trajectories that are accessible from basin{A}. The 'dynamic' problem that is associated with the 'static' one is the following. Accessible basin boundary dynamic restraint problem. Given a line segment J that has one end point in basin{A} and the other end point in basin{B}, describe a procedure for finding a numerical trajectory on the basin boundary that starts on J and which is accessible from basin(A}. The ideas of the 'accessible basin boundary refinement procedure', which solved the 'static' problem, can be applied to solve the 'dynamic' problem, in such a way that implementation is possible on a computer. For more details, see the discussion in section 6. The organization of the paper is as follows. In section ,2we present the 'accessible basin boundary refinement procedure'. Then, n section 3, we discuss some examples in which the straddle method involving this refinement procedure has been used. The main result for the validity of the refinement orocedure for hyperbolic systems is stated precisely in section 4, and this-result is proved in section 5. Section 6 is devoted to the discussion of ths associated numerical method (the accessible basin boundary straddle trajectory method or ABST method) and related 1188 H E Nusse and J A Yorke ntimerical methods. Finally in section' 7i the case- of d-dimensionAl hyperbolicsystems, d -_3, and smoothness of Fare discussed, 2. The accessible basin boundary refinement procedure Let -the manifold M, the diffeomorphism F; the tr" ient region R, and generalized attractors A andB be as'before. Recall that We as:. .- that each point that leaves R under iteration of F is either in basin{A} or in basin(B}. The escape time TR(x) of a point xin R is defined by TR(x) = min(n -_1 :F"(x) 0 R}, and TR(x) = if F(x) ER for all n :, 1. We say, TR(x) = 0if x o R. Let J be an unstable segment in R. The notation {x, y} for a pair, means that x and y lie on J. Since J is homeomorphic to an interval, we may assume it has the ordering of an interval. For {x, y} we always assume for convenience that the ordering on J is such that we may write x <y, and denote [x, y]i for the segment on J joining x andy. Let L cJbe any-connected subset of J. Assume L intersects the stable sei S(R) transversaily, and let (a, b) be a pair on L. For each e >0, an r-refinement of {a, b} is a finite set of points a =go<g 1< ... <g = b in [a, b]j such that (E/2).- p([a, hi) < p([gk, gk,+]j e -.p([a, bi,) for all k, 0 - k N- 1. We say the pair (a, b} is a straddle pair if a Ebasin(A} and b e basin{B}. We call (a, b) a proper straddle pair if (a, b) is a straddle pair, and at least one of the points a andb is in the interior of L. If (a, b} is a (proper) straddle pair, then we call the interval [a, b]j a (proper) straddle segment. Our objective is to describe the 'accessible basin boundary refinement procedure' that selects in a unique way a proper straddle pair from any E-refinement of a givenstraddle pair (on J., When we rcveatedly apply the procedure to the. end points of the ever decreasing straddle segments (with lengths converging to zero), the resulting nested sequence converges to an accessible point p in the basin boundary; of course, this point p is in i n S(R). The point p that we find is accessible using the curve [r, p]j for some r in I nbasin{A}, so we say p is 'accessible from the left' ('accessible from basin{A}'), that is, from the side containing r (in basin{A}). We could alternatively have chosen to approach from the right and we would expect to find a different point on the basin boundary. Since almost-every point on J has finite escape time (see section 4), we can assume that all points of all refinements are chosen with finite escape time. We now describe the accessible basin boundary refinement procedure which is the refinement procedure that generates a uniquely defined proper straddle pair from a given straddle pair. This procedure plays a dominant role in the method that generates a numerical trajectory on the basin boundary that is accessible from basin{A}. A slightly improved version is stated in section 4. Let (a, b} be a straddle pair on a curve segment J such that a is contained in basinfA), and b is contained in basin{B}. Let P=Cxi:0--i<--N(e)} be any E/3-refinement of (a, b), we of course have P c J and a = xo < xI <... < XN(e) = b. We choose the proper straddle pair {a*, b*} from P in the following way: (1) select b* to be the leftmost point of P that is in basin{B}; (2) define m to be the minimum of the escape time of the points in P to the left of b*, and write a° to denote the rightmost point to the left of b* that has the minimum escape time m. I I 1189 Accessible tiajectories on basin-boundaries (2a) If'm <TR(a) then choose a* =an; otherwise, (2b) if m =-TR(a) tihen the choice of a5 depends on the grid P*consisting of b* andallthe points in P to the-left of b* (that.is,P* = (X : P:x E (a, b*])). (i) If the grid P * is not an r-refinement of (a, b*}, then, choose a*=a; otherwise, (ii) if the grid-P* is an e-refineineht of (a; b*} then choose a*to be the-idjacent point in. P* to the rightof a', unless b* is thattadjacent point, in which case choose ,a* = do Remark. Assume that e >0is suitably chosen. In case, of step (2b) the equality a*= a0 does not occur and'one has a*> a0 . (1) As the accessible basin boundary refinement procedure is, -applied, re5 peatedly ,step (2a) only Occursat most finitely,,inany times, andthe segment-[a, a* ] in (2a) may include points that are in basin{B}, However, once step (2b)-occurs, step (2a) will, never-occur again. When step (2b) is applied, the entire segment [a, a*] (n0,'1,st the grid points) is in basin(A) .but,[a, a*]may include points that have escape, timeinfinity. We would, like to emphasize, that all'the points between a and,a* in step (-'b) whose~escape time is finite, go to attractor A. This-is why the method produces an accessible point as.the refinement is repeated. The problem ofcourse isto find E small-enough. (2) When aI and b* have been chosen, iftthe grid consisting of a*, b* and all the points in P between a* and b* is still an e-refinement of the pair then set a*= a and b*-b and apply step-(2b). Repeat this until the grid {x e P:xe [a*, b*]} fails to be an i-refinement of (a*, b*}. Notice that in cases when only step (2b), is repeated, the point b does not move. (3) Under hypotheses in section, 4, it is possible to repeatedly apply the accessible basin boundary refinement procedure obtaining a sequence of straddle pairs that convergesto an accessible point on~the basin boundary. I Example 3. The purpose of this example is to illustrate 'the accessible basin boundary refinementprocedure in a graphical way. We choose E = 0.1. Let (a, b) be a straddle pair, and let P be an E/3-refinemerit of (a, b}. We assume that P is on 3 TR TR , * ag, ;* --h*s | . , . . ab 3. The accessible basin boundary refinement procedure. In figure 3(a) the grid igrid on (a. b#] is not an e-refinement of {ab*) and so a does not move; in figure 3(b) the on (a, b*] is an,6-.finement of (a, b) and so a moves to the right. 1i90 HE Nusse and A Yorke the.str ght line segmeni, that joins, a with b and that the grid points are equalIly s-paced,.so P consists of 31 gridjpoints. In figure 3 the escape time of a grid point x ih Pis represented by,a stir, if x is'in basin{A), and it-is represented by-a dot if-x is in basin{B}. In, figure 3(a) We have b*=xq. The grid P* = {x e P:x e [a, b*]}. is not an 6-refinement of (a, b*}, since-the'distance between two adjacent points equals IIb'-:ai/8 "Which is greater. than t.Jib* - all. Hence, we choose a* = a. In figure 3(b) we ha'e b* =x2o. Thegrid P* = (xe P:x c (a, b*J} is an t-refinement of {a, b), since the distance between two adjacent points equals 1ib* - a 11/20 which is smaller than e lib* - a1. Since TR(xo) = TR(x8) = TR(x o) = m, we choose a*= xi as indicated in the figure. 3. Applications The objective of the paptr is to present the, accessible basin boundary refinement -procedure which enables us to obtain accessible-numerical trajectories on the basin boundary. We also prove that this numerical pre-idure works in-ideal cases. While we believe that the hyperb'oiicity hypotheses (stated in section 4) are often satisfied, they are nonetheless, in practice difficult or impossible-to verify. While chaotic attractors are usually not hyperbolic, the sets we look at are fo0attractors. We do observe that frequently we can successfully use the procedure to'obtain pictures of the accessible- points on the basin boundary. In all the examples below, the pictures were-obtained by using the Dynamics Program [Y]. In these pictures, basin(X) is obtained as follows: for a 960 x 544 grid, use-each grid point as initial value and assign to each grid point a colour (respectively, no colour) if its trajectory converges to X (respectively, stays away from X). The set of coloured grid points is in basin(X}, and the non-coloured grid 1{X}. In all. the pictures for which one of the numerical points are outsi,' .plied in order to produce a single numerical trajectory, have procedures has be been obtained by selecting e = 1/30 as default value (see also section 6). 3.1. Hnon map Let the diffeomorphism F acting on the plane be given by F(x, y) = (p - x2 + M. y, x). The map F is equivalent under a change of variables to the Hdnon map -(1-p'X2 +Y, 14'X). For a first example, we choose the parameters p= i.81257970 and g =0.02286430; these parameters are due'to Grassberger and 'Cvitanovid (personal communication). For these parameters attracting cycles-with period 3 and period 5 coexist. Let D, and D2 be closed balls of radius 0.01 centred at one of the points of the attracting period 3 cycle and 5 cycle respectively. We choose the transient region R to be the open set {(x, y): -2 < x < 2, -4 < y < 4)minus the closed balls D1 and D2. Let A and B be the attractors with period 3 and period 5 respectively. The white area in figure 4(a) is basin{A}; the black area is basin(B}. By using the bisection procedure (see-also section 6), we obtain a straddle trajectory (that is, a numerical -Accessible trajectories on 'basin boundaries (a) 1191, ' (b) Rpgme 4. (a) The white area is basin(A) and includes the period 3 attractor. the black area isbasin(B) and includcs the period 5 attractor in thc region -2 <x <2. - 4 < y< 4 of the Hdn6on map with parameter values p =1,812 579 70, ju 0M022 864 30. (b) Straddlc trajcctory using thc bisection procedure for the Hdnon map (p =1. 812579 70. ju-0,0228643) in the transient region ((x~y): -2<x<2, -4<y<4) minus two cloed balls of radius 0,01 centred at a point of each attractor. The three saddle periodic points on the basin boundary that are accessible from basin(A) and the five saddle periodic points on the basin boundary that are accessible from basin(B) are indicated by st~raight and curved arrows respec'tively, trajectory) on the basin boundary consisting of more than, 100 000 points (actually' tiny intervals); the result is presented inmfigure 4(b). By using the accessible basin boundary refinement, procedure we obtain a period 3 saddle when the left point a is chosen in'basin(A), and a period 5 saddle when the left point a~is chosen in basin{B). The accessible period 3',ahd period'S saddles on the chaotic~lsaddle are indicated by arrows in figure 4(b). Therefore, the set of all points 4ccessible-from basin(A} are the stable manifolds ~of the points ofthe period 3 saddle, and all points accessible from'basifi(B}, are -the stable manifolds of the points of the 'period s saddle.- p I For a second example, we select.the values p = 2.66, u = 0.3. The map F has two attractors A and B, Where A .and B denote the attractors infinity and a cycle with period 3 respectively.- The' box ((, .y) :-3 <x <3I -3 <y < 3) contains a chaoticsaddle, and We select 'the transient, region R 0o b,-the open set {(x, y): -3 <x < 3, -3 <y,<3) minus tie-ball of -radius 0.005 centre.. at a point of attractor B. Usiiig ~the bisection procedure results in one numerical tra tcctory, that hasbieen'p'?-sented 'in figure 5. By using the accessible- bisi boundary refinement procedure-we obtamin a period 1 saddle when the left pointa is chosen in basin{A), and a period 3 saddle when the left point az is,chosen in basin{B}.. Thea,accessible period 1 and period 3 saddles-on the chaotic-saddle are indicated by arrows in figure 5. So, the set of all- points accessible from-basin({A} is the stable manifold of the-period 1 saddle, and the set- of 1192 H E Nusse and J A Yorke * 9 Straddle trajectory usingu the /," 'forFigtire the5.Hdnon map (p = 2.66. -0.3)bisection in 'he procedure transient i region ((x, y): -3 < x < 3. -3 < y < 3) minus a closed ball * I of radius 0.005 centred at -a point of. attractor B (the, ,,.,< "'.,.period 3 attractor), The fixed point on the basin boundary ., ... ,,:..-."that-is accessible from basin{A} (where A = o), and the three saddle priodic:points onthe basin boundary that are accessible from basin{B) are indicated by curved and straight arrows respectively. all points accessible from basin(B) are the stable, manifolds of the points of the period 3 saddle. For a third example of this map, we select the parameter values p =1.405, -0.3. The map F has two coexisting attractors, namely, a period 2 cycle (attractor A) and the attractor infinity (attractor B). The box ((x, y): -3 <x <3, -3<y<11) contains a chaotic saddle. Basin(A) is the white area in figure 6(a). (the two points of attractor A are marked by a dot in the figure), and basin{B) is black in figure 6(a). = (a) (b) Flgure 6. (4) The white area is basin{A) and includes the period 2 attractor, the black area is basin{B} (where B oo)in the region ((x, y):-3 <x <3, - 3 <y <11) of the Hinon map with parameter values p = 1.405, pu= -0.3. Attractor A is marked by two dots, and a saddle fixed point in basin{A) is marked by a cross. (b)Straddle trajectory using the bisection procedure for the Hdnon map (p = 1.405, pu= -0.3) in the transient region &(, y):-3 <x <3, - 3 <y <11) minus a closed ball of radius 0.2 centred at a point of attractor A. The three saddle periodic points on the basin boundary that are accessible from basin{A} and the saddle fixed point on the basin boundary that is accessible from basin(B} are indicated by straight and curved arrows respectively. 2 I 1193 Accessible trajectories on basin boundaries We select the transient region R to be the open set {(x, y): -3 <x <3, -3 <y < 11} minus the ball of radius 0.2 cenitred at Apoint of attracto" A. Using the bisection procedure results in one numerical trajectory, that has been presented in figure 6(b). The PIMtrip!e proceduremay resiult in a saddle fixed point that-is in basin{A); this saddle point is marked. by across in figure 6(a). If we select the transient region to be the region R minus a ball of radius 0.2 centred at this saddle fixed point, then applying the PIM triple procedure results a similar numerical trajectory as in figure 6(b). Notice that the ball including the saddle fixed point is in 3 basin(A}. By using the accessible basin boundary refinement- procedure we obtain a period 3 saddle, when the left point:a is chosen in basin(A}, and a period Fsaddle. when the left point a is chosen in basin(B}, The points of the accessible period 3 saddle on the chaotic saddle are indicated-by arrows in figure 6(a). So, the set of all points accessible from -basin(A) are the st'able manifolds of the'points of the period 3 saddle, and the set of allpoints accessible from basin{B} is the stable manifold of the period;1 saddle. Note that the invariant set of points in the transient region consists of at least three basic sets, namely, (1) the period 2-attractor, (2) the saddle fixed point !n basin(A} and (3) the chaotic saddle on the basin boundary. 3.2. Pendulum We consider the differential equation x"(t) + vx'(t) + sin x(t) =f cos(t). UJ I Ipendulum i (a) (b) Figure 7. (a) The white area is basin{A} and the black area is basin(B} (where A =-(-.472 615, 2.037084) and 8 = (-0.478014, -0.608233) are fixed point attractots) in the region {(x, y):-r <x <,r, --<y < 4)of the time-2:r map of the forced x'(t) + 0.2x'(t) + sin x(t) = 2 ces(t). The three saddle periodic points on the basin boundary that are accessible from basin{A) -are indicated by arrows. (b) Two stfaddle trajectories using the PIM triple refinement procedure for the time-2;r map of x'(t) + 0.2x'(t) + sin x(t) = 2 cos(t) in the transient region ((x, y): -ir< x <,, -3 <y < 4) minus two closed-balls of radius 0.05 centred 3t the fixed point attractors A and B, one trajectory in both basin{A} and basin(B}. The two saddle periodic 2 orbits on the stable set that are accessible from the transient set R\S(R) are indicated by arrows. 1194, H E Nusse aid J A Yorke We choose the parameter values v = 0.2 and f =2. For these parameters, the time-2 rmap has two stable fixed points A and B. In; figure 7(a), !basin(A} is coloured white and basin{B} iscoloured black, It was already observed [GOY2] thatthere was -transient behaviour in the basin(A} and basin{B}. We choose the transient region to be the rectangle {(x,y):-jr<x <#, -3<y<4) minus two balls (of radius 0.05) centred at the attractors A and B. By using the PIM triple procedure for two different transient regions, we obtain twonumerical trajectories. The result for the choice of the interval with end points (-3, -3) and (-3,4) isa trajectory lying in basin(A); and the segment from (-3, 4) to (3, -3) results in a numerical trajectory lying in basin{B}. Both trajectories are presented in figure 7(b). By using the accessible PIM triple procedure we obtain period 2 saddles, see also the discussion in section 6. The result for the segment from (-3, -3) to (3, 4) is-a Lperiod 2 saddle on the chaotic saddle in basin(A), and the segment from (-3, 4) to (3, -3) results~in a period 2 saddleon-the chaotic saddle in basin(B). The points of these accessible period 2 saddles on the chaotic saddle are indicated by arrows in -figure 7(b), The set.of all accessible points on the two chaotic saddles are the stable manifolds of the points of these period 2 saddles. By using the accessible basin boundary refinement procedure we obtain two period 3 saddles: one is accessible from-basin{A}, and the other one is accessible from basin{B). The points of the period 3 saddle that is accessible from basin{A} and is on the basin boundary, are indicated by arrows in figure 7(a), The set of all points on the basin boundary that are accessible from basin{A), are the stable manifolds of the points of this period saddle. A similar result as above holds for the points on the basin boundary that are accessible from basin{B}. 3.3. Complex quadratic map We consider the quadratic map in the complex plane given by z,+= z,+ 0:32 + 0.043i. For this system two attractors coexist, namely, a period 11 attractor (attractor A) and the attractor infinity (attractor B). Let D be a clcsed ball of radius 0.05 centred at a point of attractor A. We choose the transient region R to be the open set {(x,y):-l.35<x< 1.35, -1.35<y < 1.35) minus the ball D. The basin boundary straddle trajectory resulting from the bisection procedure is presented in figure 8(a). The accessible basin boundary straddle trajectory resulting from the accessible basin boundary refinement procedure, a trajectory of which all the points are accessible from basin(A) is presented in figure 8(b), and the accessible. basin boundary straddle trajectory of which all the points are accessible from basin{B} is presented in figure 8(c). The choice-of this equation was motivated by the picture of the Julia set in [PRI. The reader should compare our figure 8(a) with figure 25 in [PRI. We would like to point out that the basin boundary of thissystem (the Julia set) is two dimensionally unstable; thus our results are not valid for this example. Accessible tiajectories on'basin boundaries -1195 LI Figure 8. (a) Straddlc trajectory using the bisection procedure for the complex quadratic map + 0.32 + 0.043i in, the transient region ((x, y): -1.35 <x. y < 1.35)minus a closed ball of C/ "fl J J SA (period 11attractor). (b) Straddle trajectory the accessible basin boundary refinement procedure for the complex quadratic map that is from basinA). (c) Straddle trajectory using the accessible basin boundary refinement Cusing 'accessible . , procedure fom-the" complex-quadratic map that is accessible from oasin(B). 4. Results In section 2 we presented the accessible basin boundary refinement procedure for finding a point on the basin boundary in the transient region, which is accessible from basin{A). First, -we formulate a refinement procedure which is a slightly improved veision of the accessible basin boundary refinement procedure. We will describe inductively how to refine our proper straddle pairs. Given a straddle pair {a,,, b,,}, we have a,, is contained in basn{A), and b,, is contained in basin(B}. Given any e/3-refinement P,={x:O< i N(a)} of {a,,,b,}, we of coursehave a,, =Xo<x, <X...<xN(,) = b,..-We choose the next proper straddle pair {a.+t, b.+t} from P. in the following way. (1) Select b,,+t to be the leftmost point of P,, n basin{B}. J.)6 H ENusse aidJ A York (2) Definei,, = min{TR(x):x EP, and x < b,,..}; a% =inax(xi Q.:x <b.+, and TR(x) = m}. (2a)Ifmn < TR(aq) then choose an~1 ao+1 ;-otherwise, (2b) If mn,, TR(ai) then in order to choose a, we write -Q. = {x .[a, faP,b,,),+I= minimum of the set (x Q, :a%,<x < unless this set is empty in which case a,, = a I = max{x e Q :x < bn+ and TR(x) = TR(a +t)}. "ase (i) If Q,, is -not -an &-refinement of (an, bi,+,), then choose a,,+,=an; -wise, Case (ii) -fQnis an s-refinement of {a,, b,, ) then choose a,,.=-. = Remark (1) For the convenience of the reader, if e > 0 is chosen suitably, then )TR(4,)=R(a+). N( that a'+|< b,+ andmrn- TR(a°+,) Qn might-fail to be an s-refinement of (a,,,b,,+,} in that the distance between some r..ir of consecutive points in Qn might be bigger than E -"p([a., b,+,]. (2) Under the hypotheses below it is possible to repeatedly apply the improved !-.:iement procedure above-obtaining a sequence (-(a,, b} },,, that settles down to an accessible point on the-basin boundary. ,an <-an+ <a1' In the descrir-ion of-the refinement procedure above, we assumed that there exists an E> 0 f,,. which every E-refinement -of a straddle pair (an, b,) includes a, proper straddle pair {a,+,, b.+},) such that_[an, an+dj is-in basin{A}, and the letgth of the. straddle segment, [a.+, bn + s at most (I - e/2) times the length of the previous straddle segment [a,, bnk. We will justify the seconcepts. Let the n !old M and the diffeomorphism F be as in -the -introduction. We assume that A and B are two generalized attractors -such that each attractor is contained either in A-or in B. Recall that a subset A of M is hyperbolic ifit is closed and F-invarin' nd the tangent bundle TAM splits into dF-invariant sub-bundles E dFis uniformly contracting and uniformly expanding respectively. -And E" on v, A hyperbolic set A is called saddle-hyperbolic if dim E' - I and dim E" >- 1. In [NY2] we defined a region R to be a-saddle-hyperbolictransient region if R satisfies all the following conditions: (Al) R is a transient region: (A2). hyperbolicity properqynv(R) is a non-empty saddle-hyperbolic set; (A3) boundary property: U(R) n aR is mapped outside the closure f of R; (A4) intersection property: each non-trivial component y of U(R) is an unstable segment, that iF,y intersects Inv(R); note that such a segment y must intersect S(R) transversally. In this paper, we say a transient region R satisfies the basin boundary property if (1) each-point in R\S(R) is contained in eil'er basin{A) or basin{B}, (2) the sets R nbasin{A} and R fl basin{B) are nc -npty, and (3) the R flbasin boundary is oasin boundary into itself). We define a positively invariant (that is, F maps region R to be a basin boundary transient region if R is a saddle-hyperbolic transient region and R satisfies- the basin boundary property. Accesible trajeciorieson basin bodnd4ries : 1197 For a basin boundary transient region R, and- £ >0, the properties _(Al) and -(A2) imply that the escape'.time of almost every pointpoint on an unstable segment .3 is finite. (A result due to Bowen and Ruelle [BRI shows-that -S(R) has-Lebesgue measure zero.) Hence, one may-assume that such azrefinementdoes not intersect -the stable se- S(R). The basin boundary property implies thateach point that Ilim escapes from R under-iteration of~the hap F is either in basin{A) or in basin{B). If R is a basin boundary transient region; then the escape time map T restricted to an unstable segment JT U(R) has the following two properties (see [NY2). (i)-All the points in a chosen segment [a, bkjon J will escape from R if and only if no t-refinement of(a, b }:includes-a PIM triple (that is,:a triple (p, r, q):onJ such that TR(r) > TR(P), TR(r)> Tk(q), and p([p, q]j) < p([a, bk)). (ii) TR is locally constant on an open subset of full measure of J and'if-TR(x) < and x is a point of discontinuity of' TR then lim inf,_, TR(y) = TR(x) and sup,,., Tq (y = TR(x) + 1. We- assume throughout that dim E" = h For, the,sake Of simplicity, we -assume that-d- 2; the moredifficultcase d 3-will-be-discussedin-section 7. From now on, We-will assume that R is a basin boundary transient.regionfor F, and that J cU(R) .denotes an unstable segment. The proof of the proposition below, Will follow immediatelyan the propositions 5.1 and 52. I I:1 Proposition There exists a -finite set-of periodic points P" in-Inv(R) such that (1) each pointin P" is accessible from R\S(R), an&(2) for x E S(R), the point x is accessible from R \S(R) if and only if x E Ws(p) for some p EP". Corollary. Each accessible point on the basin boundary is-in the stable-manifold of some periodic point. Since J is an unstable segment, recall that this implies that both ends of J are in the-boundary of the transient -region-,R. We know by the intersection-assumptionthat J intersects the stable set S(R). Obviously, if (a, b} is a- straddle pair, -then there exist proper straddle pairs in every E-refinement of- (a, b}, for each- e, 0<e <0.5. The next result deals with the convergence of the sequence of nested proper straddle segments*[a,+,, b+,I] c [a,, bn]j on J. A sequence of straddle segments {fan, b.l}j}. 0 on J is called a straddle segment sequence if {a.,,, b,+I} is in -an E-refinement of the straddle pair {an,-b.} for all n. We say {[an, b.]j}n ., is the accessible straddle segment sequence if (an, b,,) is selected using- the accessible basin boundary refinement procedure for all, n. For every E, 0 < e 0.5, each straddle segment sequence {[an, b.n}jlh;,() converges to a point on- the basin boundary. In section 5 we will, show that there exists e > 0 (depending on F and R) such that for every accessible straddle segment sequence {[an, bn,] }, there is an integer N - 0 such that ,f6r every integer n > N the straddle segment [a,, a.+Il, is contained in basin{A). This number e also appears in the result stated below. The main result stated below implies that the accessible basin boundary refinement procedure is valid. Theorem. There exists E>0 (depending on F and R) such that every accessible straddle segment sequence converges to an accessible point on the basin boundary. 1198 -F ENusse andl'A Yorke' 5. Proofs 5.1. Preliminaries, Let the. manifold ,M; the distance p oh M, and'diffeomorphism F be, s before. We assume, that R; is a, basinboundary transient region for the diffeomorphism -F; and that there.are generalized ,attractorsA -and B such :that each ,point that',eventually leaves R is eitherinbasin{A} or in bisin{B}, Recall thatthe non-wandering set-9 (that is,. the set of-all points x in M such thatfor every open neighbourh6od V of x 'there exists n- 1 for which F"(V) n Vis non;empty) can uniquely be decomposed, into a 'finite-collection of disjoint closed invariant subsets and on each of these subsets-F has a dense orbit; these maximaIinvariant subsets of 0? appearing in the decom-osition are, called thebasic sets (see'e.g. [GH -for the definitionsand several prop&, . s of uniformly hyperbolic systems). From now.on,,iet r denote a basic set of'F From the definitin Of Inv(R) it. follows immediately that either r a Inv(R), or r n'inv(R) is empty. Thus, we can decompose Inv(R):-'into finitely many basic.sets. Note that, 'rl Inv(R) is empty' does not imply 'r R is empty', and 'r R is non-empty"does not imply 'i n Inv(R).is non-empty'. Recall that 'for z E9, thestable manifold WS(z) of z is the-set of points'x for which p(F(z), F (x))-- 0 asn--co';-and the unstable manifold Wu(z) of zis theset of points x for which' p(F-(z), F-"(x))--0 as n The local, stable manifold Wfo (z) of z (of ) is the setof points x in W(z)'such that -p(F'(z),'F"(x))<I3 for;all integers.n -0, and the local utnstable manif0ldW W"(Z) of, z is the set of points x in W"(z)such that p(F'(z), F-"(x)) I3 for all n _ 0, where P > 0. When the -stable or, unstable manifold is a curve, we write Wlvc(z) 'and W'-(z), for the .two components of W'(oj)\{z}, where a is either s or u. We call F a trivial basic set if r consists of one periodic orbit, and we call' r a non-trivial basic set if r includes more than one-periodic orbit. Assume that r is non-trivial; we callxF periodic if there exists m EN. such that F' has no dense orbit on F. and'we~callF non-periodic if it is not periodic. We will:,see below that the structure of Inv(R) is essentially controlled by-finite sets of.pedidic points. Recall -that x in Inv(R) is accessible from an open set V if there is a curve y such that y\{x} lies in V. If wechoose V to be the transient set R\S(R),.and if x in Inv(R) is accessible from R\S(R),it isalways possible to choose 'this curve y to be a piece of the unstable manifold WU(x), that is, -, can be chosen to be either Wr(x) or W-(x). Notice if x is accessible from R\S(R) and y W (x), ,then x is not a limit point of W"o(x) nl 9. Similarly, if we choose V to be the open set'R\U(R), and if x in Inv(R)'is-accessible from"R\U(R) it is always possible to choose this curve y to be a piece of the stable manifold W'(x), that is, 7 can be chosen to be either WfZ(x) or'W (x). Applying a result due to Newhouse and Palis [NP], we obtain the following. Proposition5.1. There exists a finite set P of periodic points in Inv(R), P = P" U P', such that each point-in Inv(R) that is accessible from R\S(R) is in W'(p) for some p in P", and each point in Inv(R) that is accessible from R \ U(R) is in W"(p) for some p in P'. Proof. For a proof, see Newhouse and Palis [NP]. 0 Accessible trajectorieson-basin boundaries 1i99' Palis, and, Takens. [PT] have shown that there exist regions in -M,whose boundaries are segments in the stable and unstable manifolds 6f,these finite sets of periodic points P' and P", suchthat the intersection of-the unionof these regions with the saddle basic set r is a Markov partition forT, see Bowen [B] for the notion of Markov partitibn, Proposition,. 2. Assume f.is a non-trivial non-periodic basic set in Inv(R), and let, z E r be fixed,.Let P' and, Pu.be asabove. There exist finitely many disjoint regions Ribeing diffeomorphic images of the square B =[-1, ] x [-I, 1], say R =gI(B), I< i -<N for some N EN, and a connectedsubset fu of W"(z) such that: (I) rn R, is.1 non-empty for all i; (2),raUO R,; (3)F(aR)cU. 1IRi and F-'(a,,Ri)c ..1 9,,R, where aRi= g,(((x, y): xI l J, lYj < 1)) and cj,,R =j(((x,y):IxI --, y =,I)) are connected subsets in the stable set WS(PU n F) and the unstable set Wu(P fl r)respectively; and (4)for every i, :-fnRi -consists of exactly one -component -and a(U n Rt) a ula I 1I i<N. Prbof. For a proof, see Palis and'Takens [PT]. 0 Recall that R is a basin boundary transient region, and F a basic set in Inv(R). From now on, let, the point z e r, the regions Ri, 1 i <N, and the segment Ju c Wu(z) be as in proposition 5.2. There exist a C', stable foliation 5P on a neighbourhood V. of r and a C'" unstable foliation 9'u ownA neighbourhood Vu of r, for some a > 0. Since it is no restriction to assume that every region Ri is contained in V.frl Vur, 1 < i _ N, see [PT], we will do so. Let r: R - WU(z) be- a C3 parametrization, and define a projection ;r:rUi.I R, nlIu by taking in each region Ri the projection along the local stable manifolds into the intersection 1u with that region, I -_i < N. This projection can be extended from r to the union of the-regions R by projecting along the leaves of the foliation 5P. This extension will also be denoted byr. The following result says that for some iterate K, the map F can be viewed as expansive along unstable segments. Proposition 5.3. There exist a positive integer K and a C ' map q' :U., r-I(lr fn Ri)-- R defined by IV(x) = T - o ;r oFK 0 T(x) such that Ip'(x)l > 1, for some a>0. Proof. For a proof, see Palis and Takens [PT]. 0 5.2. Proof of the theorem Let J c U(R) denote an unstable segment. Recall that both end points of J are on the boundary of the basin boundary transient region R, and that J intersects the, stable set S(R). Recall also that if a point x in-R eventually leaves R, then x is either in basin{A} or in basin{B). We define for every integer k > 1: Ck(J) = {x eJ:T(x)>k} Dk(J)= {xe J: TR(x) =k}. 1200, H E Nusse and J A Yorke In particular, Ci(V) =J.,Hence, foi eachinteger k -I we have Ck+tI):is the set of pointsin, Ck() Whose escape time from Ris,at~least k + I hence, :Ck+ 1(J)is the set of points in I that stayjnR under P-k. The p6ints in-, which stayin R under all iterates~willbe denoted by C(J). For ever' k _>'I, we write Dk(J; A)= (x r: P(J):x-e basin{A}} /Dk(3;'B)- (x e Dk(J):x Ebasin{B}}. The 'basin boundary propetty':nowimplies that for every k > 1: Dk(.) = Dk(; A) U Dk(J; B). Notice that basin{A} and~basin{,B} are disjoint open sets, so, that if there are nbasin(A) and b eJ n basin{B}, then there is apoint x in [a, b~J with points a Jfi TR(x) = -. Observe that Q is a component of DA(J) if and only if Q is a component of either Dk(J;A) or Dk(J; B). Forleach k i 1 we-have: Ck(J) = Ck+i() U'Dk() = J=Ck.I1 )U k Ck+,V) U D(J; A)-U O(J; B) k k D(J) = CkI(J) JUD,(J;A)U U D(J; B) that is, J is the union of the set of points CA+I) whose escape time from Ris at least k -L 1 and, the set of points D(J) whose escape time from R isj, and each of those p, ',s is either in basin(A) or in basin{B}, where 1 --j --k. We write D.J) = U DkJ)= U DkV;A) U U DkV; B). k-I k-I "k-I = fnl. 0 C(), and J = C.(J) UD (). Let I- c a component of Ck(J) that includes a point of basin{A) (or basin{B}). The following result then says that for some fixed positive integers (depending only on F and R), C contains a component of UI, D+,(J; A) (or U,- D+,(J; B)). In particular, s does not depend on k. The following lemma (basin boundary combinatorial lemma) is used to prove the 'basin boundary geometric lemma' which follows. Note &- (() Basin boundary combinatorial lemma. Let X denote either A or B. There exists an integer s -_1 such that for every unstable segment J and for each- integer k > 1 and every component C of.Ck(J), the following holds. If C includes a point of basin{X}, then there is an integeri, k:, i< k +s and a component D of D,(; X) such that D a C. Proof. Let U be a neighbcurhood of Inv(R) on which a C"+U stable foliation F exists, for some er > 0. Select the minimal integer v _-1 such that for each basic set F of F" the following holds, either F is a fixed point or r is a non-trivial non-periodic basic set. For each non-periodic basic set 1"of F", let 1"r and the regions Ri(J), 1 -<i - N(F, be as in proposition 5.2, and let Ur be an open neighbourhood of F Ur) consists of N() open such that (1) Uffr)Ri(F) c Ura U, (2) the set xTj(.ln intervals and its closure consists of N(F) disjoint intervals, and (3) the map Pr in proposition 5.3 may be extended to rTj'(lj. l Ur). For each trivial basic set F, let Ur be an open neighbourhood of F in U such that Ur does not intersect UA, for each basic set A in Inv(R)\F. I, Accessible trajectories on basin. boundaries i: 1201 Let, Li, ... , L (r)be the components of rw(I -lUr); these finitely many components are Open intervals in R. Select the minimal integer K(F) - I such that the map q'r: tr((Iu fl Ur)'-* O defined by Tr(X) - o "ro FKr.o tr(X); satisfies 19(x)I> 1, Define the map lr:rrr(X) (I l Ur)- 4 -by -1 o r oF o 'Now we define the-N(F) x N(F) matrix Ar by Ar(i, i) = 10 if Vpr(Lf) =L otherwise for all 1 < i, j - N(F). Since F is a non-trivial non-periodic basic set of.F", the matrix Ar is primitive. Choose the minimal -integer m(r) 1 such that all the entries of the matrix A ' r)are positive. We define the intbger s(F) as follows. If r is a non-trivial non-periodic basic set, then define-s() = m(r). v, and if Fis a-fixed pointvof F" define s(r) = v. Now, let s be the smallest common multiple of {s(F):Fis a basic set of F'). Let re(R) be the number of -basic sets of -F" in lnv(R), and write Inv(R)= 1J1±) Gk We associate -with Inv(R) a directed-,graph G as follows: G, consists of the points FA, 1 < k <m(R), and there exist a path-from Fr, to Fi if there exists a point z eFi such that W"(z) fl W(F) is non-empty. Notice that for each k, 1 k m (R),there exists a path in G, from Fk to itself. Let J be an arbitrarily chosen unstable segment. Select an integer > I such that Cg(J) is-contained in U. Let,& denote the number of-components of Cg(J), that is, Cg(J) = U I C(J). From the definition of the matrices associated with the non-trivial basic-sets, the-directed graph G, associated-with lnv(R), and the choice of the integer s, and'using the techniques in [Null and [Nu2I, we can associate a (0, 1)-matrix Mj with-C,(J), which is defined by * I M I I to if ;,, o Fv(C,(J)) otherwise C,(J) for all 1 - i, j,</R, where ;rj is the projection on J along the stable leaves. We will assume that the C :,'s are-numbered in suchaway that the matrix M. is written in-the normal form, that is, _'M 11 0 ............ 0 ................ M.'1M., where each Mkk is an Nk x Nk matrix which is either irreducible (that is, for each pair (i, j) there exists t EN such that the (i, j)th entry of-the matrix (Mkk)' is positive, l-i, j<Nk) or a 1xI null matrix, l<k -rn and E'..INk=& for some m, <N. This assumption on the C4;i's is no restriction, since for every non-negative square matrix B there is a permutation matrix P such that PBPT has the normal-form (see Berman and Plemmons [BPI). In-particular, each irreducible Mkk is primitive,-and if Nk --2 then Mkk equals Ar for-some non-trivial nonperiodic basic set F in Inv(R), and from the choice of the integer s it follows that all the entries of (Mkk) ' are positive. Let X denote either A or.B. Let integer k --1 be given. Let C be any component CQ(J), and assume that C includes a point of basin{X}. We first assume-that k . The definition of Mj, the choice of s, and the results in [Nul] and [Nu2] yields that S<m 1202 R E-Nise and J.A Yorke there 'exists aninAteger i, -k i < k + s' and a component D of D1(J;X) such that D c:C. This ,result together With the .Lfinitions of Ak(J) and D,(J; X) imply immediately that also for I k - I one has -that C includes, a component of Df(J X),for. some i, k i k + s, Since J -was arbitrarily given; we have shown the following. There exists an integer s 11'such that. for every unstable segment Jand, for, each integer k _ I-and eyery component C of Ck(J), the following holds. If C includes a point of basin(X}, then there is an integer i, -k - i <k +s ,ahd a -component D'of D(J; X)-such that D c C, where X denotes- either A or B. This completes the proof of the -basin boundary combinatorial lemma. 1 From now on, let's be-as in the 'basin boundary combinatorial lemma', and let G = F. ,We now consider the escape time of points under G. For every-point x in R, the escape time TR(x) of x under G is defined by T"(x) = min(n ->-N:G"(x) I R and T,(x) = 6 if G"(x) E R for all n - 1. We say that T'(x) = 0 if x q R. We define for every integer k > 1; c (V)= (xc-J: T'(x) ->k) Da(J; A) = {x e J: Ta(xt) =k and x c basin{A)) D(J; B) ={x EJ.: Ts?(x).= k and x E basin(B}}. Hence, for each integer k --'I we, have C7+,(J)is the set of points in Ck?(J)whose escape time under-G from R is at least k + 1; hence, C+ 1 (J) is~the set-of points in j that stay in R under G' . The points in J which will stay in'4h under all iterates will be denoted by CQ(i). For each k > I we have: Ck(J) = Ck+ 1(J) U DkJ; A) U D (J; B) = c'+k 1 (I) U I'D(J;A) U U'D (J9B) that is, J is the union of the set of points Ck'+1(I) whose escape time under G from R is at least k + 1 and, the set of points D5(JA)-in basin{A}'(espectively, D(J; B) in basin(B}) whose escape time under G from R is j,. where 1 k.'We write DM(J) = U D/(J; A) U U D(J; B) k-1 k-l Note that 'C$(J)'= nk=o Ck (J), and J =-C(J)U D.(I). Lemma 5.4. For every integer k > 1, we have: (1) Dk(J;A) = -s D,(J: A); Dk(J; B) = % D,(J; B); '(2) C(J)= C(J) and DG(J) = D (); (3) each component of D(J) belongs to either basin{A} or basin{B}. Proof. The proof is left to the reader. 0 Note that'the set DWkJ) is the set of points x rJ with finite escape time (that is, T1(x) < w). The following result- says that, if the value of the escape time map TG cha ,7es then it changes in steps of 1. Denote the length of a connected subset L CJ by p(L).- 1203 AcCessiUle traectories:onbasin boundiries T-JumPh properiy. ',F&,-evcf, y. J Mihp([x,,yJ) < s 4 J With T Gx) <* there exists s >0 uch-that each - TR(y)li. Proof. Apply lemma,,5.4 andthe Tjtimp lemma in [NY2]: b0 The following, lemma for, G implies 'that if an unstable segment y has a n C(.y), that intersects basin{X}, ,then there is a point p of CG (r) h basin{X} with escape,time k, and- the- length of the component D of DG(y; X)' including p is at' least 6.p(C). Bain 'boundary'geometric'lemma. Let X-denote either A or B. There exists 6 >0, such, that for every unstable- segment J, and for each integer k ;>1 and every component C of Ck(J), we have: If;C includes 'a point of basin(X},>then there isa component D of D((J; X),such that D c' and p(D)Ip(C) : 6. Proof. From the geometric lemma'II in [NY2] applied to G, there exists 6 > 0 such that for every J in U(R),.andfor every integer k._1l,the following holds: (1), each component of Ckr(J) contains components of Ck(J) anc, C' + (J);:and (2) if C is any component of Cf(J), then' every component D of DG(J)flC satisfies p(D)lp(C) > 6, and every component 'U of C+l(J)n C satisfies P(U)IP(C):, 6. Let X denote either A or B, and let J be any unstable segment. Let integer k > .1 and component C of C(J)'be given. Assume ihatvC includesa point of basin(X}. Applying the basin boundary combinatorial lemma yields that there exists a component D of Dk(J; X) such that D : C. From the geometric lemma II in [NY2], since D is a component of Dk(J), and the definition of 6, we obtain p(D)Ip(C) 6. Since J,k and C are assumed to- be given arbitrarily, we conclude for each unstable segment J,for each integer k - I and every component C of CG(J), if C includes a point of basin{X), then there is a component D of D'(J;X) such that D c C and p(D)Ip(C) : 6. This completes 'the proof of the basin boundary geometric lemma. 0 From now on, we fix 6 as in the basin boundary geometric lemma. Before we prove the theorem, we present a non-intertwining.property for the escape time map as well as an auxiliary observability result for accessible straddle pair sequences. We call a'pair {p, q} a balanced pair if T'(p) = T'(q). Non-intertwining lemma. Let {p, q} be a balanced pair, let P be a 62 -refinement of (p, q}, and assume that TR(xi) _TR(p) for every x in P. If each point of P is in basin{A) then [p, q]j is-contained in basin{A}. Proof. Let {p, q} and P be as in the lemma. Assume that~each point of P is in basin(A}. Write in = min(TR(x) :x c [p, q.,}. The assumptions 'T(x ) >_T"(p) for all xi e P', 'P is a 6-refinement of {p, q)', together with basin boundary geometric lemma yields that'm = T'(p). Hence, [p, q]j is contained in a component of Cg(J). If there exists a component D of D (J) including [p, q],, then D is a component of Dmr(J;A), and we are done. Therefore, from now on, we assume that [p, q], is not contained in a component of Dr(J). This implies that there are at least one 1204: ilk Nuse and J.A kokke comnponent of CI~(J) in the interior of Jp, qj, and-atAkast'Wo comiponents o6f 'D(J) which havea nn-e.mpty ,irtersection'.,withi[p, qjj. Let D' be a component of,.DgJ uh htD0 p J, snnemt.B the basin, boundary geometric lemma,, We 'have p(D)/p((p,,qj,)> &. Since P is, an 8refinemenit Iof {p', q, it follows that P flD is noni-empty. This fact and, the assumption "that, eachlpqiht of P is.in basin{(A} imply thit D is 'a,component- of D1(J; A). This implies that [p, qh h D~J)is-contained in basi{} Let C be any component of C-,G,,J) in'the interior or ['P, qhj. Applying the basin boundary, geometric lemma we get thit'p(C)/p((p,,qj) >6. If C includes a point of basin{B}, then- C includes a component D of- L),' B), and by 'the basin boundary geomeic6lemnma, we have p(D)/p([p )=(P(D)/p(C)) -(p(C)/p([p, qkj))* 62. Hence, if Cincludesa point of' bsin{B} then every.0 2-refin enent of (p, q}',inciudes a point of basif{'B}. Sijnce Pis an "62-refiniement of (p, q}) andY does not coniainva -point of basin(-B}, it follows that C includes no point of basinfB). Since C is arbitrry- we get. that each component, of Cl-jJ that'is in [p, q~J contains no ,point of basin(B); Therdforejp, qj, 0 Cli/, 1 j) is contained .in-bAsin{A}. &ecause of '[p, q] ",((p, q], n D'(J) U'((p, q], 0 ))-the coni..~ision is that [p, qj, is, contained iin basin{A)-- This completes the proof -of the ,nonintertwining lemma. 0 Basin boundary observability lemma. L 63/3-refinemdent of~a straddle pair (ao, bo}, and assume T(x,) > Tr'R(aO) for (;eryxi in P. Let (a(,, bI) be, the straddle pair in P, in which b1 is,selected- as -in-the accessibl e -basin boundary refinement procedure. Let, al be defined as in the improved version of the accessible basin boundary refinement procedure. f'P'isa'-refi ,nement of (a(O, b1}, then r a(), aI') is in basin{A}, and T"(al) = T"(a)) + 1. Proof. Let P, (aO,bi), and all be as in the lemifia, and assume that Pfnl[au,,bJ, is an e-refinement of (a0 , b,}, where E= 6'. Let rn =-ffin{T (x):x iE[a0, b'lj. Let a" and at be defined as in the improved version of the 'accessible basin boundary refinement procedure. The assumptions 'TR(Xj): : TR(aO) for all x, e PF, 'Pn [a1O,bljis an 6-refinement of -(ao, 'b1}', together with the basin boundary geometric lemma yields mn = t(a) Hence, [aO, bill is contained in a component of Cr(J). By definition, we have a0 - ao. We 'show first that [at, a~,iscnandn basin{A). App lying the* T-jump property and the basin boundary-geometric -lemma we~ obtain that there exists a component D of Gf,'+ 1(J; A) such that D is in the ineror'o,( bil,, and p(D)/p([ao, bl,) > 6. Therefore, at exists and T'(at) m + 1. The definition of a' and lemma 5.4 imply that [a(, a+], is contained in aI =T(a)R~a If basin (A). Recall that (a+,, a',) is a balanced pair, that is, and a, are in the same-component of D.G, 1 (J) then [a+, al], is in basin(A}, and we get that [a', af], is in basin{A}. Now assume that a+ and a, are in different components of DMG+ 1 (J). Then, '[at, all], includes at least one component C of lemma we have C%.72(J) in its interior, and by the basin boundary geometric +()p(a, a11j)> 6. This implies that P n [a', a 'j is a 62-refinement of (a+, a"). Applying the non-intertwining lemma yields [a;, a,isnbaiA)adweotn also this case that [a', all, is contained 'in basin{A}. We conclude: basin{A} +1 fa': a'l, and TG(a',) = Tg'O inch. Accessible tfaje~ories onwbasin boundaries I 1205 'Ifa ao, , tewit,.follows, ifiiiediAtely, from,,the concisiioai, aboveihAat [o, a I'D is contained in basin{A}. From now ,on, we asswiie a6ol Recall tha a, 9?}is a balaniced pAir. f no and a? -are in,the same comont ofDG)te~[, ?,i n baisin{A}. f -a6 and a? -ared in different components ofD V~(), then [do a?), includesat least one co~mponent C -of, C.,,(J). Since p((ao,, aJ,)/p([ao, bill)---: p(C)Ip([ao, a?1j,> 6,,and P fl [a0 , AIvis: a 62*refinemnent of (aO, a,)', applying :the non-intertwining, lemmia -we obtain-JaO,-,aj, is in basin(A},, Since, [a?1 al'kj is in basin{A}, the~conclusion-is that [a0, aili is contaidd in basin{(A}. This completes, the proof'of'the, basin boundary observability lemma. 0, Proof-of the th'eorem. Let 6-ba as in the basin boundary geometric lemma, and ,choose t= 63. Let {[aij, b.], ,,.,:be,an -accessible straddle 'segment sequence, that is, (ao, --b,) is a straddle pair an&, a,, bJ)is obtained by the improved version of the -basin ~boiindary refinement procedure forall n :-::,. For-n >_, let P',be an sf3-refinementv of (a,, b,,}), -and let m, be as -in-the improved version of the accessible basin bouindary refinedment, proicedure. By the basin boundary geometric wve obtain m, =,,min{T"(x):xO(a,, b.+1 ,}. We will -show, that there, exists4 an; integer N >-0 such that for every integer n :-:N tfolowingroerties hold. ()Ta)=m,(P)I(a+)-TG(ai)I < 1, and (P3) [a,,, '4+1 is contained- in basin{A}. Notice that we do not claim that ITRG(x) -- TG(a )I - 1 orall x-C- (an a---,whr f N. From the T-jump property and the basin boundary-,geometrid lemma, together with- the -assumption that ([a,, is obtained- using the accessible basin procedure, we have ,that if T~r(a,,) > in,, then T'(q,+ f)= in,,, for each a -:0. This property implies that there exists a minimal integer Ni - 0 such that Tg(x,)_: m = T'(ajy) for each xi E 1%,. Hence, (P1) holds for N, We now show that (P2) and (P3) hold for this integer N. Iaccessible I-lemma I:boundary Case 1. Pv is not an E-refinement of (UN, bN+I1). Then av+I = UN, -so [ay, aN+ I j is contained in basin (A) and T'(xi) --mv+ I = TR'(aN+ 1)for each xi EPV+ 1. Therefore, (P3) holds, while (P2) is obvious since aN = aN'+I. I Case 2 PNv is- an E-refinement of The basin boundary observability lemma imolies (P3) since [aN, aN+ '), is contained in basin (A). It also implies (P2) since T"(xi) -_inf.+1 (UN, bN+,}. T"(av+I) = T (aN) + I for each xi E PNv+I. By induction, one obtains the desired result. This completes the proof of the 1 1~ 6. The numerical procedure and related numerical methods 61. The dynamic problem Now we return to the 'dynamic' problem stated in the introduction, namely, to describe a procedure for finding a numerical trajectory on the basin boundary which is accessible from basin{A}. (Recall that the basin- boundary of basin{A} is the boundary of the closure of basin{A}.) We assume we are given a straight line segment that intersects- the basin boundary transversally and has one end point in 1206 H ENusse and J A Yorke basin (A) and the other end point in basin (B). In the statement of the results, we assume that a straddle pair and! its e-refinement lie in a connected subset of an unstable segment, and that ail unstable segments intersect the basin boundary transversally. However, from our proof of the theorem iifollows that a similar result holds if we replace the unstable segment by a straight line segment so that we assume that. every e-refinement of a straddle pair fa, b} is in the straight line segment [a, b] from a to b, and that *[a, b] intersects the basin boundary transversally. A straight line segment [a, b] straddles the stable manifold of a point P if (a, b] intersects this manifold transversally. In the caseswe study, that is, a e basin{A} and b Ebasin{B), the stable manifold of P will be replaced by a (fractal) basin boundary and (a, b] will straddle a subset of the basin boundary. 'Furthermore, in practice [a, b] will be very short and will be extremely close to the invariant set Inv(R). The numerical procedure goes as follows. (1) Choose (with some experimenting) a straddle pair (a, b} and let I denote the line segment from a to b. (2) Apply the accessible basin boundary refine-ent procedure (that is, refine and choose a new straddle pair (x, y} in / and then replace I by the straight line segment from x toy. Repeat this process until the length of Iris less than some distance a (for example, a = 10-). If the initial a and b are less than a apart, then the pair is not changed, Given any initial straddle pair (a, b), we will write (ao, bo) = ABS 0 ((a, b}), for the straddle pair resulting from step 2. Note that Ilao - boll < a. 'ABS' is an abbreviation of 'accessible basin boundary straddle refinement'. (3) For each integer n ::0, and straddle pair {an, b,) such that Ila,,," b,11 <a, compute the refinement for the image pair {F(a), F(b)), and write {a, 4t, b.+1) = ABSo(F(a,,), F(b.)}. Thus we obtain a sequence ((a,, b,},)).o of straddle pairs. Note that only F(a) and F(b ) and a are relevant to the computation of {a,+,, b,+,) = ABSo((F(an), F(bn)}), since ABS((F(a,,), F(b,))) is a straddie pair in the line segment from F(a) to F(bn). Write I.for the line segment from a,to b,. Since the system is hyperbolic and the matrix of the second parti" 'erivatives D2 F will be bounded on the closure of the region R, there will be a bk -:id on the curvature of the curve F(In), and F() will deviate from the straight line segment L, from F(A) to F(b) by an amount proportional to IL I2 , where ILI denotes the length of L,. We thus obtain a trajectory of tiny straight line segments 1,, and to the precision of the computer (about 10-14) we usually have 1,,+ c= F(I,,), and selecting any point x, from I, perhaps the midpoint, we have that Ix,,+. - F(x,,)l is small, typically of the order of a. Since computers can never be expected to produce true trajectories (except in trivial cases such as fixed points), we may say {x,,.o is a numerical trajectory with precision a. Despite the complexity of the construction, we will refer to x,+, as the 'iterate' of x,,. We call the sequence of intervals (I},,),o an accessible basin boundary straddle trajectory or ABST trajectory, and we call the numerical procedure above that generates the sequence {,,no, the accessible basin boundary straddle method or ABST method. Notice that each interval straddles a piece of the I Accessible trajectories on basin boundaries I i207 basin boundary. After-a few iterates, the sequence {x,},, 0 resembles a subset of the non-wandering points in'R which are accessible from'basin(A). 1n this paper we have shown that our procedure, (the accessible basin boundary refinement procedure) is valid in ideal situations. We find that theaccessible basin boundary straddle. method works well in practice even in less than ideal cases, in particular, cases where hyperbolicity seems to fail. If e is chosen too large, then (a., b.)},;.() would still be a sequence of straddle pairs with a, e basin(A}' and b, ebasin{B), but the sequence would not be accessible and probably would not settle down tea periodic orbit. In practice we find that, in most cases we study, the method appears to work well for E= 1/30. In computing the sequence of straddle pairs (a,, b,) defined by the accessible basin boundary refinement procedure, once case (2c),holds, then it can be shown that every e-refinement of the proper straddle pair (a,b) includes a proper straddle pair. For the examples in this paper we find that the accessible basin boundary straddlc method leads (in all-cases- but- one) to accessible fixed-points or periodic points, in agreement with the fact that all the accessible points for two-dimensional saddle-hyperbolic systems are on the stable manifolds of finitely ma., periodic points. The e'.ceptional case is the example of the complex quadratic map of which the basin boundary is two-dimensionally unstable, and the result due to Newhouse and Palis does not apply in this particular case. 6.2. The accessible set on the basin boundary We have seen above that in many interesting cases our numerical method (accessible basin boundary straddle method) produces a periodic trajectory on the basin boundary that is in Inv(R). If P is a periodic trajectory in Inv(R) that is accessible from basin(A), then all the points op the stable manifold of P are accessible from basin(A). Therefore, we need a numerical method that produces the stable manifold of a periodic point. In [YKYJ a procedure has been presented that can be used for the calculation of stable manifolds of saddle periodic points of the diffeomorphism F. The calculation can be made with a guaranteed accuracy, in particular, it can be used to calculate the pieces of the stable manifolds of the periodic points that we find. As illustration, we present in figure 9 the stable 1Figure 9. The stable manifold of the fixed point of the Hdnon map (with p = 1.405, is = -0.3) that is accessible from basin{B}. .18 FlW ENusse and JA Yorke rmanifoldkof the period- 1 saddle in the'example of the Hdnon map for which the attractor infinity (attractoi A) and a period 2 attractor coexist. This stable manifold of the saddle fixed point constitute the accessible set (accessible from basin{A)' on the basin boundary. 6.3. Related straddle trajectories In this subsection we review briefly 'straddle trajectories' that are obtained by methods which are based on refinement procedures such as the'bisection procedure [BGOYYJ, [GNOY], the PIM triple refinement ptocedure[NY1u, [NY21 and the accessible PIM triple refinement piocedure [NY2]. The methods were used in the applications presented in section 3 and the refinement procedures above are related to the accessible basin boundary refinement procedure. Thesedstraddle methods are numerical methods for obtaifiing&trajectories-on the basin boundary and on chaotic saddles. For clarity of the exposition and in order that this paperis self-contained, we describe these methods; see the references above for details. Straddle methods involve ,s refinement procedure in which 2 points on a curve segment are replaced by two new points. In some cases the points have different roles. Usually each of the refinement procedures takes a pair of points and returns a pair of points; such a returned pair is on the line segment joining the two points ofthe original pair. The dhstance between the two points in the returned pair is smaller than the distance between the points of the original pair. Straddle methods consist of applying the refinement procedure repeatedly until the points in the resulting pair are less than some specified distance a apart, say o = 10- '. If the points in the original pair are already less than a apart, then no refinement is carried out. Next apply the dynamics; that is, apply the map F to each of the two points of the w pair. resulting pair, giving The basic numerica, ,nethod takes a pair (as, b,} which is separatedby at most a distance a, and applies the map F to each of the points of this pair. If the new pair (F(a ), F(b.)} is separated by less than a, then it is denoted (a..,, b,+), and -ocedure is applied repeatedly until a pnir with separation otherwise the refineme at most a is obtained, awu it is called {a,+,, b,,.t}. However, ii. ter to produce the first pair {ao, bo), the method starts by applying the refinement procedure on the given pair (a, b}, whose points are presumably more than a apart. Writing I,,or '[a, b,,] for the line segment from a, to b, and to the precision of the computer we usually have 1,+ a F(,). We call the sequence of tiny straight line segments { a straddle trajectory. 'e a numerical BST method. the 'basin boundary dynamic problem' is to dmethod for finding a trajectory on the basin boundary. The refinement procedure for straddle pairs is particularly simple. Let {a, P} be a straddle pair such at a e basin{A} and P3e basin{B}. We define y to be the midpoint of the straight line segment [ar, /3], that is, y = (a + P)/2. If y e basin{A} then we choose cr* = y, * = P; otherwise, if YE basin(B) then we choose cr' = (r, /3*= y. This refinement procedure is also called the bivcction procedure. The solution to the 'basin boundary dynamic problem' is the straddle trajectory using the bisection procedure. We call the sequence of tiny straight line segments j.I}, o a basin boundary straddle trajectory or BST trajectory, and we call the I! Accessible trajectories on basin boundaries 1209 straddle method above that generates the BST trajectory {i),,, the basin boundary straddle trajectory method or BST meihod. Notice thit each tiny, line segment in a BST tjiiectory straddles the basin boundary. A BST trajectory typically resembles (after a few iterates) a basicset.in the basin boundary. SST method. the 'saddle 'dynamic restraint problem' is to describe a numerical method for finding a trajectory that remains-in a specified transient region for an arbitrarily long period of time. First, we describe the refinement procedure that is involved in the current, straddle methliod, Let fa, b} be a pair such that [a, b] intersects S(R) transversaiiy. The notation (x, y, z) for a triple means that x, y, and z lid, on [a, b] auid y is between x and z, and we assume for convenience that the ordering on [a, b] is such that x <y < z. For each e > 0, an e-refinement ofa triple (x, z, y) is an E-refinement of {x, y) such that it includes z. Let (a, y, P) be a triple on [a, b. We call (a, , f3) an Intet 'orMaximum triple if both TR(y) > TR(a) and'TR(y) > TR(1)); and we call (a', V, ) a PIM triple if (a, ,,fy) is an Interior Maximum triple and lit - all < Ilb -all. Let (a, V, fP) be an Interior Maximum triple, and let P be an, E-refinement of .). The procedure that selects in 'the refinement P any PIM tripld (a, ,*, f) is called a PIM triple (refinement) procedure. The solution to the 'saddle dynamic restraint problem" is the straddle trajectory using the PIM:triple procedure. We call the sequence of tiny straight line segments Y.),..() a saddle straddle trajectory or SST trajectory, and we call the straddle method that generates the SST trajectory 1 the saddle straddle trajectory method or SST method. Notice that each tiny line segment in an SST trajectory straddles a piece ofa (chaotic) saddle. An SST trajectory typically resembles (after a few iterates) a basic set in the chaotic saddle. ASST method. The 'accessible saddle dynamic restraint problem' is to describe anumerical method for finding a trajectoryon the stable set S(R) that is accessible from the transient set R\S(R). The refinement procedure that is involved in the current straddle method is a PIM triple (refinement) procedure in which a PIM triple (a*, y*, P*) is selected from the E-refinement P of.the interior maximum triple (a, c, b) such that [a, a*] is in-the transient set R\S(R) (hence, [a, a*] does not intersect the stable set S(R)). This refinement procedure is called the accessible PIM triple (refinement) procedure. The solution to the, 'accessible saddle dynamic restraint problem' is the straddle trajectory using the accessible PIM triple procedure. We call the straddle trajectory {(I}),,o an accessible saddle straddle trajectory or ASST trajectory, and we call the straddle method that generates the ASST trajectory {If4,.o, the, accessible saddle straddle trajectory method or ASST method. An ASST trajectory typically resembles (after a few iterates) a-subset of the non-wandering points in R which are accessible from the transient set R\S(R). In most cases that we have investigated we find that every 6-refinement of two points {a, b}, when e is chosen to be 1/30, includes several PIM triples. In [NY1], [NY2] we find that the ASST method leads to accessible fixed points or periodic points, Which is in agreement with the fact that all the accessible points for two dimensional hyperbolic systems are on the stable manifolds of finitely many periodic points. In [NY2] we have shown that the two PIM triple procedures are valid in ideal situations (hyperbolic systems). We find SST and ASST methods work well in (a', y, 1210 H E Nusse and JA Yorke practice-even in less than ideal cases. From the examples in [NY1]; we have seen that-the SST method-works quite well for, a-variety of dynamical systems. Most pictures in section 3 for which one of the numerical straddle procedures has been applied in order to obtain a single numerical trajectory, have been obtained by selecting e = 1/30 as default value, and neglecting the first 10 iterates. We chose e to1 be-somewhat smaller (0.01) in the ABST method for the Hdnon map (parameter values p = 2;66, -- 0.3). 1 6.4. Shadowing It is important to- ask if such straddle trajectories obtained by one of the straddle methods (BST method. SST method. ASST method. or ABST method) represent true-trajectories of the system. In other words. does-there exist a true trajectory of the system that shadows (i.e. stays close to)-the numerical trajectory obtained by a straddle method? When a map is sufficiently hyperbolic on the invariant set in question. Bowen [BI obtained a result saying that each noisy trajectory in the an-wandering set can be shadowed by a true trajectory if the perturbation is small: see [BI for the precise statement. Recall that Inv(R)-satisfies the 'no cycle condition' rk(.% if whenever basic sets 17k(1) .... 1) is a sequence of basic sets in Inv(R) for which the stable set of rk(i) has a non-empty intersection with the unstable set of I', +lIfor all I <_i < k(M), then the stable set of rk(1f) does not intersect the unstable set of rk(i). Assuming Inv(R) satisfies the *no cycle condition' and 6 is sufficiently small. we can show that every BST or SST trajectory of a two dimensional uniformly hyperbolic system with a fractal basin boundary or a chaotic-saddle, obtained by the BST method and SST method respectively, can be shadowed by a true trajectory (for as long as the saddle straddle trajectory can be computed). I 7. Concluding remarks 7.1. Higher-dimensionalsystems One of the ingredients in the analysis of the validity of the accessible basin boundary procedure in dimension two, is the existence of a C +" foliation -1y on a neighbourhood of a basic set. The proofs of the basin boundary geometric lemma and the basin boundary combinatorial lemma require the existence of such a stable foliation (see also the proof of geometric lemma II in [NY2], on which the proof of the basin boundary geometric lemma is heavily based). For d = 2, the existence of such a foliation is guaranteed by a result due to De Melo [MI. Unfortunately, the existence of a foliation a' on a neighbourhood of a basic set in higher dimensions is not known, see e.g. [PT]. Let from now on, the dimension d >_3. Let Fbe an Axiom A diffeomorphism, let R be a basin boundary region such tho. dim E" = 1, and assume that for each basic set r in Inv(R) there exists a C' staule foliation T on a neighbourhood of 17, for some a > 0. Then the conclusion of the theorem is again valid. The proof is almost the same; instead of proposition 5.2 one should use the properties of Markov partitions of basic sets; see Bowen [B]. I : Accessible trajectorieson basin boundaries 1211- Z2 Orderof diffrpntiabilityof the diffeothorphism + I We-assumed that the diffemorphism-F is C3. This assumption implied the existence of a C' expanding map, for some cr>0, in proposition 5.3; If F is of class C2 , then it is known that such an expandngmap is C'. We would like to point out, that the H61der exponent cr is only used to obtain (2) in the proof of, the Geometric lemma I in [NY2I; the proof of the basin boundary Geometric lemma depends indirectly on this result. Fortunately, -we can prove Geometric lemma I in [NY2] (in particular the property (2) mentioned above)for the C'-map .9of proposition 5.3 by combining the techniques of the proof of proposition 6 in [Ne] and lemma 5.5 in 2 [Nul]. Thus in fact, it is sufficient to assume F is C to guarantee the main result of the paper. 7.3. An ad hoc numericaltechnique I 3 [GOY1] describes an ad hoc straddle technique for determining accessible periodic saddle points on the basin boundary. In [GOY1] it is.issumed that there are two attractors A and B. The objective in [GOY1] is tofind a saddle periodic point-on the basin boundary that is accessible from, basin(B}. This-method worked-on several test problems but had no rigorous foundation. The objective of this paper is to attack the problem raised in [GOY1] and we find a straddle method (ABST method) which has a rigorous foundation. i 7.4. Examples 3 By using the SST method, in the example of the H6non map with parameter values p = 1.812579 70, It = 0.022 864 30 the resulting SST trajectory gives virtually the same picture as figure 4 (which was generated using the BST method). Also in this case, the ASST trajectory is similar to the ABST trajectory. In the second Hdnon example (p = 2.66, It= 0.3) we choose in the ABST method e = 0.01; the ASST method gives a similar result when E= 1/30 is chosen. 7.5. Smooth or fractal basin boundaries I 3 I IThe accessible basin boundary procedure is valid for smooth as well as fractal basin boundaries. References [ASI Alligood K T and Sauer T 1988 Rotation numbers of periodic orbits in the Hnon map Commun, Math. Phys. 120 105-19 MAYI Alligood K T and Yorke J A 1989 Accessible saddles on fractal basin boundaries Preprint [BGOYYI Battelino P M, Grebogi C, Ott E, Yorke J A and Yorke E D 1988 Multiple coexisting attractors, basin boundaries and basic sets Physica 32D 296-305 (BPI Berman A and Plemmons R J 1979 Nonnegative Matrices in the Mathematical Sciences (New York: Academic) 1~1& I- Nusse andfA Yorke [B] Bowen R 19-75 Equilibrium States and the Ergodic Theory of Anosov biffe6morn';sms 4.ecture Notes-in~Mahematis 470 (Berl in: Springer) [B] -Bowen R and Ruelle D 1975 the irgodk theory of Axiom A flows'lnvent. -atdh. 29 181-202 [GH) Guckenheimer I and H6lmei P 1983 Nonli -nearOscillatiotis, D%1,aiical Systems. and Bifurcations of Vetor'Fields'App'lied Mathematical Sciences 42 (Berlin: Springer) (GNOYJ -Grebogi C, Nusse H E; Ott E and YarkelJ A 1988 Basic sets: sets deteiine the dinriension of basin undaries Dynamical Systems: Proc0. University of Maryland 1986-87 (Lecture Notes in Mathematics 1342) ed J C Alexander (Berlin: Springer) pp 220-50 [GOYi Grebogi C. Ott E and Yorke J 1987 Basin boundary metamorphoses: changes in accessible boundary orbits Physica 24D 243-i52 (GOY2J Grebogi C, Ott E and Yorke J A 1987 Chaos. strange attractors, and fractal basin boundaries in nonlinear dynamics Science 238 632-8 [HI~l Hamimel SM and Jones C K R T 1989 Jumping stable manifolds for, dissipative maps of the plane Physica 35D 87-406 [KGj Kantz H and Grassbergcr P 1985 Repellers. semi-attractors, and long-lived chaotic transients Physica 17D 75-86 ][MI de Melo W 1973 Structural stabilityof diffeomorphisms on two-manifolds Invent. Math. 21 233-46 IMGOYI McDonald SW, Grebogi C, Ott E and Yorke J A 1985 Fractal basin boundaries Phvsica 17D 125-53 [NPJ Newhouse S and Palis J 1973 Hyperbolic nonwandering sets on two-dimensional manifolds Dynamical Systems M M Peixoto (NewYork: Academic) pp 293-301 [Ne] Ncwhouse SE 1979 The abundance of wild hyperbolic sets and non-smooth stable sets for diffeomorphisms Pub!. Math. IIIESSOIOI0-51 e [Null Nusse HE 1987 Asymptotically periodic behaviour in the dynamics of chaotic mappings SIAM J. App!. Math. 47 498-515 [Nu21 Nusse H E 1988 Qualitative analysis of the dynamics and stability properties for Axiom A maps 1. Math. Anal. App!. 136 74-106 [NY1 I Nusse H E and Yorkc J A 1989 A procedure for finding numerical trajectories on chaotic saddles Physica 36D 137-56 [NY21 Nusse H E and Yorke J A 1991 Analysis of a procedure for finding numerical trajectories close to chaotic saddle hyperbolic sets Ergod. Theor, Dynam. Syst. 11189-208, [PR] Peitgen H10 and Richter P H 1986 The Beauty of Fractals (Berlin: Springer) (PTJ Palis J and Takens F 1987 Homoclinic bifurcations and hyperbolic dynamics. 161h C'oldquio Brasileiro Matemdtica.. IMPA 1987 IYI Yorke J A 1989 DYNAMICS. A Program for IBM PC Clones 1989 [YKYJ You Z, Kostelich E J and Yorke J A 1991 Calculating stable and unstable manifolds /ft, J, Bifurcation and C/haos I in press. Volume 156. number 1.2 I I PHYSICS LETTERS A, 3June 1991 Calculating topological entropies of chaotic dynamical systems Qi Chen. Edward Ott'I and Lyman P. Hurd ~ ~Lahoratri for I'Iasmna Re'~earch. L'nm'rviv if*3larvland, ('ol'eve I'aik..111 20 42. Received 13 Fetbruar 19I. ie%ised manuscript rcceiscd 26 March 1I") 1.atccepted lr publiwtion %\prii 1991 Communicated b AXP. Fordy I We present an ciielent kofritihm Ior wik ulai ng iopologit.i~ miopi eN W Ihd% klOOiAli.a "'OUS11 fi.1 ow hd .ipic% it) chaoiic altictors as \Nell a% ciiaoti saodkes, The quantitative characteriz.ation of chaotic processes has proven to be an important issue in nonlinear dynamics. Calculations of' Lyapuno\ exponents and ('ractal dimensions hiase been \er\ useful in this regard. \nother fundamental qujantity is thle topological entropy [ 1,2.1,w\hich characteriz.es the comnplexit\ of'the orbit structure o1'.4 gis en ch namical Stcei. Thle topological entropy i%imsariant untopological conjugac\ of'the dynamical systemIs (iLe. it is preser' ed by Lontinuous and not necessarily differentiable changes of' ariables), Ider The general definition of' topological entropy is computationall\ unwield\, so .alculations inaria- IFor Inential general definitions. an axiomi Ndif'feomnorphism I' (see ref. 131 f'or a definition of the conditions satisfied b\ an axioin \ systemn thle topologikal cntrop\ is thle gross th rate of the numbei ul periodic. points5 13 1. Let P),, be thle number of fi\cd points of' ihle it times iterated map P'". thIus. P,~ LOUfS thle niumbei of points of period it plus thle number of'points " hose period di'ides it. 'flhe topological entropy. W.1 . satisfies IgII lgP It (2) ('haoit *L sstems encountered inapphLations are often not axiom \. Nes ertheless. for non-axiom A stiuations. it is often assuimed that eq, (2 1continues to hold. ELen so. chait sstvims tend to he numeri, all\ un'stable. and this c in make it dif'ficult to obtami I suiffi-ientl\ large number ol periodic orbits to uise in eq. ( 2). Calculations based onl this method iequtire Ingenuit\ and Iiai'e been carried out in a few cases 14.i1. AnAIM* 11proac2h ducto Ne%lmhe and Yonmdi ponentmal gross t rate ol'a A-dimiensional volumle in thle plhine space j 11. F-or tso-(imenslonal mnaps. Nevwbouse uises these testilt to obtain numerical bounds (inthle eiitiopy bk .oiptmin the exponential !;.-o\\ III rat of thle length of a ty pical line segment. Recenils. a more sophismiated technique based onl !;enviatintg paritliuns ol haooic attractors has been proposd. Tis mnichud seems to yield precise estiatc'. on topological ci1uiopies. I luses er. genlerating partitions are usually difficuit to construct 16]. tills note. we introduce a new algoriihmn for cal- I) culating the topological entropy which is particularly Thus for N sufficiently large. %%e hiase thle approximation some aksaitages oser pre\sious methods. It applies Ii= lim '~'~ Also at. Department of Elcttriwai Engineering and Depai,meni of Physics. 48 simple :and efficient. and may in some cases have to chaotic attractors as \\ell as chaotic saddles. Consider anl insertible map of thle plane (. I-') j,'( v. ' ) Choose a compact \olume V. Normally we choose V to contain thle chaotic Invariant set of 11.17-9601/9 'S 0)3.50t 149 1 - Ulse%ivr Science Publishers B. t North-HollandI Volume 156, number I;2 PHYSCS LETTERS A. ,themap. Howe'r,.since:the topolosical entropy of F is bounded below by the entrpy -restricted to any subregion. our'algorithm obta...j lower bounds evenwhen this is not the case. This fact is useful if one does not know a priori bounds on the dynamics. We assume that under the action of the inverse map F-', all points in Vexcept for aset of Lebesgue measure zero (the invariant set and, its unstable rhanifold) eventually escape V.This is true, for example, for area -contracting maps such as the-djissipative H~non map. Consider the intersection of V with its preimages, V.=VnF-'(V)nF- 2(V)c ...n 3 June 1991 behavior of s,, for largqen. Alteriatively, wecan plot log N, Versus n and estimate in(F) as the slope of the fitted curve, (discarding a suitable number of small n values); To obtainr ai estimate of the number of disjoint strips in V,,, let T(x) denote the smallest value of n such that F-n(x) is not in V. We call T(x).the inverse escape time from V. Now consider a line cutting transversely across the stable manifold. Then this line also cuts through all strips in V, for large n, since each strip of V, lies basically along the direction of the stable manifold. Hence, N, is given by the num- F-"(V). For large-n. V, generally consists of disjoint elongated strips i. ing in the direction of the stable manifold of F for the invariant setcontained in V. In the limit n-,oo, V,, is the intersection of the stable manifold with'V. Let us denote the total number of disjoint components in-V, by N, (in the-case of the standard hnrseshoe map, this number is 2"). The theoretical oasis for our algorithm lies in the following model situation (see ref. [ 7 ] ). Let V be a rectangle whose sides are roughly parallel to the stable and unstable directions of the invariant set. If F(V) o)V consists of in horizontal strips and F-I(V)cnV consists of in vertical strips, and Funiormly contracts horizontal strips, and F - 1 uniformly contracts vertical strips, then F restricted to the non-empty invariant set A=no.=.,,, F'(V) is conjugate to the full shift on in symbols which has entropy log in and therefore the map F has entropy at least log m (see re,. 17] for details). Given the region V and the map F,often the above hypotheses are not satisfied, but are satisfied by an iterate, F" and a possibly smaller region V' :V ". Recalling that Nn is the number of disjoint strips in F-"(V) rnV the above argument implies that the en- ber of intervals where T(x)>n in a typical onedimensional line cut. In practice, we count the number of such intervalswhere T(x) >,n for successively larger values of it and calculate the quantity s,, up to a certain level, or until it converges within a given tolerance. Although h obtained in this fashion only gives a lower bound for the topological entropy, for all the systems where comparisons with previous calculations are available, this algorithm appears to yield very sharp lower bounds, We remark that in studying chaotic scattering in two-dimensional Hamiltonian flows, Kovdcs and TO have obtained a similar quantity, Ko, for the Poincar6 map on a surface ofsection, They call Ko the topological entropy of the scattering process [8]. Their method is similar to ours except that we use F - I while they use F (the topological entropy of a map and its inverse are the same). Using F -', however, allows us to obtain the entropy of chaotic attractors (this is not possible using the method of ref. [8], which was designed for chaotic saddles). We first illustrate our algorithm for the Hdnon map, tropy ofF" is at least logNn. Since h(F")=nh(F), X+, we define If the above hypotheses are satisfied by the region V and iterate i, the above estimate forms a rigorous lower bound. In cas ere explicit checking ofthese hypotheses is impr..cal, we examine convergence (4) =x. Set b=0.3, in the parameter range !.4,a<4.0, the invariant set of the Hnon map changes from a strange attractor to a strange saddle, and finally to a full 2-shift (horseshoe). For a sufficiently large, the topological entropy saturates at log 2. It can be shown that the invariant set of the Hdnon map is included in the square max( lxi, liy )<R, where [9] =a-x +by., '+ R= J(l + Ibl + [( I + Ibl ) 2+4a ")2 }. " Recall that the topological entropy of F restricted to V gives a lower bound for the topological entropy ofF restincted to V. This is the region V which we use for calculating the inverse escape time function. For simplicity, we take 49 Volume 156. number 1.2 a vertical one-dimensional line through the- origin .k =0. y=0 and calculate T(.x) at regularly spaced intervals. This is shown in fig. I for a=3.0, where the invariant set is topologically a full 2-shift (horseshoe). There is a natural Cantor set level structure in the inverse escape time function. At level 1.there ae two intervals from which it requires at least two backward iterations to escape the square V; at-level 2. there are four intervals from which it requires at least three backward iterations to escape V. etc. The intersection of these intervals is the intersection of the stable manifold of the invariant set with the vertical axis. I'sing a double-precision algorithm, weare able to calculate the inverse escape time function up to level 20. The algorithm is implemented as follows. Starting from the initial interval Q0given by the intersection of the vertical axis with V, we interpolate q, with auniform grid of N=50 points %ridcalculate the inverse escape time for each point with cutoff time n= 2, We find all the intervals 21's in the grid where the inverse escape time function is greater than I. We then interpolate again each interval Q,with 50 points, calculate the inverse escape time for each point with cutoff time n=3, and find all the subintervals 22's where the inverse escape time function is greater than 2, etc. Assuming each iteration of the Hnon map costs about 10 machine instructions and the topological entropy to be calculated is log 2.the whole calculation up to level 15 then costs approx- 21 I ,HYSICS LETTERS A U6 T~ • ,i 1,K11 I -3 03 -2 71 K0.7 0 1 2 i 3 Fig. 1.The inverse escape time function for the Henon map at a= 3.0. b=0.3 for a verical cut through the ongm =0. j =0. 50 0.91. !0.8? I 3 I imately 32 million machine instructions. On a 10 MIPS workstation. the whole computation takes approximately 3 s.We can achieve better precision by going to higher levels or interpolating more points in the grid. The calculation time typically increases with the level at an exponential rate given by the topological entropy. Usually, level 10calculation ( I million machine instructions, or 0.1 s on a 10 MIPS workstation) yields good estimates on the entropy for chaotic systems. (For instance, for the Htnon attractor ata= 1A b=0.3, level 10 calculation gives .s=0.660. while level 15 givess=0.670. a relative error of less than 2%. Note this value isconsistent with the one obtained in ref. [6].) In all our numerical examples, the logarithms are taken to be base 2. Fig. 2 shows the topological entropy for the H6non map at b=0.3 in the parameter range l.4.<a<3.0. It is calculated with 100 interpolation points at level 15. This figure seem,. to be identical (with better precision) with the one obtained by Biham and Wenzel [41. Note that there are plateau regions where the entropy is constant. This is because for any parameter value where the invariant set is hyperbolic, the topological entropy must be locally constant due to the structural stability of hyperbolic sets. The whole calculation with 260 parameter values takes about 50 min on a 10 MIPS workstation. We also apply our method to open Hamiltonian systems. Generically, the phase space of Hamilto- ----.r -3June1991 0.61.0 1.5 2.0 a 2.5 3.0 3.5 Fig. 2.The topological enirepy for the HWnon map as a function ofaatb=0.3.Thisgraphisobtained usingthe methoddescnbed in the text at level i5with 100 interpolation points. Volume 156. number 1.2 PHYSICS LETTERS A nian systems has mixed, components (101:' regular rotational motions of KAM type, andirregular motions with positive Lyapunov exponents. If the irregular component is noncompact, its only bounded invariant subsets are strange saddles. The topological entropy is related to the escape dynamics from the saddle [ 5]. We wish~to calculate the topological entropies of such systems. One example-is given by the area-preserving sawtooth map on the plane [ 5], Heref./(x) is a sawtooth function, x(6) where (.vI denotes the greatest integer in x. Note that f(x) is discontinuous on the line x=0, therefore the sawtooth map is piecewise linear with constant Jacobian matrix except on the line x=0. The nonlinearity of the map comes from this line of disconti. nuity. For K>0, the map is uniformly hyperbolic except on the discontinuity line, hence, there are no KAM curves in the phase space. The Lyapunov number A is related to the parameter K by J = I+ 0.5 [K+ (K2+ 4K)1121]. L ..er the action of the sawtooth map, almost all initial conditions inside the fundamental region V= (x: xIx0.5) escape to infinity. It can be shown that there is an unstable invariant set r in V [51. Fig, 3 shows this invariant set at A = 2.4. We will calculate the topological entropy for this invariant set as a function of.A. (We.iote that the t6pologicaientropy of the sawtooth map defined on the-plane is different from the topological entropyof.the same map defined ofi the torus. In,the-latter case. the, space is, compact, the chaotic invariant setis'the whole torus. which -contains r as a subset. The- topological en!ropy of the latter is g:. en by the Lyapunov exponent log A, the uniform expansion rate of a line segment.) When A> 3,we can show that the invariant set is a full 2-shift [5], therefore, the topological entropy saturates at log 2 when / > 3. For convenience, we choose the cut at x= -0.5. The topological entropy is shown in fig. 4 for 2< ,10.The solid curve is the entropy obtained by counting the number of periodic points of the map by using the coding scheme of ref. [ 5], the dots are entropies calculated with our algorithm at level 18. The agreement is excellent. When i <2. the convergence for both methods becomes slow, and we find it prohibitive to obtain the entropy value wthout going to a higher precision algorithm. We note that there is no apparent plateau structure in fig, 4,This is because the invariant set is not everywhere hyperbolic in this parameter range. We have also calculated the topological entropy for the corresponding invariant set of the standard map on the plane. The standard map is given by replacing the impulse function in (5) with a sinusoidal function [10]. t.o U, 0.8 0.4t O.6 0.2[ y 3June 1991 0.0 L0.4 .4 - ..... 0 1& I )1 1 -- -0.2 0.02 2.0 .0. 2.4 2.6 2.8 3.0 1 -0.41 -0.6 .0.5 .2 2.2 0 3 0.5 Fig. 3.The unstable invanant set for the sawtooth map at A = 2.4. Fig. 4. The topological entropy as a function of A for the sawtooth map. The solid curve is the entropy obtained by counting the number of n-cycle fixed points, the dots are the level 18 calculations with 100 interpolation points. 51 I Volume-]56. number 1 2 3 I.)- ,PHYSICS LETTERS A (7) -sin(2nt.)/2r. For moderately large values of K, (of -rder I), the I motion ,in the phase space has'both regu lar and irregular components.. However, When the parameiet K is large, the map is almost hyperbolic [ 10]. Therefore. the invariant set contaifned'in the fundamental region V= Kx: Ixi <0.51 is a strange saddle for large K. In fig- 5, we show the topological entropy in the parameter range 5.0 < K< 9.0 calculated using our algorithm atlevel 10 (again logarithms are calculated in base 2), We see that at Kz,8.4, the topological entropy saturates at log 3, indicating the invariant set is topologically a 3-shift. Indeed. this is the typical dynamics of the standard map for large parameter K. 3 June 1991 In conclusion, we have presented.aft efficient algofithm for calculating:the topological' entropy of chaotic dynamical systems. QC wants to thank the Aspen Center for Physics for its hospitality and Rex Skodje for discussions. This work was supported by the Office of Naval Research (Physics), by the Department of Energy (Scientific Computing Staff Office of Energy Research) and by the Defense Advanced Research Projects Agency. References We again note that in the entropy function there are obvious plateau regions where -the topological entropy remains constant, similar to the case of the ( I S,Newhouse. Entropy and volume as measures of orbit complexity, in: Lecture notes in physics, Vol. 278, The physics of phase space (Springer, Berlin, 1986) P.2 . H46non map. (21 P. Walters, An introduction to ergodic theory (Springer Berlin. 1982): N.F.G, Martin and J.W. England. Mathematical theory of entropy (Addison-Wesley, Reading, 1981 ). 2,0 .-4--1.5 I(31 R,Bowen, Am. Math, Soc. 154 (1971 ) 377: A.B. Katok. Publ. Math, IHES 51 (1980) 137. (410. Biham and W.Wenzel, Phys. Rev, Lett. 63 (1989) 819; P. Grassberger, H, Kantz and U. Moenig, J. Phys. A 22 I (1989) 5217. 151 Q. Chen, 1.Dana, J,D. Meiss, N. Murray and I,C. Percival, Physica D 67 (1990) 217: N. Bird and F. Vivaldi, Physica D 30 (1988) 164: 11 1.0' I.C. Percival and F.Vivaldi, Physica D 25 (1987) 105. 0,5 : •161G. 0,0_, 4 (71 6 8 10 K Fig. 5. The topological entropy for the standard map as a func. tion of K. 100 interpolation points at level 10. I U I I3 52 D'Alessandro, P. Grassberger, S. Isola and A, Politi. J. Phys. A 23 (1990) 5285. J.Guckenheimer and P. Holmes. Nonlinear oscillations, dynamical systems and bifurcations of vector fields (Springer, Berlin, 1983), (8 Z. Kovdcs and T. TOI, Phys. Rev. Lett, 64 (1990) 1617. (91 R. Devaney and Z. Nitecki. Commun. Math, Phys. 67 (1979) 137. 110) B.V. Chirikov. Phys. Rep, 52 (1979) 262. I On the Tendency Toward Ergodicity with Increasing Number of Degrees of Freedom in Hamiltonian I| Systems Lyman Hurd Iterated Systems, Corp. 5550A Peach Tree Parkway, Suite 545 I 1 Norcross, GA 30092 and Celso Grebogia and Edward Ott b U ,Laboratory for Plasma Research University of Maryland College Park, Maryland 20742-3511 U ABSTRACT INumerical experiments on a symplectic coupled map system are performed to inves- tigate the tendency for global ergodic behavior of typical Hamiltonian systems as the I number of degrees of freedom N is increased. As N increases, we find that the fraction of phase space volume occupied by invariant tori decreases strongly. Nevertheless, due to observed very long time correlated behavior, a conclusion of effective gross ergodicity cannot be confirmed, even though extremely long numerical runs were employed. 3 1! a. and Department of Mathematics, and Institute for Physical Science and Technology. b. and Department of Physics and Astronomy, and Department of Electrical Engineering. 'The bAsic assumption in' statistical mechanics is thatf of ergodicity over the phase -space hypersurface determined by- the global constants, of the motion (e.g., total-energy,. total angular momentum, etc.). On the other hand, studie, of 'Hamilt0nian systems with few degrees of freedom (e.g., two) typically reveal the presence ofinvriant KAM tori in addition to chaotic orbits; and -the -existence of KAM tori yields motion that is grossly different :from that assumed in statistical' mechanics. A natural supposition reconciling the above contradictory views might be that, As the number of- degrees of freedomis increased, the tendency for global ergodicity increases. By "tendency for global ergo lcity" we meaii that, for systems with many degrees of freedom (the situation of interest in statistical mechanics), the, overwhelming majority of' initial conditions would be ergodic over effectively all of the area of the phase space hypersurface determined by the global constants of the motion. The purpose of this paper is to present numerical experiments which attempt to test this supposition in a specific case. In particular, wt study a symplectic map system (the symplectic condition insures that the dynamics is Hamiltonian). A closely related wcrl is that of Falcioni et al.1 For other previous relevant works on Hamiltonian dynamics in higher number of degree of freedom systems see Kaneko and Bagley, 2 Gyorgyi et al., 3 and the discussion and references in the book by Lichtenberg and Lieberman. 4 The main result of the present paper is that, for the system we study, the fraction of orbits on tori decreases very strongly as the number of degrees of freedom is increased, but there is still no conclusive evidence for effectively complete global ergodicity even over the very long times investigated in our numerical experiments. The latter is due to the extremely long time-scales, insensitive to machine precision, observed in the numerical experiments. The system we studied -derives from the standard map, x/ = x+Y, (1) y = y+ksinx'. In these coordinates the map can be considered as a map of the two-torus T2 , 0 < x < 27r and 0 < y < 27r. Given a positive integer N, consider the space (T2)N thought of as 2n-tuples (xo,y0,xi,.y, ... ,Xv-1,yv-). We define a coupled standard map allowing symmetric 2 bidirectional nearest neighbor interacti6ns , t x, SY _. Xi + yi, (2) = y j+ K sinxx + CK sin(x x.-.) + CKsin(x - where the indices are taken modulo N and xi, yj are taken modulo 2-r. Here C is the I coupling parameter to nearest neighbors. Letting K = k/(2C + 1), Eqs. (2) reduce to Eqs. (1) for N = 1. We call k the nonlinearity parameter. This map is symplectic since it can be obtained from the generating function, n F(xx') = 3 (x 2 =1 -xi) 2 + K cos x + CK cos(x - x+). (3) One checks readily that yj = OF/Oxi, y = -DF/Oz,. The original aim of our numerical experiments was the exploration of the relative Smeasure 3 3 of KAM tori as a function of the number of coupled maps. To this end, we first note that motion on KAM surfaces is quasiperiodic with all Lyapunov exponents zero, while motion not on KAM surfaces typically is chaotic and has at least one positive Lyapunov exponent. Thus we proceed as follows (see also Ref. 1). A cutoff value Cfor an orbit to be considered quasiperiodic was set and the number of initial conditions with largest Lyapunov exponent (LE) less than e counted. The run consisted of taking m initial I conditions uniformly distributed in the 2N-torus and iterating them approximately 106 times along with their tangent vectors to compute their LE's. A cutoff value e = 0.005 for the largest LE was set below which an orbit was considered quasiperiodic, and the ratio of the number of quasiperiodic initial conditions to the total number of initial conditions 3 3 3 3 was returned. When the coupling coefficient is zero, the volume of the KIAM tori decays exponentially with N. In particular, if f denotes the fraction of phase space occupied by KIAM tori for a single standard map, Eq. (1), then the fraction of the phase space (T2 )N for N uncoupled maps for which motion in the 2N variables (Xo, Yo,..., XN.-1, YN.1) is quasiperiodic is fN. When C > 0, the rate of decay was observed to increase dramatically. Results for the parameter values C = 0.5, k = 0.3 are displayed in Table 1. In this table the estimated measure of quasiperiodic (QP) initial conditions (sLcond column) is the fraction of 8192 1 3 ,randomly chosen initial conditions yielding LEs Aless than E. Maps J Estimated Measure of QP Initial Conditions J 106 1.000 0.403 0.048 0.002 0.000 0.000 0.000 1 2 3 4 5 6 7 Iterations 3.25 3.25 3.25 1.25 x 106 x10 6 x10 6 x 106 106 106 Table 1: Fraction of Initial Conditions Yielding Quasiperiodic (QP) Orbits Figures 1 show histograms of the observed distribution of maximum LE's for the 8192 randomly chosen initial conditions for N = ., 3,.... ,7 coupled maps. The case of three maps is presented twice with different numbers of iterations for the same set of data. The observed peaks get sharper but the effect is very slow. In most cases the following phenomena were noted: 1. The number of initial conditions following within the e bound for quasiperiodicity decreases rapidly as the number of maps increases. 2. The ohk peaks grew sharper with repeated iteration-but very slowly. 3. The histograms with more than one peak preservrd those peaks and they individually got sharper. These observati :us might lead one to conjecture that each peak represents a distinct ergodic component with its own maximum LE. We now discuss the behavior of six individual orbits for the N = 3 case, where the orbits are chosen so that their maximum LE's cal:ulated after 3.25 x 106 lay in distinct regions of interest of the histogram in Fig. 1(c). The calculated LE values for these six orbits are indicated by the arrows labeled with the letters (a)-(f) along the axis of Fig. 4 1(c). The pruojection of these orbits onto. the first two components (xo,'yo)'areplotted for 101 iterations in Figs. 2(a)-(f). Distinct orbits appeared to stay constrained in afixed regionof phase space, azpd this was also true when time series of 10' iterations -were plotted. Lyapunov exponents were then computed for each of these orbits-for a much greater period of time (3 x 108 iterations). The results are shown in Fig. 3 where the letters (b)l(f) labeling the curves correspond to the orbits shown in Figs. 2(b)-(f) and the arrows shown along the horizontad axis of Fig. 1(c). The first initial condition, which was presumed quasiperiodic, remained stable during the whole process, and in fact its computed LE reached zero to machine precision. Initial condition (f) also remained at a highly stable value. The remaining four, however, appear to have started to converge. slowly to a new common value. Figures 4 break down the curves in Fig. 3 (plus orbit (a)] giving the cumulative LE and a "local" LE which is calculated in 500,000 iterate bursts. Observe the gi'cat stability of initial conditions (a) and (f). Further studies were conducted for a variety of initial conditions and various behaviors were observed. 1 1.Some initial conditions "tppeared to lead to orbits whose LE's showed a great deal of stability (they remained essentially unchanged over the observed time scale). 2. Some initial conditions showed a high degree of stability at one value of the maximum LE but then "leaked" into a regime with a different LE. 3. Some initial conditions alternated between chaotic behavior and behavior very close to quasiperiodic. Il One effect of these observations was to call into question the reliability of the LE calculations in general. Many of these calculations seemed to be stable for greater than 106 iterates before changing value. Given the relative rarity of these "leaks," it was impractical to assign any numerical value to this diffusion. * 5 The histogram calculationswere performed on a"Connection, Machine, using (0f necessity) single-precision, arithietic,. The orbit calculations were performed, on a-DecStation I 3i00 usifig double precision. To examine ,the effect of machineprecision several ofthe -long-term LE calculations were done at both single and double precision. The observed behavior was qualitatively the same; the observed leakage between regions of different LE occurred in each case (at slightly different iterates). This work was supported by the Office of Naval Research (Physics Branch), by the Department of Energy (Scientific Comp-4ting Staff, Office of Energy Research), and by 3 the Defense Advanced Research Projects Agency. Ii U 6! I I I I I 1 I I REFERENCES ILJM. Fai-cioni, U._ M-arini Bettolo M-arconi, A. Vkiilpiani, Phyvs. Rev- A 44, 2263 1 (1991). We had completed our research at the time of the publication of the paper ofFalcioni et al. Because of the similarity of that work and ours, in this Letter vde shall be somewhat briefer than we otherwise might have been, and-will also emphasize that part 6f our work which is different fromthat of I alcioni et al. 1 12. ]K. Kaneko and R. J. Bagley, Phys. Lett. A 110, 435-(1985). [3. ]G. Gyorgyi, F. A. Ling and G. Schmidt, Phys. Rev. A 40. 5311 (1989). [4. ]A. J. Lichtenberg and M. A. Lieberman, Regular and Stochastic Motion (Springeri I I I I 1t Verlag,Berlin, 1983). 17 FIGURE CAPTIONS 1. Histograms of maximum -LyapUnov. exponents for 8192 initial conditions and N 2,3, ... , 7. - For (a) the -alue in the first histogram bin is about 2700, far off the scale shown. In general, the value in the first -bin is an estimate of the number of quasiperiodic orbits. 2. Projection of six individual orbits for N - 3 onto the plane corresponding to the first two components. 10 iterations are plotted. The calculated inaximum LE's for these orbits are (a) 8 x 10-5 (quasiperiod"c), (b) 0.0166, (c)-0.0676 [corresponding to the lower LE peak in Fig. 1(c)], (d) 0.1170 [corresponding to the higher LE peak in Fig. 1(c)], (e) 0.1191, and (f) 01300. These LE values are indicated along the horizontal axis of Fig. 1(c). 3. Maximum calculated LE as a function of the number of map iterations for the five orbits corresponding to Figs. 2(b)-2(f). 4. Cumulative and "local" maximum LE for the orbits corresponding to Fig. 3. 8 ii I I I I I I I 1 I I I -- I I I I~ ~ I -' I-~~ I~~I I 6 C Q bn Cl z liii I I U I I ~I I I 00 06 C1C C) C) C C)~~~ )4 co ) C ) C0 lx -q - c1 z CD00 4cq 00 A I I I I i I 4 -D 6o 0L I I I 1. .1 I . Ij I I I I I I I I- l~ *1*'*~I 6 0 0 0 I ~1 I _ -~ -~ ~ 0 yR I I .3 S I I I Iz 0 0 Cl 0 0 i-H 0 0 0 0 Cl 06 I I 6 S 0 0 ______ 0 0c/2 bn II z I 0 I I I I I 0 0 0 0 -0 06 biO~ zI f i l I Cl~~ 0 Ic 1 1 1 1 1 1 . (01 0 t4 C t -~l - 0 it 1 3 I 5 I I I 3 - N * N Cr) *~*~ t* * ~*-*j. -~1~ 0 ~ a I 7 -~ .*-~ N I * J I 3 I N N N 0 13: I I _____________ b I * ~ *.*~* I ~- ~.' * . . .,t,%~ . . . .* V * *74~4.... l~*;... .IW.., . . **. L~.~P'* 7...- . .-. 7 .z'~.. ~ ::1 ........................................................ 7 **~ *. . . 0 *~4-? 4 "7 ~ .' ,*, ,.*.~ '7 - -~'*~* *, * .: I"' .d~. 4 '.... 7~74 .~J~*7 * ~ * " .j~..J .. , 7~".*.. I I eKI Cl 0 . 0 0 'I ~ I I 1 A A * ~ - * ~ ~ .. ' (NI P. ~~,.'*** \~ * ~ ... * * 4" I ~ ':~i.I.. ~ ~ * ~ i I'~ .~ *.9 ~.,**..* *~,, I I I * 3 v~. 4 ~ .~.* .~. ~ ~ ~ ~. ~ ~ Cl ~ ~ .~:I I I. I (NI -~ -~ Cr) 0 I; 3 U .a. * b. ~ 0, I m I ~ I * ~ . ;~'.'P. .. ~;. .. *.~ . -. .*az. .'~ ~ . ..A. "*~s. * ~ . f.*. ~ .~- ,-~*. ~ .~ ... r ... ~ . , ... .'4 ~ * I ~ .. , * * . . , .. *~....... ..: * ... -. . * . ,-.' -I .,.*. ................. . ... ~. * *\****.*****~ .-. .................. ' * * I . .* .~ ;.~*A...',. p. ~ .4 '-.4..., F.. r~ Cl *!*~ :7K . I p p I I I Cl I ; I 'I ~ (N~ 0 I. -. I S £ ~ I ~ I -. ~s. I I.-.. - I. - - *1 L I- -'-9 -- ~.; I...* .~ .-31 . ~ CI I.. ..- -c,..-- - I *1*~" ~ 1 ; *1 * ~-: .... ~~--- * I, .;. ~ '.1 5... I..... S. 9.. .. *1 - -*~-** -. 5 F~. -. 9 ***'.S.; - -:~~ e.g.. L. -' .~ .. ~ -.sI ... 9; S -- Cl 5 K555.... V. I .*, .. - . .~ - I 9.... Cl I i Cl Cl -~ -~ 0 .5 I V 0 0 0 0o a I squouodxa~~0 Aon-eSj I II [ 3 uuax 3 ~x ;- c-, (1) 00 I0 CI- xx 3!a oxr- AIn r un-i-l 0 I- 0 C C 0 -4 0 0 sqU~UOdxaf AoundugrJ tuntuix-e- ~--00 0 1 4 1 -pI x C.c) 5 _ AI!d~r _C_ _ ___uox~ _ _ rrax r)iC) 00 00 i-t-~ £ xd I' _ _ _ _ _ _ __ Iq _ _ _ __ _ _ _ _ _C _ _C s~ueauodxq, Aound-egrj tunuiuxr 1 2 i- Ai r /d * ____ Ao _________ n -1r xe 2-m -00 ."~ r~I co 001 Cd~ LO Lo squaodxaAoud-e~j uintux-I I MtETAMORPHOSES: SUDDEN- JUMPS IN 'BASIN BOUNDARIES 1* by Kathleen T-. Alligood Department of Mathematics George Mason University Fa'irfax, VA 22030 and Laura Tedeschini-Lalli Department of Mathematics and Institute for Physical Science and Technology University of Me-yland, Colleae Park, MD 20742 On leave from- Dipartimento di Matematica Universita di Roma "La Sapienza" Rome, Italy 1-00185 * I I and James A. Yorke Department of Mathematics and Institute for Physical Science and Technology University of Maryland College Park, MD 20742 December 1985 I 1I 1) This research was supported in part by grants and contracts from the Defense Advanced Research Projects Agency, The Consiglio Nazionale delle Ricerche (Comitato per le Matematiche), and the Air Force Office of Scientific Research. METAMORPHOSES: SUDDEN JUMPS' IN BASIN BO UNDARIES Dynamical systems in-the plane can have -many,coexisting,,attractors. In order to -be able to predict long-term or asyrhptotic bahavior in such systems, it is important to be able to recognize to which attractor (final state) a given trajectory will tend. The set'of initial condi- tions whose trajectories are asymptotic to a particular attractor is called the basin of attraction of that attractor. In some systems that depend on a parameter, it has been observed that the boundaries of these basins are extremely sensitive to small changes in the parameter. Not only can a boundary jump suddenly but it can also change-from being smooth to being fractal. These changes, called boundary metamorphoses, are studied at length in [GOY). originally stated in IGOv, In this paper, we prove a theorem, which characterized the jumps in basin boundaries. The H6non map phenomenon. We fix f(x,y) = (A-x--Jy,x) J = 0.3 and vary A, provides an example of this resulting in a one-parameter, invertible map of the plane. The Jacobian of area contracting for all We will be looking specifically at the A. f is boundary of the basin of attraction of infinity. is the set of all points (x,y) such that J; hence, respectively. Ifn(x,y)l 00 as A = 1.314 A = 1.314) well This new set of black points has not gradually moved in from the boundary of the white region. certain critical value n + o.) In Fig. lb we see that the basin of infinity contains points which were previously (at within the white region. is (The basin of infinity Figures la and lb show the basin of infinihy in black for and A = 1.320, f A = A* % 1.3145, Rather, beyond a black points suddenly begin appearing deep in the interior of the white region. 2 As A increases, 7£jr Fiue11ato FiueIsostebsnoItrato fifiiyi lc o th S~) eo a =(-2j.) Wefx J=03InFg aAis134 n nFg bAi inraeIo13. Tecag nth ai fifnt lutae baiIondr up the thin bands thicken. This is a discontinuous change in the basin of infinity. In order to understand this phenomena, we must e*amine the dynamical behavior on'the basin boundary. At A - 1.314 (Fig. la)-the boundary is observed numerically to consist cf a saddle fixed point pl, and its stable manifold WS(pl)., (The stable manifold WS(p) fixed point as p is the set of points (x,y) such that n * ®. More generallyi, the stable manifold point Pk of period fnk(xy)' + Pk as k is the set of points n *. fn(x,y) + p WS(pk) (x,y) of a periodic such that Analogously, the unstable manifold of p,, n % . Such sets can be proved to be smooth curves.) is the set of points (x,y) such that unstable manifold of p at as shown irt Fig. 2a. At the critical value A - 1.314 f-nk(xy) + Pk wu(pl) are tangent (Fig. 2b). WU(pk) as One branch of the extends into the white region, A* % 1.3145, after which the basin boundary jumps Into the white region, we find that and of a WS(pl) S. Hammel and C. Jones [HJ] were the first to prove a theorem relating the tangency of Ws(pl) WU(pO) (called a homoclinic tangency) to basin metamorphoses. methods are different from ours, however. and, Their We want to relate these metamorphoses to the saddle periodic orbits which are found near the points of tangency and which we describe below. The complicated dynamical behavior which occurs at homoclinic tangencies has been studied at length in recent years, especially in the papers of Gavrilov and Silnikov CGS], Newhouse [N], and Robinson [R]. Under certain non-degeneracy assumptions, there are horseshoe maps defined on subsets of the plane near a point and WU(pl). Figure 3 shows a rectangle 3 qo of tangency of WS(pl ) B4 and some of its iterates IN 5,§ I IMEIO I'MMI _ _ _ _ _ _ _ _ _ _ (b ) _ I0 M5NIE. Fiur 2 Captio FiueIaad2 hwtesal n ntbemnflso fieUon eo ead attn eay ep ci ey 4,! under f. Notice that in two components. rectangle to Bn Bn f4(B 4 ) isLorseshoe shaped and intersects In fact,. for sufficiently large, there is a near the point of tangency is a horseshoe map. period h B4 .10 such that fn There is necessarily a saddle orbit of Bn fn(B ) (see, for example, CR)). On' -f these saddles will have a n "flipped" unstable ranifold (i.e., Dxfn at this saddle has an eigen- and sadd),e Pn. -1), 5 restricted n in each of the two components of the intersection of value less than I and the other will not. I We label the unflipped This orbit is called a "simple Newhouse periodic orbit" in [TY). The larger n is, the closer Bn will be to qo and Ws(pl). This corresponds to the fact that the length of time (i.e., the number of iterates of f) it takes for a point . move around the fixe 3 point p1 is determined by how close the point is to the stable manifold3 WS(p,). What we see (Fig. 4) is an infinite family of horseshoes, and a (pn} of simple Newhouse saddles (where Pn sequence and is in B-) such that (Pnl- qo. orbit Pn' for some n In the following theorem, as stated in [GOY], the saddle fixed point S corresponds to sion above, and the saddle orbit has period p, in the discus- T corresponds to a simple Newhouse n. The term "first non-degenerate tangency" refers to the following set (H) of hypotheses, (i) Wu(p]) does not intersect WS(p 1 ) for (ii) There exist points such that (iii) near ght), qo qo = fk(po) p0 for some in WU(pl) and 1, at A = A*. k There is a parametrization such that where ho - qo g'(h o ) = 0 and and A < A*. ht, -1 Wu(pl ) near g"(h o ) 10. q. in WS(p1 ) t 1 1, of qo WS(pl) is given by 3 4 V IA I.84-S Iq IP 12s I4 Ul Fiue3 ato Iiue ilsrtsahrehemp hivratsto h Iosso isi_34n f( ) h+I 8n nI nI f* 00e0 n) B qOI Figure 4 Caption3 Figure 4 shows the relative positions of two simple N{ewhouse saddles pnand pnIof periods n and n+1. respectively. i1l 3.Ii Theorem. Consider an invertible map f of the plane depending on a parameter A with a saddle fixed point or periodic orbit S.. We assume that the absolute value of the determinant of the Jacobian of f (or of fn ii the case of a periodic orbit of period than one at every-point of the plane. I :3S 3!I A* as value Assume that f n) -itless hLs a transition A increases where the-stable and -unstable manifolds of have a non-degenerate tahigency and then dross for the first time. Then there will be a periodic saddle stable manifold of in it at S for all T that is in the-closure of the 'A slightly greater than S* but is not A*. We prove the theorem with the aid of the following lemma. 3i Lemma. Let Pk be a simple Newhouse saddle of period described above) near the point unstable manifolds of :3 pl. unstable manifold of stable manifolu of Pn qo k (as of 'tangency of the stable and Then, for n sufficiently large, the crosses -(i.e., intersects transversally) the Pn+1" We postpone the proof of- this lemma due to its technical nature and' '5 proceed to show how the theorem follows. I crosses at a point x, then the forward iterates of any segment of containing x of ii If WU(pn) WU(pn+)) Su(Pn+) _ _ will eventually contain all of _ _ P(p ) in its set of limit points' (see Fig. 5a). WU(p) _ W WS(Pn+1) WU(pn) (the closure Hence Proceeding inductively, we have that _ _ iThis follows from the [GH]. A-lemma. See, for example, the exposition in 5 u n+II qOI Figure 5 Caption Figure 5a indicates that-the closure of in the closure of tangency qo0 is W Wu(Pnl .O Figure 5b indicates that in the-Closure of many simple Newhouse saddles. the unstable is contained the point of' -manifolds of infinity IN 5as I;thinner, 5 (see Fig. 5b), for- every m- W(p) e)wp mn Pt f-act, I thie proof of the lemma will show, thd horseshoe fn(Bn) contains a segment of Wu(Pn) around And- Approach p--. ri.-l* As Wu(pl1 ). tor the horseshoes become ' A slightly -larger than A for in :4tficiently large, the horseshoe together (1)And (ii),.we have (iii) W~p)C W (p) altouh bth riand after tangency-. 1(1 50 )- At to pi saddles A - -A*, and pn 1 Again N~otice that occurs for values of W(m n 'crosses W5 (p1 ) Expression (iii) is eiquivAlent to Wu(pl) Hence Onis in the closure of the portion of the plane bounded by from p1 to 9p Ws(pl) ,is invariant under -from f. The are in the interior of this region, and hence each one is a positive distance from the boundary For every I and Ws(phn4i) much smaller than the values 'ot m for which (see, for example, EGOY]). I3. for4 n sutficienly large. m are taken "sufficiently large" for this argu- ment, the crossing of WUp) I Wd(Bm), 5 W l(pl).., thus- will cut 'across IPutting 3 Fm(Bm). and hence .and A slightly larger than by the Ws(pl) A*, A-lemma. 6 of the basin of infinity. the theorem says that there is a n- such, that P -is in the closr e-of Pn -jump -inthe -boiindary at the -alue of n WS(pn+i) begins. -tee-s ; A - A*. The condition that .(2) Thus: there is.a Ws(p). - n, is suffienetly;,arge here: rbfers to .for which. the sequence of crossings -of WU(p) For the Henonmap with appears to be, 4. (see CGOY]). J =0.3 and A* -and 1.314,, This is supported hy computer evi- dence that for -A slightly greater than- 1.3145, the saddle ;p4 is on the. boundary of th 'basin of infinity. Non-degen~racy has not been proved for the tangency of the (3) map at 'At Henon .-1.314. However, theoretically, almost every such tangency will be6non-degenerat&. (4), The proof of tthe theorem characterizes the boundary after tangency by showing- that ther, are iWfinitely many saddles and their stable manifolds contained in V (p.) The fact that there is a: jump in the boundary is,, of course,, implied by this characterizat'ion. The existence Of such a jump can be demonstrated by A simpler., t6pologi.cal argument. Any path I connecting the left and right sides of Fig. 4) extends through the horse shoe image crosses Bn+j fn(Bn). B (cf,. f fn(B) (as shown in Fig. 4), a portion -of 'f(i) connects the left and right side of BAJ., If, at tangency crosses Br+1 r,. A > A*, some forward iterate of for all Proof of Lemma. r 2 n, (A =A*), fr(B then I U-I) contains r~n will then cross W5 p). qo. -so For Following the construction of ER), [TY] (see also [GH; Sec. 6.6]) we assume the following: (i) DF(p1 ) has eigenvalues 0 < , < I, X > 1, and v and X which satisfy vX <1. (ii) There exists a neighborhood 7 U of p1 in which the map f(x,y) is linear up to smooth changes of coordinates; i.e., f (Here we need an additional non-reconance U. in (x,y) for (Ax, vy) are not integer multiples of each X and v assumption--namely, that = other.) WS(p1 ) such that (O,qo) satisfy V : [p fk(poO) (Yy + x,y) f-n+k(V) For n, let the sense of Smale [S]. near fn (See Fig. 6.) (Notice that fn+kv) n W. Bn = fn*k(v) which is nearest [GH]), that know (see in ) 6 > 0, such that (x,y) E V. all , fk(po±c,O) n stretches Actually, since we Under hyposthesis (H), WS(pl). Bn restricted to is a I.,seshoe map, in Specifically, we use the following facts about such maps: (i) W2,n* Bn The saddle point of in fn which stays in (ii) fn(Bn) and Pn Wl,n* = be the connected Zomponent B may wind around a lot, we let of f-n+k(V) n W O - sufficiently large, n For such W. q W - :O,ae2] x [qo-BE, qo+8E]. (cc2 qo ± Be).) across (x-pc) o, Y, for some positive constants Now let (Oq )2 fk f c > 0 and for some p +E3 x [0,6] 0 -E, 0- WU(pl). there is a rectangular neighbor- Furthermore, (H). and Wu(pl) ard WS(pl) and (O,qO ) = and WU(pl) in (po,O) there exist points Specifically, hood WS(p I ) There is a non-degenerate tangency of (iii) intersect in two components, is contained in Furthermore, W1, Pn n Wl,n and and is the only fixed is the only point in Wl, W1 ,n under all forward and backward iterdtes of The only points which stay in (respectively, backward) iterates of WU(Pn)). 8 fn WI,n are in under all forward WS(Pn) (resp., n fn. (O,q0 ) Figure 6 Captionf Figure 6 illustrates definitions used in the proof of the Lemma. We arg~ue_ that 'the s table-,manifold of thriough Pn, (see' Fig,. !;Yt It, is edasily, seen-that-f( f ~~) ecursively,j le t Then L 'C L1 -and 1 Let Lb ph extends .(verttcaly,), O~e any horftontal segmaeftt in SB . is a parabol-a- which extenlds, through 0 n- w in 1= fn(~l for ,23,.. i a: -seq'uence of nested,,intervals with Lis n,~ I length(L1 Y( 14len~gth(L.) Hene 'Since f m(z0 ) is 16,nW f6r all Is one- point, call, 'it M > 1, t z. must* be, inW(P) This irgUmenit s-hows that Ws(p ) intersects the top and-bottom of and first leaves iterates of fn(Bn) I Bn f-1) thi'ough these sides. extends through the horseshoe first leaving the horseshoe through the "Ifeet"., (See Fig. 7). In order to prove ~that show that the horseshoe B Bn+1 Wu(pn) shows that A similar argument (using Spaaboa fn(Bn) since Xp < 1. (O,%,) we need to crosses through (Ovq 0 ) to n1 to the vertex of the right f Fn(B ), as shown in Fig. 8. It is easily seen by our assumptions dn YvVk(q,+Bc) Ws(Pn+l)e containing WUp) P be the distance from bundry intersects Let Q be the distance from (see Fig. 8). and let Wu (p) f that Q We conclude that - (n+l)+k(p0 .c) and P QC a 9. Y 00 n P - 0 as n n Figure 7 Caption Figure 7 shows parts of the stable and unstable manifolds of the simple Nfewhouse saddle pn. Pni I.I I.~ (On) I I sn+i ~Figure 8 CaPtiofl used in the proof of the Lemma. Figure 8 illustrates definitionls ------ REFERENCES [GH] J. Guckenheimer and P. Holmes, Nonlinear Oscillations, Dynamical Systems, anc Bifurcation of Vector Fields, Berlin, Heidelberg, New York: Springer, 1983. [GOY] C. Grebogi, E. Ott, and J. Yorke, Phys. Rev. Lett. 56 (1986), 1011. CGS] N. Gavrilov and L. Silnikov, "On three-dimensional dynamical systems close to systems with structurally unstable homoclinic curve I," Math. USSR Sbornik 17 (1972),.467-485; and part II, Math. USSR Sbornik 19 (1973), 139-156. [HJI S. Hammel and C. Jones, personal communication; and "A dissipative map of the plane--a model for optical bistability," Doctoral E ssertation by S. Hammel, University of Arizona, 1986. EN) S. Newhouse, "Diffeomorphisms with infinitely many sinks," Topology 13 (1974), 9-18. ER) C. Robinson, "Bifurcation to infinitely many sinks," Commun. Math. Phys. 90 (1983), 433-459. CS] S. Smale, "Differentiable dynamical systems," Bull, Am. Math. Soc. 73 (1967), 747-817. [TY) L. Tedeschini-Lalli and J. Yorke, "How often do simple dynamical processes have infinitely many sinks?" 10 preprint The Analysis of Experimental Data Using Time-Delay Embedding Methods Eric J. Kostelich* James A. Yorke Institute for Physical Science and Technology University of Maryland College Park, Maryland 20742 January 30, 1989 Abstract The time delay embedding method provides a powerful tool for the analysis of experimental data, including a novei method for noise reduction. In addition, we describe how the method allows experimentalists to use many of the same techniques that have been essential for the analysis of nonlinear systems of ordinary differential equations and difference equations. 1 Introduction Numerical computation and computer graphics have been essential tools for investigating the behavior of nonlinear maps and differential equations. The pioneering work of Lorenz [24] was made possible by numerical integration on a computer, allowing him to take nearby pairs of initial conditions and compare the trajectories. H~non [23] discovered the complex dynamics of his celebrated quadratic map with the aid of a programmable calculator. A 'Mailing address: Center for Nonlinear Dynamics, Department of Physics, University of Texas, Austin, Texas 78712 "l1 variety of classical and modem techniques has been exploited to find periodic orbits; their stable and unstable manifolds [20]; basins of attraction [251; fractal dimension [261; and Lyapunov exponents [17, 29, 351. In some cases, numerical methods can establish rigorously the existence of initial conditions whose trajectories have essentially the same intricate structure that one sees on a computer screen [11]. Unfortunately, until now experimentalists have not been able to apply most of these methods to the analysis of experimental data, since they do not in general have explicit equations to model the behavior of their apparatus. In cases where it is possible to find accurate models of the physical system, quantitative predictions about the behavior of actual experiments are possible [22]. However, all that is available in a typical experiment is the time dependent output (e.g. voltage) from one or more probes, which is a function of the dynamics. Until recently, power spectra have been the principal method for analyzing such data. For instance, Fenstermacher et a]. [19] relied heavily on power spectra to detect transitions from periodic to weakly turbulent flow between concentric rotating cylinders. However, Fourier analysis alone is inadequate for describing the dynamics. Other methods have been used to analyze time series output from dynamical systems. For instance, Lorenz [24] used next amplitude maps to describe some features of the dynamics; that is, he plotted z,+, against z, where z,, is the nth relative maximum of the third coordinate of the numerically calculated solution. Such maps are often useful, not only for investigating features of the Lorenz attractor [30], but also for instance in experiments on intermittency in oscillating chemical reactions [28]. In the past several years, the so-called embedding method has come into common use as a way of reconstructing an attractor from a time series of experimental data. In this approach, one supposes that the dynamical behavior is governed by a solution traveling along an attractor' (which is not observable directly). However, one assumes that there is a smooth function which maps points on the attractor to real numbers (which are the experimental measurements). In the embedding method, one generates a set of m-dimensional points whose coordinates are values in the time series separated by a constant delay [9]. For example, when m = 3, the reconstructed attractor is the set of points {X = (Si, si+,, -+2r)} where -ris the time delay. 'Existing numerical methods require the attractor to be low dimensional. 2 Takens [32] has shown that under suitable hypotheses, this procedure yields a set of points which is equivalent to points on the original attractor. The earliest applications of the embedding method may be called static in that the analysis focuses on the geometric properties of the set of points on the reconstructed attractor. For example, phase portraits and Poincar6 sections are used in [41 to help determine the transition between quasiperiodic and chaotic flow in a Couette-Taylor experiment. Another important static method is the estimation of attractor dimension from experimental data. for which there is a large literature [26]. In addition, various information theoretic notions can be used to find good choices of embedding dimension and time delay [21]. Certain recent applications of the embedding method are quite different in nature and can be called dyzamic in that information about the dynamics is stored in the computer for analysis. With each data vector xi, one stores the "next" vector, for example, xi+s for some S > 0. This makes it possible to compute a linear approximation of the dynamics in a neighborhood of x,, assuming that there is a low dimensional dynamical system underlying the data.2 In particular, a linear approximation provides an estimate of the Jacobian of the map at xi [9]. Eckmann and Ruelle [17] use linear maps computed in this way to integrate a set of variational equations and find the 3 positive Lyapunov exponents. In fact, the embedding method provides a powerful set of tools for analyzing the dynamics, the breadth of which may not have been realized by Eckmann and Ruelle. In this paper, we discuss two novel applications that are possible, specifically: * Noise reduction. Since one can approximate the dynamics at each point, it becomes possible to identify and correct inaccuracies in trajectories arising from errors in the original time series. Numerical evidence suggests that the noise reduction procedure described below improves the accuracy of other analyses, such as Lyapunov exponents and dimension calculations. I Simplicial approximations. Linear approximations can be computed at each point on a grid in a neighborhood of the attractor to 2This 3 Wolf material was first presented by D. Ruelle at a Nobel symposium in June 1984? et a]. [35] have proposed a different method in which nearby pairs of points are followed to estimate the largest Lyapunov exponent. 3 form a simplicial approximation of the dynamical system. This can be used to locate unstable periodic orbits near the attractor. We begin with a description of noise red iction in the next section. 2 Noise Reduction The ability to extract information from time varying signals is limited by the presence of noise. Recent experiments to study the transition to turbulence in systems far from equilibrium, like those by Fenstermacher et al. [19], Behringer and Ahlers [1], and Libchaber et al. [161, succeeded largely because of instrumentation that enabled them to quantify and reduce the noise. However, it is often expensive and time consuming to redesign experimental apparatus to improve the signal to noise ratio. In cases where the time series can be viewed as a dynamical system with a low dimensional attractor, the time delay embedding method can be exploited to correct errors in trajectories that result from noise. This is done in two steps once an embedding dimension m and a time delay r have been fixed. In the first step, we consider the motion of an ensemble of points in a small neighborhood of each point on the attractor in order to compute a linear approximation of the dynamics there. In the second step, we use these approximations to consider how well an individual trajectory obeys them. That is, we ask how the observed trajectory can be perturbed slightly to yield a new trajectory that satisfies the linear maps better. The trajectory adjustment is done in such a way that a new time series is output whose dynamics are more consistent with those on the phase space attractor. This approach is fundamentally different from traditional noise reduction methods. Because we consider the motion of points on a phase space attractor, we are using information in the original signal that is not localized in a time or frequency domain. Points which are close in phase space correspond to data which in general are widely and irregularly spaced in time, due to the sensitive dependence on initial conditions on chaotic attractors. In contrast, Kalman [3) and similar filters examine data which are closely spaced in time; Wiener [27] filters operate in the irequency domain. 4 Eckmann-Ruelle linearization 3 I. The discrete sampling of the original signal means that the points on the reconstructed attractor can be treated as iterates of a nonlinear map f whose exact form is unknown. We assume that f is nearly linear in a small neighborhood of each attractor point x and write f(x) Ax + b L(x) for some m x m matrix A and m-vector b. (The matrix A is the Jacobian of f at x.) This approximation, which we call the Eckmann-Ruelle linearization at x, can be computed with least squares methods similar to those described in [9, 17]. Given a reference point xrf, let {xi}!' 1 be a collection of the n points which are closest to Xref. With each point xi we store the next point (i.e., the image of xi), denoted y,. 4 The kth row ak of A and the kth component bk of 6 dre given by the least squares solution of the equation yk = bk + ak. x, (1) where yk is the kth component of y and the dot denotes the dot product. Figure 1 illustrates the idea.5 We mention three difficulties in computing the local linear approximations in the subsections below. 3.1 Il conditioned least squares There is a particular problem when one tries to compute solutions to Eq. 1 with a finite data set of limited accuracy that has not been addressed in previous papers [17, 291. Suppose for example that all the points in a neighborhood of x~f lie nearly along a single line, i.e., the attractor appears one dimensional within the available resolution. Although it is possible to measure the expansion along the unstable manifold at Xref, there are not enough 4The Icompute points xi are points on the attractor which are not consecutive in time. The subscript i merely enumerates all the points on the attractor contained within a small distance e of xr. In this notation, xi and yi are consecutive in time. 5 Farmer and Sidorowich [18] observe that the Eckmann-Ruelle linearization can be used for prediction. Given a reference point xi, find the Eckmann-Ruelle linearization Aix+ bi, xi+= Aix, + bi, and repeat the process to get the predicted trajectory f(x)=Ax+b Figure 1: Schematic diagram for the first stage of the noise reduction method. A collection of points in an e.ball about the reference point Xref is used to find a linear approximation of the dynamics there. 6 I points in other directions to measure the contraction. Hence it is not possible to compute a 2 x 2 Jacobian matrix accurately. Any attempt to do so will result in an estimate of the Jacobian whose elements have large relative errors. This kind of least squares problem is ill conditioned. The ill conditioning can be avoided by changing coordinatei so that the first vector in the new basis points in the unstable direction. 6 A one dimensional approximation of the dynamics is computed using the new coordinates; that is, we approximate the dynamics only along the unstable manifold. We recover the matrix A by changing coordinates back to the original basis. For example, if we are working in the plane and the unstable direction is the line y = x, then we rotate the coordinate axes by 45 degrees. The dynamics are approximated by a one-dimensional linear map computed along the line y = x. Then we rotate back to the original coordinates. (The resulting matrix A has rank 1 in this example.) This approach substantially enhances the robustness of the numerical procedure. 3.2 3 Sbox Finding nearest neighbors A second problem is finding an efficient way to locate all of the points closest to a given reference point. The dynamical embedding method imposes stringent requirements on any nearest-neighbor algorithm. The storage overhead for the corresponding data structures must be small, because there are tens of thousands of attractor points. The algorithm must be fast, since there is one nearest-neighbor problem for each linear map to be computed. We solve this problem by partitioning the phase space into a grid of boxes that is parallel to the coordinate axes. Each coordinate axis is divided into B intervals. (Figure 2 illustrates the grid in two dimensions.) Each point on the attractor is assigned a box number according to its coordinates. For example, a point on the plane whose first coordinate falls in the jth interval (counting from 0) along the x axis and whose second coordinate falls in the kth interval along the y axis is assigned to box number kB + j. The list of numbers is sorted, carrying along a pointer to the original data point. Given a reference point x, its box number is found using the above formula. A binary search in the list of box numbers then locates the address Uf X ef 6This is done by computing the right singular vectors [81 of the n x m matrix whose jth row is xi. 7 I . ..I• m• • • B'-:B B2-B+1 B 2 -B+2 B B+1 B+2 0 1 2 .... I 2B-1 2. ... B-i Figure 2: Box numbering scheme in 2 dimensions. The attractor is normalized to fit in the unit square. The bottom row of boxes rests against the x axis and the leftmost row of boxes against the y axis. and all the other points in the same box number. The search is extended if necessary to adjacent boxes. Only a crude partition is needed for this algorithm to work efficiently (typically we choose B = 40), and the grid is extended only to the first three coordinate axes. When the embeddi. g dimension is larger than three, a preliminary list of nearest neighbors is obtained using only the first three coordinates of each attractor point. The final list is extracted by computing the distances from x,,f to each point in the preliminary list. Although there are circumstances where this algorithm can perform poorly (e.g., when most of the attractor points are concentrated in a handful of boxes), the distribution of points on typical attractors is sufficiently uniform that the running time is very fast. Memory use is also efficient: a set of N attractor points requires 3N storage locations. In contrast, the tree-search algorithm advocated in [181 requires several times more storage (although the lookup time is probably slightly less). Because N 1i05 in typical applications, we believe that the box-grid approach (or some variant) is itae most practical. A survey of other nearest-neighbor algorithms is given in [2]. 3.3 Errors in variables There is a potential difficulty in the use of ordinary least squares to compute the linear maps. In the usual statistical problem of fitting a straight line, one has observations (xi, yi) where xi is known exactly and yi is measured. One assumes that yi = ao + alix + ei, where the ei are independent errors drawn from the same normal distribution. (Analogous assumptions hold 8 3 in the multivariate case.) In the present situation, however, both xi and yi are measured with error. It caii be shown that ordinary least squares produces biased estimates of the parameters ao and a, in this case [15, 10]. In practice this does not seem to be a serious problem, but statistical procedures to handle this situation (the so-c.dlled "errors in variables" methods) may provide an alternative approach to noise reduction. We consider this question in the appendix. 4 Trajectory Adjustment by Minimizing Self Inconsistency The Eckmann-Ruelle linearization procedure described above is computed and the resulting maps are stored for a sequence of reference points along a given trajectory (for the results quoted here, the sequence usually contains 24 Points). We now consider how to perturb this trajectory so that it is more consistent with the dynamics. The objective is to choose a new sequence of points *i to minimize the sum of squares wl - xill + 11*i - L-1(*i-1)l + ilI*+i - Li(*ki)I 2 (2) where L(x,) = Aixi + bi, w is a weighting factor, and the sum runs over all the points along the trajectory. Equation 2 can be solved using least squares. Heuristically, Eq. 2 measures the self-inconsistency of the data, assuming that the linear approximations of the dynamics are accurate. See Fig. 3. We say the new sequence {A} is more self consistent. The trajectory adjustment can be iterated. That is, once a new trajectory ki has been found, one can replace each xi in Eq. 2 by ki and compute a new sequerce {x*}. We place an upper limit on the distance a point can move. Points which seem to require especially large adjustments can be flagged and output unchanged. (This may be necessary if the input time series contains large 7 ,n the results described in this paper, the Eckmann-Ruelle linearization procedure is done using a collection of points within a radius of 1-6% of the each reference point, depending on the embedding dimension, the dimension of the attractor, and the number of attractor points. This results in collections of 50-200 points per ball, which gives reasonably accurate map approximations without making the computer program too slow. The weighting factor w is set to 1. 9 Figure 3: Schematic diagram of the trajectory adjustment procedure. The trajectory defined by the sequence {x,} is perturbed to a new trajectory given by {*i} which is more consistent with the dynamics. In this example the dashed line shows what the perturbed trajectory might look like if the dynamics were approximately horizontal translation to the right. "glitches" or if nonlinearities are significant over small distances in certain regions of the attractor.) When the input is a time series, we modify the above procedure slightly since we require a time series as output. The trajectory adjustment is done so that changes to the coordinates of xi (corresponding to particular time series values) are made consistently for all subsequent points whose coordinates are the same time series values. For example, suppose the time delay is 1 and the embedding dimension is 2. Then trajectories are perturbed so that the second coordinate of the ith point is the same as the first coordinate of the (i + 1)st point. That is, when xi = (.s,si+1) is moved to the point =( we require that the first coordinate of *i+l be ii+1. 10 5 Results using experimental data aof ! 3 U 3 3 '3 We note that the attractor need not be chaotic for this noise reduction procedure to be effective. Fig. 4(a) shows a phase portrait of noisy measurements wavy vortex flow in a Couette-Taylor experiment [12]. This flow is periodic, so the attractor is a limit cycle (widened into a band because of the noise) and the power spectrum consists of one fundam"rental frequency and its harmonics above a noise floor. See Fig. 4(b). Figures 4(c)-(d) show the same data after noise reduction. The noise reduction procedure makes the limit cycle much narrower, and the noise floor in the power spectrum is reduced by almost two orders of magnitude. However, no power is subtracted from any of the fundamental frequencies, and in fact some harmonics are revealed which previously were obscured by the noise. These results are significantly different from those obtained by low pass filteriag. Figure 4(e)-(f) shows the phase portrait and power spectrum when the original data are passed through a 12th-order Butterworth filter with a cutoff frequency of 0.35. Th,. dynamical noise reduction procedure is more effective than low pass filtering since the noise appears to hwave a broad spectrum. However, the method appeaito subtract power from a mode whose fundamental frequency is approximately 0.3 times the Nyquist frequency. We do not know exactly why this occurs. However, this peak corresponds to the rotation frequency of the inner cylinder and may result from a defect in the Couette-Taylor apparatus [31]. We do not consider this to be a serious problem, because the power associated with this mode is several orders of magnitude smaller than that of the wavy vortex flow. We emphasize that our objective is to find a simple dynamical system that is consistent with the data. It is possible for this method to eliminate certain dynamical behavior from an attractor if those dynamics have small amplitude. This situation is most likely to arise when there are not enough data to distinguish such dynamics from random noise. In the present example, the noise reduction procedure reveals the limit cycle behavior quite well. 8 The results obtained by applying the method to chaotic data from the "We have not attempted to find the smallest amplitude at which the noise reduction procedure can distinguish quasiperiodic from periodic flow. I 1 11 (b) 3 2 0 -2 L -3 -4 (C)3 (d) 1 1 -2 0 -2-1 -4 I o-2 -2 0 -2 -4 0 0.1 0.2 0.3 0.4 0,5 Figure 4: Phase portraits and power spectra for measurements of wavy vortex flow in a Couette-Taylor experiment. (a)-(b) Phase portrait and power spectrum before noise reduction is applied: ()-(d) after noise reduction; (e)(f) after a low pass filter is applied to che original data. The vertical axis in (b), (d) and (f) is the base-10 logarithm of the power spectral density; the horizontal axis is in multiples of the Nyquist frequency. 12 U Scesses Couette-Taylor fluid flow experiment described in (4] are shown in Fig. 5. Figure 5(a) shows a two dimensional phase portrait of the raw time series at a Reynolds number R/& = 12.9, which corresponds to weakly chaotic flow [4]. The corresponding phase portrait from the filtered time series is shown in Fig. 5(b). Figs. 5(c)-(d) show the power spectra for the corresponding time series. 9 It is difficult to estimate how much noise is removed from the data in this example on the basis of power spectra. One problem is that the transition from quasiperiodic to weakly chaotic fluid flow is marked by a sudden rise in the noise floor in the power spectrum (cf. Fig. 3 in (4]). Hence one cannot determine how much of the noise floor is due to deterministic chaos and how much results from broadband noise. The noise reduction procedure described here has the effect of reducing the power in the high frequency components of the signal. One question therefore is whether reducing the high-frequency noise corresponds to discovering the true dynamics which have been masked by noise. We believe that the answer is yes, based on those cases where there is an underlying low-dimensional dynamical system. However, in chaotic prosome high-frequency components remain, because they are appropriate to the dynamics. 6 * Numerical Experiments on Noise Reduction One important question is how much noise this method removes from the data. The power spectra above suggest that the method eliminates most of the noise, but it is impossible to give a precise estimate for typical experimental data. However, the H1non map [23] provides a convenient way to quantify the noise reduction, because it can be written as a time delay map of the form (3) + #i-I. XI+1= f(z, xi- 1 ) = 1 -X We use Eq. 3 to generate a time series as follows (with the standard parameter values a = 1.4, P = 0.3). We choose an initial condition and discard the 9The time series consists of 32,768 values, from which an attractor is reconstructed in four dimensions. Linear maps are computed using 50-100 points in each ball. Trajectories are fitted using sequences of 24 points. 13 (a) (b) 2 I, 0 -2 .3 .4 )(d) 2 -2 -3 -4 0 8 16 24 32 Figure 5: Phase portraits and power spectra for measurements of weakly chaotic flow in a Couette-Taylor experiment. (a)-(b) Phase portrait and power spectrum before noise reduction is applied; (c)-(d) after noise reduction. The units for the power spectrum plots are the same as those in [4]. 14 40 :1 3i U 3 *I first 100 iterates. The next 32,768 iterates are stored, and a time series is generated by adding a uniformly distributed random number to each iterate. This simulates a time series with measurementnoise, i.e., a time serics where noise results from errors in measuring the signal, not from perturbations of the dynamics. We measure the improvement in the signal after processing by considering the pointwise errorei = lxzi+ -f(xi, zi-t)I, i.e., the distance between the observed image and the predicted one. Let the mean error be E = (F e /V)I1 2 , the rms value of the pointwise error over all N points on the attractor. We define the noise reduction as R = 1- Efitd/Eno1 , where the mean errors are computed for the adjusted and original noisy time series, respectively. The quantity R is a measure of the self-consistency of the time series. (In other words, R measures how much better on the average the output attractor obeys Eq. 3 as one hops from point to point.) When 1% noise is added to the input as described above, the noise reduction (measured with the actual map) is 79%.1o Nearly identical results are obtained wnen the input contains only 0.1% noise. In addition, noise levels can be redtced almost as much in cases where the noise is added to the dynamics, i.e., where the input is of the form {xi+,: xi+1 = f(xi + i7i, xi-.i + i.i_i), mi, ?1i_ random}. When the program is run on noiseless input, the mean error in the output is 0.025% of the attractor extent, which suggests that errors arising from small nonlinearities are negligible when the input contains enough points. 7 I Simplicial Approximations of Dynamical Systems Recent work has shown that simplicial approximations of dynamical systems can reproduce the behavior of the original system to high accuracy [341. (See also [33] for a bilinear approach.) In particular, the fractal structure of the original attractors and basin boundaries is preserved over many scales. Such approximations can yield significant computational savings, especially when the original system consists of ordinary differential equations. 1The pointwise error is measured using Eq. 3. However, the attractor can Le embedded in more than two dimensions when performing the noise reduction. 3 I I 1 15 aa• • •a This approach can be extended in a natural way to generate simplicial approximations of the dynamics on attractors reconstructed from experimental data. Our objective here is to find an approximate dynamical system in a neighborhood of the attractor as follows. A simplex in an m dimensional space is a triangle with m + 1 vertices. Suppose the map is known at each point on a grid. Then there is a unique way to extend the map linearly to the interior of the simplex S whose vertices are grid points. Given a point P in the interior of S, let {bi}!= be its corresponding barycentriccoordinates (see [34] for an algorithm to compute them). Let f(vi) be the map at the ith vertex. The dynamical system at P is iterated by computing m D(P) = _bif(vi). i=o (4) We apply this method to experimental data by finding a linear approximation of Lae dynamics at each vertex vi with the least squares method described above, using a collection of points in a small ball around vi. The maps are stored and retrieved using a hashing algorithm similar to that described in [34]. This yields a piecewise linear approximation of the aynamics from a set of experimental data which can be analyzed with the methods 11 that previously were available only to theorists. We illustrate the approach using a time series of 32,768 values from the H~non map with a = 1.2, 3 = 0.3 using Eq. 3 and adding 0.1% noise as described above. The original attractor is shown in Fig. 6(a). We take a grid of points which are spaced at 1% intervals (this and subsequent distances are expressed as a fraction of the original attractor extent). The time series is embedded in two dimensions, and a linear approximation of the dynamics is computed at each grid point for which 50 or more attractor points can be collected with a ball of radius 0.03; the set of such grid points is shown in Fig. 6(b). We take an initial condition near the original attractor and show the first 3000 iterates using Eq. 4 in Fig. 6(c). Although some defects are visible, the attractor produced by the approximate dynamical system looks almost identical to the original one. "This approach is leps ambitious than that of Crutchfield and McNamara [7], who attempt to find a single set of nonlinear difference equations that creates the observed attractor. 16 3(a) (b) ...... ... ....... / //N ,r... N -I7'M M-1 ... ...... .......... N 3~ ~~~ ........... .____________ ~~~~~~ ______ ....... ..... ..... MMM .1 3 Figure 6: (a) H~non attractor computed from Eq. 3 with a 1.2, i3=0.3. (b) 1%grid on which linear approximations of the dynamnics are computed from the available attractor points. (c) Attractor produced by the simplicial approximations. 17 period D=2 exact I D=3 1 1.793 1.695 1.757 2 2.178 2.199 2.183 4 4.226 4.329 4.051 6 10.38 10.70 9.626 6 10.38 11.32 12.12 8 25.80 24.88 30.25 8 20.02 20.60 20.38 8 17.70 24.32 21.70 Table 1. The largest eigenvalues of the Jacobian of the periodic orbits located using the simplicial approximation of the H6non attractor. One application of simplicial approximations is the location of periodic saddles and the estimation of the largest eigenvalue of the corresponding Jacobian. That is, if x is a periodic point of period p, then we find the eigenvw' e of DfP(x) of largest modulus, where DfP(x) refers to the matrix of paftia, derivatives of the pth iterate of the map f evaluated at x. Given an initial guess for x, one can apply Newton's method using the maps computed at the grid points and Eq. 4 to locate the'saddle using the simplicial approximations. Likewise, Eq. 3 can be used to locate the corresponding "exact" saddle. Saddle orbits up to period: 8 have been computed in this way. In all cases, the saddle point for the simplicial approximation is within 2% of the corresponding saddle point for the H6non map. Table 1 shows the largest eigenvalues of the saddle orbits. (The columns labeled m = 2 and m = 3 refer to the embedding dimension used to reconstruct the attractor.) In most cases, the relative error is only a few percent, and in no case exceeds 25%. (The largest relative error is for the period 8 saddles, where one finds the eigenvailue of the product of 8 Jacobians computed from the least squares.) This method can be extended to experimental data sets. However, there are relatively stringent requirements on the data that can be handled: the time series must be long enough to trace out many trajectories near the principal unstable saddle orbits, and the noise level must be low. (Presumably noisy data can be preprocessed using the approach described in Section 3.) 18 ii 13 3 The current computer implementation uses a large amount of disk space to store the linear map approximations at the grid points. We have constructed a simplicial approximation for an attractor obtained from a Belousov-Zhabotinskii chemical reaction [6, 28]. The attractor is reconstructed in three dimensions from a set of 32,768 measurements of bromide ion concentration. The phase portrait is shown in 7(a). Linear approximations of the dynamics are computed at each point of a grid consisting of 50 intervals along each coordinate axis for which 50 or more attractor points can be located within an 8% radius of the grid point. This produces a database of 59,550 maps. We observe from graphical evidence that many trajectories approach what appears to be a period 3 saddle in the middle of the attractor. Using initial guesses from some of the trajectories, we apply Newton's method to locate the saddle orbit shown in Fig. 7(b). Moreover, we obtain estimates of the Jacobian DF of the map evaluated ac a point on saddle orbit. The eigenvalues of DF are estimated as A1 = 1.14, A2 = 0.102, and A3 = -1.53. These quantitative results confirm that the * orbit is a saddle since A1 > 0 > A3 . (Note that wie expects A2 = 0 for a how generated from a set of differential equations.) * 8 3 Methods for approximating the dynamics of attractors reconstructed from experimental data provide powerful tools. Most of the same procedures that have been so important for theoretical insight, such as Poincar6 maps, unstable fixed points and their manifolds, basin boundaries, and the like, are now available to experimenters, at least in cases where the dynamics are low dimensional. There is little doubt that these tools will lead to breakthroughs in the understanding of a wide variety of physical systems. However, considerable effort is needed before we learn which kinds of systems will benefit Conclusion most from these types of analyses. Significant improvements in technique will certainly extend the applicability of dynamical embedding methods, for example to higher dimensional attractors. 1 19 (a)) Figure 7: (a) The attractor reconstructed from a time series of bromide ion concentrations in a Belousov-Zhabotinskii chemical reaction. (b) The period 3 saddle orbit. 20 I n I g 3 3 I Appendix In this appendix we outline a possible alternative noise reduction method based on the theory of least squares when all the quantities in the regression are measured with error. In ordinary least squares, the variables in the problem fall into two classes: the independent variables, which are known exactly, and the dependent variables, which are observations assumed to be functions of the independent variables. The dependent variables are subject to random errors that are assumed independent and identically distributed (i.i.d.). On an attractor reconstructed from experimental data, we assume that the mapping which takes points in a sufficiently small ball to their images is approximately linear. However, the locations of all the points are subject to small random errors because of the noise. Hence one cannot describe the points as independent variables and their images as dependent variables. The usual least squ-es method produces a biased estimate of the linear map, and this bias does not decrease if more observations are added [15, 10]. The so-called "errors in variables" least squares methods can be used to handle the latter problem. This approach can be used to obtain both an estimate of the linear map as well as estimates of the "true" values of each of the observations. At first this appears to be an underdetermined problem: from n pairs of observations one wants to compute the parameters of the functional relation between them as well as estimates of the n actual pairs. 12 However, it is possible to solve this problem by making some assumptions about the errors [15, 10]. In our case, we assume that the errors in the location of each point and its image are i.i.d. In particular, we let the covariance matrix of the errors in the variables be the identity matrix. This assumption is valid whenever 13 the noise is independent of the dynamics. We illustrate the procedure for the case where we are given a collection of n points (in Rm ) and their images. Following Jefferys [131, we form a set 12 1n the statistical literature, the problem is said to be unidencified. 13Dynamical noise (i.e., each point is perturbed slightly before iterating) yields a covariance matrix which depends on the point. However, as long as the dynamical noise is small, our assumptions about the covariance matrix of the errors should not compromise the accuracy of the method. 21 of n equations of condition given by fi(xi) = x,+i - Ax - bi = x,+/ - L(x) (5) where xi is the ith point, x,,+i is its observed image, A is an m x m matrix, and b is an m-vector. The goal is to find estimates of L (i.e., A and b), together with perturbations ',, such that fi(xi + i)= (X,+1 + i+O- L(xi + =0 and such that the quadratic form 1' -i 1 v SO0 = (6) is minimized. The superscript t denotes transpose and a is the covariance matrix of the observations (which we assume is the identity matrix here). Th'i minimization problem can be solved using Lagrange multiplies (see [13] and [14] for a numerical algorithm). The solution gives A and b together with estimates xi + ,iof the "true" observations. It can be shown [10] under fairly mild hypotheses that the estimates of L and the observations are the best in the class of linear estimators. One way to approach noise reduction is to extend Eq. 5 to include several iterations of the observed points. Given a collection of points in a ball, together with the next p iterates of each point, the method above is used to find a collection of linear maps L1 , L 2 , ... , Lp approximating the dynamics. The method also finds estimates of the actual observations. In this approach, therefore, the calculation of the maps and the adjustmenc of the trajectories is done in one step. Moreover, each point and its image exactly satisfy a linear relationship. Of course, p cannot be too large, because nonlinear effects eventually will become significant when the dynamics are chaotic. On the other hand, Eq. 5 p-ovides a natural way to include quadratic or other nonlinear terms. We have written a computer program to implement this alternative noise reduction algorithni. So far, the results of this approach have not been as good as those from the method described in the main part of the paper, but further refinement should improve them. 22 Acknowledgments Dan Lathrop provided invaluable assistance in finding periodic orbits in the Hnon and BZ attractors. We thank Bill Jefferys for useful discussions and computer software for the errors in variables least squares problem. Andy Fraser, Randy Tagg and Harry Swinney all provided helpful suggestions. This research is supported by the Applied and Computational Mathematics Program of the Defense Advanced Research Projects Agency (DARPA-ACMP) and by the Department of Energy Office of Basic Energy Sciences. References [1] R. P. Behringer and G. Ahlers, J. Fluid Mech. 125 (1982), 219; G. Ahlers and R. P. Behringer, Phys. Rev. Lett. 40 (1978), 712. (2] J. L. Bentley and J. H. Friedman, ACM Comput. Surv. 11 (1979), 397. [3] See for example S. M. Bozic, Digital and Kalman Filtering (London: Edward Arnold Publishers Ltd., 1979). [4] A. Brandsti.ter and H. L. Swinney, Phys. Rev. A 35 (1987), 2207. (5] M. Casdagli, "Nonlinear Prediction of Chaotic Time Series," preprint (Dec. 1987). (6] K. C. Coffman, Ph.D. thesis, University of Texas at Austin, 1987. [7] J. P. Crutchfield and B. McNamara, Complex Systems 1 (1987), 417. (8] J. J. Dongarra, C. B. Moler, J. R. Bunch, and G. W. Stewart, LINPACK User's Guide (Philadelphia: Society for Industrial and Applied Mathematics, 1979). (9] J.-P. Eckmann and D. Ruelle, Rev. Mod. Phys. 57 (1985), 617. (10] W. A. Fuller, Measurement Error Models (New York: John Wiley -Sons, 1987). [11] S. M. Hammel, J. A. Yorke, and C. Grebogi, J. of Complexity 3 (1987), 136; ibid., Bull. Amer. Math. Soc. 19 (1988), 465. 23 I [12] D. Hirst, Ph.D. dissertation, University of Texas, Dec. 1987. (13] W. H. Jefferys, Astron. J. 85 (1980), 177. [14] W. H. Jefferys, Astron. J. 86 (1981), 149. [15] M. G. Kendall and A. Stuart, The Advanced Theory of Statistics, Vol. 2 (London: Charles Griffin & Company Limited, 1961), p. 375. [16] A. Libchaber, S. Fauve, and C. Laroche, Physica D 7 (1983), 73. (17] J.-P. Eckmann, S. 0. Kamphorst, D. Ruelle and S. Ciliberto, Phys. Rev. A 34 (1986), 4971. (18] J. D. Farmer and J. J. Sidorowich, Phys. Rev. Lett. 59 (1987), 845. [19] P. R. Fenstermacher, H. L. Swinney, and J. P. Gollub, J. Fluid Mech. 94 (1979), 103. [20] W. Franceschini and L. Russo, J. Stat. Phys. 25 (1981), 757. [21] A. Fraser and H. L. Swinney, Phys. Rev. A 34 (1986), 1134. [22] E. G. Gwinn and R. M. Westervelt, Phys. Rev. A 33 (1986), 4143. (23] M. H6non, Comm. Math. Phys. 50 (1976), 69. [24] E. N. Lorenz, J. Atmos. Sci. 20 (1963), 130. [25] S. W. MacDonald, C. Grebogi, E. Ott and J. A. Yorke, Physica D 17 (1985), 125. [26] For example, see the papers in Dimensions and Entropies in Chaotic Systems, ed. by G. Mayer-Kress (Berlin: Springer-Verlag, 1986), and references therein, [27] For example, see L. R. Rabiner and B. Gold, Theory and Application of Digital Signal Processing(Englewood Cliffs, N. J.: Prentice-Hall, 1975). [28] J.-C. Roux, Physica D 7 (1983), 57; J.-C. Roux, R. H. Simoyi, and H. L. Swinney, Physica D 8 (1983), 257. 24 I [29] M. Sano and Y. Sawada, Phys. Rev. Lett. 55 (1985), 1082. [30] C. Sparrow, The Lorenz Equations: Bifurcations, Chaos, and Strange Attractors (New York: Springer-Verlag, 1982). [31] R. Tagg, private communication. [32] F. Takens, in Dynamical Systems and Turbulence, ed. by D. A. Rand and L.-S. Young, Springer Lecture Notes in Mathematics, Vol. 898 (New York: Springer-Verlag, 1980), p. 366. [33] B. H. Tongue, Physica D 28 (1987), 401. [34] F. Varosi, C. Grebogi, and J. A. Yorke, Phys. Lett. A 124 (1987), 59. 1 (35] A. Wolf, J. B. Swift, H. L. Swinney, and J. A. Vastano, Physica D 16 (1985), 285. 2 12 II November 1989 ACCESSIBLE SADDLES ON I' FRACTAL BASIN BOUNDARIES i £by Kathleen T. Alligood1 Department of Mathematics George Mason University Fairfax, VA 22030 5 and James A. Yorke' 2 Department of Mathematics and Institute for Physical Science and Technology University of Maryland College Park, MD 20742 I I I I V 1Research I I. 2 partially funded by a contract Computational Mathematics Program, DARPA. from Research partially funded by a grant from AFOSR. the Applied and ABSTRACT For a homeomorphism of the plane, the basin of attraction of a fixed point attractor is open, connected, and simply-connected, and hence is homeomorphic to an open disk. The basin boundary, however, need not be homeomorphic to a circle. When it is not, it can contain periodic orbits of infinitely many different periods. Certain points on the basin boundary are distinguished by being accessible (by a path) from the interior of the basin. For an orientation-preserving homeomorphism, the accessible boundrv points have a well-defined rotation number. We prove that this rotation number is rational if and only if there are accessible periodic orbits. In particular, if the rotation number is the reduced fraction p/q, then every accessible periodic orbit has minimum period q. In addition, if the periodic orbits are hyperbolic, then every accessible point is on the stable manifold of an accessible periodic point. 1. Introduction and Statement of Main Theorems When a dynamical system has more than one attractor, the boundaries between respective basins of attraction can exhibit very complicated patterns. For invertible maps of the plane, these boundaries can be smooth or fractal, and they can contain infinitely many saddle-type periodic orbits. (By fractal, we mean that the set has non-integer Hausdorff dimension.) Two basins of attraction of the time 2n map of the forced damped pendulum equation are shown in black and white in Figure 1. This picture was constructed by choosing a 960 x 520 grid and, using each grid point as an initial condition, testing where its traject, y goes. The system has two fixed point attractors--one in the white region to which all grid points colored white tend under iteration by the map, and one in the black region to which all grid points colored black tend. The boundary between the black and white basins is fractal, making final state predictability very difficult. In addition, buried within the fractal layers of the boundary are saddle periodic orbits of arbitrarily high periods. Even though the dynamics on the boundary appear to be very complicated, it has been observed (see, for example, [G[)Y I ) that some points on the boundary exhibit regular behavior. p We say that a point )n the boundary of an open set W is accessible from W if there is a path beginning in W such that the path hits. p is the first boundary point which Surprisingly, when the boundary is fractal, most points are not accessible. For the map in Fig. 1, ther: are two points that are saddles of period two (i.e., one period two orbit) which are accessible from the white region, and all other points which are accessible from the white region are on the stable manifold of this periodic orbit. In this paper, we investigate the dynamics of the accessible points on basin boundaries. The paper is strongly motivated by numerical studies that repeatedly conclude there are accessible periodic saddles in the boundary. In fact, we know of no natural case of an area-contracting diffeomorphism having a basin boundary without accessible periodic orbits. We would like to thank J. Mather and H. Nusse for helpful discussions. Throughout this paper, set either in the plane R2 W is a connected, simply-connected open S2 , and or in the sphere F is a homeomorphism (or diffeomorphism, if differentiability is required) of the plane or the sphere. (i.e., F(W) = W). attraction. We assume that W is invariant under Our main examples of such sets will be basins of In particular, the basin of attraction of an attracting fixed point must be such a set. (See Sec. 2.) We assume in addition that Wi is not the entire plane, in which case its boundary more than one point. under F, F, 8W d - a nact: -.. is also an invariant set. Since W 8W is is invariant All connected, simply-connected open sets are homeomorphic to an open disk. On the other hand, the boundary of such a region does not have to be topologically a circle, and examples abound in which the boundary of a basin of attraction is a fractal set. The characterization of a set W I 3 as a topological open disk occurs in the study of the Riemann Mapping Theorem which says that for any such set W there is always a one-to-one analytic map 3, h of an open disk D onto W. The knowledge that the basin is topologically an open disk tells us nothing about the boundary of a basin, and It Is our objective to describe the dynamics on the points in aW that are accessible from W. 3point In the following we say that p is accessible only if it is a of 8W that is accessible from W. Caratheodory [C] investigated the behavior of the map h in the Riemann mapping theorem to see when h could be defined at boundary points of the disk. If r is a (continuous) pph in W which limits on the accessible point p, then h-1 () is a (continuous) path in D limiting on exactly one point r in Si, the boundary of D. We call such points as r trivial circle points; we call all other points on the circle non-trivial circle points. Caratheodory's approach was to construct a compactification of W which is topologically identical to D, the closed disk. (His is not the standard compactification; points in this compactification which correspond to points in the boundary S1 of D are called "prime ends" and are defined precisely in Sec. 5.) We define a map h on points in D and on those points in D that are trivial circle points by hc (x) = h(x) for x in D, and hc (r) = p where p is an accessible point and r is an associated trivial circle point, as defined above. It is clear from the construction that each accessible point is the image of at least one trivial circle point. 3 The map hc is not necessarily one-to-one on trivial circle points. However, once a path r in W (See Sec. 7; in particular, see Fig. 7.) limiting on an accessible point p is specified, then there is exactly one trivial circle point x which is the limit of h-1 (r) We mention two properties of accessible points and the map hc: PROPERTY 1 (DENSITY) The set of accessible points is dense in aW; the set of trivial circle points is dense in S , the boundary of D. PROPERTY 2 (EXISTENCE OF AN INDUCED MAP) There is a map, denoted f and called the induced map, from D to itself such that h cf(x)) = F(h Cx)) c when x is in D or x is a trivial circle point. C If p is an accessible point and r is a path in W ending at p, then F(F) is a path in W ending at F(p). to accessible points. trivial circle points. Hence, accessible points map If follows that f maps trivial circle points to On the set of trivial circle points, f is one-to-one, onto, and order-preserving. Such a map can be uniquely extended to a homeomorphism defined on all of SI. These properties allow us to study the dynamical system on the closed disk, maintaining the dynamics on the accessible points. Since in general aW will include much more than the accessible points, much of aW is lost in this representation. For us, however, the simplification is advantageous since we wish to describe the dynamics 4 on the accessible points. We have important examples in which W is not a basin even though - a dense set of points in W have trajectories tending to an attractor. 'I 3i The following definition allows the inclusion of such examples. We say that aW is unstable In W if there is a neighborhood Bc of aW with the property that the set of points in B leave b B W In n>O.) is dense in such that B xeQ W. (I.e., implies that whose orbits eventually there is a dense set Q in Fn(x) is in W\B for some This definition is easily seen to be satisfied when W is a basin of attraction. It is also satisfied in the very different case where there is a dense orbit in W. 3 Certain types of periodic orbits in S 1 merit particular attention. p 6 S be a periodic point of period k. We say p is Let attracting on at least one side (of SI) that I x * p and lim such The following key theorem is proved in Sec. 5: that for each I x e S1 fnk(x) = p. THEOREM 1.1 (ATTRACTING LEMMA). * if there exists k Assume that aW unstable in W and the fixed points of Fk are isolated. Then each periodic circle point that is attracting on at least one side is a trivial circle point. An orientation preserving homeomorphism of the circle can be classified according to its rotation number--a number p, with I 0 5p <1, which represents the average rotation of points under the map. (A precise definition is given in Sec. 5.) is independent of the choice of point on S I. The rotation number The idea of associating a single rotation number with each orientation preserving Such a homeomorphism of the circle originated with Poincar6. homeomorphism will have a periodic point if and only if its rotation number is rational. It will have a fixed point if and only if its rotation number Is 0. We define the rotation number p(aW,F) of F on the accessible points of aW to be the rotation number of the induced map f on S1 . If W is a connected, simply-connected open set in R , if F Is orientation preserving, and if the closure of W is invariant under F, then W has a rotation number. In particular, if p is an 2 Isolated, attracting fixed point in R , if its basin W is not all of R , and If F is orientation preserving, then aW has a rotation number. (See Sec. 2.) G.D. Birkhoff recognized that the set of accessible points is dense in the boundary of an invariant region and that their dynamics can be characterized by their rotation number. He used this idea in [B] to construct a map of the annulus into itself with an unusual Invariant set J. On one hand, J resembles a closed Jordan curve in that each of its points is on the boundary of both an interior region Sint (conti.ning one boundary circle of the annulus) and an exterior region Sex. tcontaining the other boundary circle). On the other hand, J is "remarkable" in the sense that it contains a dense set of points accessible from Sin t with one rotation number and a dense set accessible from Sex t with a different rotation number. To compare this situation with our hypotheses, notice that such a map has an inverse for which J is unstable (in Sint and in Sext ) and J is the boundary between the points which go outward and those which go inward (under the inverse). Cartwright and Littlewood further developed these ideas in [C-L1], where they prove the existence of and determine the stability of periodic orbits for a certain class of second order differential equations in the plane. More recently, J. Mather" has given purely topological proofs of some of the topological results of Carath~odory in [M1] and has used the theory to study Invariant sets for area-preserving homeomorphisms of the annulus (M2],(M3]. We rely on the proofs in the above references of Cartwright-Littlewood and Mather I for much of the material on prime ends given in Secs. 5, 6, and 7. A general reference for Carath~odory's theory is (C-Lo], Chapter 9. The following argument explains the significance of the Attracting Lemma. Assume that the rotation number of f on S rational (say the reduced fraction p/q). is Then SI will have at least 1 fixed point under fq, (i.e., a periodic point of period q). If a trivial circle point x is not fixed under fq, then its orbit converges to a fixed point r under iterates of fq. is necessarily a trivial circle point. accessible point p on aW. By the Attracting Lemma, r Corresponding to r is an By Property 2, p is fixed under F we have the following result: 7 Thus THEOREM 1.2. Assume that aW is unstable in W and that for each k the fixed points of Fk are isolated. number p(W,F) is p/q (resp., 0). Assume further that the rotation Then there Is an accessible fixed point of Fq (resp., F) on aW. In Sections 6 and 7 we describe the dynamics on the set of accessible points under the hypotheses that p is rational, F is a diffeomorphism, and periodic points in the boundary are hyperbolic. (A periodic point p is hyperbolic if the Jacobian matrix no eigenvalues with absolute value 1.) DF(p) has By the Inverse Function Theorem, a hyperbolic point is isolated from other periodic points of the same period (or smaller period). In the following theorem, which is a special case of Theorem 6.1 in Sec. 6, we assume that W is a basin of attraction: i.e., there exists a compact set K in W such that the "w-limit set" of the orbit of each point x in W is non-empty and is contained in K. (Given a point x, the point z is in the w-limit set of the orbit of x, if there exists a sequence {tn }, with tn --) co, t such that f n(x) -- z.) If the orbit of each point in W is bounded, then there exists a compact set Sec. 2 for definition) [BS]. K' S K which is Liapunov stable (see - 3 I I I I THEOREM 6.1'. Assume that the periodic points of F in aW are hyperbolic and that W is a basin of attraction. If the rotation number p is rational, then every accessible point either is a periodic point or is In the stable manifold of an accessible periodic point. Theorems 1.2 and 6.1' do not mention the minimum period of an accessible periodic orbit. Degeneracies can occur due to the fact that the map hc is not necessarily one-to-one on trivial circle points, so that the period of the accessible points can strictly divide the period of the orbit on S1 . In Sec. 7 we prove that such degeneracies are ruled out for homeomorphisns of the plane, although they can still occur for homeomorphisms of the sphere. following two results. We use the The first, a converse of Theorem 1.2 for planar maps, implies that the period of an accessible periodic point cannot be strictly smaller than the period of a trivial periodic circle point. The second implies that it cannot be strictly larger. PROPOSITION 7.3. Let F be a homeomorphism of the plane R2 . there exists an accessible fixed point on aW, then PROPOSITION 7.4. p(aW,F) If is 0 If p = 0, then every accessible periodic point in aW is a fixed point. 9 COROLLARY 7.5. Let F he a homeomorphism of the plane R2. If Is the reduced fraction p # 0 p/q, then every accessible periodic point in aW has minimum period q. The next corollary (a special case of Cor. 7.6) follows, although not directly, from Prop. 7.3, Prop. 7.4, and Thm. 6.1'. In particular, it remains to be shown that if the orbit of an accessible point converges to a fixed point in aW, then the fixed point is accessible. We point out that this corollary does not mention the rotation number p. COROLLARY 7.6'. Assume the following conditions hold: (1) F is a diffeomorphism of the plane 2; (2) the periodic points of F in aW are hyperbolic; (3) W is a basin of attraction; and (4) either (I) there exists an accessible period point of minin~um period q, or (ii) there exists an accessible point which converges (under fq) to a periodic point of minimum period q. Then every accessible point in aW either is a periodic point of minimum period q or is in the stable manifold of such a periodic point. 10 In Sec. 2 we define a general class of connected, compact attractors and show that attractors in this class have connected, simply-connected basins. In Sec. 3 we study the orientation-reversing case, and in Sec. 4 we apply Theorem 1.2 to a class of chaotic i attractors, viewed as boundaries for the inverse of the map F. Figures 1 through 4 were made using Dynamics [Y]. i i I i i I I i I U I 2. Attractors with Simply-Connected Basins If A is a hyperbolic fixed point, then A has a connected, simply-connected neighborhood which contracts to it under iteration by F. In this case, the entire basin of A (see Sec. 1 for definition) is connected and simply connected. Here we look at a more general class of attractors and show that their basins are connected and simply connected and thus satisfy the hypotheses of Theorems 1.1 and 1.2. (The hypothesis that the boundary aW is unstable in U is trivially satisfied if either the attractor A does not intersect aW or if A has a dense orbit and is not a subset of For a closed set S, let S is the set of points y such that W.) be the c-neighborhood of S; i.e., S min U x - y I < c , where xcS 1111 denotes the Euclidean norm in-R . We say a set A is a regular attractor if A satisfies the following three properties: (2.1) A is compact and connected; (2.2) A is Liapunov stable; i.e., for each neighborhood Y of A there exists c>O such that A c Y, and if x e A then Fn(x) for all nl; (2.3) The basin of A contains an open neighborhood of A. in the following proposition, "area-contracting" means specifically that there exists a number g, where Idet DF(x)I < 9 , for all x in R2. - am. -- •m • . . m•m •• 12 •iI g < 1, such that Y, Il PROPOSITION 2.4. Let F be an area-contracting map of the plane. If A is a regular attractor, then the basin U of A is open, connected, and simply connected. Proof. Let given such that such that Y c U be an open neighborhood of A. Let Fn(A )C Y for all Fn(A)cA n t O. c Yc U, for all c > 0 be Select 6, 0 < 6 < c, nt J. Such c and Sexist, since A is Liapunov stable. Let x e U be given. I' Choose k > 0 such that Fk(x) e A . Since AC is open, there is an open neighborhood VX of x which maps into AC under Fk. Thus each point x in U has an open neighborhood V in U, and U is open. Let x I and x2 in U be given. Choose integers P>O and Q>O such that F (X ) E A8 and FQ(x ) e A Define m = max PQ. Then 1 2 a Fm(X ) and Fm(x ) are in AC . Since A is connected, Ac is connected for each c>O. Hence, A is an open, connected set. Since open, connected Cm sets are path connected, there is a path r in Y connecting Fm(xI) and Fm (x) Thus F-m(F) lies in U and connects x to x Therefore, U is connected. 31 It remains to show that U is simply connected. Suppose that U is not simply connected, and let C be a simple closed curve in U which bounds a region D containing a set S (consisting of one or more points) that is not in U. This implies that the distance between SFn (S) and that A is at least c, for all Fn (A ) c AS, for all n 0. Select a, 0 < a < 6, such n : 0. Since C is compact, there exists 13 an integer rJ(C) c A., J(c) > 0 such that FJ(c)(C) c A for all J a j((c). We conclude that the distance between Fn (S) and Fn(C) is at least c-8 for all . n ? J(c). Therefore, On the other hand, since F is area contracting, the distance between Fn(S) and Fn(C) converges to zero as n 4 w. This contradicts the fact that Therefore, U is simply connected. c-6 > 0. 3. Continuation and Orientation-Reversing Maps Let FA be a homeomorphism of R2 depending on a scalar parameter A. We assume that FA has a fixed point regular attractor 3A., which depends continuously on A, for each A. We define the maximal basin W to be the largest open set having a dense set of -A points that are attracted to A WA; under FA. Let B and let p. be the rotation number of F points in BA' be the boundary of on CA , the accessible For a parametrized homeomorphism on a circle, the rotation number varies continuously with the parameter (see, for example, (D]). Unlike the circle case, however, pA is not necessarily continuous in A. In fact, the boundpry BA can jump discontinuously, even when there is no change in the attractor. It was shown in [HJ] (see also [GOY] and [ATY]) that when the stable and unstable manifolds of an accessible saddle on the boundary become tangent at then cross for A = A., and A > A,, the stable manifold jumps a positive distance c (not dependent on A) into W.. for each A > A,. Figure 2 shows in black the basin of attraction of infinity for three different values of the parameter A in the Henon map FA b(x,y) = (A - x2 -by, x) 3 (3.1) where b is fixed at 0.3. There is a period two attractor in the white region to which the orbits of almost all white points tend. 3 experiments indicate that for Numerical X=1.39 (in Fig. 2a), a period-four saddle orbit and its stable manifold are the only boundary points I I 15 accessible from the white region. There is a tangency of the stable and unstable manifolds of consecutive points in this orbit at z 1.395. X = X. Specifically, if we number the four points in the orbit x1 9....x4 consecutively (in the counter-clockwise direction around the basin boundary), and if we set is a periodic trajectory. xn = Xn(mod 4) for n>4, then {Xn I At A=A, the unstable manifold of x. is tangent to the stable manifold of x + 1. For each A > A,, black points appear in what was the interior of the white region. addition, it has been numerically observed that for each In A > A. (near X,), the set CX of accessible boundary points is composed of a period-three saddle and its stable manifold. Fig. 2bc show in black the basin of attract.-n of infinity at A = 1.4 and A = 1.42,respectively, with the accessible period-three saddle. A numerical investigation of rotation numbers for the orientation-preserving, area-contracting Henon map appears in [AS]. When f is an orientation-reversing homeomorphism, the possible dynamics on accessible orbits are limited. For a connected, simply connected basin of attraction W, an orientation-reversing homeomorphism on W restricts to an orientation reversing-homeomorphism on aW. Again, we study the dynamics on aW through its association with the circle. have fixed points. two. An orientation-reversing homeomorphism f of S1 must It may or may not have periodic points of period Notice, however, that f can have no periodic points of minimum period greater than 2. The map f2 is orientation preserving and has rotation number 0 since it has fixed points. But an orientation-preserving homeomorphism of the circle with rotation 16 number 0 has no periodic orbits of minimum period greater than 1. i 3 Suppose f has a periodic orbit of minimum period k, kL3. a periodic orbit of minimum period k/2, if k is even, or of minimum period k, if k is odd. 3 Then f has Thus f has only periodic points of period one or two. We have the following restatements of Theorems 1.1 and 1.2 for * i orientation-reversing maps: 3 THEOREM 3.2 (ATTRACTING LEMMA). Let F be an orientation-reversing homeomorphism of the plane. i unstable in W. Assume further that the fixed points of F2 i aW are isolated. I Assume that aW is Then each circle point that is fixed under f2 and is attracting on at least one side is a trivial circle point. THEOREM 3.3. Under the hypothesis of Theorem 3.2. there is an accessible fixed point of F2 on aW. I Let F be a one-parameter family of orientation-reversing homeomorphisms. occurs for FV From Theorem 3.3, we observe that if a metamorphosis then BA must jump to different fixed points of F2 I I 17mm m~n Example. The Henon map (3.1) is orientation reversing for b<O. It is easily verified that F ,b can have at most 2 fixed points and at most one periodic orbit of minimum period two. In this situation the possible metamorphoses are severely limited by Theorem 3.3. As long as the period-two orbit and one of the fixed points is in the attractor (and the hypotheses of Thm. 3.2 are satisfied), no metamorphoses will occur. If, however, the basin becomes disconnected, as shown in Fig. 3, then the theorem no longer applies and the boundary can be fractal. Fig. 3 shows in white the basin of a two-piece attractor (which is also plotted in the white region). A metamorphosis has occurred, and there is no longer an accessible fixed point on the boundary. Now the accessible iddle has period six. The existence of periodic orbits in the maximal basin of the attractor but not in the attractor itself is also restricted by Thm. 3.2. Suppose (3.1) has a regular attractor A Properties (2.1)-(2.3)). (i.e., A satisfies If A contains a fixed point and an orbit of period two or if (3.1) is in a parameter range where there is no period-two orbit and A contains a fixed point, then the basin U of A is necessarily bounded by the stable manifold of the (other) saddle fixed point p. (For every choice of parameter values, the orbits of some points in the plane go to infinity; boundary.) thus the basin U has a In particular, under these hypotheses, there are no periodic orbits in the region containing A and bounded by W s(p) except those in A. 18 4. Rotation Numbers for Chaotic Attractors. Here we look at a class plane: an attractor B A is in of non-periodic attractors in the if 8 4 is compact, connected, invariant under F, and contains more than one point. In order to apply Theorem 1.2, we show how to assign a rotation number to an attractor in the class 4, assuming that F is an area-contracting nomeomorphism of the plane. This approach is reminiscent of Birkhoff [B] and also of Cartwright and Littlewood [C-L2] and Levinson [L] who studied attractors in forced 'an der Pol type equations. In looking at the Poincare map of such equations, Cartwright and I 3 3 Littlewood showed that there are invariant annuli which have unequal rotation numbers on the boundary circles and which possess strange attracting sets. Each such attractor is the boundary of the inside contracting and outside contracting parts of the annulus. The existence of different rotation numbers inherited from the boundary I circles was evidence to them of a continuum attractor which was not I homeomorphic to S . Levinson gave a careful analysis of the attracting invariant set of a piecewise-linear version of this map in I [Lni. His work set the stage for the discovery of the horseshoe map by Smale. I I I I See also Levi's analysis of forced van der Pol type equations in(Lil]. Let 2 Z = F u {ic} be the one-point compactification of R F extends to a homeomorphism of Z by setting F({o}) = {co}. 2 Then LEMMA 2 plane R2 . 4.1. Let F be an area-contracting homeomorphism of the If 6 Is in 4, then Z - 6 is connected and simply-connec'ed in Z. Proof. Since 8 is connected, each component of connected in Z. Z - 8 is simply (This simple fact follows most clearly from Alexander Duality with Cech cohomology. See, for example, (Do].) compact, only one component D of R2 8 - Since e is has infinite area (in R2 ) and, given any bound 7),there are only finitely many other components with area larger than n. Let D be a component of H maximum finite area in R2. of D. R2 -6 a homeomorphism. Thus 8 with Since F " is area-expanding and components map onto other components of But F -I also maps D R2 R-2 6, F "1 maps DM ont onto D , contradicting the fact that F" Z -e is is connected and simply connected in Z. 1 Now we can apply Theorem 1.2 to 6, which is the boundary of the open, connected, simply-connected region Z - 6. By looking at F-_ instead of F, it can be shown that G is unstable in follows. in C Let 6 n (Z-G) Z - e, as be an c-neighborhood of 0, and let D be an open set Since F-1 is area-expanding, the area enclosed by the boundary of D becomes unbounded under iteration by F_ easily be shown that almost all points in mapped out of G Z - 6 under F - -1 under iteration of F ; D n E eventually will be hence, G is unstable in Theorem 1.2 provides the following result: 20 It can I 3 3 I 3 3 PROPOSITION 4.2. Let F be an area-contracting homeomorphism of the plane, and let 8 be in the class A of attractors. for each k, the fixed points of Fk are isolated. number p(6,F) Assume that, If the rotation isthe reduced fraction p/q, then there is an accessible fixed point of F on 8. Figure 4 shows an attractor for the Ikeda map with an accessible period 6 orbit. For a typical area-contracting diffeomorphism depending on a parameter A, we conjecture that the rotation number p(A) will vary continuously, except possibly at a discrete set of values of A, and that p(A) will 3 I I I I I I I I of measure 0. le irrational for a non-empty set of A 5. Froof of the Attracting Lemma. Let F be an orientation-preserving homeomorphism of Z = R2 u {}, the 1-point compactification of the plane. A simple arc Q in W with end points q1 and q2, 1q q2 0 on aW and no other points on aW is called a crosscut of W. Each crosscut divides W into 2 subdomains, since W is simply connected. Let {Qn} be a sequence of pairwise disjoint crosscuts such that Qn separates Qn+ from Qn-l' Then there is a corresponding sequence {V n } of subdomains of W such that Vn contains Qn+ except for its endpoints. See Figure 5. The VI D V2 D V 3 D ... is called a chain. sequence V, = {V')} If V ={V n} and are two chains, we say V divides V' if for each i, there is n a j such that V' 9 V the other. We say V and V' are equivalent if each divides Under this relation, an equivalence class of chains is called an end. A chain V is called prime if any chain which divides it is equivalent to it. A prime end is the equivalence class of a prime chain. only if For the unit disk D in R2 a chain nvn {Vn } is prime if and n is a single point (necessarily on the boundary S') general, if there exists a sequence In Qn } of cross-cuts defining an end V such that {Q n} converges to a point in aW, then V is prime (see, for example, [MIl). Let (Vn} be a representative chain in a prime end V. Since each Vn is connected and W is compact in'Z, n vn is a connected, compact, non-empty subset of Z. Thus it is either a single point or a continuum. We call I(V) = nV nEIn 22 the impression of the end V. The I impression of V is independent of the defining chain in V. (However, I two prime ends can have the same impression. * that there are two prime ends corresponding to non-trivial circle 3 points and that beth have impressions that equal aW.) In [C], Carathdodory presents an example of a domain for which the impression of each prime end is a continuum; I In Fig. 1, it appears i.e., none is a single point. A point p in I(V) is called a principal point of V if there exists a sequence {Qn of crosscuts (defining a chain in V) such that {Q } converges to p, i.e., p is the only limit point of this sequence. The set of all such points is called the principal set of V. Finally, we say a point r in aW is accessible from W if there is an embedding n of I 3 I (0,1] into W such that these definitions. In Fig. 6, wp illustrate The following lemmas appear, for example, in [Mi] (as Theorem 17.1 and Corollary 15, resp.): LEMMA 5.1. I limt_>o+q(t) = p. The principal set of V has only one point e if and only if e is accessible from W. LEMMA 5.2. The principal set of V is compact, connected, and 3 non-empty. I 3 Now we describe a topology on the set of prime ends. open set in W. We say an end V is contained in U (i.e., Let U be an Ve 'U) if there exists a chain {V } in V all of whose elements are subsets of U. n Let W* = W u 0, where 0 is the set of prime ends of W. W is open if and only if U1 n W is open (in W) and 23 A set U* in "O = {V: V contains a chain all of whose elements lie in W}. With this topology, a sequence {E n) n of prime ends "converges" to a prime end E, represented by {V n}, if for every m, there exists such that En V for every n > N(m). N(m) e N We call WO together with this topology the Prime end compactification of W. Central to the theory of boundary sets is the following theorem of Carath6odory (see, for example, [C-Lo]): THEOREM 5.3 (Caratheodory). Let W be a connected, simply connected open set. Assume that aW has more than one point. Then W* is homeomorphic to a closed disk, where points in W correspond to points in the interior of the disk, azv- the prime ends correspond to points in S1, the boundary of the disk. Furthermore, if F is a continuous map on Z with W Invariant under F, then there is map FO on W* so that F* = F on W. With this theorem, we are able to learn about the dynamics of F on the boundary of W by studying the corresponding dynamics o" Si, the boundary of D. Prime ends "map" to prime ends under F; induces a map F' on W'. Let T hence F be a homeomorphism from W* to D, the closure of D. Then the circle S I is invariant under the induced homeomorphism f = ToF'of circle is classical. of D. The study of homeomorphisms of the Here we mention briefly some facts about these maps which are needed in the arguments that follow. A reference for this material is (D]. 2 24 ! Poincar6 showed that associated with each orientation preserving homeomorphism I of the circle is a "rotation" number, an asymptotic measure of the rotation of points on the circle under iteration by 1. 3 I In order to define this number, it is convenient first to consider a "lift" of T. A map G of R is called a lift of 7 if noG = yon, where n is the covering map from R to S I 3 I * 3 I i.e., ir(x) = exp(27rix). Let G(x) = limn) GCny)/n, for x in SI and y in R such that n(y) = x. (The value of independent of the choice of y.) We define the rotation number r of I to be the unique number In (0,1) such that pG(x) is pG (x) - r is an integer. This value is well-defined: FACT C1. The value r = r(j) is independent of both x and the particular lift G of 1. The dynamics of I are, to a large part, described by the rotation number r(T): I 3 3 FACT C2. A map z of the circle has points of minimum period q if and only If r(j) is an irreducible fraction of the form p/q, for some positive integer p. The map 1 has fixed points if and only if r(4) = 0. 25 Thus, if z has periodic points, they must all have the same period. FACT C3. If x has a periodic point of period n, then every point on the circle is either a fixed point of yn or is asymptotic to a fixed point under iterates of yn In the following set of definitions, we describe various notions of stability for periodic points and periodic prime ends. We often mention only fixed points, but the definitions and lemmas which follow carry over to periodic points by considering the appropriate iterate of f: a periodic point of period n is a fixed point of fn A fixed point p on S1 is called: (1) attracting on one side If all nearby points on one side of p converge to it under Iteration by f; (2) repelling on one side if all nearby points on one side of p converge to it under iteration by f-1. The analogous definitions hold on the space of prime ends if the word "point" is replaced by the term "prime end", and if "f" is replaced by "FO".) By Fact C3, an isolated fixed point p on SI is either attracting or repelling on each side. If p is attracting (resp., repelling) on one side, then by Carath~odory's Theorem, the associated prime end P is attracting (resp., repelling) on one side. A prime end P fixed under F* is called weakly contains a chain {Vn } such that 26 F() I ' table from W if 91 for every i. The following lemma follows easily from the definition of aW being unstable I I I in W (see Sec. 1): LEMMA 5.4. weakly stable from W. 3 I 3 I If aW is unstable In W, then no fixed prime end is The following three lemmas are important In relating fixed points of F on aW to fixed points of f on S1. Although there is a fixed prime end corresponding to each fixed point on the circle, It is not the case that a prime end which is fixed under FO necessarily contains a point which is a fixed point of F. Lemma 5.5 appears in [C-Li]. l LEMMA 5.5 (Cartwright-Littlewood). SF', Let ? be a fixed prime end of and let {Q I} be a chain of cross cuts converging to a point q (necessarily a principal point) of P. If, for every I, F(Qi ) has at least one point In common with QV then q Is a fixed point of F. LEMMA 5.6. If aW is unstable in W and if a fixed prime end P is attracting on one side, then all principal points of P are fixed under F. Proof. Suppose P is attracting on one side. principal point of P. In Let z be a By Lemma 5.2, there exists a sequence 27 Q} of cross-cuts converging to z. Let {Vn } be the chain defined by these crosscuts. By throwing out elements of the chain where necessary, we can assume that either F(Q) A QI * o, for all i, or that F(QI ) is disjoint from QV for all i. In the former case, z is fixed, by Lemma 5.5. Suppose that F(QI ) is disjoint from QV for all i. Then (OF)(Q I ) is disjoint from r(QI), for all i, and T(M) = p is attracting on one side. Let a- on S I be the end point of T(QI ) which is on that side of p. Then for I sufficiently large, fn ( I)--+p, as Since r(QI ) and (CoF)(QI ) are disjoint, we then have that But then F(VI) C Vi. for all i, contradicting (roF)(QI) C r(V). n--w. Lemma 5.4. Thus z is fixed under F. Proof of Theorem 1.1 (Attracting Lemma). a Suppose that x is a periodic circle point of period n and that x is attracting on one side. Then the corresponding prime end P is fixed under (F*)n and attracting on one side. fixed under Fn. connected. By Lemma 5.6 all principal points of P are By Lemma 5.2 the set of principal points is Since fixed points of Fn are isolated, there can be only one principal point, say p. By Lemma 5.1 the point p is accessible. For a given curve r in W limiting on p, the corresponding curve h-(r) (by definition) limits on a trivial point r in Si. 28 6. Hyperbolicity In this section we describe the dynamics on the set of accessible points under the hypotheses that F is a diffeomorphism of either the plane or the sphere and that periodic points in the boundary are In addition, we either assume that W is a basin of hyperbolic. attraction (see Sec. I for definition) or we add a condition on the map F at w. there exists We say that c is repelling in W r2 > 0 such that if Ixi< rI , if, for each then r1 > 0, IFn(x)l < r2 for all x in 1 and n a 0. THEOREM 6.1. Assume that the periodic points of F in aW are hyperbolic, and that either aW is unstable in W and w (i) W is a basin of attraction, or is repelling in W . (ii) If the rotation number p is rational, then every.accessible point either is a periodic point or is in the stable manifold of an accessible periodic point. The following lemmas are used in the proof of Theorem 6.1. each, the hypotheses of Theorem 6.1 are assumed. For Let S be a (finite) periodic saddle of F in aW, and let Ws (resp., Wu ) represent either branch of the stable (resp., unstable) manifold of S, excluding S. LEMMA 6.2. If8W Intersects Ws , then Ws and W are disjoint. 29 Proof. disjoint. If W is a basin of attraction, then clearly W s and W are Suppose therefore that aW is unstable in W, that repelling in W, and that both 8W and W intersect WS . Let Q crosscut in wnws, three components. and let Q2= F(Q1 ). Then W- One component meets both Q1 and w is be a j {Q1 U Q2 ) has %2 . Let D 1 be the component that meets only Q1, and let D2 be the component that meets only Q2 . Then Since v D2 = F(DI). is repelling in W exists a compact set K such that and W is invariant under F, there F(K nW) and an open neighborhood of S is in K. exists a sequence {D n } of open sufficiently large Dn nK is contained in K 0 w Iterating D 1 forward, there ets in W intersecting Ws such that {D n } approaches Wu (locally), as n-w. Given c > 0, choose j u. that Dn intersects K and there is no c-disk in for all n > j. (This is possible since K includes an open neighborhood of S and there are only a finite number of c-disks inside K.) Then for n > J, every point in D n n K boundary, contradicting the hypothesis that LEMMA 6.3. is within c of the aw is unstable in W. * If p c S' is a trivial fixed point, then it corresponds to an accessible fixed point S in the boundary aW. a repeller, then so is p. 30 I If S is I I 3 Proof. Corresponding to p Is an accessible point S in 8W. The point S is necessarily a fixed point since accessible points map to accessible points and S is the only accessible point corresponding to the prime end p. Suppose that S is a repeller. 3 g 3 Since the boundary 8W is connected and more than one point, each circle centered at S of sufficiently small radius must intersect aW. Let 7 be an "accessing" path in 11 which limits on S (corresponding to a path in the disk which limits on p), and let {QnI be a sequence of crosscuts converging to S such that (1) Qn is an arc of a circle of radius 1/(n+N) for some fixed integer N t 1, and (2) 1 intersects Qn an odd number of times, for each n. As I 3 described in Sec. 5, since the sequence (i.e., {Q n} the point S ), it defines a prime end. converges to one point Since this prime end has accessible point S with accessing path 7, it is represented by p on S1 By the construction, p is a repeller on S I 3 We say that two iccessing paths can be homotoped to z 0 and via a continuous family of paths that remains in W, all having the same endpoint S, (i.e., I 3 3 3 continuous family g(0) = S, for all 1 are equivalent if 0 gt:I --> W such that if there exists a g0 (I) = lot g1 ) = T, and t c I). Notice that if S has two non-equivalent accessing paths, then it corresponds to (at least) two different circle points under hc . In the next two lemmas, we assume the following additional hypotheses: (1) S is an accessible fixed point saddle; and (2) S has an associated trivial circle p which is attracting on at least one side, (i.e., lim there exists a point z c S1 , z ; p, such that fn Cz) = p). For c small, let M C be the union of the segments of the stable and unstable manifolds that connect S to the boundary of BC (S), the c-ball around S. We can assume that c is small enough that the segments of the stable and unstable manifolds in MC intersect only at S. LEMMA 6.4. Let 1 be an accessing curve to S. T)-n 7 is equivalent to an accessing curve that does not intersect MC . Proof. Suppose that I is not equivalent to an accessing curve that does not intersect MC . Since W is open, it must be the case that I intersects at least two components of and the boundary aW intersect Ws BC (S) - Mc n M.or and that both z both intersect Wu case in which both intersect Ws is ruled out by Lemma 6.2. that both intersect W . Let uW{Q n} n M. The Suppose be a sequence of crosscuts converging to S such that Qn is a closed interval on Wu and Qn intersects 7 an odd number of times, for each n. (Since the endpoints of Qn are the only points of Qn on the boundary aW, we can assume in fact that Qn intersects 7 only once.) Q} n The prime end determined by is represented by p on the circle. 32 In- this case, p must be a repeller, a contradiction. In the following, let c > 0 and let 7 be an accessing path to S such that there is a unique component of i. Call this component QC Lemma 6.4.) B C(S) MC - (The existence of Q that intersects is guaranteed by Since S is hyperbolic, we can further assume that B (S) is a neighborhood in which F is smoothly conjugate to a linear map, that S is the origin, and that QC is an (open) quadrant in R2. LEMMA 6.5. The romponent Q , as defined above, contains no points of the boundary aW. Proof. Suppose that Q contains a point of aW. Let ea, aeR, be a family of ("hyperbolic-like") invariant curves in Q boundary is connected, there is a connected component of Since the 8W n Q. containing S and a point b of e , for e sufficiently close to S. a a a Assume ea is sufficiently close to S that I extends from S to a point ga on ea. is above). Assume ga is below ba on ea (the argument is similar if it Assume further that F(g a ) is above b . (Otherwise take a higher iterate.) Then F(g a ) is between ba and F(b on a) e Since 7 and f(T) are both accessing curves to S (and they I- correspond to curves in the disk limiting on the same circle point), ga and F(g) can be joined by a curve contained entirely in W so that 33 the resulting loop g is null-homotopic in W. This is a contradiction since either b or F(b ) is contained in (. a Proof of Theorem 6.1. We assume that the rotation number is 0. (If the rotation number is p/q the proof.) with p*O , then replace F by Fq in Let x be an accessible point in aW which is not a fixed Corresponding to x is a trivial circle point z. By Lemma 6.3, point. z is not a fixed point. fixed point p on S1. point. Then the forward orbit of z converges to a By the Attracting Lemma, p is a trivial circle Corresponding to p is an accessible point S in aW. 6.3, q is a fixed point. aW * a By Lemma Since either W is a basin of attraction or is unstable in W, S cannot be an attractor, and again by Lemma 6.3, S is not a repeller. Thus S is a saddle, and the hypotheses of Lemmas 6.4 and 6.5 are satisfied by S, since p is attracting on one side. By Lemmas 6.4 and 6.5, there is at least one component Q BC (S) - MC which is in W and contains no boundary points. are boundary points in another component of of If there B (S) - MC, then they are in connected components of aW which intersect both invariant manifolds bounding that component. If exactly one component is free of boundary branch W s points and is in W, then there are accessible points on one of the stable manifold and one branch Wu of the unstable manifold. By Lemma 6.2, each point on this branch of Ws is an accessible boundary point. Thus points on one branch of the stable manifold of S are in one-to-one correspondence with points of S1 on one side of p. Let g and g2 refer to the segments on either side of p 34 consisting of points on the circle between p and the closest fixed points on either side. Let (If p is the only fixed point, then 9, = 92' be the segment which corresponds to W s . Necessarily, 9 l is part {Q n} of the stable set of p. Let be a sequence of crosscuts converging to S such that one endpoint of Qn is on Wu and one is on W for each n. Since accessible boundary points on W F- to S, given a point y in WU fn aW converge under (necessarily accessible) and n>O, all but a finite number of points in the forward orbit of y under F will be in Vn' n, the closure of the domain determined by Qn and S. In this case p, which corresponds to the prime end determined by is repelling on 02" Qn , Since the forward orbit of z converges to p, z must be on gi, and thus x is in the stable -inifold of S. * The argument given in the previous paragraph holds in all cases in which a sequence {Q n} of crosscuts in W converging to S (i.e., a sequence which defines the prime end represented by p) has the I property that one endpoint of Qn is in W n 3 u and one is in W s , for all 0. The case in which there are exactly three components of B (S) - M in W this case. If the crosscuts do not have this property, then there are which are free of boundary points also reduces to necessarily exactly two or exactly four components in W. In these cases, both endpoints of a crosscut are in one or the other branch of the stable manifold of S. 3 (Since the fixed point p is attracting from at least one side on S I, the case in which only the unstable manifold of S intersects the boundary is ruled out by an argument similar to I that in the proof of Lemma 6.3.) In this case, p is necessarily attracting on the circle, and points on both g I 35 and 02 are in one-to-one correspondence with points in the stable manifold of S. Thus x is in the stable manifold of S. x The following corollaries follow from the proof of Theorem 6.1. The first extends Theorem 1.1 (the Attracting Lemma) to all points of S1 not just periodic points. The second shows that the map hc , the accessible-point extension of the Riemann map h (described in Sec. 1), is continuous on stable manifolds of periodic points of S I (up to and including the periodic point). For a trivial circle point r, we let r denote the corresponding accessible point in aW. We assume the hypotheses of Theorem 6.1. COROLLARY 6.6. Assume that p is rational. If a point r in S' is not a periodic point, then r is a trivial circle point. COROLLARY 6.7. Let p in S' be a periodic point of f, and let {r n } be a sequence of points in SI converging to a point r. If rn is in the stable manifold of p, for each n, then the corresponding sequence {r } of accessible points in aW converges to r in 8W. 36 36I I ;I 7. Minimum Periods of Accessible Periodic Orbits Unfortunately, although a rational rotation number p/q implies Il that f has a periodic orbit of minimum period q on SI, we cannot claim that F has a periodic point of minimum period q. See, for example, the boundary depicted in Fig. 7, where p(CW,F) is 1/3, and F has an *I accessible fixed point on the boundary but no period three orbit. Recall that hc is the accessible-point extension of the Riemann map h. If the rotation number p of f is rational (say p/q), but not 0, then trivial circle points which are periodic (necessarily of minimum period q) can map by hc to periodic points in the plane of smaller minimum period. This situation is illustrated in Fig. 7, where all points in one orbit on the circle coalesce to a fixed point Surprisingly, Cartwright and Littlewood [C-Li] showed on the sphere. that this type of example is the only possible one when accessible points coalesce: THEOREM 7.1 (Cartwright-Littlewood). If p # 0, then 8W contains at most one accessible fixed point. It is easily seen that this theorem rules out coalescing to an orbit of minimum period strictly between 1 and q. Suppose that a trivial periodic orbit of minimum period q on the circle maps (under hc) to a periodic orbit of minimum period k on 37 W, where k#1 and kaq. Then k = q/r for some divisor r of q (r*1), and Fk has k accessible fixed points on aW. But the rotation number of the induced map fk on I the circle is non-zero, contradicting the theorem. The situation illustrated in Fig. 7 can be largely overcome by using Theorem 6.1 and assuming that the accessible periodic points are saddles. PROPOSITION rotation number 7.2. p * 0 Assume the hypotheses of Theorem 6. 1. is the reduced fraction p/q, where If the q 0 2, then every accessible periodic saddle in aW has minimum period q. I Proof. Suppose there exists an accessible orbit of period k on k aW, where 1<k<q. Then Fk has at least k fixed points, but the I rotation number of the induced circle map fk Is non-zero, contradicting Theorem 7.1. fixed point saddle z on aW. Hence we assume there is an accessible Given a path r in W limiting on z, let y be the trivial circle point which is the limit point of h -1 (r). Either y is a periodic point of period q, or the forward orbit of y under fq converges to a periodic point r. By Theorem 1.1, r is a trivial circle point. By Corollary 6.4, the trivial circle point r corresponds to the accessible point z (i.e., hc (r) = z ), as do each of the q points I r = rI , r2 ..... rq 38 in the orbit of r. U Let 0 be the center of the disk which S 711-...7q Then 3 3 r be line segments joining 0 to bounds and let rl,.. .rq, respectively. h(I11J.... h(q) are paths in W, all of which limit on z. Let and r be adjacent points on the circle. Since r1 ,..., rq represent distinct prime ends on S1 , the closed loop formed by h(v ), and Therefore, by Lemma 6.5, manifold of ) z necessarily contains boundary points in its interior. These boundary points are connected to If q h is 3 or z q 4, z within the loop. can be at most 4. then at least one branch of the stable is in W, and r is necessarily attracting on at least one side of the circle under fq --as is each of the q points in i the orbit of z. Each of these stable sets must correspond to a branch of the stable manifold of 3 I 3 r in the disk connecting a point in the stable set of a path a point in the stable set of TI z. On the other hand, there exists or xj r ri to which crosses one of the segments exactly once and intersects none of the other segments. Hence, in W, h(f) crosses h(iI) (or h( )) exactly once and intersects none of the other "accessing" paths, a contradiction for q >2. i For a map of the sphere, two types of degeneracies are possible when p = 1/2, even with the hypothesis that accessible orbits are hyperbolic saddles. In Fig. 8a, I p = 1/2 These possibilities are illustrated in Fig. 8. and there is an accessible fixed point saddle p I on aW. In Fig. 8b, aW Is a line segment. The basin W complement of B) is simply connected on the sphere. p = 1/2 (the In this case, I and there is an accessible fixed point saddle p and an accessible saddle orbit {r,r 2 } of period two. I I The situation is greatly simplified when we look at homeomorphisms of the plane. We use the following converse of I Theorem 1.2 for planar maps: PROPOSITION 7.3. '.et F be a homeomorphism of the plane R2. there exists an acce!aile fixed point in aW, then If p = 0. 1 1 I Suppose Proof. and let p c S1 0 0. Let 2 N > I M-I (f(p), f (p)]...., [f' Let I be an accessible fixed point, be a corresponding trivial circle point. p e 0, we can choose extension h x such that the intervals N (p), f (p)] of the Riemann map, and 6 cover S I . Since [p, f(p)], By Property 2 of the hc(fi(p)) = x, for be paths beginning at a point 0 i z 1. in D and r be the closed loop ending at p formed by h(z), h(6), h(O), and x. Choose a preferred direction, and f(p), respectively. Let clockwise or counterclockwise, so that the accessible boundary points corresponding to trivial circle points between p r. Let G be r together with its interior. Since the accessible 40 . and f(p) are in 1 I IN points are dense in compact set aW, the entire boundary is contained in the K = U f (G) .. The complemeint of D is contained i=1 entirely in W or entirely in R2\W. since W is simply connected. But then of 3 The former case is ruled out h(7) c W is in the boundary K, a contradiction. Now assume that p is the reduced fraction p/q. Assume further that there is an accessible periodic orbit of minimum period in 3 aW Since (in the plane). Fr r, F induces the map has a fixed point, the rotation number of Prop. 7,3. I The iterate Thus all periodic points c.-* which Implies that q divides r. in S1 f fr f on S is 0, by are fixed points of The next proposition shows that q must equal r. I PROPOSITION I 7.4. If p = 0, then every accessible periodic point in aW isa fixed point of F. I Proof. Assume there is an accessible periodic point period q, q > 1. Let I p and let D, let 7 hc, p is not a fixed point. be the line segment from be the line segment from I I of be a trivial circle point corresponding to x. By Property 2 of the map be the center of x 41 0 to 0 Let to 0 p, f i(p), for each i, Then 1 S i S q+1. form a closed curve Since of r which, except for p, ii contained in W. p = 0, f(p) the circle and is between By Prope rty 2 of the map is inside on one side of heCf(p)) F hc (fq+l(p)) I (hc(p)) = hc (fq ()). I is i,% r, a M contradiction. COROLLARY 7.5. n q c o and Thus one r and the other is outside. F(h C(p)) = Fx) =Fq+1() But only one of p # 0 fq(p) p and fq+ 1 (p) is between them on the other side. Ac(?1) and hc(q+1 hc(f(p)) p, and they both contain h (Xq) and h(z Let 1 be a homeomorphism of the plane R2 If is the reduced fraction p/q, then every accessible periodic point in 8W has minimum period q. Proof. Suppose there is an accessible periodic point with period r. By the discussion following Prop. 7.3, q divides r. It follows from Prop. 7.4 that since the rotation number of fq (on S1 ) is 0, q must equal r. I The final corollary puts together the previous results with the assumption of hyperbolocity to obtain a statement that does not mention the rotation number p: I 42 I COROLLARY 7.6. Assume the following set of hypotheses: (1) F is a diffeomorphism of the plane R2; 3 (2) the periodic points of F in aW are hyperbolic; either (I) W is a basin of attraction; i(3) or (ii) aW is unstable in W, and w is repelling in W; 3 and (4) either (i) there exists an accessible periodic point of minimum period q, or (ii) there exists an accessible point which converges (under fq) to a periodic point of minimum period q. I 3 Then every accessible point in W either is a periodic point of minimum period q or is in the stable manifold of such a periodic point. I I 3 We need to prove that if an accessible point Proof. fq under show that z, then to a periodic point p z exposition, we assume that q is 1 and that I 3 3 Otherwise, replace F with F of the stable manifold of and gI 'be paths from y U .) By hypothesis, z. Let to z x y converges First we (For ease of is a fixed point. x s is on one branch W be a point in W, and let and from y to go F(x), respectively. By Lemma 6.2, W and Ws are disjoint. on any assumption about p.) I is accessible. Assume otherwise. is rational. x unmn mu nuum (This lemma does not depend Therefore, there must be accessible 43m points in the region between x and F(x). V bounded by g0 , In fact, since and the portion of W s g, is irrational, there must p be points in the orbit of the accessible point But then W s must enter V. 91, 91 p or W5 , x in the region The only way Ws can enter V is through all of which are impossible. (In particular, are in W, which by Lemma 6.2 does not intersect Ws.) ir rational, and by Theorem 6.1, an accessible periodic point, namely Since z V. g g0, and Therefore, is in the stable manifold of x z. has minimum period q and p is rational, by Cor. 7.5, eve-y accessible periodic point has minimum period q; the result follows from Theorem 6.1. I I II 1 44I U I 3 References [AS] K. Alligood and T. Sauer, "Rotation numbers of periodic orbits in the H~non map", Commun. Math. Phys. 120 (1988), 105-119. [ATY] K. Alligood, L. Tedeschlni-Lall, and J. Yorke, "Metamorphoses: sudden jumps in basin boundaries", preprint. [B] G.D. Birkhoff, "Sur quelques courbes fermees remarquables", Bull. Soc. Math. France 60 (1932), 1-26. [BSI N.P. Bhatia and G.P. Szeg, I 3 Stability Theory of Dynamical Systems, Springer-Verlag, Heidelberg, 1970. [C] C. Caratheodory, "Uber die Begrenzung einfach zusammenhangender Gebiete", Math. Ann. 73 (1913), 323-370. [C-L1] M.L. Cartwright and J.E. Littlewood, "Some fixed point theorems", Ann. Math. 54 (1951), 1-37. [C-L21 "On non-linear differential equations of the secorA order: I. Th- equation y"-k(1-y 2 )y'+y = bAkcos(At+m), k large", J. London Math. Soc. 20 (1945), 180-189. I 3 __ (C-Lo] E.F. Collingwood and A.J. Lohwater, Theory of Cluster Sets, Cambridge Tracts in Mathematics and Mathematical Physics, No. 56, Cambridge Univ. Press, 1966. [D] R.L. Devaney, An Introduction to Chaotic Dynamical Systems, Benjamin/Cummings Publishing Co., Menlo Park, 1986. [Do] A. Dold, Lectures on Algebraic Topology, Springer-Verlag, Heidelberg, 1972. I 3 [GOYa] C. Grebogi, E. Ott, and J. Yorke, "Basin boundary metamorphoses: chanees in accessible boundary orbits", Physica 24D (1987), 243-262. [GOYb] , "Critical exponent of chaotic transients in nonlinear dynamical systems", Phys. Rev. Lett. 57 (1986), 1284-1287. 3 3 [HJI S. Hammel and C. Jones, "Jumping stable manifolds for dissipative maps of the plane", Physica 35D (1989), 87-106. [Lil M. Levi, Qualitative Analysis of the Periodically Forced Relaxation Oscillations, Mem. AMS 214, 1981. [Ln] N. Levinson, "A second order differential equation with singular solutions", Annals of Math. 50, no. 1 (1947), 127-153. 45 [Ml] J. Mather, "Topological proofs of some purely topological consequences of Caratheodory's theory of prime ends", Th.M. Rassias, G.M. Rassias, eds., Selected Studies, North-Holland (1982), 225-255. [M2j J. Mather, "Area preserving twist homeomorphisms of the annulus", Comment. Math. Helvetici 54 (1979), 397-404. [M3 , "Invariant subsets for area-preserving homeomorphisms of surfaces", Advances in Math. Suppl. Studies, Vol. 7B (1981). _ [Y] J.A. Yorke, "Dynamics, a Program for IBM-PC Clones", 1987. (Available from J. A. Yorke.) 46 I I I 6 * :'. - " "w.l . 1.2. -a. I • f I -. -2 -3.-2- .1 1 •O1• .3 .. FIGURE 1. Two basins of attraction of the time 2Tr map of the forced damped pendulum equation 0" + .10' + sine = 2cost are shown in black and white. The black and white regions are connected on the cylinder. !A +1 FIGURE 2. A portion of the basin of infinity of the Hnon map (3.1) is shown in black for b fixed at 0.3 and each of three values of the parameter X. The x and y values shown are in the rectangle L-2,21 x [-2,11] In (a) at X = 1.39, the set of accessible points consists of a period-four saddle and its stable manifold. Crosses show a period-three saddle to which the boundary jumps at a boundary metamorphosis at X z 1.395. In (b) and (c) at = and X = 1.42, respectively, set of accessible points consists1.40 of this period-three saddle and the its stable manifold. I - I I I U I I I I I I I I I I .:..* .. - . - I I I I -, *~l(~) I I I I I 1 I 1 I I I 1 I I I I I I U I I I I I I U II I m FIGURE 3. A portion of the basin of infinity of the orientationreversing H6non map (3.1) is shown in black. There is a twopiece attractor whose basin is not connected, and the basin boundary is fractal. 4.A caoti oftheIked ma FIGUE atractr 09(xsnT ycoT)I (97 -sinT, +0.9xcos f~x~) isshow. Tere sa T.4 x+y2)-6.0(1.0+ wher accesibe (1rbi on peiod6 hettratoI F's x FIGURE 5. Sequences of crosscuts and subdomains defining a prime end are illustrates. Ii I /Qn _ _ _ _ _ _ _ _ _ A _ _ B I (b) __ _ __ A _ __ P _I I'BI (c) FIGURE 6. Each figure represents an open, simply connected setI (the interior of the rectangle minus the line segments). In each case, segment AB is the impression of a prime end. In (a), each point of AB is a principal point, and there are no accessible points in AB. In (b), segment CD is the principal set of AB, and there are no accessible points in AB. In (c), P is the only principal point and the only accessible point of AB. I U I, I p=f (PI 1P I 3 ~P3f 2 (pl) I P=f (PI) II (a) I i I I I I (hi FIGURE 7. In (a) the rotation number on the boundary circle is 1/3. The circle maps to the boundary in (b), however the boundary in (b) does not have an accessible periodic point of period 3, but rather has an accessible fixed point. This example is realizable on the sphere, not in the plane. Ft, 7 o I I I I ' / cb) / I ,r'/l / FIUR S.Totpso|eeea so teshri hc h iiu periods of accessible saddles do not equal the periods of orbits of then associated circle map, are shown. By Prop. 7.2, the rotation numbers in each case is 1/2. in (a), the boundary DW has a fixced point saddle p. in (b), W is the complement in the sphere of the line seIi-nt 31 from rI to rg; thus 3W is the boundary of a simply connected set v on the sphere. Agai, the rotation number is 1/2 ad there is an accessible mixed point saddle p and an accessible saddle orbit r,rl 2 of period two.s of h I 5 Figure 1 Two basins of attraction of the time 21r map of the forced damped pendulum equation )" + .16' + sinO = 2cost are shown in black and white. The black and white regions are connected on the cylinder. 1 Figure 2 A portion of the basin of infinity of the H~non map (3.1) Is shown in black for b fixed at 0.3 and each of three values of the parameter A. The x and y values shown are in the rectangle [-2,21 x (-2,11]. In (a) at A = 1.39, the set of accessible points consists of a period-four saddle and its, stable manifold. Crosses show a period-three saddle to which the boundary jumps at a boundary U metamorphosis at A = 1.395. In (b) and (c) at X = 1.40 and A = 1.42 , respectively, the set of accessible points consists of this perpod-three saddle and its stable manifold. I I Figure 3 A portion of the basin of infinity of the orientation-reversing H6non map (3.1) is shown in black. There is a two-piece attractor whose basin is not connected, and the basin boundary is fractal. Figure 4 A chaotic attractor of the Ikeda map f(x,y) = (.97 + 0.9(xcosT - ysinT), 0.9(xsin'r + ycosr)), where 2 period 6 orbit on the attractor. I 2 r = 0.4 - 6.0/(1.0 + x + y2), is shown. There is an accessible Figure 5 Sequences of crosscuts and subdomains defining a prime end are illustrated. Figure 6 Each figure represeits an open, simply connecteu set (the interior of the rectangle minus the line segments). In each case, segment AB is the impression of a prime end. In (a), each point of AB is a principal point, and there are no accessible points in AB. In (b), segment CD is the principal set of AB, and there are no accessible points In AS. In (c), P is the only principal point and the only accessible point of AB. Figure 7 In (a) the rotation number on the boundary circle is 1/3. The circle maps to the boundary in (b), however the boundary in (b) does not have an accessible periodic point of period 3, but rather has an accessible fixed point. This example is realizable on the sphere, not in the plane. Figure 8 Two types of degeneracies on the sphere, in which the minimum periods of accessible saddles do not equal the periods of orbits of the associated circle map, are shown. By Prop. .7.2, the rotation number in each case is 1/2. In (a), the boundn-y aW has a fixed point saddle p. In (b), W is the complement in the sphere of the line segment aW from r1 to r2 ; thus aW is the boundary of a simply connected set W on the sphere. Again, the rotation nnber is 1/2 and there is an accessible fixed point saddle p and an accessible saddle orbit {r,r 2 } of period two. I| WHEN CANTOR SETS INTERSECT THICKLY 3 Brian R. Hunt'. Ittai Kan2. and James A. Yorke July 3, 1991 Abstract The thickness of a Cantor set on the real line is a measurement of its "size". Thickness conditions have been used to guarantee that the intersection of two Cantor sets is nonempty. We present sharp conditions on the thickness of two Cantor sets which imisly that their intersection contains a Cantor set of positive thickness. 1 - Introducticn Newhouse defined [5] a nonnegative quantity called the "thickness" of a Cantor set in order to formulate conditions which will guarantee that two Cantor sets intersect. (All Cantor sets considered in this paper lie in R 1 .) These conditions have been used [5, 6, 7, 8, 9] in the study of two-dimensional dynamical systems to deduce the existence of tangencies between stable and unstable manifolds whose one-dimensional cross sections are Cantor sets. I Thickness may be thought of as a measure of how large a Cantor set is relative to the intervals in its complement. Henceforth, these intervals will be referred to as gaps; the two unbounded intervals in the complement are each included in our use of the term gap. Newhouse's result [5, 7, 8] is that two Cantor sets must intersect if the product of 3 3 their thicknesses is at least one, and neither set lies in a gap of the other. When this latter condition is satisfied, the sets are said to be interleaved. In [10], Williams observed the surpising fact that two interleaved Cantor sets can have thicknesses well above one and still only intersect in a single point. 'Code R44, Naval Surface Warfare Center, Silver Spring, MD 20903-5000 of Mathematical Sciences, George Mason University, Fairfax, VA 22030 3Institute for Physical Science and Technology and Department of Mathematics, University of Maryland, College Park, MD 20742 The first author was supported by the ONT Postdoctoral Fellowship Program administered by ASEE, by ONR, and by the NSWC Independent Research Program. All three authors were partially supported by Applied and Computational Mathematics Program of DARPA. 2 Department Sthe One might hope that under sufficiently strong -'L3 -G, G3 GR R3 L2 G2 , R2 Figure 1: Constructing a Cantor set thickness conditions, the intersection would be a Cantor set. However, the intersection of two arbitrarily thick interleaved Cantor sets can contain isolated points, so Williams posed the question of what conditions on the thicknesses of two interleaved Cantor sets will guarantee that their intersection contains another Cantor set. Williams obtained such a condition, though" it is not sharp. In this paper we obtain the sharp condition. More precisely, we find a curve in (T, 'r2 )-space such that if the ordered pair (T 1 , "2 ) of thicknesses of two interleaved Cantor sets lies above the curve, their intersection contains a Cantor set, but if the pair of thicknesses lies below the curv,_ here exist examples for which the intersection is a single point. Kraft [2] has independently arrived at this condition. We further show that if the thickness pair lies above the curve, the intersection must contain a Cantor set of positive thickness. This is the only result that addresses in terms of thickness how large the intersection of two Cantor sets must be. There are well known probabilistic results concerning the Hausdorff dimensions of intersections of Cantor sets (c.f. [1, 3, 4]). One may think of a Cantor set as being constructed by starting with a closed interval and successively removing open gaps in order of decreasing length. Williams' formulation of the thickness of a Cantor set may then be thought of as follows. Each gap G, is removed from a closed interval I,, leaving behind closed intervals L, the left piece of I, - G,, and R, oil the right (see Figure 1.) Let p, be the ratio of the length of the smaller of L, and R" to the length of G,. The thickness of the set is the infimum of p, over all n. We consider as an example the "middle-third" Cantor set, constructed as follows. Start with "he'closed interval [0,1], and remove the open interval (1/3, 2/3), the middle third of the original interval. Then from each of the two remaining intervals, remove their middle thirds; repeat this process infinitely often. Each gap G,, is the same length as the adjacent intervals L,, and R,, so p,, = 1 for all n. Thus the thickness of the middle-third Cantor set is one. There is a connection between the thickness of a Cantor set and its fractal dimension, which depends in part on how the ratios p,, are distributed as n -* o. However, two largc gaps close together make the thickness of a set very small, while its dimension can still be large. It was shown in [7] that the Hausdorff dimension of a Cantor set with thickness r is bounded below by log 2/ log(2 + 1/r). This lower bound is sharp for the middle-third Cantor set (whose dimension is log 2/log 3.) We offer here a new formulation of the definition of thickness which we state for all 2 SI: ib, I a, l C2 2: b2 Figure 2: Non-intersecting interleaved sets compact sets, not just Cantor sets. (The results in this and previous papers are found to be valid for all compact sets.) We define non-degenerate intervals to have infinite thickness, while singletons are defined to have thickness zero. In fact, any set containing an isolated point will be seen to have thickness zero. To define the thickness of a compact set S which is not an interval, we consider a type of subset of S obtained by intersecting S with a closed interval. We call such an intersection P a chunk of S if P is a proper subset of S and has a positive distance from S - P, the complement of P in S. (Notice that for P to be a chunk both P and S - P must be closed and nonempty.) We then define the thickness of S to be the infimum over all chunks P of the ratio between the diameter of P and the distance from P to S - P. In the case of the middle-third Cantor set, the given ratio can be shown to be smallest when the chunk P is obtained by intersecting S with an interval L" or R, in which case the ratio is one. in Section 2 we will show that our new definition is eq"'valent to the old one for all Cantor sets. The reason thickness is an appropriate quantity for determining when one can guarantee that two compact sets intersect is illustrated by considering an example where each of the two sets is a union of two disjoint intervals. For i = 1,2 let 5, consist of closed intervals of lengths ai and b, with a, < b,, separated by a distance ci. Then each Si has only two chunks, and is found to have thickness a,/c,. If the product of the thicknesses ala 2 /cIc 2 is at least one, then either a, > c2 or a 2 >_c1 (or both); assume a, >_c2 . Then since bi >_a1, neither interval of S, can lie in the gap of S2; hence if the two sets are interleaved, they must intersect. If on the other hand ala 2 /clc 2 < 1, then with an affine map we can scale the sets so that a, < c2 and a 2 <c, and position them so that the component of S, with length a, lies inside the gap of S2 , and vice versa. The two sets are then interleaved, but they do not intersect (see Figure 2). This example could of course be made to involve Cantor sets by constructing very thick Cantor sets in each chunk of each S,. An important point which is apparent in the above example is that the union of two sets can have a smaller thickness than either of the original sets. In other words, adding points to a set can decrease its thickness. By the same token, one may be able to increase the thickness of a set by removing appropriate subsets. This observation is useful in the following way. No matter how thick two interleaved compact sets are, their intersection may have thickness zero because it may contain isolated points, or arbitrarily small chunks which are relatively isolated from the rest of the intersection. Nonetheless we are able to show that 3 6 5 = f(71) 4C g( 2) 'W'7 t 1 3 2 B " = /77 A 0 0 1 2 3 4 5 6 7l-- Figure 3: The intersection of two interleaved compact sets with thicknesses T1 and 72 can be empty for (7"1 , r2 ) in region A, must be nonempty but can be a single point in region B, and must contain a set of positive thickness in region C. if the original sets are thick enough, then by throwing out the relatively isolated parts of their intersection we can obtain a set of positive thickness in the intersection. To define the set C of thickness pairs (r 1 , 72 ) for which a Cantor set of intersection can be guaranteed, we make use of the functions T2 + 3r + 1 r2 fjr) (27 + T3 g(T)= Let C be the set of pairs (71, r- 72) 1)2 for which one of the following sets of conditions holds: - 2, f(r2), and r2 > g(rl) (1.1) r2 > f(r 1 ), and 'r > g(r2) (1.2) 71 > or r 2 >_r,- (see Figure 3.) Our main result is as follows. 4 Theorem 1 There is a function (r, -r2) which is positive in region C such that for all interleaved compact sets S1,$ 2 C R with T(S1 ) >!ri and r(S2) >_r2 , there is a set S C SfnS2 with thickness at least p(r, r 2 ). Notice that a compact set with positive thickness can have no isolated points, and thus must either be a Cantor set or contain an interval; either way it contains a Cantor set. We remark that (ri, r2) is in C if both thicknesses are greater than V2 + 1. This is the critical value Williams found for the case of interleaved Cantor sets with the same thickness. Also, no matter how small one thickness is, the other thickness can be chosen large enough so that the pair lies in C. Our results and the results of Newhouse are summarized in Figure 3. In Section 2 we give a proof of Newhouse's result, which will illustrate some of the methods to be used later. Then we present for all pairs (r,,r2) not in C an example of interleaved compact sets with thicknesses 71 and 72 whose intersection is a single point (except when (r-, r2 ) is on the boundary of C, in which case our example gives a countable intersection.) This example shows that Theorgm 1 is sharp in that its conclusion cannot hold for any larger set of thickness pairs (r1 , r2). In Section 3 we prove Theorem 1, and in Section 4 we discuss some further properties of S1 n S2 . The positive thickness set S E S n S2 constructed in Section 3 need not be dense in S, n 52 ; however we find that there are subsets with thickness at least p(rl, r7 2 ) arbitrarily near any accumulation point of S, n S 2. In additioa, we find bounds on the diameter of S which allow us to obtain thickness conditions that imply thac the intersection of three Cantor sets is nonempty. 2 Preliminaries Let us define precisely the concepts and notation we will use. Definition 1 We say two sets S1, S2 C R are interleaved if each set intersects the interior of the convex hull of the other set (that is, neither set is contained in the closure of a gap of the other set.) We define the distance between two nonempty sets S1, 32 to be d(Si,S 2) = inf{Ix - yjx E Si, y ES 2}, and write S2 - S1 for the intersection of 52 with the complement of S1. We say that a set S is a chunk of a set S,, and write S1 0( S2 ,if S1 is the intersection of a closed interval with S2, is a proper subset of S 2 , and d(S 1 , S 2 - S1) > 0. Notice that a closed set S has a chunk if and only if it is not connected. We denote the diameter of a ret S (the length of its convex hull) by IS1. Lk Gk Rk L, G. R. Figure 4: Chunks and gaps of a Cantor set (k < n) Definition 2 Given a compact set S C IR, we define the thickness of S to be 'r(S) = inf IPI P=S d(P,S - P) (2.1) prozided S has a chunk. Otherwise, we let r(S) = 0 if S is empty or consists of a single point, and r(S) = oo if S is an interval with positive length. The following simple proposition demonstrates that Definition 2 agrees with Williams' definition of thickness for Cantor sets [10]. Proposition 2 Let S be a Cantor set, and define the ratios p,, as in the introduction. Then the quantity r(S) given by (2.1) is equal to the infimum of p,n over all n. Proof The intervals L,, and R, defined in the introduction are the convex hulls of chunks A, = L, nl S and B,, = Pn n S of S. Since the gap Gn is not larger than any previously removed gap Gk, k < n, it follows that d(A,,,S - A) = d(B,,,S - B,) = IG,, (see Figure 4.) Thus for all n, m i'L I IR,,I ( IAI IBnI (S). = i-,['IG,,I, =mI \d(A,-S--A)' d(B,,S-Bn) ) Next, if P is a chunk of S, it must be bordered on each side by a gap of S; let Gn be the smaller of these two gaps. Then IG,,I = d(P,S - P) and IPI > min(IL,,, IR,I). Therefore r(S) inf IPI ?osd(, S - P) - nn which completes the proof I We now prove Newhouse's result in a way that will motivate our later examples and methods. Proposition 3 If S1 and S2 are interleaved compact sets with r(S1).r(S 2) _ 1, then S, n S 2 ts not empty. 6 P3 P1 PO 1 S2-H P2 XO 3 X2 P I-H X4 Figure 5: The points x,, and chunks P,, Proof Let S1 and S2 be as above, and let xo=max infx, xES2 infx) \XESI the greater of the leftmost points of S and S2 . Assume without loss of generality that xo E S2 . We will show that S1 n S2 is nonempty by looking for the leftmost point of this set. Let x, be the leftmost point of S which is at least as great as x0 . Since S and S2 are interleaved, x, must exist (otherwise S would lie entirely to the left of S2; see Figure 5.) Next, let X2 be the leftmost point of S 2 greater than or equal to xj. Once again the interleav:ng assumption implies that X2 exists, for otherwise S2 would lie inside a gap of S. We similarly define X3, X4, ... ; if each of these points can be shown to exist, we claim to be done. Then {x,} will be a nondecreasing sequence which is bounded above (since 51 and S 2 are bouided), so it approaches a limit. This limit must belong to both S and S2 since these sets are closed and the odd numbered terms of {,} belong to S1, the even ones to S. If at any step x,, exists and equals X,-1, then Xn+l,Xn+2, .. .will also equal X,,_, and we will have found a point in S1 nl S2. Henceforth we assume £o < x, < ... as long as they are defined. We know at least that x0, £1, and X2 exist, so there is a chunk PO of S2 which lies in [xo, xj), whose diameter is thus less than x, - xo, and whose distance from the rest of S 2 is greater than £2 - X1 (see again Figure 5.) Then X1X X2 - > x1 > (S2). (2.2) d(Po, S2 - Po) Let P be the largest chunk of S, which lies in [X1, X2). If X3 did not exist, in other words if all points in S1 were less than X2, then S - P1 would lie to the left of P1 , and the distance between these sets would be greater than x, - xo. But then using (2.2) and r(S).T(S2) > 1 we woul t. have d(P,S, - P) x, - o (S2 ) contradicting the definition of the thickness of S1. Thus X3 exists, and similarly to (2.2) we obtain > X2-X> X3 - X2 (S). (2.3) d(P 1 , S1 - P ) Likewise (2.3) car be used to show the existence of by induction. I 7 £4, and so forth. The proof is completed One could similarly find the rightmost point in $n S 2 , but as Williams observed it may coincide with the leftmost point, even if both thicknesses are significantly greater than 1. We next present an example which will give a single point of intersection for thickness pairs ('r-, r2 ) not in the closure .f region C, and a countable intersection for (ri, r2 ) on the boundary of C. In our example both sets are countable unions of closed intervals, but they could be replaced by Cantor sets with the same thicknesses by constructing a very thick Cantor set in each of the closed intervals. Let r be a positive constant, and define the intervals A0 = [r2 +3r+ 1,(2T + 1)21, Bo = [r,r2 + 3r + 1], An = 2 Bn = 2r +1 r +Y Ao, Bo, where multiplication of a set by a scalar means the set obtained by multiplying each element of the original set by the given scalar. Let S,= (0A) U {, B(C U{O . S2 = Notice that Bn is the closure of the interval between An and An+ 2 for all n, and An is the closure of the interval between B.-2 and Bn for n > 2. Thus S, n 82 is countable, containing only the point 0 and endpoints of the intervals An and Bn. Furthermore, the intersection could be reduced to only the point 0 by shrinking the intervals which make up one of the sets by a factor arbitrarily close to one. Let us compute the thicknesses of the sets S and 82. Observe that IA, I d(B-._. 2 ,B,) = (2Ti- IBnI d(Am,An+ 2 ) = (2r ) i r(3r + 1), ) (37+ 1). The intervals An are ordered from left to right A 1 , A 3, As,.. , A 4, A2 , Ao, so any chunk P of S which does not contain 0 must be a finite union of consecutive even or odd numbered A,. Let An be the interval in P with the largest index; then IA,,I 1PI d(P,S, - P) - d(A,,,An,+2 ) 8 with equality holding when P = A,. On the other hand, if a chunk P of S 1 contains zero, let n be the larger index of the leftmost and rightmost Ak in P. Then P must contain A,,- 1, and since P is not all of S, n > 2, so (r/(2r + 1))n-l(3r + 1)(2r + 1) IPI > IA, U A.-I.I (r/(2r + 1))n- 2 (3r + 1) d(P, S1 - P) - d(An,A n- 2 ) Therefore the thickness of S is T. Similarly, if P is a chunk of S 2 , then for an appropriately chosen B,, either _ (2r+ 1)2 IBI IPI g(r). 73 d(B,, Bn+2 ) d(P,S2 - P) or - PI > d(P, S 2 - P) - Bn UB,- II d(B.,Bn_2 ) (r/(2-r + 1))n-1((3-r + 1)/(27 + 1))(72- + 3-r + 1) (7r/(27r + 1))n 72 2 ,(37 + 1) + 3-r + 1 2 7 ff(r). Thus r(S 2 ) = min(f(,r),g(-r)). As we pointed out before, by reducing the thickness of S2 by an arbitrarily small amount we can shrink the intersection of S, and S2 to a single point. Let 7r1 denote the thickness of the set S1 , and let r2 be the thickness of S2 . Then up to a change of indices, the above construction demonstrates that a single point of intersection can be obtained when either 71 < min(f(T2 ),g(r 2 )) (2.4) r2 < min(f(71 ),g(7.1 )). (2.5) or Also, if either (2.4) or (2.5) is an equality instead, the intersection can be countable. (Kraft [2] has analyzed this borderline case and determined when the intersection can be finite.) Therefore we can only hope to guarantee an uncountable intersection if 7ri > min(f(T2 ),g(-r2 )) (2.6) -r2 > min(f(7r1 ),g(Tij)). (2.7) and One may check that g(7) > f(7) > v' + 1 for r < v + 1 and g(7r) < f(r) < v2 + 1 for r2, and to r > vf2 + 1. Therefore (2.6) and (2.7) are equivalent to (1.1) in the case r" 1 >__ (1.2) when r2 > r.. 9 Case 1 s .<------------------.. 1. = Io S2:77- Case 2 J< SI:: <--...... -- ... S:<----1J*J . .......... -----> 11 Figure 6: Cases in the construction of I. and J. 3 Proof of Main Result We now prove Theorem 1 by constructing a set S with positive thickness in S1 n S 2. Proof of Theorem 1 Let S, and S 2 be interleaved compact sets with r(SI) >_-rand r(S 2 ) >_r2 for some (rl, -r2) in region C of Figure 3. Let the gaps of S be lo, I1, 12, ... , with Io and I unbounded, lo to the left of I,,and !1f2 ? 1131 >_ "'.For S2 we define JoJ 1J/ , 2, ... similarly. We refer to the intervals I, and J,, collectively as the "original gaps". Our goal is to construct the complement of S as a union of disjoint open intervals Ko, Yi, K 2 , ... with Ko and K, unbounded, and with every original gap contained in some K, (whence S C SnfS 2.) To get a lower bound on the thickness of S,observe that every chunk P of S is bordered on each side by a gap of S,with at least one of the bordering gaps being bounded. Pick a chunk P, and say P is bordered by Km and K, with m > n and m > 2. Then IPI d(P,S- P) = d(K, I) min(lI,,I 1'.I) - d(K, ,') IIimI The theorem will therefore be proven when we show for some t(rl, r2) > 0 that whenever m>n andm >2, d(KmKn)> ( 2) (3.1) We begin by finding a pair of original gaps I. and J.between which S will lie; that is, I. and J.will be contained in Ko and K 1.The properties we desire of I. and J.are that they are a positive distance apart, that all gaps of S with an endpoint between the closures of I. and J. are bounded and no larger than I., and likewise (in comparison to J.) for gaps of S2 between I. and J.. We will show later that once I. and J. have been determined, the diameter of S can be bounded below by a constant depending on r, and r 2 times the distance between I. and J.. Assume without loss of generality that Jo C lo. If I C J, 1 (Case 1 of Figure 6), then 1. = Io and J. = J1 have the above properties; they must be separated by a positive distance 10 1: J----> F---I S2: H------H -------I S: < ....................................... gfo Figure 7: The construction of Ko I II since S and $2 are interleaved. If J, C 1 (Case 2 of Figure 6), let J. be the largest gap of S2 with an endpoint between Io and 1, and let I. be whichever of Io and 1 is farthest from J.. At least one of Io and 1 must be a positive distance from J. since S and $2 are interleaved. Next, let t be a positive constant whose precise value will be chosen later; for now we assume that t < (rir2 - 1)/(r, +r 2 +2) < min(rl, r2). Assume without loss of generality that I. lies to the left of J.. We begin constructing K0 by requiring that it contain I.. We then require that K0 contain the rightmost bounded J, with d(I., J,) < tIJ, I (we will verify that there is a rightmost gap satisfying this condition when we later examine our construction in more detail.) If there does not exist such a J, that is not already contained il I., we stop the construction and let "0 = I.. Otherwise, we further require that No contain the rightmost bounded I, that is within t times its length of the previously added J,,. Again, if this requirement does not extend Ko any farther rightward, we stop the construction. If' not, we then add to Ko the rightmost J which is within t times its length of Im and is at most as large as J, (see Figure 7.) If a next step is necessary, we consider gaps of S1 which are no larger than 1,, and so forth. We may have to continue this process infinitely often, but if so we must converge to a right endpoint for K0 , sinc3 there is no w,.y this construction can extend past the rightmost point in Sl U 52. We define K, similarly, starting with the requirement that K contain J. and extending K1 to the 1cft if necessary in the same way we constructed Ko. Next, to construct K 2 we first require that it contain the largest original gap (choose any one in case of a tie) not contained in K0 U K1 (if no such gap exists, we leave K2 undefined and let S be the complement of Ko U K.) Then we extend it it on both the left and right in the same manner as before, but considering only gaps that are at most as large as the one we started with, to obtain the endpoints of K2. \Ve next start with the largest original gap not contained in Ko U K1 U K 2 , proceeding similarly to define K3, and so forth. Any given original gap must eventually be contained in some K, because there can be only finitely many original gaps that are as large or larger than the given one. We do not yet know that the KN are disjoint from each other; this .will follow when we prove (3.1), though. Let us now examine our construction more closely. Define 1(1) and r(I) to be respectively !1 the left and right endpoints of an interval I. For a given K,, let Go be the gap we started with in its construction, which for n > 2 must be the largest original gap it contains (or at least tied for the largest.) For simplicity we assume here that Go is a gap of S1 . Consider the collection E of all Jn with IJI - IGol, r(Jn) > r(G,), and d(Go,J) < tlJ,,I. We claim that the members of E (if any) are increasing in size from left to right. If J, J" E E with Jm to the left of J,, then since r(J.) < r(J,,), it follows that d(J, Jn) < d(Go,.J) < tlJi.. Since t < r 2 and d(J,,,, J) > r 2 min(IJm[, IJI), it must then be the case that IJI > IJ l Thus if E is not empty, it must have a rightmost member, which we call G, (notice that G, is also the largest member of E.) If E is empty, we let G1 be empty, but in order to facilitate future formalism, we define IG,I = 0 and r(GI) = r(Go). One must keep in mind this degenerate case in verifying the assertions and formulas that follow. We likewise define G 2 to be tht rightmost gap of $ which is at most as large as Go and lies within t times its length of G1 ; again if no such gap exists with r(G2 ) > r(GI) we say that IG21 = 0 and r(G 2 ) = r(GI). Next, to define G3 we consider only gaps which are at most as large as G 1,for G 4 we look only at gaps no larger than G 2 ,and so forth. Define G-1 ,G- 2, ... similarly to be the leftmost (and largest) gaps added to K, at each stage of the process of extending If,, leftward. Then we may think of the open interval K, as being defined by l(I,,) = lim l(Gm), r(K,,) = lim r(Gm). Each limit exists becausc; it is the limit of a bounded monotonic seq.aence. In the above construction, the even-numbered G. are gaps of S, and the odd-numbered ones are gaps of S2, but if Go had been a gap of S2 it would be the other way around. In any case. Go is the largest even-numbered G and either G 1 or G- 1 is the largest oddnumbered one. Also, the even-numbered G,, decrease monotonically in size as one moves either rightward or leftward from the largest, and the same statement holds for the oddnumbered Gm. We call a given Gm either a "1-gap" or "2-gap" of K,, according to whether it is a gap of S1 or S2 . Notice that not all original gaps contained in K,, are 1-gaps or 2-gaps, only those that have been given a label G in the construction of K,,. When we refer henceforth to left-to-right ordering or adjacency among the 1-gaps and 2-gaps of a given K,,, it is with respect to the ordering ... , G- 2 ,G-1,Go,G 1,G 2,. (Thus, for instance, 1-gaps can only be adjacent to 2-gaps and vice-versa.) The following lemma will be used in bounding both the numerator and denominator of the left side of (3.1). It establishes for all m > 0 a bound on how far K,, can extend to the right of Gm in terms of how far Gm+, extends past G, and similarly for m < 0 on the left. 12 I Lemma 4 Assume t < (-r, 1)/(TI + 2- I T-2 + 2). Let 0 1 (T T - t)(r 2 +1) 2 (Ti -t)(T 2 - t) - (1+ t) = and ( = - t)( + 1) t)( _t)_(I+ t)2 Let C be a 1-gap of K, which is at least as large as all 1-gaps of K" to its right. Let H be the next 2-gap of K,. to the right of G. Then r(K,.) - r(G) _ u2(r(H)- r(G)). l* The same statement with "1" and "2" interchanged holds, as do the corresponding results for left endpoints. Proof Let I be the next 1-gap of K,. to the right of G. Then since IlIII III <-IGI, < d(G. I) <_d(H, I) + r(H) - r(G) _ till + r(H) - r(G). which, because t < T1 , implies that III is bounded above by (r(H) -r(G))/(Ti - t). Hence 1 r(I) - r(H) (1 + t) 111 11 + d(H,I) _< 1+ t (r(H) (t - r(G)). Likewise the next rightward 2-gap of g, extends at most ((1 + t)/(r beyond I, and by induction I 2 (3.2) - t))(r(I) - r(H)) r(K,.) - r(G) = r(H) - r(G) + r(I) - r(H) + ... +-... ) (r(H) - r(G)) < 1+ -- +_±t_ I= u 2 (r(H) - r(G)). The geometric series converges, and the denominator of I 02 is positive, because of our as- sumption that t < (7rr 2 - 1)/(r1 + r 2 + 2). I The next lemma builds on Lemma 4 to obtain a positive lower bound on the distance between a given Km and K,,, provided we can find a 1-gap of Km and a 2-gap of K which are respectively larger than all 1-gaps and 2-gaps between them. The proof is difficult and will be handled later. Lemma 5 There exists a function 0,(r 1 , 72) that is positive whenever (r1 , 72) is in region C and t is sufficiently small, and for which the following statement holds. For m 0 n, let G be a 1-gap of K, and H be a 2-gap of K,,. If all 1-gaps of K,, or K, with at least one endpoint between the closures of G and H are bounded and at most as large as G, and all similarly situated 2-gaps are bounded and at most as large as H, then d(K,,Km) v/t,(r, 72)d(G,H). 13 Case I Case2 Case 3{ S2: = HI- SI: H S --i G S:H I --..-... -H lqH ---iH H ----------.. H J H F---------=_ i Figure 8: Cses in the proof of (3.1) Recall that to construct Ko and K1 , we chose I. and J. to satisfy the above hypotheses. Thus we now know that K0 and I1 are disjoint and separated by a positive dista,,ce (which is at least Ot(ri, -. ) times the distance between I. and J..) Now suppose 0 < n < m and m > 2; we will prove (3.1) by finding a G and H which satisfy the hypotheses of Lemma 5. Assume without loss of generality that K, lies to the left of K,,. Let I be the largest original gap in KIn; say I is a 1-gap. If all 1-gaps of K are smaller than I (Case 1 of Figure 8), let H be the largest original gap in K,,. Since m > n, K,, was constructed before K,, so H must be at least as large as I, and thus is a 2-gap. Let G = I; then G and H satisfy the hypotheses of Lemma 5. Also, d(G, H) > tIG1, since otherwise G would have been included in the construction of K,.. If on the other hand there are 1-gaps of K,, which are at least as large as I (Cases 2 and 3 of Figure 8), let J be the closest such gap to I. Consider all 2-gaps of Km or K,, to the left of J; let K be the largest such 2-gap (any one will do in case of a tie.) Notice that K must be adjacent to I or J. If K is in K,, (Case 2), let G = I and H = K; then G and H satisfy the hypotheses of Lemma 5, and d(G, H) > tIGJ because G was not included in K.. Otherwise (Case 3), let G = K and H = J, and reverse the indices "1" and "2". Once again, G and H satisfy the hypotheses of Lemma 5 and d(G, H) > tIGI. Notice also that in all cases, G is the largest 1-gap of K", and H is at least as large as all 2-gaps of Kn. We now estimate how large K, can be. Let I and J be the 2-gaps of K, adjacent to G on its left and right, respectively. Since I is at most as large as H, T211 <_d(I, H) :_d(I, G) + IGI + d(G,H) _ tIlI + IGI + d(G,H), 14 or in other words III < - 1 -- (IGI + d(G, H)). (3.3) t The same bound holds also for J, so by Lemma 4, SIK.I = IGI + l(G) - (1g.) + r(Km) - r(G) < IGI + o2(l(G) - l(I)) + a 2(r(J) - r(G)) IGI + a2(1 + t)(1I1 + IJI) 1+t <_IGI+ 2U(-r t)(IGI + d(G, 1I)) (+ 2 02 1+t (1 +1+d(G,H) (,r2 - t ) t t 1 2+ 1)d(GH)3 (r,_t)(r2 - t) + (1 + t)(1 2 (2r + t) ) t((rl - t)(r 2 - t) - If on the other hand G is a 2-gap and H is a 1-gap, we 'btain the same bound as (3.4), but with the indices "l" and "2" interchanged. Then in either case, IKm I_< (rI - t)(r 2 - t) + (1 + t) 2 (2 max(ri, r2 ) + 1) d(G, t((T1 - t)( 2 - t) H). + t)2 ) -: Finally, by Lemma 5, d(Km,K) IKI ( (1 + t) 2 )t(r,r)2 ) t((r - t)(r 2 - t) - t)(r 2 - t) + (1 + t) 2 (2max(rl,r2) + 1) (3.5) The right side of (3.5) is positive as long as t is between 0 and (7r12 - 1)/(r + r2 + 2), and 72) > 0, and goes to zero when t approaches any of these borderline values. Therefore the right side of (3.5) attains a maximum value, call it ;(71, 72), at some allowable value of t, say t.. We thus carry out the construction of S with t = t.; then (3.1) holds, and the 3)t(r, proof is complete. I Let (71,r2) = t. (r1 ,r 2 ); then t.((r t.)( t.) (I + t.)2)0(rl, r2) 1 2 (r - t.)('r 2 - t.) + (1 + t.) 2 (2 max(ri, r) + 1) - - - Remark We will see in the proof of Lemma 5 that 0'(r1 ,r 2 ), and hence ;(-r,72), must be very small when (7l,r2) is near the boundary of region C. flowever, if both T1 and r2 are large and t is small compared with the two thicknesse5, it is not hard to check that O.t(rl,r2) is close to one. Then if 71,72 > 1, one finds that t. is of order N/min(Tl, r), whence o(ri, r 2 ) is of order V/min(ri, 72) also. Thus when the thicknesses of S, and S2 are large, the lower bound we obtain on the thickness of S is reasonably large. 15 --- I S2-:.- HH [----- Ho 2 Figure 9: The gaps G'i and Hi We now prove our main technical lemma. Proof of Lemma 5 Let G be a 1-gap of Km and H be a 2-gap of I, satisfying the hypothesis. We assume without loss of generality that rl _ r2 ; then by (1.1) the condition (r1 ,r 2 ) E C implies + 1 2 2+ 2 1 (3.6) and 72 > g(r 1 ) = (2ri + (3.7) 1)2 If d(G, H) = 0, the inequality to be proven is trivial. Otherwise, let us normalize d(G, H) to be one, and assume G lies to the left of H. Let Go = G and H0 = H. Let G1 be the 1-gap of K, adjacent to Ho on its left, and let H1 be the 2-gap of Km adjacent t, Go on its right. Let G2 be the adjacent 1-gap of Km rightward from H1 , and likewise define H2, G 3 ,1I3 , .. (see Figure 9.) For i > 0 let Xi - l(Gi+1 ) { I(Hi) i even ={ r(H'+) - r(G,) i even i odd. r(Gi+) - r(Hi) i odd and il(Gi) - l(Hi+1 ) Let Ri = d(Gi, H,); then R0 = 1 and R,+1 = max(R, - xi - yi, 0) for i > 0. Let R, be the limit as i goes to infinity of Ri. Then d(K,,,K,) = R,, so we wish to show that there is a positive lower bound on R, which depends only on r1, r7 2, and t. In the same way as we obtained (3.2) it follows that for all i, and a dy y Xi+i < 1-+l-tt i (3.8) i+1< 1 + Iti (3.9) T2- t Furthermore, by Lemma 4 we have that Yi + Xi+1 + i+2+" 16 O2Yi and Xi + Yi+i + Xi+2 +... crlX. Thus, for each i, 3R. 3 R xi-,yi - x+ 1 -Yi+i - - > Ri - oixi- 2yi. (3.10) We will show that for some i, the right side of (3.10) is positive. Next let us obtain upper bounds on xo and Yo. We know that Xo = l(Ho) - l(Gi) < jGII + d(G 1 , Ho) :_ (1 + t)1G 11, (3.11) and by hypothesis 1G11 _ IGoI, so xo = I(Ho) - r(Go) - (l(Ga) - r(Go))= 1 - d(Go, G) <_ 1 - ril GI. (3.12) Eliminating 1G1I from these inequalities yields Xo < 1+ t (3.13) Yo : 1+ t (3.14) Similarly, We can obtain similar bounds on x; and y, for i > 1, but the bounds are complicated by the fact that we do not know in general that 1G 1+j1 _<GI (or 114+ 1 _ IH I). The analogues of (3.11) and (3.12) are thus x,. < (1+ t)lG,+l I and xi _ Ri - ri min(IGil, IGj+j 1). If IjG+ 1I < (3.15) eGvi, then as in (3.13) it follows that 3i < 'r, 1++1t+ t Ri. (3.16) If IG,+jI > lGjj, then by (3.15), ?-TIGl ij xi < Rix - -rjlGd R- y l7 i.(3.17) (3.17) If (3.16) fails, then using (3.17) together with the negation of (3.16), one finds that x,- 1 is bounded above by the right side of (3.16). Thus regardless of the relative lengths of G, and Gi+i", min(xi, xi-1) _< 1 + t Ri. 17 (3.18) for i > 1. Likewise, regardless of the relative lengths of Hi and Hi+,, we have for all i > 1 that 1 + t R.(3.19) min(yi, yi-1.) < 7 2 + 1+t Let ai = xi/Ri and bi = yi/Ri provided Ri > 0; then Ri1= inax(1 ai - bi, 0)Ri. - Thus ai+l and bi+l are defined as long as 1 - ai- bi > 0. For j = 1, 2 let (3.20) 1+= Ai 1~+1t' Yj-7j- t The conditions (3.13), (3.14), (3.18), and (3.19) can then be written a0 < A,, bo :5 A2, and(39)beom Alsomi (3.8 codtin 1~+-1 - ab - and 1 1 Weso observeon that8) and 3. - oa2b- 0,(323 aedfie9tlas)sln because a ,<A thn T1 T - 1 r 1 a(-biA (1 + 1)( 0, -aa 2 + +3.1) -A 2 -)(+) r+ t +1)( > 0 t) 2 + t+1) adb 2 (since t < (rlr 2 - 1)/(r 1 + -r2 + 2).) Also, as long as a, < A1 , by (3.22) we have Yz2AI 1-A, - bi* Let Sh(b) = 1- L2 A1 - b The equation h(b) = b has two solutions, -4P2A1 _A (1- 1 - A, 2 and if the roots are real, then h(b) A2 and (3.24) 1- auAl - a2b-> 0. (3,25) Let us delay the verification of this claim until the end of the proof. Choose b. > b... with 1 - a1 ,1 - o 2b. > 0. Now b0 _<A2 < b+, and as long as ai _< . continues to hold, bi+, :_h(bi) < bi for bi E (b., b+). Then eventually bi _<b., and furthermore since b - h(b) must have a positive minimum value on [b., A2] (if b.> A2 then bo < b. already) there is a maximum number N (depending only on rl, r 2 , and t) of iterations it can take before bi :5 b.. We therefore have shown that if ai < A1 for i < N, then bi < b. for some i < N, and hence (3.26) 1- alai - a 2 bi > !-iA - a2 b. > 0. If on the other hand a,+, > A1 for some i < V, then let i be the smallest index for which this occurs. We claim that then (3.23) holds for i. By the results of the previous paragraph. bi < bi-, < ... < bo _<A2. Also, by (3.21), ai _<Al(1 - ai - bi), or in other words ai :_i-1T1 1-bi). Then 1- 1a - a 2 bi >_1 - l bi. 0.2 Now when t = 0, C7 1 Al I1+A, (71 + 1)r2 r-7 2 -1I (T Tr(7 2 +10 +2)('r1T2-l1) rl(rlr 2 - 1) + 2r-r2 + 2r2 (,r 1 + 2)(7-1r 2 > 0, 19 - 1) and thus for t sufficiently small it remains positive. Then since bi -,la, - AIx ( rA A Tb I_ 012ba I'bT iT (.-, 1# 1 + A,) A 2)L7 a22 When t A2 , (3.27) l 0, by (3.6) i- o 2A 2 = ( 1+1)7 + 2 (r2 + 1)(ri 7 - 1) 7 1 122 - 2r 2 - 1 (r 2 + )(r1 >2 2- (r 2 .)(rT T2 - 7yrT while alAl (A2)i- + r2 = - 1) + 3r 2 + 1 - 2r2 - 1 2 r 1 (r2 2 1)- 1 + 1)_rr2 TA, (r 1 + 2)(rT, 2 -r -) 71 + 2 rr - 1 so .'.e right side of (3.27) is positive for t = 0. It therefore remains positive for t sufficient: small. To summarize, we have shown that if t is sufficiently small, then for some i < N, either (3.26) or (3.27) holds. The right side of each of these equations is positive and depends only on rl,'r2 , and t. Furthermore, ai < AI and by _<A2 for j _<i, so by (3.20), Rj+1 > (1 - Al - A2 )tj, and hence Ri _ (1 - \ 1 - A2 ) . Then by (3.10), Roo > Ri(1 - olai - a bi) >_(1 - A1 - A2 )(1 where 1 -ala, - a2bi), - a2b, is in turn bounded below by the lesser of the right sides of (3.26) and (3.27). We have therefore shown for t sufficiently small how to obtain a positive lower bound on RO, which depends only on rl, r2 , and t; we let 4t(r., r2 ) be this lower bound. It remains for us to verify (3.24) and (3.25). We again show they are true for t = 0, whence they hold for t sufficiently small by continuity. When t = 0, , 1 /(T, 1 + 1) ± Iri/(r '-,2 ± r1 2 + 1)2 - 4/((T 2 b± 2 2 - 4(r, + )7-2 2(rl + 1)r2 Now by (3.6), 712722 - 4(rl + 1)r2 = 71(7722 - 20 4r2 ) - 4r2 + 1)-r2 ) (3.28) > -+I) - 472 l(22r2 + 1)(r22 - (r22 +37-2 = 3 (722 72 - - 2+ 1) - 4r23 (3.29) 1)2 722 Thus bd. are real md distinct (and the same must then hold for t sufficiently small.) Next by (3.28), 2 + 722 - 4T25(1/'r + 1/7i2) 22(I + 1/ri) from which we see that b+ is increasing as a function of ri. Thus b+is greater than the value it would take on if (3.6) were an equality, which owing to (3.29) means (7.22 + 372 + 1)/72 + 22 -2 r2 11/r2 2+ 2(2732 + 3 22 + 1)/r2 U> 722+32+1-(721 - ) 1 2(,2 + 1)(272 + 1) ,r2 + I A2.0 - Hence (3.24) holds for t = 0, and consequently for t sufficiently small. When t = 0, (3.25) can be written b- < 1)2,r2(71 -+ 2r 01 Or 2,r2 (3.30) 1 The right side of (3.30) is an increasing function of 72, and since b. - V1 4714(+1)/r2 2(71 + 1) b- is a decreasing function of 2. Then by (3.7), b ~ b7-1 - Vri2 - 47i,3 (Til + 1)/(2r, + 1)2 2(r, + 1) Ti((2ri + 1) - V(27ri + 1)2 - 4(r,2 + 7r) 2(71 + 1)(2r, + 1) (1r + 1)(2( + I m white w2h72 - 2r - 1 (-r, + 1)27- i) + (2 " + 1)2//1 -(2r1 (T-1 + 21 + 1) 1)2(2--1 + 1)2/713 +1(ri + 1)2 (2,r + 1) 2 7T (+ )(2r, + 1) Thus (3.25) holds for t = 0, and for t sufficiently small. The proof of Lemma 5 is now complete. 4 I Intersecting Three or More Cantor Sets In proving Theorem 1, we chose a subset S of S1 nS 2 in order to guarantee positive thickness. In this section we demonstrate that positive thickness sets are in some sense generic in S, nS 2 . We also explain how Theorem 1 is useful in finding conditions under which three or more Cantor sets must have a nonempty intersection. The set S we constructed in Section 3 need not be dense in S fn S2 nor even in the non-isolated points of S n S2. However, there are subsets of S, n S 2 with thickness at least v('i, r2) near i.ay .ccumulation point. To see this, let {q,} be a sequence of distinct points in S, n S2 which converge to a point q. It is not hard to s: aw that within any neighborhood N of q there are compact subsets T, C S, and T2 C 52, each of which contains all but finitely many q,, with r(T) 2! r(SI) and r(T 2 ) ? -r(S2). Notice that any two compact sets which intersect in three or moe points must be interleaved. Thus T, and T2 are interleaved, and by Theorem 1 their intersection contains a set with thickness at least w(ri, -r2). We conclude that arbitrarily near any non-isolated point of S, n S2 there are subsets of Si n S2 which have thickness at least p(r1 , 72). In addition to showing that there are many subsets of S, n S2 with positive thickness, it is possible to obtain a lower bound on the diameter of the positive thickness subset S of Si n S2. If the two sets Si and S 2 are interlea.,d in such a way that neither is contained in the convex hull of the other, then by the discussion following the statement of Lemma 5, the diameter of S is at least O(ri, 7r2) times the length of overlap between the convex hulls of S and S2. Since the thickness of S is at least V(,rl, r2), we immediately have the following result. Corollary 6 Let S, and S2 be two interleaved compact sets whose thicknesses (r1,r2) lie in region C and for which the intersection Q of their convex hulls contains neither S, nor S 2. If S3 is a compact set with largest bounded gap G such that (i) the hull of S3 contains Q, (ii) IGI < O(ri, 72)IQI (iii) T(S 3 )p(7 1 ,7 2 ) > 1 • • m 99 I then s1fn 2 fn s3 is nonempty. 3 3 3 1 3 35] 3 We note that if instead of condition (iii) we required the pair r(S 3) and V(, rj2) to lie in C, then S, n '2n Ss would contain a set of thickness at least v(r(S3),w(rl, r 2)). Thus one can inductively find thickness conditions guaranteeing the nonempty intersection of any finite (or even countably infinite) number of compact sets, although the analogue of the interleaving condition gets more complicated. sufficiently far from the boundary of region C, then as discussed in the remark If ('ri, r2 ) is preceding the proof of Lemma 5 it is not hard to obtain explicit lower bounds on w(ri,1 r 2 ) and V)(r(, r2). In particular, for rT and r2 large we found that V(r ,r 2) is at least of order /min(rw,r), and 01(ri, r2) is approximately one. We thank the referee for a thorough reading of our paper and many helpful comments. References [1] K. J. Falconer, The Geometry of Fractal Sets, (Cambridge University Press, 1985). [2] R. Kraft, "Intersections of thick Cantor sets," Mem. Amer. Math. Soc. (to appear). [3] J. M. Marstrand, "Some fundamental geometrical properties of plane sets of fractional dimensions," Proc. London Math. Soc. (3) 4 (1954), 257-302. [4] P. Mattila, "Hausdorff dimension and capacities of intersections of sets in n-space," Acta Math. 152 (1984), 77-105. S. Newhouse, "Nondensity of axiom A(a) on 14 (1970), 191-202. 2, Proc. A.M.S. Symp. in Pure Math. [6] S. Newhouse, "Diffeomorphisms with infinitely many sinks," Topology 13 (1974), 9-18. [7] S. Newhouse, "The abundance of wild hyperbolic sets and non-smooth stable sets for diffeomorphisms," Publ. Math. IHES 50 (1979), 101-151. [8] S. Newhouse, "Lectures on Dynamical Systems," Progress in Math. 8 (Boston: Birkhauser, 19S0), 1-114. 1 3 I Robinson, "Bifurcation to infinitely many sinks," Comm. Math. Phys. 90 (1983), [9] C. 433-459. [10] R. F. Williams, "How big is the intersection of two thick Cantor sets?", to appear in M. Brown, ed., Contemporary Mathematics, Proceedings of the 1989 Joint Summer Research Conference on Continua and Dynamics. 23 N I I U BORDER-COLLISION BIFURCATIONS INCLUDING I FOR ''PERIOD TWO TO PERIOD THREE' PIECEWISE SMOOTH SYSTEMS by Helena E. Nussealb James A. Yorkeac I 3 I December 1990 I I U Research in part supported by the Department of Energy (Scientific Computing Staff Office of Energy Research), and by DARPA/ONR. I a. University of Maryland, Institute for Physical Science and Technology, College Park, MD 20742, U.S.A. b. Rijksuniversiteit Groningen, F.E.W., Vakgroep Econometrie, 3 Postbus 800, NL-9700 AV Groningen, The Netherlands c. University of Marylanc., Department of Mathematics, College Park, MD 20742, U.S.A. ABSTRACT. We examine bifurcation phenomena for maps that are piecewise smooth and depend continuously on a parameter p. In the simplest case there is a surface F in phase space along which the map has no derivative (or has two one-sided derivatives). r is the border of two regions in which the map is smooth. As the parameter p is varied, a fixed point E may collide with the border F, and we may assume that this collision occurs at p = 0. A variety of bifurcations occur frequently in such situations, but never or almost never occur in smooth systems. In particular E may cross the border and so will exists for p < 0 and for p > 0 but may be a saddle in one case, say p < 0, and may be a repellor for p , Q, For p < 0 there can be a stable period two orbit which shrinks to the point E0 as p - 0, and for p > 0 there may be a stable period 3 orbit which similarly shrinks to E0 as M -+ 0. Hence one observes the following stable periodic orbits: a stable period 2 orbit collapses to a point and is reborn as a stable period 3 orbit. '.e also see analogously "stable period 2 to stable period p orhtr bifurcations", with p = 5, 11, 52, or period 2 to quasi-perlo,:u or even to a chaotic attractor. We believe this phenomenon ili: bp seen in many applications. 1. INTRODUCTION Certain bifurcation phenomena have been reported repeatedly in numerous studies of low dimensional dynamical systems, that depend on one parameter. The rather familiar bifurcation phenomena describing the evolution of attractors as a parameter is varied include the saddle node bifurcation, the period doubling (or halving) bifurcation, and the Hopf bifurcation. In the literature 1 dealing with bifurcation theory, it is frequently assumed that the map corresponding to the dynamical System is differentiable; see I for example [GH], [K], [RI, or (S]. To remind the reader so that we may draw contrasts, the well known bifurcation diagram of the quadratic map Q (x)= p- x is given in Figure 1 (1 < p < 1.5). All the computer assisted pictures were made by using the DYNAMICS program [Y]. FIGURE 1 We say a map is smooth if the map has a continuous derivative. A region is a closed, connected subset in phase space. We examine I continuous maps which are piecewise smooth. We restrict attention to those which are smooth on two regions of the plane with the border between these regions being a smooth curve. From now on we assume that there is a smooth curve r which separates the plane into two regions denoted by R A and R phase space R . We say, a map F from th- to itself is piecewise-smooth if (1) F is continuous, and (2) F is smooth on both the regions R\ and R.. Note that on the border F between the regions, the mappings musr is assumed to be continuous. A special case tnat be equal since F we shall refer to frequently is the following prototype example, piecewise linear map into which other generic piecewise linear maps in the plane can be transformed by changes in coordinates. Let u and w be vectors in the plane. Let x and y be the ph~tse space coordinates and p is a scalar parameter. Let P be the map defined by 3 P (x,y) xu + JxJw + (y + p)(l,O) and we investigate trajectories (Xn+lYn+I : P (xnYn) ' The regions R and R are the left and right half plane separated by 3 A B 2 t Figure 1. Bifurcation diagram of the quadratic map Q (x) - x. The parameter p (plotted horizontally) varies from 1 to 1.5, and x is plotted vertically, -1 x . 2. F, the Y-axis. To illustrate the "period two to period three" border-collision bifurcation phenomenon, consider tle one-parameter family of maps fP (-m < P < I ) from the plane to itself, defined by f (x,y) = (-l.4x + y, -O.1x) + M(1,0) if x & 0 (-3x + y, -4x) + p(1,0) if x z 0 Notice that the map is smooth in each of the half planes x - 0 and x ; 0, and the Y-axis is the border which is a smooth curve. Note that to write f in the form of P, let u = (-2.2,-2.05), and w (-0.8,-1.951. The bifurcation diagram exhibiting the 'period two to period three" bifurcation, is presented in Figure 2 (-0.1 < P < 0.2). All the bifurcation diagrams in this paper show a projection of the attractor, projecting (x,y) onto the X-axis, which is II plotted vertically; the horizontal coordinate is p. FIGURE 2 The purpose of this paper is to study the occirrence of suc.h a I 3 new bifurcation phenomenon for continuous, piecewise smooth maps. These systems include, for example, two-dimensional continuous, piecewise-linear maps. In (HNS] the dynamics of a simple economic model was studied, and a "period three to period two" bifurcation was observed numerically, and was established rigorously in (HNJ for a degenerate piecewise-linear situation. The "border-collision bifurcation" phenomena is a much richer class of bifurcation phenomena than just a "period two to period three" bifurcation and occur for generic piecewise smooth maps. We present phenomena that occurs when the nature of an unstable fixed point of a piecewise smooth map is changed while the fixed point collides with the 1 3 Figure 2. Bifurcation diagram exhibiting the "period two to period three" bifurcation of the map f P(x,y) = (-1.4x + y + p, -O.Ix) if x :50, and = (-3x + y + p, -4x) if x > 0. The parameter p (plotted horizontally) varies from -0.1 to 0.2, and the coordinate x is plotted vertically, -1 4 x -, 1. 3 border between two regions in which the map is smooth. Since the fixed point is unstable before and after collision, it is not 5 5 shown in the bifurcation diagram in Figure 2. While we consider maps in the plane, higher dimensional analogues exist. We know of no phenomena that can occur only in higher dimensional cases. There is also no difficulty in changing the notation to that there are more than 2 regions on which the map is smooth. We could also 3 allow f to depend on p, but coordinates could be chosen so that it remains fixed, so our case in practice includes moving boundaries. With moving boundaries the map would be piecewise smooth in p. We say, a fixed point E I I 3 is a border crossing fixed)Point if it crosses the border r between two regions in which the map is smooth. We will assume that the crossing occurs at p = 0. The fixed point E called a flip saddle if tne eigenvalues X and P of the is Jacobian matrix DF (E) if X < -1 < u < 1. Assume that there exists a one-parameter family of piecewise smooth maps and assume that there is a border crossing fixed point (or periodic point) I 3 3 Ep the case when E , we emphasize crosses the border r it changes from being a flip saddle to a repellor with complex eigenvalues. The above example has this behavior. In Section 2, we discuss why the border-collision bifurcation phenomenon occurs when the nature of an unstable equilibrium I I 3 3 changes when it crosses the border of two regions. To be somewhat more specific, assume that a border crossing fixed point (or periodic point) E of a one-parameter family of piecewise smooth maps changes from being a flip saddle to a repellor with complex eigenvalues when it crosses the border r. Then at p = 0, a bordercollision-bifurcation occurs at t 'is fixed point E 4 3 on the border. In Section 3, we mainly deal with two piecewise smooth systems of the plane, one piecewise linear and one piecewise nonlinear. The first system is the map P (derived in Section 2) that correspon-ls with a generic piecewise smooth nonlinear map, and the other system is based on the Henon map. For the piecewise linear map P we present several examples including "period 2 to period p" (p = 5, 11, and 52), "period 2 to quasi-periodic" and "period 2 to chaotic" bifurcation. We also present an example of a bordercollision bifurcation for the map P in which no attractors but chaotic saddles are involved. The system of the plane involving the Henon map at the left side and a linear map at the right side of the border, different border-collision bifurcations are observed. We present a variety of examples. Although we we do not have an exhaustive list of types of border-collision bifurcation of one-parameter families of maps under consideration, we point out that several other types of bifurcation occur. We believe this phenomenon will be seen in many applications. In Section 4 we prove that for certain one-parameter fami.l.-s of piecewise smooth maps exhibit a "period 2 to period 3" bortercollision bifurcation. This phenomenon persists under small perturbations of the involved maps. In Section 5, we discuss the state of the art, and pose several questions which remain unresolved. This paper does not give a complete theory, but can be considered as initiating a bifurcation theory of piecewise smooth maps. 5 2 THE BORDER-COLLISION BIFURCATION PHENOMENON In the bifurcation theory for maps, attention is focused on differentiable maps when one or more eigenvalues of a fixed point (or periodic point) cross the unit circle. When this occurs, the 3I nature of the fixed point changes. For example, a fixed point attractor becomes a saddle (possibly a flip saddle) or a repellor. For border crossing fixed points, the Jacobian matrix of the fixed point generally changes discontinuously, and the fixed point can for example change from being a repellor to a saddle as p crosses zero. Let F(.,P) F be a one-parameter family of piecewise smooth maps from the phase space R2 to itself, depending smoothly on the I parameter p, and where p varies in a certain interval on the real line. Let E 3 denote a fixed point of F defined on -c 0. For a general approach (which is given below) we need the concept of the "orbit index" of a periodic orbit [MYI. The orbit index is a number associated with a periodic orbit, arvi 'his number is useful in understanding the patterns of bifurcations the orbit undergoes. We say an orbit of period p is typical if its 3 Jacobian matrix exists (that is, the Jacobian matrix of the p-th iterate of the map at a point of the orbit) and neither +1 nor -l is an eigenvalue (of the Jacobian matrix). For typical orbits, the I orbit index is -1, 0, or +1. The orbit index is a bifurcation invariant in the sense that if one examines the periodic orbits 3 that collapse to the fixed point E as p -. 0, and adds the orbit indexes of the periodic orbits that exist just before a bifurcation, then that sum equals the corresponding sum just after I that bifurcation. Suppose a typical periodic orbit PO of a map F 1 6 has (minimum) period p. The orbit index of that orbit depends on the eigenvalues of the Jacobian matrix A of the map Fp at one of the points in PO. Now we Oefine the orbit index I of PO. Let m be the number of real eigenvalues of Ap smaller than -1, and let n be the number of real eigenvalues of A p greater than +1. The orbit index IPO of PO is defined by IPO = 0 if m is odd; IPO = -1 if m is even and n is odd; IPO = +1 if both m and n are even. If the orbit index = -1, then the orbit is called a regular saddle. The typical orbits with orbit index +1 in the plane are repellors and attractors and fixed points with non-real eigenvalues. The def'nition of orbit index is technical when a p;.int of the orbit lies on the boundary and so does not have a Jacobian matrix, and the definition is unnecessary since we consider orbits for p # 0. For a moment, assume that E R is in the interior of the region (or the region RB), and write X and u for the eigenvalues of DF (E). If neither of the two eigenvalues X and v is on the unit circle, then the fixed point E index 0) if X < -1 < v < 1; -1) if -1 < L I and juj > 1; and E both E is called a flip saddle (and has is an attractor (and has index +1) if IAI < 1. Note that E has orbit index +1 if the eigenvalues are not real. Hence, a typical fixed point is a flip saddle, a regular saddle, a repellor or an attractor. Similarly, the nature of periodic points is defined. Now we are able to provide a definition of the notion "border- 7 J collision bifurcation". Let the regions RA and RB, the map F the fixed point (periodic point) E and be as above. Assume there exists a number c > 0 such that (1) E0 iz cn the border of the two regions RA and RB, (2) for -E < p < 0 the fixed point E is in the region RA , and its index is IA , and (3) for 0 < p < e the fixed point E is in the region RB , and its index is B- If A and IB are different, then (as stated below) some bifurcation must occur at E0 , since the orbit index of E is changing from IA to IB 1 while the parameter p is increasing from -e to +c. We say a periodic orbit PO is an isolated border crossing orbit if (1) PO includes a point that is a border crossing fixed point under some iterate of the map, and (2) the orbit PO is isolated in phase space when p = 0, that is, in the plane there exist neighborhood U of the orbit PO such that PO is the only periodic orbit in U when p = 0. From the topological degree theory as described in (MY] (see also [AYYI for the two dimensional case), the following "Border-Collision Bifurcation" result follows after some minor modifications. BORDER-COLLISION BIFURCATION THEOREM. For each two-dimensional piecewise smooth map and depending smoothly on a parameter p, if the index of an isolated border crossing orbit changes as M crosses 0, then at p = 0 a bifurcation occurs at this point, a bifurcation involving at least one additional 'periodic orbit. This result says that additional fixed points or periodic points must bifurcate from E0 at g = 0. These bifurcating orbit6 need not to be stable. An example of the preservation of orbit 8 index occurs with a period doubling bifurcation. If for p < 0 there is an attracting fixed point (and no other entering orbits), the total index is +1. Then for p > 0 we can have a flip saddle (orbit index 0) and a period 2 attractor (orbit index +1). Hence, the sum of the orbit indices before and after P = 0 is + 1. Note that the two points of the period 2 orbit are collectively assigned +1, not individually, since that orbit has index +I. Since this bifurcation occurs while the fixed point (or periodic point) collides with the border of the regions RA and RB1 we call it a border-collision bifurcation. In other words, a bordercollision bifurcation is a bifurcation at a fixed point (or periodic point) on the border of two regions when the orbit index of the fixed point (or periodic point) Lefore the collision with the border is different from the orbit index of the fixed point after the collision. We derive the map P that was introduced in Section 1, froin nonlinear piecewise smooth maps. We assume coordinates are chosen so that the curve r is a straight line. Let z denote any vector in the plane, and write F (z) = F(z;u), and write z0 = E04 From the is piecewise smooth, we have that on each of the assumption F regions RA and RB F(z;p) = F(zo;0) + DZF(z 0 ,O)(z-z0 ) + D F(z0,O)p + H.O.T. where H.O.T. stands for Higher Order Terms. Hence, there exist matrices MA and MB and vectors vA and v B such that if z is in the region RA then F(z;p) = F(z0 ;0) + MA(z-z 0 ) + vA M + H.O.T. and if z is in the region RB then = F(z 0 ;0) + MB(Z-z 0 ) + VBP + H.O.T. F(z;i) Let e I be the unit vector tangent to F at z0 . The assumption F is piecewise smooth and depends smoothly on M implies MAeI = MBe 1 = e 2 and vA = VB = v. Choose coordinates so that z 0 = 0, so F(z0 ,0) = 0. Assume that e2 is independent of el, so we may use e l and e2 as basis vectors. We let e1 and e2 be the basis vectors of the plane. We assume that e2 is independent of v and that v is not parallel with e I-e2 We claim that by change of variables and by . rescaling M we may assume that v = e . Write v = (VxVy). We now assume that v * 0. We can make vy = 0 after a change of vaIls andv x variables, an y. rescaling of p. If v is not 0 then *e 1 =Iby can change variables, setting y = y - VyM (where x is unchanged), and the new vector v for the (x,y) system will have its second coordinate 0. By rescaling p, that is, by introducing p can change the system so that the new vector v is (1,0), , we may write MA . the parameter. Therefore, pv x we when o is [c and v = (1,0). Since all these assumptions are generic, we sa" tnh prototype piecewise linear form of F F(z;p) F(z;p) =+ U [ b 0 + p(1,0) for p small is defined ov if z is in the region RAs p(1,0) if z is in the region RB To write the prototype piecewise linear form of F II. of the map P , let u 2 b+d and w in the form a c b-d We observe the following fact. Assume that the fixed point E a flip saddle (orbit index 0) in region RA and a repellor with complex eigenvalues (orbit index +1) in region RB. If there ex.ists 10 s a stable periodic orbit with period 2 in RA that converges to E0 when u approaches 0, then the total degree in RA is +1. Hence, if there exists a stable periodic orbit in RB that converges tco E 0 when P goes to 0, then there must exist a regular saddle periodic orbit of the same period (orbit index -1) in RB that converges to E0 when p goes to 0, since the total orbit index is a bifurcation invariant. Consequently, for the family of maps f in the Section 1 exhibiting a "period two to period three" bifurcation in figure 2, there must also exist a regular saddle periodic orbit with period 3. PERIOD TWO TO PERIOD THREE BORDER-COLLISION BIFURCATION THEOREM. Let F be a one-parameter family of liecewise smooth ma-i which has a prototype piecewise linear form at M = 0, and assume that (1) a < -1, c < -1, d < -1; (2) c 2 + 4d < 0; and (3) 0 < a(ac + d) < 1. Then, there exists c > 0 such that if IbI < e, then the family F has a "period two to period three" border-collision bifurca~ton at (0,0). We point out that the border-collision bifurcations persist under small perturbations. The proof follows of the Theorem from the result obtained in Section 4. The geometrical proof given in Section 4, might give insight why other bifurcations (for example, period 5 to period 2 bifurcation) may occur in piecewise smooth systems. Presumably, the method of proof only works if one of the two maps involved has a small Jacobian. Hence, when the piecewise smooth map consists of maps that all have Jacobian bounded (far) away from zero, new techniques have to be developed to obtain 11 I I rigorous border-collision bifurcation results. 3. A VARIETY OF BORDER-COLLISION BIFURCATIONS. In this Section we present a variety of numerical examples exhibiting a border-collision bifurcation. The first series of examples is from the piecewise linear map and the second series is based on the H~non map. We will present examples showing I that in a border-collision bifurcation not only attracting periodic orbits are involved, but also chaotic saddles may play a role. Therefore, in order to describe the qualitatively different border-collision bifurcations in a consistent manner, we refer to the invariant sets that are involved in the border-collision I bifurcation. A chaotic saddle is a compact, invariant set that is not an attractor which contains a chaotic trajectory. If an attractor A of a map F is an attracting periodic orbit with period p, then we call A a period D attractor, and we say instead of "period two to period three" bifurcation a bifurcation from a I * period 2 attractor to a period 3 attractor. The bifurcation diagrams below show the long term behavior of the coordinate x for p between -0.1 and 0.2. The diagrams have been constructed as follows. For the minimum value -0.1 of p, and initial value (0,0), calculate the first 200 points (transient I I time 200) of the orbit and plot the next 1000 points of the orbit. Increase M slightly, say by 0.001, take for the initial value the last point which was plotted, calculate 200 points of this orbit and plot the next 1000 points. Increase p again, and continue increasing until p achieves the maximum value 0.2. Hence, once the I I12 orbit is close to an attractor, as the parameter is increased, this attractor is "followed" as long as it exists. In the diagrams, the x-coordinate is plotted vertically, and the parameter p is plotted horizontally. Define the map GL T-d2 from the plane to itself to be the prototype piecewise linear form of F , that is, GL (x,y) = (ax + y, bx) + p(1,0) if x S 0 GL (x,y) = (cx + y, dx) + p(1,0) if x 2 0 Rezall that the map GLP is equivalent to the map P , since to a+c Ib+d , -u = write the map GL in the form of the map P , let P and w . We present a few numerical examples for this a-c Ib-d-- map GL exhibiting a border-collision bifurcation. In all these examples, the fixed point is a flip saddle for p < 0 and a repellor with complex eigenvalues for p > 0. EXAMPLE 1. The presumably simplest border-collision bifurcation is from a period 2 attractor to a period 3 attractor presented in Figure 1. We present parameter values for which the map GL shows a bifurcation from a period 2 attractor to a period p attractor for a variety of period p. For a = -1.25, b = -.035, c = -2, d = -1.75, the bifurcation diagram in Figure 3a exhibits a bifurcation from a period 2 attra:tor to a period 5 attractor. For a = -1.25, b = -0.0435, c = -2, d = -2.175, the bifurcation diagram in Figure 3b exhibits a bifurcation from a period 2 attractor to a period 11 attractor. For a = -1.25, b = -0.03943, c =-2, d = -1.9715, the bifurcation diagram in Figure 3c exhibits a bifurcation from a period 2 attractor to a period 52 attractor. 13 II 'I• • • • • • • • •w• • m••m Figur"e 3a. II Bifurcation diagram of GL Ij(x,y) =(-1.25x + y + Mj,-0.035x) if x Ii I = (-2x exhibits at p. Ill + y + Ml, -1.75x) = if 0, and x a 0 0 a border-collision bifurcation from a period 2 attractor to a period 5 attractor. The parameter M (plotted horizontally) varies from -0.1 to 0.2, and the coordinate x is ploted vertcly -0. O5x_03 N/ N/ N/ N/ /N N Figure 3b. Bifurcation diagram of GL 11(x,y) =(-1.25x + y + p1,-0.0435x) if x .50, and =(-2x + y + 11, -2.175x) if x a 0 exhibits at p0 0 a border-collision bifurcat'ion from a period 2 attractor to a period 11 attractor. The parameter p (plotted horizontally) varies from -0.1 to 0.2, and the coordinate x is plotted vertically, -0.3 _ x ; 0.3. Figure 3c. GLBifurcation diagram of ,y)(~.2~+ y + It -0,03943x) (-x+ y + M, -1-9715x) eXhibits at if p0 =0 a if x ,and a 0 attactr t aPerod border..lli. 2 atrcor n bifurcation 1 a tra tot y o a pr from a Period i s fo 52 a t c o The Parame ter hPlo td verti a ries fr g (Plotted _ 0.1 to 0.2) and the coordinate s P ed l tv r ic l y -0.3 'S x Is.3 0.3 For other choices for a, b, c, and d we have found bifurcations from a period 2 attractor to a period p attractor, where p = 6, 7, 8, 9, 10, 11, 13, 19, 21, 23, 29, 31, 37, 41, etc. EXAMPLE 2. The simplest border-collision bifurcation in which chaotic attractors are involved is presumably the bifurcation from a period 2 attractor to a (1-piece) chaotic attractor. Frequently, the border-collision bifurcation from a period 2 attractor to a p-piece chaotic attractor is observed. For a = -1.25, b = -0.042, c = -2, and d = -2.1, the bifurcation diagram in Figure 4a exhibits a bifurcation from a period 2 attractor to a 1-piece chaotic attractor. For a = -1.36, b = -0.12, c = -2, and d = -2, the kifurcation diagram in Figure 4b seems to exhibit a bifurcation from a period 2 attractor to a 12-piece chaotic attractor, but using the phase space it turns out that the bifurcation is from a period 2 attractor to a 18-piece chaotic attractor. We have observed many other values of p, the map GL shows a bifurcation from period 2 attractor to p-piece chaotic attractor. For the selection a = -1.25, b = -0.03865, c = -2, and d = -1.9325, we obtain a bifurcation diagram similar to figure 4a, but in this case the border-collision bifurcation is from a period 2 attractor to a what appears to be quasi-periodic attractor. EXAMPLE 3. A border-collision bifurcation in which chaotic saddles (rather than attractors) are involved, will not be exhibited by bifurcation diagrams. Therefore, some other numerical method is needed to detect these sets. We use the Saddle Straddle I I I I I I 4a. I I a t I c o t o° a - i.°c h o i a t a c o . T h a r m t r p l t e Figure 4a. I I I r m Bifurcation diagram of GL (x~y) =(,-l.25x + y + p, -0.042x) if x 0, and (-2x + y + p, -2.1x) if x a 0 exhibits at 0:0 a border-collision bifurcation from a period 2 attractor to a 1-piece chaotic attractor. The parameter Ns (plotted horizontally) varies from -0.1 to 0.2; the coordinate x is plotted vertically, -0.3 < x < 0.3. .Iwmm w ww mmm~ mw~ wm nm m .~m w, w um mwmwm ~m m Figure 4b. Bifurcation diagram of GL 11(x,y) (-1.36x + y + g, -0.12x) if' x s,0, and y + M, -2x) if x a 0 exhibits at go=0 a border-collision b-ifurcation from a period2 attractor to a 18-piece chaotic attractor. The parameter u (plotted horizontally) varies from -0.1 to 0.2; the coordinate -(-2x + is plotted vertically, -0.3 -e X :5 0.3. Trajectory (SST) method introduced in [NY] to detect such sets. We select a = -1.25, b = 0.18, c = 2, and d = -3. For p = -0.05 the invariant set (obtained by the SST method) is presented in Figure 5a, and the invariant set for p = 0.05 is in Figure 5b. Presumably, it is correct to say that the border-collision bifurcation is a bifurcation from a chaotic saddle to another chaotic saddle. Now we present a few examples based on the Henon map. In fact, in these examples we have a moving border. Define the map H from the plane to itself by H(x,y) and define the map L (-vs (A - x + By, x) < p< m) from the plane to itself by L (x,y) = (A + Cx + By - (p+C)p, Dx + (1-D)p) The regions R A and R B are the half planes to the left and the right of the straight line x = p. The map we are investigatin-, defined being the Henon map on RA and the "linear" map L Define the one-parameter family of maps F on from the plane to itself by if x FH(x,y) (<yP I L P(x,y) if x a P Notice that the map is smooth in each of the half planes x a p and x a p, and the line x = p is the border which is a smooth curve. Since the map F is continuous, it is a piecewise smooth map. Nol, that for this family F border-collision bifurcations occur presumably for values p0 different from zero. 15 rho -0.0500000000 I~ 'I 'I' Figure 5a. Chaotic saddle of GL (x,y) (-1.25x + y + p, O.18x) if x s 0, and -0.05. (2x + y + p, -3x) if x a 0 when p and the The coordinate x (-0.2 . x < 0.1) is plotted horizontally, coordinate y ( -0.25 < y - 0.02) is plotted vertically. I rho 0.0500000000 I •~A ,4 ,.. / I I AV 7! - I I I 3 I 3 Figure 5b. Chaotic saddle of GL (x,y) (-1.25x + y + 11, 0.18x) if x 0, and = 0.05. (2x + y + M, -3x) if x a 0 when P horizontally, and the The coordinate x (-1 5 x - 0.6) is plotted coordinate y ( -1.8 . y : 0.2) is plotted vertically. EXAMPLE 4. Simple border-collision bifurcations are bifurcations from a period p attractor to a period q attractor. For A = 1.4, B = 0.3, C = 0.9, and D = -5, the 5ifurcation diagram in Figure 6a exhibits a bifurcation from a period 3 attractor to a period 4 attractor, where p (plotted horizontally) varies from 0.89 to 0.87. In the region RA the fixed point is a flip saddle and in the region RB the fixed point is a repellor. The border-collision bifurcation occurs at g = go s 0.884. For p > go (the side of the period 3 attractor which has orbit index +1) the fixed point is a flip saddle (orbit index 0) and we find no other periodic orbits on this side of the bifurcation. For g ' 0 (the side of the period 4 attractor which has orbit index +1) the fixed point is a repellor (orbit index +1); there also exists a period 4 regular saddle (orbit index -1). The regular saddle also shrinks to a point (the fixed point) as g -+ 0o. Hence, the orbit index is +1 on both sides of g0. For A = 1.4, B = 0.3, C = 1, and D = -5, the bifurcation diagram in Figure 6b exhibits a bifurcation from a period 6 attractor to a period 4 attractor, where g (plotted horizontally) varies from 1.05 to 0.8. In the figure one might first notice a bifurcation from a 6-piece chaotic attractor to a period 4 attractor, but closer examination gives the above mentioned bifurcation from a period 6 attractor to a period 4 attractor. Similarly as above, the periodic orbits involved in the border-collision bifurcation that occurs at g = g 0 0.884 are the following. For M > g0 there is period 6 attractor and the fixed point is a flip saddle, and for p < g0 the fixed point is a repellor and there is a period 4 attractor a period 4 regular 16 I.% Figure 6a. Bifurcation diagram of F (x,y) = (1.4 - x 2 + 0.3y, x) if x & , and = (1.4 + 0.9x + 0.3y - (g+0.9)p, -5x + 6p) if x a p, 0.884 a border-collision bifurcation from a period 3 attractor to a period 4 attractor. The parameter m (plotted horizontally) varies from 0.89 to 0.87; the coordinate x exhibits at 0 is plotted vertically, 0.6 x : 1.2. / d Figure 6b. Bifurcation diagram of (1.4 - x 2 + 0.3y, F (x,y) (1.4 + x + 0.3y - x) if x s P, and (p+l)p, -5x + 6g) if x a g, bifurcation from a exhibits at g0 A 0.884 a border-collision The parameter g period 6 attractor to a period 4 attractor. to 0.8; the coordinate "z (plotted horizontally) varies from 1.05 is plotted vertically, -0.5 S x s 2. saddle. Hence, the orbit index is +1 on both sides of M0 . EXAMPLE 5. In this example we present two cases of a border-collision bifurcation from a period p attractor to a q-piece chaotic attractor. For A = 1.4, B = 0.3, C = 1.1, and D = -5, the bifurcation diagram in Figure 7a exhibits a bifurcation from a 1-piece chaotic attractor to a period 4 attractor, where p (plotted horizontally) varies from 1.05 to 0.8. The border-collision bifurcation occurs at g= go A 0.885. For g > g0 (the side with the chaotic attractor) we do not know the (total) orbit index since the chaotic attractor contains a lot of periodic orbits. For g > M0 the fixed point is a flip saddle (orbit index 0). For p < g0 (the side of the period 4 attractor which has orbit index +1) the fixed point is a repellor (orbit index +1) there also exists a period 4 regular saddle (orbit index -1). point as p -+ p. The regular saddle also shrinks to the fi.ei Hence, presumably we have a border-collisiou bifurcation from a period 4 attractor to a 1-piece chaotic attractor. For A = 1.4, B = 0.3, C = 1.5, and D = -4, the bifurcation diagram in Figure 7b exhibits a bifurcation from a 8-piece chatLc attractor to a period 5 attractor, where g (plotted horizontally) varies from 0.91 to 0.86. The border-collision bifurcation occurs at p = g ss 0.884. For g > go (the side of the 8-piece chaotic attractor) we do not know the (total) orbit index since the chaotic attractor contains a lot of periodic orbits, and the fixed point is a flip saddle (orbit index 0). For g < g0 (the side of the period 5 attractor which has orbit index +1) the fixed point 17 Figure 7a. Bifurcation diagram of F (x,y) = (1.4 - x 2 + 0.3y, x) = (1.4 + 1.Ix + 0. 3 y - if x j, and (M+I.1)M, -5x + 6p) if ; p, exhibits at g0 s 0.885 a border-collision bifurcation from a 1-piece chaotic attractor to a period 4 attractor. The parameter g (plotted horizontally) varies from 1.05 to 0.8; the coordinate x is plotted vertically, -0.5 x . 2. I ...... ... ............. ........ Fiue7. daga fIdi Iiucto ,2"()3,-info =ki 4 I .c4 3 Iu F pyk . c rh o i k%~1Otted horizO I ~ otted p ee cta odr cl i i nt ra tr h aIj t va ie nta , 2Y 2-i il ve2. l from ~~ ~ J I I I I F ( - +,I I I I I Figure 8.1 Bifurcation diagram of + 0.3y, IF(x,y) = (1.4 - x x) if x H: and , + 1.2x + 0.3y - (p+1.2)p, -4x + 5p) if x bifurcation from a exhibits at go % 0.884 a border-collision chaotic attractor. The 1-piece chaotic attractor to a 1-piece varies from 0.95 to 0.85; the parameter g (plotted horizontally) 1.6. 0.4 s x coordinate x is plotted vertically, (1.4 u n~ um nm nn um 'm m unw ~ mnm i~mnmmnnunum uwm m um wl~ I is a repellor (orbit index +1); regular saddle (orbit index -1). 3 Ito there also exists a period 5 The regular saddle also shrinks the fixed point as u -+go In the figure one might first notice a bifurcation from a 5-piece chaotic attractor to a period 5 attractor, but closer examination in the phase space gives the above mentioned bifurcation from a 8-piece chaotic attractor to a period 5 attractor. Hence, presumably we have a border-collision 3 bifurcation from a period 5 attractor to a 8-piece chaotic attractor. IEXAMPLE 6. Border-collision bifurcation from a p-plece chaoti attractor to a q-piece chaotic attractor. We present just one 3 example, namely p = a = 1. For A = 1.4, B = 0.3, C = 1.2, and D = -4, the bifurcation diagram in Figure 8 exhibits a bifurcation from a 1-piece chaotic attractor to a 1-piece chaotic attractor, where pa (plotted horizontally) varies from 0.95 to 0.85. The border-collision bifurcation occurs at y = go s 0.884 and we only can say that rk both sides infinitely many periodic orbits are involved in the border-collision bifurcation, since the attractors are chaotic. Hence, presumably we have a border-collision bifurcation from 1 1-piece chaotic attractor to a 1-piece chaotic attractor. EXAMPLE 7. In this example we show that coexisting attractors of different nature can be involved on the same side of a border-collision bifurcation. For A = 1.4, B = 0.3, C = 1.4, and D = -4, the bifurcation diagram in Figure 9a exhibits a bifurcation from a 5-piece chaot v 18 Figure a Bifurcation diagram of F11(x,y) =(1.4 x2+ 0.3y, x) if x I~ u, and =(1.4 + 1.4x + 0.3y - (p'+l. 4 )p, -4x + 5pj) if x a exhibits at ju 0 s~ 0.884 a border-coll~ision bifurcation from a 5 -piece chaotic attractor to a 1-piece chaotic attractor. The parameter p~ (plotted horizontally) varies from 0.87 to 0.895; the coordinate x is plotted vertically, 0.3 :5 x :5 1.6. - I I I I I I diagra Bifurcatio of I 2 + 0.3y, : 11.4 - ,. x) if x - , and (11. 4 + I.4x + 0. 3y - (1 +. -1)M, -4x + 51j) i f x a exhibits at go ; 0.884 a border-collision bifurcation from a F9(x,y) I 5-piece chaotic attT.,actor to a period 4 attractor. The parameter P (plotted horizontally) varies from 0.874 to 0.895; the coordinate x is plotted vertically, 0.3 -s x e, 1.6. attr-actor to a 1-piece chaotic attractor, where p (plotted horizontally) varies from 0.87 to 0.895. On both sides of the collision-bifurcation. which occurs at p0 s 0.884, there are infinitely many unstable periodic orbits involved, since the attractors are chaotic. Due to the projection of the picture onto one phase space coordinate the bifurcation diagram seems to show a 2-piece chaotic attractor, but again in phase space one has clearly a 5-piece chaotic attractor. For the same parameter values, the bifurcation diagram in Figure 9b exhibits a bifurcation from a 5-piece chaotic attractor to a period 4 attractor, where p (plotted horizontally) varies from 0.874 to 0.895. Hence, we may have a border-collision bifurcation from a 5-piece chaotic attractor to a coexisting 1-piece chaotic attractor and a period 4 attractor. EXAMPLE 8. Now we consider an example in which the rurve fr is the straight line y = -x + p. In this example we have a moving border. Let the map H from the plane to itself be defined as above, that is, H(x,y) = (A - x2 + By, x), and define the map G (- < p< ) from the plane to itself by G (x,y) = (A - PC - x 2 + Cx + (B+C)y, (B+D)x - Dy -pD) The regions RA and RB are the half planes to the left and the right of the curve F . The map we are investigating is defined being the Henon map on R and the "linear" map G the one-parameter family of maps F Define from the plane to itself by ( = H(x,y if x S -y + P G (xy) if x a -y + 19 on R Uf Ii I~mN lI NI INl l l ml Figure 10a. Bifurcation diagram of (114 - x 2 + 0.3y, F l(x,y) = (1.4 x a -y + p, x) if x -y + 0.5p - x 2 + 0.5.x + 0.2y, exhibits at p0 s + p, ana -1.3x + Y + .)if 1.015 a border-collision bifurcation from a period 4 attractor to a strange chaotic attractor. The parameter p (plotted horizontally) varies from 1.2 to 1; the coordinate x is plotted vertically, -2 s x s 2. Figure 10b. F The chaotic strange attractor _ 2 (. Ixy 4 0.3y, x) if -x+ (1.4 + 0. 5 11 - x + 0.5x x S -y + 0. 2 + 1p, and y, -1.3x t. y + p) x ;-I -Y + m, where p if =1. The coordinate x (-2 s x s 2) is plotted horizontally, and the coordinate y ( -2 zs y :5 2) is plotted vertically. - 3 Notice that the map F is a piecewise smooth map. We present an example for which the map F 5 5 has a the border-collision bifurcation from a period 4 attractor to a chaotic strange attractor. For A = 1.4, B = -0.3, C = 0.5, and D = -1, the bifurcation diagram in Figure 10a exhibits a bifurcation from a period 4 attractor to a chaotic strange attractor, where p (plotted horizontally) varies from 1.2 to 1. The border-collision I 3 3 1 bifurcation occurs at p = po 0 1.015. The chaotic strange attractor for p = 1 is given in Figure lOb. Hence, we may have a border-collision bifurcation from a period 4 attractor to a chaotic strange attractor. 4. "PERIOD TWO TO PERIOD THREE" BORDER-COLLISION BIFURCATION In this Section we explain why "period two to period three" 3 I border-collision bifurcations occur for two-dimensional piecewise smooth maps. Let a, b, c, and d denote real numbers. Define the one-parameter family GL '4 GL (x,y) = from the plane to itself, by (ax + y, GL (x,y) = (cx + y, + p(1,0) if x ! 0 dx) +P(1,0) if x a 0 bx) where p is in an open interval I including zero. Recall that this family GL 3 Let F is equivalent with the piecewise linear map P . be a one-parameter family of piecewise smooth maps which has a prototype piecewise linear form at p = 0, and assume that (Al) 3 3 * -a > 1, -c > 1, -d > 1; (A2) c 2 + 4d < 0; (A3) 0 <a(ac + d) < 1. We want to show that there exists e > 0 such that if IbI < c, then the family F has a "period two to period three" border-collision 20 bifurcation at (0,0). First, we show that for b = 0, the family GL has a border-collision bifurcation from a period 2 attractor to a period 3 attractor. We write C for the set of all one-parameter families of maps GL defined above such that b = 0. PROPOSITION. At M = 0, every family GL in C has a "period two to period three" border-collision bifurcation at (0,0). PROOF OF THE THEOREM. Assume that the Proposition has been proved. Apply the Proposition and it follows immediately from perturbation results. The geometrical proof of the Proposi.ion (given below) -ight give insight why other bifurcations (for example, period 5 to period 2 bifurcation) may occur in piecewise smooth systems. Presumably, the method of proof only works if one of the two maps involved has a zero Jacobian. We first show that a border-collision bifurcation occurs at p = 0, and we present an example to give an idea of the proof. Let GL 1 be in C. The fixed point E 0) if = ( 0 and 1 of F d -'lc--ii l-c-d. is given by E if .i 0. In the notation of Section 2, define the matrices MA and MB by MA [ 1]' MB = [c 11. The eigenvalues of MA are 0 and a, so if p < 0 then the fixed point E particular, E is unstable since -a > 1. In is a flip saddle if M < 0. The eigenvalues of MB are 0.5c ± 0.5 /c + 4d and are complex, since c 2 + 4d < 0. For p > 0 the fixed point E is unstable (repelling), since the M 21 3 product -d of the eigenvalues of MB exceeds 1. The nature of the fixed point E is changing from being a flip saddle (in region RA which is the left half plane) to a ripellor with complex eigenvalues (in region RB) when the parameter p is varied from say -0.1 to 0.1. We conclude that a border-collision bifurcation occurs at 0 when M is continuously varied from some negative j value to a positive value, since the orbit index of E changes from 0 to +1. For simplicity of the explanation of this bordercollision bifurcation phenomenon, we offer the following example. 3 EXAMPLE. Consider the one-parameter family g from the plane to itself, defined by l,,y) = (-4.x + y, 0) + je.(1,0) if x g(x,y) = (-2x + y, ---x) + 1.(1,0) 0 if x z 0 The bifurcation diagram exhibiting the "period two to period Sthree"bifurcation, family of maps g I 3 is similar to the diagram in figure 1. The is in the class C, so it is an example for %,hich the result above applies. The idea why a "period two tc period three" border-collision bifurcation occurs for the family i, s the following. For P < 0, write W for the interval [-.,-.P,) on the o h f- 3 p a p X-axis. We have (1) the image g (p) of each point p on the X-axis but not in W is in W , and (2) each point p in W is mapped to a point p* on the X-axis after two iterates, so g/ (p) = p*. In figure 11, the graph of the corresponding return map G on W whicn is defined by G(x) = g 2(x,O), is given. To be more specific, G(x) 25 16 fa 1 T p for x - 0 and G(x) 22 1 1 for x a 0. Gtz) PS Pa Figure bYg0 i defined The maP g 9 reunmp A G defined by thde on fied POI~ n + PI 2 . 25 r th. o ino val 2 k%~teirtra 5 G has Pi . h maP ~h u s a l mafia h o~P a stable ie thean tal 3 FIGURE 11 4 ±.M < 0 and P The map G has two fixed points Pu 2 -p . > The fixed point pu is unstable since the slope of G in Pu is -5, and the fixed point p5 is stable since the slope of G at p The properties (1) 4" I4 4.I < 3 I . 4 M < 0, (2) G has slope -25 at x for 1 x < 0, (3) G has slope -for 0, imply that g points P1 (-*. I < Pu is x > 0, and (4) 1 = -1.p G(0) > has a period 2 attractor consisting of the two N , 0) and P2 gg(P 1 ) = (-7-P, the norms of both these points converge to zero as p goes to zero, that is, both lHP 211-* 0 as p -+ 0. In other words, 1 11-o 0 and 11P period 2 attractor shrinks to a point as p goes to zero; the this point to which the period 2 attractor converges is the fixed point of g at p = 0. For p > 0, each point p on the X-axis 3is mapped to a point p* on the X-axis after three iterates, so g1 3 (p) p*. The graph of the corresponding return map H, defined by H(x) 3 3 (x,0), is given in figure 12. In particular, H(x) for x < 0, H(x) I-.p 113 =--.x - 3 1 .p for 0 e x L -'p, - 3 and H(x) 1 '21 for x a FIGURE 12 * The map H has an unstable fixed point Pu fixed points qs I 4 -7 p < 14 4.p > 0 and two stable 0 and ps = -- 'p > 0. Furthermore, for ail x with x <pu we have lim Hn(x) = qs, and for all x with x > p, .. p . The properties (1) H has slope between 1).%nd have lim Hn(x) 5465 S 1 for x < 0, (2) H has slope bigger than I for 0 < x H(0) has slope between 0 and 1 for x > 1.p, and (4) .iP () f - imply g has a period 3 attractor 1 89 1 and 14 I4 0), and S 14(--., ( -. P., 0), S2 consisting of the points S= 1 23 Pig~~~12. the ret.. the X- .x and23 d f)z(2 rap 'R dei e d by 'I +( The map q has ~n4two stable t'jxe (-12-X -2 6 5x if aps 3fl (xoo un fsta l e (x L) fixed points 4Iblethe five S P 0 [ap 0 an p P 4' -- sit 0,n 4t r7*9> S19 19, -- 49. -4-) Notice that the norms of all three points converge to zero as p goes to zero, that is, all three 11S11 -+ 0, 11S 211- 0, and U1S 3111 0 as p -* 0. In other words, the period 3 attractor * shrinks to a point as p goes to zero; this point to which the period 3 attractor converges is the fixed point of g at p = 0. 4 The point (-7.p, 0) is a point of a period 3 orbit which is a reguLar saddle of the map g Conclusion: at p = 0, there is a "period two to period three" END OF THE EXAMPLE. border-coilision bifurcation. ! PROOF OF THE PROPOSITION. Let GL 3 3 be a one-parameter family in the class C, where p is in some interval I. We write pO = (xOyo) for an initial condition and p that is, pn = GL n('), . value (0,0), we write A0 = (Xny) for its n-th iterate, each p. For the particular initial (0,0), A = GL(A 0 A2 = GL(A ), A3 GL (A ), and A 4 = GL (A3 ). For each initial value p0 = (X0 ,y0 ) we observe the following I fact. If x 0 0 then yl = 0, and if x 0 > 0 then yl = dx 0 < 0. Hence, it is sufficient to consider initial values in the lower half plane. Hence, from now on, we assume that yo - 0. ( L.P, Assume first, p < 0. Recall that the fixed point E 0) is unstable, and is a flip saddle, since -a > I. Assume that p 0 I = (x0 ,y0 ) is any initial value with y 0 . 0. Then, Yl = 0, and if x 0 > 0, then x 1 = cx 0 + y 0 + p < if x 0 , 0 then 0 and so Y2 = 0. Therefore, it is sufficient to consider points on the X-axis, and 3 we will do so. £ Consider the initial value p 0 = (0,0) = A 0 . Computation of the 24 first four iterates of A 0 yields A 1 = (M,O), A 2 = ((a+l)p, A 3 = ((c(a+l) + i)p, 0), d(a+i)p), and A 4 = ((a+l)(ac + d + 1)p, 0). The assumptions 0 < a(ac + d) < 1 and -a > I imply -1 % ac + d < 0 yielding 0 < x 4 < x 2 . From -1 < ac + d < 0, and the assumptions, -a > 1, and -c > 1 follows that c(a + 1) > 0 and d < c; therefore Ix3 1 > 1y3 1. Hence, A 1 is on the X-axis to the left of A 0 , A 3 is under and to the left of A1 , and both A 2 and A 4 are on the X-axis to the right of A0 and A 4 is between A0 and A 2 . First we consider the image of the X-axis. Let p 0 = (x0 9Y0 ) be any point on the X-axis. The image of the right half of the X-axis with end point A 0 is the half line through A 3 with end point A[ = GL (A0), since p1 = (cx0 + p, dx 0 ) for x 0 > 0. The image of the left half of the X-axis with end point A 0 is the half line on the X-axis to the right of A = with end point A I, since p1 0) for x 0 - 0. (ax0 + p, 2 Define Q = (--P,O) = (xQ)0) and R a ( 0) d)' (I-a)(ac + d - (XR'O). The point Q is mapped to A 0 iterating GL once, that is, GL (Q) A, and Q is on the X-axis between A and E since A P = 0 (p,O), E 1.a 1 , 0) and -a > 1. The point R is on the X-axis to the right of A 0 , and R is mapped to E iterating GL twice, that is, GL 2(R) = E . Let P 0 = (x0 0) be any point. Straightforward computation gives the following. If x 0 > 0 (that is, P 0 is on the X-axis to the (1+a)p, 0), then p1 = ([ac + d]x 0 + so p 2 is on the X-axis. If x 0 = 0 (that is, p0 = right of A 0 ) then p1 = (cx0 + p, dx0 ) and P 2 = (p, 0) and P 2 = ((l+a)p, 0), 1 A0 ) so p. is on the X-axis to the right of A 0 . If ---. 5 x 0 < 0 (that is, p 0 is on the X-axis between Q and A 0 ) then p1 = (ax 0 + p, 0) and P 2 = (a(ax0 + p) + p, 25 I 1 0), so P2 is on the X-axis. If x 0 < -!-A (that is, P0 is to the left of Q) then p1 = (ax0 + p, 0) and p 2 = (c(ax 0 + g) + p, d(ax0 W) and P 3 + = ([a c + adlx 0 + 'ad + a + d + 1).P, 0), and so P 3 is on the X-axis while p 2 is not. Summarizing, for each point P0 2 on the X-axis to the right of Q we have P 2 = GL is on the (p0) X-axis. Therefore, we have a return map on the interval consisting of the points on the X-axis to the right of Q. Let G denote the return map of GL on [Q,cD), so G(x) = GL 2(x,0) for each x z xQ* The above results imply G(x) = a x + (1+a)g for I. " I x < 0, and G(x) = (ac+d)x + (1+a)p for x a 0. The graph of Q. is similar to 1 - and The map G has two fixed points, namely figure 11. a + 1 Ps 1 - ac - d ' and Pu 1 0 < ps. The fixed > 1, and the point pu is unstable since the slope of G in Pu is a fixed point ps is stable since the slope of G at ps is ac + d for which -1 < ac + d < 0. Furthermore, for all x with Pu < x < xR we have iim Gn(x) = Ps. The properties (1) xQ < slope a2 > i f XQ < x < 0, (3) G has slope -1 > 0, and (4) G(0) > 0, imply that GL consisting of the points P1 (a + I)d c - d + I " ) ' 1 - ac - ac - d (1 -L- < 0, (2) G has 0 for: < ac + d has a period 2 attractor aac 1 d ' 0) and P 2 GL(P Notice that the norms of both these points converge to zero as p goes to zero, that is, both 1 II11 -+ 0 and 11P 2 11- 0 as p -* 0. Hence, the period 2 attractor shrinks to a point as p goes to zero; this point to which the period 2 attractor converges is the fixed point of GL at p = 0. Now assume p = 0. Assume p 0 = (x0 ,y 0 ) is any initial value with YO < 0, then x 0 & 0 implies y 1 = 0, and x 0 > 0 implies x I cx 0 yielding y 2 = 0. Hence, it is sufficient to consider points on the X-axis. Let P0 = (x0 '0) be given. If x 0 < 0 then p, = (ax0 ,0) £26 which is on the positive X-axis. If x 0 = 0 then P, = (ax0 '0) and so p 0 is the fixed point of GL = and P 2 ((ac+d)x 0 , 0). If x 0 > 0, then P1 = (cx0 ' dx 0 ), Consequently the point A 0 = (0,0) is a globally stable fixed point of GL0 , since -1 < ac + d < 0. d 1 Now assume M > 0. The fixed point E P -c-d~ J i 1--~ unstable with complex eigenvalues since it was assumed -d > I and c 2 + 4d < 0. Assume p0 = Then x 0 - 0 implies y = p Y1 0 and so y 2 = dx 0 < -y < (c < 0; 0 (x0 ,y0 ) is any initial value with y0 0, and if x 0 a -. If 0 < x 0. hence, < -p then x then x = if x 1 s 0 then Y2 = 1 + P = + Y0 = cx 0 =cx 0 + y 0 + + p and 2 c x 0 + cy 0 + cp + dx 0 +p 0 (and so 0 cX 1 + < = 0. Therefore, it is sufficient to consider points on the X-axis. Let pO then p 1 = = (x0 ,y0 ) = GL (P0 q0 = (w0 O0) = (x0 ,O) be any point on the X-axis. If x 0 e 0 (axO + P, 0) = (xlYl), so x1 > 0. Every point such that w 0 < X0 s 0 satisfies q 1 = GLP(qo) = (aw) + P, 0) = (w1 ,zI), so w I > X1 > 0. The conclusion is that poLflLs on the X-axis to the left of A 0 = (0,0) are mapped monotonically, into the X-axis to the right of (p,O). Let P 0 ((1+c)p, = do), (0,0). A simple computation shows p1 P3 = ((ac + a + d + i)p, 0), = A P., = and p 4 = (ax3 + P, 0). Notice x 3 < 0, hence x 4 > P = x1 . Recall that P0 = A 2, P 3 = A 3 , and P 4 = (p, 0), A 0, p, = A,, p2 The conclusion is that A 0 , A1 , A 3 , and A4 are on the X-axis, and A 3 is to the left of A 0 , and both A, and A 4 are to the right of A 0 with A 1 between A 0 and A 4 . Let p0 = (x0 '0) be any point on the X-axis for which x 0 Then p1 = (cxo + P, dxo). Notice that if x 0 = d1B) B d p. W B2 GL (B0) - . , 0), B 1 Write B 0 = ---. Yl > 0. then x 0 and h(B), and and GL (BI) d (0,-i.m), B 2 B3 =GL (B2 ). Then B1 ([a(1--) + 1I], 0). Notice that B at which the line segment [AI,A d ((I-I)M, 0), and B= denotes the point on the Y-axis 21 intersects the Y-axis, and that B 2 is a point on the X-axis to the left of A . The assumptions -a 0 > 1 and 0 < a(ac + d) < 1 imply ac + d < 0 and we obtain that the a point A 3 = ((ac + a + d + l)M, 0) is on the X-axis to the left of B2. The image of the half line (A1 ,a) through A 2 under the map GL is the kinked half line (A2 ,B2 ] U [B2 ,cD) through A The image of this kinked half line is on the X-axis. In particular, the image of the half line (B2 , ) through A 3 is [B3 D) on the X-axis to the I right of A BB31 [A3 ,B 3 ]. (P,0), and the image of the line segment (A2,B2 1A 31 Let p0 2 (x0 ,0) be any point on the X-axis. Straightforward computation shows the following. If x 0 ; - .p (that is, p 0 the right of B 0 ) then p 1 P 2 n d = (cx0 + p, dx , p 2 = ([ac + d]x dx 0 ), P2 = 2 = 0 c *0 + d~x0 + (c+1)4, cdx 0 + dp), (c+1)P < -cp - il P3 3 is on the X-axis between A 0 and B 0 ) then p1 (Cc2 d + c1 c d + P = (l--)p c is on the X-axis. If x 0 < 0 then p 1 (c+l)p, adx 0 + dp), and p3 = = 0 Hence, + both --.p (that (cx0 + u and since (c 2+d)x + 0 < 0, we have + ad + cd]x 0 + [ac + a + d + 1}p, ((ac is to 0 c0 (a+l)p, 0,, and p 3 = (a[ac+dlx 0 + [a(a+l) + 1I.P, 0). P2 and are on the X-axis for x z -. P. If 0 : x is, p0 is = 0), so the point p 3 (ax 0 + P, 0), (atac + dx 0 P 2 = (acx 0 + + (ac + a + d + i]p, 0), so the point P3 is on the X-axis. The conclusion is that for each point p 0 = (x0 ,0) on the X-axis, the third iterate of p 0 is also on the X-axis, that is, GL 3(p 0 ) = (x3 , 0). of GL Hence, a return map exists on the X-axis. We call this return map H, so H(x) 28 GL 3(x,O). The above results imply H(x) = (a 2c + ad)x + (ac H(x) = (ac H(x) = (a 2 2 + a + d + 1)'p for x < O, + ad + cd)x + (ac + a + d + 1)'p for 0 < x c + ad)x + (a2 + a + 1).p for x 0 z _l.p. 1c 1 -.. p, and The graph of H is similar to figure 12. The map H has three fixed points, namely ac +- a ~c~)* + d + I -(ac + a + + add -+ 1) c-ac+d) I1 p, and p5 S+ a(ac+d) > 0. The fixed point p > 2 ac < 0 is unstable since the slope + ad + cdof H in Pu is bigger than 1, and the two fixed points qs and ps is stable since the slope a2 c + ad of H at both qs and PS is between 0 and 1. Furthermore, for all x with x < Pu we have im Hn (x) = qs, and for all x with x > Pu we have lir Hn (x) = P -0 ns The properties (1) H has slope between 0 and 1 for x < 0, (2) H has slope bigger than I for 0 < x < -. p, (3) H has slope between 0 and 1 for x > -L'p, and (4) H(O) < 0 and H(-!.P) > imply that GL consisting of the points p has a period 3 attractor 2 = ( ac + a + d + 1 - a(ac+d) S Sa = ([c 3 -P, +a+ 1 1 - a(ac+d) + a $ "' I , d , + a+ 0), a a+ +a+ -a(ac+d) 1 and aand Notice that the norms of all three points converge to zero as p goes to zero, that is, all three IISlIl 1 0, IS2 I -+ 0, and IIS3i1 -+ 0 as p -+ 0. Hence, the period 3 attractor shrinks to a point as p goes to zero; this point to which the period 3 attractor converges is the fixed point of GL at M = 0. The point (pu, 0) is a point of a period 3 orbit which is a regular saddle of the map GL . We conclude: at M = 0, there is a "period two to period three" border-collision bifurcation. This completes the proof of the Proposition. 29 I 5. DISCUSSION AND CONCLUDING REMARKS. We have presented bifurcation phenomena, which we call "border-collision bifurcations". These bifurcations occur when the nature of a fixed point (or periodic point) of a piecewise smooth system changes when it collides with the border of two regions. An interesting case occurs when the fixed point changes from being a 5 3 flip saddle to a repellor with complex eigenvalues~at the parameter value where it collides with the border of two regions. We have presented a variety of examples based on the piecewise linear map P and the Henon map. In particular, we have shown the occurrence of a "period two to period three" border-collision I bifurcation for maps in the class C. We point out that the border-collision bifurcation can be 3 expected to occur in many piecewise smooth models. In particular, the "period two to period three" bifurcation phenomenon can be expected to occur in many linear models with constraints. Assume for the piecewise linear map P 5 I 3 5 that the fixed point E is a flip saddle in the left half plane and a repellor with complex eigenvalues in the right half plane. QUESTION 1. Does there exist a classification of the bordercollision bifurcations for P in the case where a period 2 attractor converges to the fixed point (0,0) when p goes to 0? QUESTION 2. More generally, is is possible to give a classification of the border- ollision bifurcations for the piecewise linear map P ? QUESTION 3. When the plane is subdivided in N regions, where N 330 is at least 3, do there exist border-collision bifurcations that do not occur when there are only 2 regions, and in particular bifurcations that persist despite small perturbations? REFERENCES [AYY] K.T. Alligood, E.D. Yorke and J.A. Yorke (1987). Why period-doubling cascades occur: period orbit creation followed by stability shedding. Physica 28D, 197-205. (GH) J. Guckenheimer and P. Holmes (1983). Nonlinear Oscillations,. Dynamical Systems, and Bifurcations of Vector Fields. Applied Mathematical Sciences 42, Springer-Verlag, New-York, etc. [GOPY] C. Grebogi, E. Ott, S. Pelikan, and J.A. Yorke (1984). Strange attractors that are not chaotic", Physica D 11, (HN] C.H. Hommes and H.E. Nusse (1990). 261-268. "Period three to perioi two" bifurcation for piecewise linear models. Submitted f'or publication (HNSI C.H. Hommes, H.E. Nusse, and A. Simonovits (1990). Hicksian cycles and chaos in a socialist economy. Research memorandum 382, Institute of Economic Research, University of Groningen [K] H.B. Keller (1987). Numerical methods in bifurcation problems. Springer-Verlag, Berlin, etc.I (MY] J. Mallet-Paret and J.A. Yorke (1982). Snakes: oriented families of periodic orbits, their sources, sinks and continuation I J. Differential equations 43, 419-450 [NY] H.E. Nusse and J.A. Yorke (1989). A procedure for finding 31 I g numerical trajectories on chaotic saddles. Physica 36D, 137-156. [R] D. Ruelle (1989). Elements of Differentiable Dynamics and Bifurcation Theory. Academic Press, Inc., San Diego and London [SI R. Seydel (1988). Front Equilibrium to Chaos. Practical Bifurcation and Stability Analysis. Elsevier Science Publ. Co., Inc. New York, Amsterdam and London [Y] J.A. Yorke (1990). DYNAMICS: An interactive program for IBM PC clones. 32

Log In

Numerical Procedures for Analyzing Dynamical Processes

Related papers

Related papers

Related topics