Abstract
The power spectral density (PSD) of any time-dependent stochastic process Xt is a meaningful feature of its spectral content. In its text-book definition, the PSD is the Fourier transform of the covariance function of Xt over an infinitely large observation time T, that is, it is defined as an ensemble-averaged property taken in the limit . A legitimate question is what information on the PSD can be reliably obtained from single-trajectory experiments, if one goes beyond the standard definition and analyzes the PSD of a single trajectory recorded for a finite observation time T. In quest for this answer, for a d-dimensional Brownian motion (BM) we calculate the probability density function of a single-trajectory PSD for arbitrary frequency f, finite observation time T and arbitrary number k of projections of the trajectory on different axes. We show analytically that the scaling exponent for the frequency-dependence of the PSD specific to an ensemble of BM trajectories can be already obtained from a single trajectory, while the numerical amplitude in the relation between the ensemble-averaged and single-trajectory PSDs is a fluctuating property which varies from realization to realization. The distribution of this amplitude is calculated exactly and is discussed in detail. Our results are confirmed by numerical simulations and single-particle tracking experiments, with remarkably good agreement. In addition we consider a truncated Wiener representation of BM, and the case of a discrete-time lattice random walk. We highlight some differences in the behavior of a single-trajectory PSD for BM and for the two latter situations. The framework developed herein will allow for meaningful physical analysis of experimental stochastic trajectories.
Export citation and abstract BibTeX RIS
Original content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
A perspective has been written on this article.
1. Introduction
The power spectral density (PSD) of a stochastic process Xt, which is formally defined as (see, e.g., [1])
provides important insights into the spectral content of Xt. Here and in what follows, the symbol denotes averaging with respect to all possible realizations of the process, i.e., the expectation. Often the PSD as defined in equation (1) has the form , where A is an amplitude and β, (typically, one has 0 < β ≤ 2 [2]), is the exponent characteristic of the statistical properties of Xt. In experiments and numerical modeling, the PSD is determined using a periodogram estimate for a wide variety of systems in physics, biophysics, geology etc. A by far non-exhaustive list of systems exhibiting 1/fβ spectra includes electrical signals in vacuum tubes, semiconductor devices, metal films and other condensed matter systems [3–5], quantum dots [6–8], nano-electrodes [9] and two-dimensional graphene layers with a widely tunable concentration of carriers [10]. The PSD has been analyzed, as well, for the trajectories of tracers in artificial crowded fluids [11], for active micro-rheology of colloidal suspensions [12], Kardar–Parisi–Zhang interface fluctuations [13], for sequences of earthquakes [14], weather data [15–17], biological evolution [18], human cognition [19], network traffic [20] and even for the loudness of music recordings (see, e.g., [21, 22]).
On the theoretical side, the PSDs showing the 1/fβ dependences have been calculated analytically for diverse situations, including, e.g., the dynamics in chaotic Hamiltonian systems [23], periodically-driven bistable systems [24], fluctuations of a phase-separating interface [25], several diffusive and non-diffusive transport processes (see, e.g., [8, 26–29]), the running maximum of Brownian motion (BM) [30], diffusion in presence of strong quenched disorder [31–34] and the electric-field-driven ion transport through nanometer-scale membrane pores [35]. The PSD of BM in an optical trap has been scrutinized in [36, 37], which permitted the calibration of optical tweezers, making them a powerful tool for force spectroscopy, local viscometry, and other applications (see, e.g. [38, 39]).
The systematic measurements of single particle trajectories started with Perrin more than a century ago [40]. Just a few years later Nordlund [41] developed impressive experimental techniques, followed by a line of further refinements up to Kappler's measurements [42], to record sufficiently long trajectories to enable quantitative analysis on the basis of individual trajectories—without the need of prior ensemble averaging. Nowadays, with the advent of modern microscopy and supercomputing, scientists routinely measure long trajectories Xt of submicron tracer particles or even single molecules [11, 39, 43–47].
In parallel to this experimental progress, there was a general shift of interest towards understanding statistical properties of individual realizations of stochastic processes. In particular, a conceptually important question often raised within this context is if one can reliably extract information about the ensemble-averaged properties of random processes from single-trajectory data [46]. Considerable theoretical progress has been achieved, for instance, in finding the way to get the ensemble-averaged diffusion coefficient from a single Brownian trajectory, which task amounts to seeking properly defined functionals of the trajectory which possess an ergodic property (see, e.g., [43–46, 48–56] for a general overview).
One of such functionals used in single-trajectory analysis is the time-averaged mean squared displacement (MSD) (see, e.g., [39, 43–47])
where the bar denotes time averaging and the lag time τ acts as the size of the analysis window sliding over the time series Xt. For finite observation time T, the time-averaged MSD varies randomly for different trajectories even under identical physical conditions. For both normal and anomalous diffusion this variation is mainly seen as an amplitude scatter of at given lag times, which remains remarkably constant (apart of relatively small local fluctuations) as function of τ [57–60].
For the time-averaged MSD the fluctuations are often quantified in terms of the so-called ergodicity breaking parameter , where the dimensionless variable for a given τ measures the deviations from the trajectory-average [57, 61], with N being the number of observed trajectories. Clearly, with γ being the coefficient of variation of the distribution .
For BM, the fluctuations decrease with growing observation time such that [44, 50, 57], which permits to deduce the diffusion coefficient specific to an ensemble of trajectories from just a single, sufficiently long, trajectory. A similar decay of EB to zero for large T is observed for fractional BM [62] and scaled BM [63]. However, for anomalous diffusion with scale-free distributions of waiting times [61] and processes with systematic, spatially varying diffusion coefficient [64] the inequality persists even in the limit , which reveals ageing in the sense that the properties of the system, such as the effective diffusivity, are perpetually changing during the measurement and depend on T [46, 57, 65]. In these systems, the fluctuations as measured by EB do not decay to zero with increasing T, and thus time averages of physical observables remain random quantities, albeit with well defined distributions (see, e.g. [50, 57, 61]). Many other facets of the time-averaged MSD in equation (2) and of the parameter EB were exhaustively studied to date for a variety of diffusive processes, providing a solid mathematical framework for the analysis of individual trajectories [39, 57, 62, 66–69].
Single-trajectory PSDs have been studied in several cases. In particular, the PSD of the loudness of musical recordings was discussed in [21, 22] and the spectra of temperature data were presented in [16, 17]. We note that, of course, for these two examples averaging over 'an ensemble' does not make any sense since the latter simply does not exist, likewise, e.g., the case of the financial markets data for which the single-trajectory MSD has been recently analyzed [70]. Further on, power spectra of individual time-series were examined for a two-state stochastic model describing blinking quantum dots [26] and also for single-particle tracking experiments with tracers in artificially crowded fluids [11]. It was shown that, surprisingly enough, the estimation of the exponent characterizing the power spectrum of an ensemble from the single-trajectory PSD is rather robust. Next, in [71, 72] the power spectra of the velocities of independent motile amoeba (see also [47, 73]) were analyzed, revealing a robust high-frequency asymptotic 1/f2 form, persisting for all recorded individual trajectories.
Clearly, these numerical observations raise several challenging questions. Indeed, could it be possible that the exponent characterizing the frequency-dependence of the standard PSD, defined as an ensemble-average property, can be observed already on a single-trajectory level? If true, does it hold for any stochastic transport process or just for some particular examples? Moreover, in which range of frequencies can such a behavior be observed? What are the distributions of the amplitudes entering the relation between a single-trajectory PSD and its ensemble-average counterpart and how broad are they? Evidently, numerical analyzes may give a certain degree of understanding of some particular features, but a deep insight can be obtained only in conjunction with a full mathematical solution. Unlike the single-trajectory MSD, for which a deep knowledge has been already acquired via an exact analytical analysis, a similar analysis of the statistical properties of a single-trajectory PSD is lacking at present, although its ensemble-averaged counterpart is widely used as an important quantifier of different properties of random trajectories in diverse areas of engineering, physics and chemistry.
In this paper, going beyond the text-book definition (1), we concentrate on the question what information can be reliably obtained if one defines the PSD of a single, finite-time realization of Xt. We here focus on the paradigmatic process of BM. This choice is two-fold: first, BM is ubiquitous in nature which renders this analysis particularly important. Second, it permits us to obtain an exact mathematical solution of the problem: we calculate exactly the moment-generating function and the full probability density function of the single-trajectory PSD and its moments of arbitrary order in the most general case of arbitrary frequency, arbitrary (finite) observation time and arbitrary number of the projections of a d-dimensional BM onto the coordinate axes. This furnishes yet another example of a time-averaged functional of BM, whose moment-generating function and full probability distribution can be calculated exactly (see, e.g., [74]).
Capitalizing on these results, we observe that for a sufficiently large T (and frequency f bounded away from zero) for any realization of the process a single-trajectory PSD is proportional to its first moment (1), and the latter embodies the full dependence on the frequency and on the diffusion coefficient specific to an ensemble of the trajectories. This means that the frequency-dependence can be deduced from a single trajectory. In other words, there is no need to perform averaging over an ensemble of trajectories—one long trajectory suffices. However, the proportionality factor, connecting a single-trajectory PSD and its ensemble-averaged counterpart—a numerical amplitude—is random and varies from realization to realization. Due to this fact, one cannot infer the value of the ensemble-averaged diffusion coefficient from the amplitude of a single-trajectory PSD. The distribution function of this amplitude is calculated exactly here and its effective width is quantified using standard criteria.
As a proof of concept, we revisit our predictions for a continuous-time BM—an idealized process with infinitesimal increments—resorting to a numerical analysis based on Monte Carlo simulations of discrete-time random walks, and also using experimental single-trajectory data for the diffusive motion of micron-sized polystyrene beads in a flow cell. We demonstrate that our theoretical prediction for the relation connecting a single-trajectory PSD and its ensemble-averaged counterpart is corroborated by numerical and experimental results. Additionally we show that the predicted distribution of the amplitude is consistent with numerical results, which means that the framework developed here is justified and allows for a meaningful analysis of experimental trajectories.
Pursuing this issue further, we address several general questions emerging in connection with a comparison of our analytical predictions against numerical simulations and experimental data. To this end, we first consider Wiener's representation of BM in form of an infinite Fourier series with random coefficients, whose truncated version is often used in numerical simulations. We show that the distribution of the single-trajectory PSD obtained from Wiener's representation in which just N terms are kept, instead of an infinite number, has exactly the same form as the one obtained for the continuous-time BM when fT is within the interval (0, πN). Outside of this interval, the probability density function of the truncated PSD converges to a different form.
We examine the case when a trajectory of a continuous-time BM is recorded at some discrete time moments, so that it is represented by a set of M points. The single-trajectory PSD, i.e., the periodogram, becomes a periodic function of fT with the prime period equal to 2πM. We analyze several aspects of this discrete-time problem: we study how large M should be taken at a fixed observation time T so that we may recover the results obtained for the continuous-time BM. We analyze the limiting forms of the distribution of a single-trajectory periodogram and show, in particular, that for fT kept fixed and , the distribution converges to the form obtained for a continuous-time BM. On the contrary, when fT is left arbitrary so that it may assume any value within the prime period, that is, , the distribution of a single-trajectory periodogram converges to a different limiting form as . Our analysis demonstrates that when fT belongs to a certain interval within the prime period, a single-trajectory periodogram equals, up to a random numerical amplitude, the ensemble-averaged periodogram, and the latter embodies the full dependence on f and T. Therefore, similarly to the continuous-time case, the correct spectrum can be obtained already from a single trajectory. These new results are the foundation for the use of single trajectory PSD in the quantitative analysis of single or few recorded particle trajectories, complementing the widely used concept of the single trajectory MSD.
The paper is outlined as follows: in section 2 we introduce our basic notations. In section 3 we first derive explicit expressions for the variance of a single-trajectory PSD and the corresponding coefficient of variation of the probability density function, and present exact results for the moment-generating function of a single-trajectory PSD, its full probability density function and moments of arbitrary order (section 3.1). Section 3.2 is devoted to the relation connecting a single-trajectory PSD and its ensemble-averaged counterpart, while section 3.3 discusses fluctuations of the amplitude in this relation. Next, in section 4 we analyze the probability density function of a single-trajectory PSD obtained by truncating Wiener's representation of a continuous-time BM. Section 5 presents an analogous analysis for the case when a continuous-time trajectory is recorded at discrete time moments. In section 6 we conclude with a brief recapitulation of our results and outline some perspectives for further research. Additional details are relegated to appendix A, in which we present exact results for the distributions in the special case f = 0, and to appendix B, where we discuss several cases in which the full distribution of a single-trajectory periodogram can be evaluated exactly in the discrete-time settings.
2. Brownian motion: definitions and notations
Let Xt, t ∈ (0, T) denote a Brownian trajectory in a d-dimensional continuum and with j = 1, 2, ..., d stand for the projection of on the axis xj. The projections are statistically independent of each other and (likewise the BM Xt itself) each projection is a Gaussian process with zero mean and variance
where D is the diffusion coefficient of . In a random walk sense, , where is the variance of the step length and is the mean time interval between consecutive steps. Hence, D contains the dimension d of the unprojected motion.
In text-book notations, the PSD of each of the projections is defined by equation (1), that is,
where (taking into account that is real-valued) the T-dependent function is given explicitly by
being the covariance of . For BM, one then finds the standard result [1]
such that
Hence, for BM the standard PSD , defined as an average over an ensemble of trajectories, is described by a power-law with characteristic exponent β = 2 and an amplitude which is linearly proportional to the diffusion coefficient D.
Going beyond the text-book definition in equation (5), we now define the PSD of a single component :
and another property of interest here, the partial PSD of the trajectory
in which we take into account the contributions of k, (k = 1, 2, ..., d), components of a d-dimensional BM. Clearly, for k = 1 the definitions (8) and (9) coincide. The PSDs and are f- and T-parameterized random variables: the first moment of a single-trajectory PSD (8) is given by the standard result (6), while the first moment of the partial PSD (9), due to statistical independence of the components, is given by expression (6) multiplied by k. Our goal is to evaluate exactly the full probability density function for .
3. Brownian motion: results
To get an idea of how representative of the actual behavior of a single-trajectory PSD the result (6) is, we first look at the variance of a single-component single-trajectory PSD,
Expression (10) permits us to determine the corresponding coefficient of variation
of the yet unknown distribution of a single-component single-trajectory PSD.
In figure 1 we depict γS, which is a function of the product exclusively. We observe that γS approaches for fixed T and , and tends to the asymptotic value (thin horizontal line in figure 1) when at any fixed f > 0. Overall, for any value of the coefficient of variation appears to be greater than unity, meaning that the standard deviation is greater than the average, , which signals that the parental distribution is effectively 'broad', despite the fact that it evidently has well-defined moments of arbitrary order. As a consequence, the average described by equation (6) may indeed not be representative of the actual behavior of a single-trajectory PSD. This fully validates our quest for the distribution .
3.1. Brownian motion: moment-generating-function, distribution and its moments
Our first goal is to calculate the moment-generating function of in equation (9), defined formally as the following Laplace transform
To calculate Φλ exactly, it appears convenient to use Wiener's representation of a given Brownian path in the form of a Fourier series with random coefficients
where are independent, identically-distributed random variables with the distribution
The corresponding single-trajectory partial PSD in equation (9) can be formally rewritten as
where the functions and are given explicitly by
and
Inserting expression (15) into (12) and using the identity
we rewrite equation (12) in the factorized form
where the subscript ζ in the averaging operator signifies that averaging over all paths of the component is replaced by an equivalent operation, the averaging over all possible values of . Performing this averaging, as well as the integrations over dx and dy, we get
We focus next on the infinite sums entering equation (19). They can be calculated exactly, and expressed via the first and the second moments of a single-trajectory PSD,
and
where the average and the variance (as well as the corresponding coefficient of variation) of the single-component single-trajectory PSD are defined in equations (6), (10), and (11), respectively.
Consequently we realize that the moment-generating function in equation (19) can be cast into a more compact and physically meaningful form, which involves only the first moment and the variance (through the coefficient γS of variation) of the single-trajectory PSD,
This expression holds for any value of k, f and T.
Performing next the inverse Laplace transform of the function defined by equation (21), we arrive at the following expression for the desired probability density function of the single-trajectory partial PSD defined in equation (9), which also (as the result in equation (21)) holds for arbitrary f, arbitrary T and arbitrary number k of the projections of the trajectory onto the coordinate axes,
Here, is the modified Bessel function of the 1st kind and Γ(...) is the Gamma-function.
Before we proceed further, two remarks are in order. First, we note that the distribution (22) is the Bessel function distribution that has been used in mathematical statistics years ago as an example of a distribution with heavier than Gaussian tails (see, e.g., [75, 76]). Second, as already mentioned, for f = 0 the coefficient of variation is exactly equal to and hence, the coefficient in front of the term quadratic in λ in expression (21) vanishes. In this particular case the distribution (22) simplifies to the χ2-distribution with k degrees of freedom, which is presented in appendix A.
Expression (22) permits us to straightforwardly calculate the moments of the partial single-trajectory PSD of arbitrary, not necessarily integer order ,
where is the Gauss hypergeometric function. Note that the moments of order Q > 2 for an arbitrary k are all expressed through the first and the second (via γS) moments of the single-component single-trajectory PSD only, since the parental Gaussian process is entirely defined by its first two moments.
The distribution in equation (22) and the formula for the moments of order Q, equation (23), ensure that the single-trajectory PSD has the form
where is the first moment of this random variable, i.e., a deterministic function of f and T which sets the scale of variation of , and is a dimensionless random amplitude, whose moments of order Q are defined by the expression in the right-hand-side of equation (23). The relation in equation (24) has important conceptual consequences on which we will elaborate below.
3.2. Brownian motion: single-trajectory PSD
One infers from figure 1 that upon an increase of the oscillatory terms in γS fade out, and γS saturates at the value . Let us define ωl as the value of when the amplitude of the oscillatory terms in γS equals , where ε > 0 is a small fixed number. Given that the amplitude of the oscillatory terms is a monotonically decreasing function of one has that for > ωl the coefficient of variation . Next, the decay law of the amplitude of oscillations can be readily derived from equations (6) and (10) to give that, in the leading in ε order, .
Then, for , up to terms proportional to ε, which can be made arbitrarily small, we see that the moments of the random amplitude in equation (24) are given by
meaning that in this limit becomes just a real number that is independent of f and T. This implies, in turn, that with any necessary accuracy, prescribed by the choice of ε, and for any realization of a trajectory ,
where the symbol O(ε) signifies that the omitted terms, (stemming from the oscillatory terms in , and hence, in ), have an amplitude smaller than ε.
Therefore, we arrive at the following conclusion, which is the main conceptual result of our analysis: for continuous-time BM and , the frequency-dependence of the PSD of any single trajectory is the same as that of the ensemble-averaged PSD with any desired accuracy, set by ε. Consequently, in response to the title question of our work, we conclude that for BM the 1/f2-dependence of the power spectrum can be deduced already from a single, sufficiently long trajectory, without any necessity to perform an additional averaging over an ensemble of such trajectories. In section 5 (see figure 7) below we show that this prediction made for BM—a somewhat idealized stochastic process with infinitesimal increments—holds indeed for discrete-time random walks and single-trajectory experiments, in which a BM trajectory is recorded at discrete instants of time and for a finite observation time.
Lastly, we note that since is linearly proportional to the diffusion coefficient D and the latter appears to be multiplied by a random numerical amplitude, one cannot infer the ensemble-averaged diffusion coefficient from a single-trajectory PSD. The error in estimating D from a single trajectory is given precisely by the deviation of this amplitude from its averaged value. Below we discuss the fluctuations of this numerical amplitude.
3.3. Fluctuations of the amplitude
The exact limiting probability density function of the amplitude in equation (26) follows from equation (22) and reads
In figure 2 we depict this distribution for k = 1, 2 and 3 (curves from left to right). We observe that the very shape of the distribution depends on the number of components which are taken into account in order to evaluate the partial PSD. For k = 1 (that is, for BM in one-dimensional systems, or in two or three-dimensions but when only one of the components is being tracked), the distribution is a monotonically decreasing function with the maximal value at . In contrast, for k = 2 and k = 3, is a bell-shaped function with a left power-law tail and an exponential right tail.
Download figure:
Standard image High-resolution imageIn figure 2 we also present a comparison of our analytical predictions in equation (27) against the results of numerical simulations. We use two methods to produce numerically the distributions of defined in equation (24) for the range of frequencies where is almost constant (see sections 4 and 5 for more details). The first method hinges on Wiener's representation of a BM in equation (13), which is truncated at the upper summation limit at N = 3.2 × 106 (see section 4 below for more details). Crosses (×) in figure 2 depict the corresponding results obtained via averaging over 105 trajectories generated using such a representation of BM. Further, we use a discrete-time representation of a BM—lattice random walks with unit spacing and stepping events at each tick of the clock—which are produced by Monte Carlos simulations. We set the observation time T = 1 and generate trajectories with steps to get a single-trajectory periodogram. A thorough discussion of the domain of frequencies in which the periodogram yields the behavior specific to a continuous-time BM are presented below in section 5. Pluses (+) depict the results obtained numerically for random walks averaged over 105 realizations of the process. Overall, we observe an excellent agreement between our analytical predictions, derived for BM—a continuous-time process with infinitesimal increments, and the numerical results. This signifies that the framework developed here is completely justified and can be used for a meaningful interpretation of stochastic trajectories obtained in single-trajectory experiments.
Moreover, the moments of the distribution in equation (27) are given by equation (25). We find that the average value is , as it should be, and the variance so that the distribution broadens with increasing k. The coefficient of variation of the k-dependent distribution in equation (27) is equal to meaning that fluctuations become progressively less important for increasing k. We also remark that the most probable values for the cases k = 2 and k = 3 are and , respectively, and are well below their average values.
Lastly, we quantify the effective broadness of the distribution in equation (27), focusing on its heterogeneity index
where and are the amplitudes drawn from two different independent realizations of . Such a diagnostic tool has been proposed in [77–79] in order to quantify fluctuations in the first passage phenomena in bounded domains, compare also with the discussion in [80, 81]. In our context, shows the likeliness of the event that two values of the amplitudes and will be equal to each other. With the distribution (27) of the random variable the distribution of the heterogeneity index can be calculated via its integral representation [77, 78]
Performing the integrals in relation (29), we find the following exact expressions for the distribution of the heterogeneity index: for k = 1 we have
where is the complete elliptic integral of the second kind; for k = 2 the distribution has the form
while for k = 3 it follows that
with
Note that the third-order derivative with respect to p in relation (32) can be taken explicitly producing, however, a rather cumbersome expression in terms of the complete elliptic integrals. For the sake of compactness, we nonetheless prefer the current notation.
The probability density functions in equations (30)–(32) are depicted in figure 3. We observe that for k = 1 the event in which two values of deduced from two different independent trajectories are equal to each other, (that is, when and ), is the most unlikely, since it corresponds to the minimum of the distribution in equation (30). Therefore, for the case k = 1 the most probable outcome is that two values of obtained for two different realizations of BM will be very different from each other. For k = 2 and k = 3, corresponds to the most probable event but still the distributions in equations (31) and (32) appear to be rather broad so that a pronounced realization-to-realization variation of the amplitude is expected.
Download figure:
Standard image High-resolution image4. Truncated Wiener's representation
In simulations of a continuous-time BM one often uses Wiener's representation (13), truncating it at the upper summation limit at some integer N,
The sample paths of such a partial sum process are known to converge to the sample paths of the BM at the rate . Focusing on a single-component single-trajectory PSD in equation (8) (generalization to the d-dimensional case and a partial PSD in equation (9) is straightforward) we have the following estimate for ,
Below we examine how accurately reproduces . We first consider the first moment of the truncated series and its variance, which are given by
and
In figure 4 we present a comparison of the averaged single-component single-trajectory PSD and of its counterpart obtained from the truncated Wiener's series (34), as well as of the corresponding coefficients of variation of the two probability density functions. We observe a perfect agreement between the results obtained from the complete series and the truncated ones. Moreover, we see that and exhibit a uniform convergence to and γS, respectively, for . This implies that keeping just N = 40 terms in the truncated Wiener's series permits us to describe reliably well the behavior of the pertinent properties over more than two decades of variation of f T. Extending this interval up to three decades will require keeping N ≈ 320 terms, for four decades N ≈ 3200 terms, and so on.
Download figure:
Standard image High-resolution imageWe finally focus on the moment-generating function of , and find
We note that has exactly the same form as the moment-generating function (21) (with k = 1) evaluated for a complete Wiener's series and hence, has the distribution of exactly the same form as the one in equation (22) with the only difference that the first moment and the variance have to be replaced by their counterparts obtained via truncation of the Wiener's representation at some level N. Given that for these properties are identical (see figure 4), it means that in this interval of variation of f T equation (37) coincides with (21), and the probability density function of coincides with result (22). Outside of this interval, i.e., for , the coefficient of variation of the truncated PSD jumps from to meaning that the distribution of becomes the -distribution (see appendix A), which is evidently a behavior.
5. Discrete sets of data
In experiments or in Monte Carlo simulations of BM, the particle position is recorded at some discrete time moments such that one stores a given trajectory as a finite set of data. Here we analyze how the features unveiled in the previous sections will change, if instead of continuous-time BM we rather use a picture based on a discrete-time random walk.
Suppose that the time interval (0, T) is divided into M equally-sized subintervals Δ = T/M, such that the particle position is recorded at time moments , . We use the convention that at t = 0 the particle starts at the origin and focus on the behavior of a single-component PSD—the extension of our analysis over the general case of k components is straightforward but results in rather cumbersome expressions.
As a first step, we convert the integrals in equation (8) into the corresponding sums to get a periodogram
where we now denote a single-trajectory PSD as RM(f) to emphasize that it is a different mathematical object as compared to the PSD in equation (8). Since we now have only a finite amount of points instead of a continuum, RT(f) in equation (38) will be a periodic function of fT so that it may only approximate the behavior of the PSD for continuous-time BM in some range of frequencies at a given observation time. Further on, we use the scaling property of BM to rewrite the latter expression as
where Bm is now a trajectory of a standard lattice random walk with unit spacing and stepping at each clock tick m, m = 0, 1, 2, ..., M. Next, we write
where sj = ±1 are independent increments, and s0 ≡ 0. Then, expression (39) becomes
where
and
Expression (41) is the discrete-time analog of the PSD in equation (15) obtained for the continuous-time BM.
5.1. Discrete-time case: mean and variance of a single-trajectory periodogram
At this point, it may be expedient to first look at the ensemble-averaged single-trajectory periodogram in equation (41) and at its variance, and to compare them against their continuous-time counterparts. Averaging the expression in equation (41) and its squared value over all possible realizations of the increments sj, we have
and
Note that these somewhat lengthy expressions (43) and (44) are exact, and valid for any M, Δ and f.
To illustrate the behavior of , and of the corresponding coefficient of variation of the distribution of a single-trajectory periodogram RM(f), we first depict in figure 5 these parameters as functions of fT for a rather small value of M, M = 10. We observe that they are periodic functions of fT with the prime period 2πM, (such that figure 1 presents a zoom of just one-half of the prime period) and exhibit peaks in the center, that is, at fT = πM, and at the end-points of the prime period, that is, for fT = 0 and fT = 2πM. More specifically, we see that in the middle of the prime period
and
leading to
For fixed T and , and both tend to zero, but their ratio γR stays finite and tends to . Next, at the end-points of the prime period and are given by
and
such that they both depend on the observation time T and diverge when . In this case γR obeys
and also stays finite as .
Download figure:
Standard image High-resolution imageConsider next the behavior of and for fT = πM/2, that is, at one-quarter of the prime period, and also for fT = 2πM/3, that is, at one-third of the prime period. Here, we have
and
producing
and
which imply that here, as well, γR is given by equation (47c). Therefore, we have that for the last two situations the first moment and the variance of a single-trajectory periodogram both tend to zero when (that is, when at a fixed T), which is similar to the behavior at one-half of the prime period. In contrast, here γR tends to , which is the asymptotic coefficient of variation of the distribution of a single-trajectory PSD for the continuous-time BM.
In figure 6 we present a comparison between the averaged PSD (6) for the continuous-time BM and its discrete-time counterpart (43). We observe a nearly perfect agreement between these two representations for a wide range of variation of . We see that the periodicity of starts to matter in a noticeable way only at for M = 103, at for M = 104 and only at fT ≈ 2 × 104 for M = 105, respectively. Below we will quantify the onset of the deviation between equations (43) and (6) more accurately.
Download figure:
Standard image High-resolution imageNext, we come back to figure 1 in which we compare the coefficient of variation of the distribution of a single-trajectory PSD for the case of the continuous-time BM, and an analogous coefficient γR of variation of the distribution in the discrete-time case, calculated using our results (43) and (44) for M = 104 and M = 105. Here we observe that the agreement is even better and extends over a fairly large range of values of fT, until γR starts to show a strong oscillatory behavior close to the middle of the prime period (see also figure 5). Overall, it appears that γR can be set equal to with an arbitrary accuracy dependent on the choice of ε (see section 3.2) on a bounded interval and, by symmetry, for .
Therefore, in response to the title question of this work we conclude that one can indeed observe the behavior specific to the continuous-time BM (that is, the 1/f2 dependence of the averaged PSD and, by virtue of equation (26), of a single-trajectory PSD) in the discrete-time settings. However, the spectrum must be analyzed on a finite interval (ωl, ωr) of variation of fT, which is bounded from below by ωl (see section 3.2)—ensuring that γR becomes equal (with any necessary accuracy) to and hence, does not contribute to the frequency-dependence of a single trajectory PSD—and by ωr from above, when μR(f, T) starts to deviate from μS(f, T). Extending the arguments presented in section 3.2, we define ωr as the value of fT at which . To calculate ωr, we note that the ratio of two ensemble-averaged PSDs can be very accurately approximated by
so that we find that ωr = 2αM, where α obeys . For small ε, the parameter α is given, to leading order in ε, by . Recalling the definition of ωl (see section 3.2), we thus see that the 1/f2-dependence can be observed once , which means that the number M of recorded positions of each trajectory has to obey .
In figure 7 we plot the logarithm of a single-trajectory periodogram versus the logarithm of f T for five individual, randomly chosen trajectories of discrete-time lattice random walks and also five trajectories of a BM generated by the truncated Wiener's series (see section 4). For random walks, (which make a step of unit length at each tick of the clock, so that D = 1/2), we set the observation time T = 1 and have discrete points within the unit time interval. For such a choice of parameters, we set the accuracy parameter ε = 0.01 to get ωl = 40 and . We observe that, indeed, for all these five trajectories and , (that is, for f varying over more than five decades), the relation holds. Next, for a BM generated by the truncated Wiener's representation we set T = 1, D = 0.01 and take N = 3.2 × 106. Here, as well, for all five trajectories we observe a perfect agreement with our prediction in equation (26) for , , that is, for f spanning over more than five decades.
Download figure:
Standard image High-resolution imageLastly, we plot in figure 8 (upper panel) a logarithm of a single-trajectory periodogram versus the logarithm of f, for five trajectories of polystyrene beads in experiments performed within a flow cell (see below for more details). Each trajectory represents a one-dimensional projection of the bead motion, which is a BM with the diffusion coefficient , as estimated from the bead's squared displacement averaged over 150 trajectories. Note that the value of D inferred from the averaged PSD appears to be pretty close, . Again, we observe that the periodograms of the bead motion agree very well with our prediction in equation (26), and clearly follow the 1/f2-dependence for each individual trajectory. We stress again the similarly to the behavior of the MSD (2) [57, 58] to the amplitude scatter of a single-trajectory PSD as function of f: apart from some small fluctuations, it is remarkably constant.
Download figure:
Standard image High-resolution imageIn the lower panel in figure 8, using a set of 150 individual experimental trajectories, we construct the distribution of the amplitude A in the 1/f2-dependence and compare it against our prediction in equation (27). The distributions of both one- and two-dimensional motions are shown, i.e., k = 1 (projection along a line) and k = 2 (projection along the imaging plane), respectively. Specifically, the distributions of the amplitudes are constructed by computing A = f2S(f)/(4 D) for 350 frequencies in the range 50 ≤ fT ≤ 400. We note that the agreement with equation (27) is quite impressive for both k = 1 and k = 2, with the size of the error bars being less than the size of the symbols.
In our experimental setup, we used a flow cell made up of a cover slip and a plastic slide. Two holes were drilled in the plastic slide to form the inlet and outlet of the chamber. Then the plastic slide and cover slip were attached using double-sided tape and sealed with nail polish, which was left to cure for 24 h. 1.2 μm polystyrene beads (SVP-10-5, Spherotech, Lake Forest, IL) were diluted in phosphate-buffered saline (PBS) with 0.05% Tween 20. The sample was agitated, centrifuged and resuspended in PBS with 1% bovine serum albumin (BSA) and 0.05% Tween 20. Subsequently the bead suspension was introduced in the flow cell chamber and the holes were sealed with nail polish, followed by immediate imaging. The beads were imaged in an inverted microscope equipped with a 40× objective (Olympus PlanApo, N.A. 0.95) and a sCMOS camera (Andor Zyla 4.2) operated at 100 frames per second. Each recorded sequence consisted of 4096 frames. Bead tracking in the plane was performed in LabView using a cross-correlation based tracking algorithm [82]. An image of a polystyrene bead is shown in the inset of figure 8. Several rings are observed in the image because the bead is intentionally not in focus, in order to increase tracking accuracy.
We close this subsection with the following remark. Estimating the diffusion coefficient or the temporal evolution of the MSD, one often slices a long trajectory into many smaller ones, which permits to create some statistical average. In this case, it seems to be appropriate for a stationary process because one here estimates either a constant (diffusion coefficient) or deals with a monotonically growing function of time. For the PSD the situation is much more subtle, because in the discrete-time case it is a periodic function of the parameter ω, with an oscillatory behavior within each prime period. Therefore, in this case one needs to fit the PSD only on a particular interval of variation of ω described below our equation (49). While slicing the trajectories permits one to create some statistical ensemble here, as well, this occurs at the expense of shrinking the interval [ωl, ωr], and it is not clear a priori if such a procedure can be beneficial or detrimental to the analysis. This question will be studied in detail elsewhere.
5.2. Discrete-time case: moment-generating function and distribution of a single-trajectory periodogram
The expression in equation (41) permits us to write formally the moment-generating function and the distribution of a single-trajectory periodogram as
and
where the summation extends over all possible realizations of the sequence of the increments sj. These expressions are exact, but they are of a little use, except for the (uninteresting) case when M is sufficiently small such that they can be written down in an explicit form by enumerations of all possible sequences of sj.
We notice next that the weighted sums
and
appearing in equations (50a) and (50b) can be considered as two coupled discrete-time random walks with variable, generally irrational and incommensurate step-length. From this observation, some general conclusions can be reached about the expressions in equations (50a) and (50b): (i) for arbitrarily large M, both WM and VM are bounded from above by their maximal displacements, meaning that has a bounded support and there is an upper cut-off value Rmax(M) above which is identically zero. At the same time (apart from some special cases, for instance, when fΔ/π is a rational number, see appendices A and B) the trajectories WM and VM, for whatever large M, will never visit the origin again which signifies that . (ii) Moreover, we may expect that for an arbitrarily large M there is some gap Rmin(M) below which is identically zero too, that is, WM and VM do not visit simultaneously some vicinity of the origin. These two specific features of the discrete-time settings make a significant difference as compared to the truncated Wiener's representation, in which analogous sums and are not bounded, and may, in principle, be equal to zero, since the increments ζn are continuous random variables with the support ().
We proceed with the derivation of closed-form expressions for the moment-generating function and for the distribution of a single-trajectory periodogram. Using the integral identity in equation (17), we cast equations (50a) and (50b) into a form which permits to perform the averaging very directly. This leads to
and
where J0(...) is the Bessel function of the 1st kind. Expressions (52a) and (52b) are formally exact and hold for arbitrary M, f and Δ. We note that there is a set of magic frequencies f such that fΔ = πq with q a rational number, when aj and bj become periodic functions of j and the kernel can be written down explicitly in form of a finite series of cosines. Then, the integrations in equations (52a) and (52b) can be performed exactly resulting in exact closed-form expressions for and . Several examples of such calculations are presented in appendices A and B. However, for arbitrary f, analytical calculation of the integrals in results (52a) and (52b) is certainly beyond reach and one has to resort to some approximations.
5.3. Discrete-time case in the limit M ≫ 1: limiting forms of the moment-generating function and of the distribution of a single-trajectory periodogram
Meaningful approximations of expressions (52a) and (52b) can be obtained in the large-M limit. One has, however, to distinguish between the case when fT is fixed and , and the case when but fT scales with M and may take any value within the interval (0, 2 πM).
5.3.1. Fixed fT and
For fixed T and not too large λ, the parameter is large when M ≫ 1, so that the exponential function in equation (52a) is small everywhere except for the vicinity of the origin. Moreover, the integrals in equations (52a) and (52b) look very similar to the ones appearing in the theory of Pearson's random walks with a variable step length (see, e.g., [83]) and involve a kernel . It is well known that in the limit the behavior of such a kernel is determined overwhelmingly by its behavior in the vicinity of x = 0 and y = 0, so that one has
The accuracy of this approximation is checked in figure 9 in which we depict a density plot of the difference κ(x, y) of and of the exponential function in the right-hand-side of (53). Already for M = 102 the difference κ(x, y) is zero almost everywhere and in the regions where it deviates from zero it is nonetheless numerically very small.
Download figure:
Standard image High-resolution imageNext, we have that for fT fixed and M ≫1 the ensemble-averaged periodogram and its variance admit the following representations
and
where the omitted terms have a bounded amplitude for any value of fT and decay with the growth of M in proportion to the second inverse power of M.
Inserting expression (53) into equations (52a) and (52b) and performing the integrations, we find that in the limit M ≫ 1, for fixed f T and bounded λ, the moment-generating function and the distribution of a single trajectory periodogram are given up to terms of order , by our previous results (21) and (22).
5.3.2. Arbitrary fT ∈ (0, 2 πM) and
We now relax the condition that fT is fixed, and suppose that it can attain any value within the prime period 2πM. In other words, we consider the situation in which f T scales with M such that the expansions (54a) and (54b) are invalid, and we can no longer discard the correction terms. In this case, we find that the moment-generating function of a single-trajectory periodogram is given by
Differentiating this expression once and twice, and then setting λ = 0, we observe that (55) correctly reproduces the first moment of the periodogram, but yields an incorrect expression for the variance, meaning that in this limit the approximation (53) is insufficient (although it works fairly well in the previous case with fT kept fixed). In other words, in order to obtain a moment-generating function which reproduces correctly first two moments of a single-trajectory periodogram, one has to go beyond this approximation. For Pearson's random walks with j-dependent step-lengths aj and bj this turns out to be quite a complicated problem which we are not in the position to solve here.
Conversely, on intuitive grounds, we may conjecture such a form of the moment-generating function which reproduces the first and the second moment correctly. For , this is given by
that is, has precisely the same form as the moment-generating function in the continuous-time case, equation (21), but with the first two moments replaced by the analogous properties of a single-trajectory periodogram. Evidently, a generalization to the case of a k-component periodogram amounts to merely replacing 1/2 by k/2.
In turn, the form (56) implies that the distribution of a single-trajectory periodogram is given by equation (22) with replaced by and replaced by its discrete-time counterpart , that is,
We are unable to prove expression (56) (and hence, equation (57)) in the general case of arbitrary f. However, as we have already remarked, there are plenty of cases in which the moment-generation function and the distribution in equations (52a) and (52b) can be calculated exactly. In appendix B we present such calculations for several particular values of the frequency, such that , fΔ = π/2, fΔ = 2 π and fΔ = π, and show that the exact discrete-time results do indeed converge to the asymptotic forms (56) and (57). Therefore, we have all grounds to believe that expression (57) is correct.
Lastly, we note that equation (57) permits us to make a substantial generalization of the single-trajectory relation (26). We see that for (in which domain with any necessary accuracy set by the choice of ε), a single-trajectory periodogram obeys
implying that the spectrum of a single-trajectory periodogram should be the same as for the ensemble-averaged periodogram, while only the amplitude A will be a fluctuating property. The limiting distribution of the amplitude is given by equation (27).
6. Conclusions and discussion
We studied here in detail the statistical properties of the PSD of a single trajectory of a d-dimensional BM. We calculated exactly the moment-generating function, the full probability density function and the moments of arbitrary, not necessarily integer order of such a PSD in the most general case of arbitrary frequency f, arbitrary (not necessarily infinite) observation time T and arbitrary number of projections of a given trajectory onto the coordinate axes. We showed that for a sufficiently large T (and the frequency f > 0) a single-trajectory PSD for any realization of the process is proportional to its first moment, which embodies the full dependence on the frequency and on the diffusion coefficient. This implies that the correct frequency-dependence specific to an ensemble of trajectories can be deduced already from a single trajectory and solely the numerical proportionality factor in this relation is random, and varies from realization to realization. Due to this fact, one cannot infer the precise value of the ensemble-averaged diffusion coefficient from a single-trajectory PSD since the ensemble-averaged PSD is linearly proportional to the diffusion coefficient which thus appears to be multiplied by a random amplitude. The distribution function of this amplitude was also calculated exactly here and its effective width was quantified using standard criteria.
Moreover, we addressed several questions emerging in connection with the numerical and experimental verification of our analytical predictions for the continuous-time BM. To this end, we first considered Wiener's representation of BM in form of an infinite Fourier series with random coefficients, whose truncated version is often used in numerical simulations. We showed that the distribution of the single-trajectory PSD obtained from Wiener's series in which just N terms are kept, instead of an infinite number, has exactly the same form as the one obtained for the continuous-time BM when fT is within the interval (0, πN). Outside this interval, the probability density function of the truncated PSD converges to a different form.
Next, we examined the case when a trajectory of continuous-time BM is recorded at some discrete time moments, so that the whole trajectory is represented by a set of M points and the PSD, called a periodogram, becomes a periodic function of the product fT with the prime period equal to 2πM. We analyzed several aspects of this discrete-time problem. Namely, we studied how big M should be taken at a fixed observation time T so that we may recover the results obtained for the continuous-time BM. Apart of that, we studied the limiting forms of the distribution of a single-trajectory periodogram and showed, in particular, that for fT kept fixed and , the latter converges to the form obtained for the continuous-time BM. In contrast, when fT is left arbitrary so that it may assume any value within the prime period, that is, fT ∈ (0, 2πM), the limiting distribution of a single-trajectory periodogram converges to a different form as . Our analysis revealed the remarkable observation that for fT belonging to a certain interval within the prime period, a single-trajectory periodogram equals, up to a random numerical amplitude, the ensemble-averaged periodogram, and the latter embodies the full dependence on f and T. Therefore, similarly to the continuous-time case, the correct spectrum can be obtained already from a single trajectory.
To check our analytical predictions we performed numerical analyzes, using a truncated Wiener's representation and also Monte Carlo simulations of discrete-time random walks, as well as an experimental analysis of BM of polystyrene beads in aqueous solution. We confirmed our prediction of the relation connecting a single-trajectory PSD and its ensemble averaged counter-part and demonstrated that the former indeed exhibits a well-defined 1/f2-dependence, specific to a standard, ensemble-averaged PSD. Furthermore, our numerical analysis confirmed the form of the distribution function of the amplitude in this relation.
The theoretical analysis for the case of BM presented here can be extended in several directions. In particular, one may inquire about analogous distributions for anomalous diffusion processes [57], as exemplified, for instance, by superdiffusive Lévy motion or subdiffusive continuous-time random walks with a broad distribution of the waiting times. Another important example of transport processes is given by a wide and experimentally relevant class of anomalous diffusion processes called fractional BM, for which the generalization of our analysis is relatively straightforward. Subdiffusive fractional BM is related to the overdamped, generalized Langevin equation with a power-law memory kernel typical for viscoelastic systems [84]. However, superdiffusive fractional BM was identified for active transport in biological cells [85]. We also mention the very active field of diffusion with varying diffusion coefficients. These can be systematically varying in space [64], time [86, 87], or be randomly varying in time [88–90] or space [91–93]. Lastly, a challenging field of further research is to search for such periodic functions of the frequency in equation (1), in place of the exponential function, for which a single-trajectory PSD will possess an ergodic property so that its variance will tend to zero in the limit .
Acknowledgments
DK, RM and GO wish to thank for warm hospitality the International Center for Advanced Studies in Buenos Aires, where this work was initiated. DK acknowledges a partial support from the NSF Grant No. 1401432. EM acknowledges funding from the European Research Council (ERC) under the European Union Horizon 2020 research and innovation program (grant agreement No. 694925).
Appendix A.: Moment-generating function and the distribution of the PSD for f = 0
We focus in this appendix on the moment-generating function and on the probability density function of a single-trajectory PSD for the continuous-time BM and of the discrete-time single-trajectory periodogram in the special case f = 0.
Consider first the case of BM. The first moment and the variance of the PSD for f = 0 follow from equations (6) and (10) and are given explicitly by
and
so that the coefficient of variation is . Hence, in this case the coefficient in front of the quadratic term in equation (21) vanishes and the moment-generating function obeys
Taking the inverse Laplace transform, we then find that in this case the distribution is given by
that is, it is the χ2-distribution with k degrees of freedom. Note that in contrast to the distribution (22) with f bounded away from zero, the one in equation (A4) does not attain a limiting form as the observation time . For any fixed S > 0 one has
In the discrete-time case, for f = 0, the first moment of the periodogram, the variance and the variation coefficient γR are given by equations (46a)–(46c), respectively. We notice that they converge, as , to their continuous-time counterparts in equations (A1) and (A2), and , meaning that in this limit the moment-generating function and the distribution of the periodogram converge to the forms in equations (A3) and (A4).
For f = 0 in the discrete-time case, the moment-generating function and the distribution of the periodogram can be calculated exactly for arbitrary M, which permits us to estimate the rate of convergence. We constrain our analysis to the case when M is divisible by 4. We impose such a constraint just for a simplicity of exposition, exact solutions can be also obtained for other values of M resulting, however, in rather cumbersome expressions without providing any additional insight.
We notice that for f = 0 the coefficients bj in equation (42b) are identically equal to zero, which implies that VM ≡ 0, (51b), while the coefficients aj in equation (42a) are explicitly given by . Consequently, the sum appearing in equations (50a) and (50b) can be thought of as a 'random walk' with a linearly 'shrinking' time-dependent step length (see, e.g., [94, 95] for other examples of such random walks). In this case, the product of cosines in equations (52a) and (52b) can be explicitly written down as
where the coefficient qM(j) is defined by
with being the q-Pochhammer symbol [96]. In other terms, qM(j) is the numerical coefficient before the term qj in the polynomial . For , is simply the number of all possible partitions of an integer j into distinct parts. Note also that qM(j) is symmetric around m = 0 such that , .
Inserting next expression (A6) into equations (52a) and (52b) and performing the integrations, we arrive at the following exact results for the moment-generating function and the distribution of a single-trajectory periodogram RM(f) for f = 0:
and
Note that R has a discrete support on the interval with . Note, as well, that the presence of a delta-function at R = 0 is the consequence of the choice of M; for M not divisible by 4 the coefficient in front of δ(R) equals zero because the 'random walk' WM never visits the origin.
In figure A1, upper panel, we present a comparison of the expressions for the moment-generating functions in equation (A3) for the continuous-time case and in equation (A8) for the discrete-time case, for different densities of the discrete-time points for different values of T. We observe that for T = 1 and M = 12 (corresponding to Δ = 1/12) the agreement between the continuous- and the discrete-time results is fairly good up to λ = 103. For T = 10 and M = 36, such that on average 3.6 points are recorded for a unit of time, the discrete-time result agrees with the continuous-time one up to . Increasing the number of points up to M = 60, so that we have now 6 (instead of 3.6) points per unit of time, a perfect agreement between equations (A3) and (A8) extends up to λ = 103. Lastly, for the observation time T = 20 with 60 recorded points (which corresponds to just 3 points per unit of time T) we have a perfect agreement between the discrete- and continuous-time predictions for values of λ up to λ = 3 × 102 and a significant departure for λ exceeding these values. In contrast, for T = 20 and M = 102, such that we have 5 points for each unit of time, a perfect agreement is observed for the whole range of variation of λ. We thus conclude that, in fact, once we seek to achieve a good agreement between the continuous- and discrete-time expressions for the moment-generation function, we do not need a very high precision in approximating the continuous-time trajectory by a discrete-time one.
Download figure:
Standard image High-resolution imageNext, in the lower panel of figure A1 we compare the probability density functions obtained for the continuous-time and the discrete-time cases. Since the distribution in the discrete-time case is a finite sum of delta-functions (see equation (A9)) it is appropriate to compare not the distributions themselves, but rather their integrated forms, the cumulative distribution functions , obtained by integrating the forms in equations (A4) and (A9) over dR from 0 to R. From figure A1 (lower panel) we conclude that here too just 16 recorded points per unit of time appear to be enough to get a fairly good agreement between the distributions in the continuous- and the discrete-time cases.
Appendix B.: Several exact results for the moment-generating function and the distribution of a single-trajectory periodogram
In this appendix we present several stray examples for which the moment-generating function in equation (50a) and the distribution of a single-trajectory periodogram in equation (50b) can be calculated exactly. Our aim here is to demonstrate that in all these particular cases the exact moment-generating function and the distribution converge to the forms (56) and (57) as , which validates our conjecture. These particular choices of the frequency within the prime period are: fΔ = π, fΔ = π/2, fΔ = 2 π/3 and the endpoint of the prime period, fΔ = 2 π. We note that for all choices of the frequency the first moment μR(f, T) and the variance σR(f, T) converge as to different values, as compared to their continuous-time counterparts. This means, in turn, that for fT within the prime period these are the forms in equations (56) and (57) which describe the moment-generating function and the distribution of the periodogram in the limit , but not the continuous-time results derived in section 3.
B.1. The case fΔ = π
We start with the simplest case, where the convergence of the moment-generating function (50a) and of the distribution of a single-trajectory periodogram (50b) to the limiting forms in equations (56) and (57) can be established analytically. We will again assume, for simplicity, that M is divisible by 4. We hasten to remark that such a constraint is not crucial—an exact solution can be obtained in the general case of an arbitrary M, but the resulting expressions will be more cumbersome.
For fΔ = π, the coefficients bj in equation (42b) all vanish, such that the 'random walk' VM (51b) stays at the origin. The coefficients aj in equation (42a) are periodic with period 2; that being, , and are given by when M is divisible by 4. Consequently, the random walk WM in equation (51a) steps at even time moments and pauses at the odd ones. In this case, the first moment and the variance of a single-trajectory periodogram are, respectively, given by equations (45a) and (45b), while the coefficient of variation obeys equation (45c) and tends to as . This means that the limiting form of the distribution of a single trajectory periodogram for such a choice of f should be the χ2-distribution. We show below that this is indeed the case.
For M divisible by 4, the kernel in (52a) is given by
where is the binomial coefficient. Substituting equation (B1) into equation (52a) and integrating term by term, we find that for fΔ = π the moment-generating function is given by the following sum of exponential functions:
and hence, the distribution of a single-trajectory periodogram is given explicitly as a sum of delta-functions
Note that again the delta-peak at R = 0 is a consequence of the specific choice of M. For M not divisible by 4 such a peak is absent, precisely as it happens for a standard random walk, for which the parity of the number M of steps matters.
In the limit it is legitimate to replace the summation operation by an integration and use
to get from expression (B2) the following limiting expression for the moment-generating function
Recalling that we see then that for the right-hand-side of the latter equation converges to
which is precisely our conjectured expression (56) corresponding to . The distribution is then obtained upon inverting the Laplace transform to get
which is the χ2-distribution conjectured in equation (57). The convergence of equation (B2)–(B6), and also of (B3)–(B7) for different values of M is demonstrated in figure B1.
Download figure:
Standard image High-resolution imageB.2. The case
Consider the case when fT is at one-quarter of the prime period. In this case the coefficients aj and bj are periodic functions of j with period 4, i.e., and . Within the period, their values are given by
such that random walk WM (51a) pauses at first two 'time' moments and steps on two consecutive ones. The random walk VM (51b) pauses at the first time moment, then makes two steps and pauses again.
Now, we again assume that M is divisible by 4. Then, the kernel in (52a) reduces to
such that the contributions dependent on x and on y appear in the factorized form. Inserting the latter expansion into equation (52a) and integrating term by term, we get
where is defined by
The function is identically equal to zero for odd n. For even n, n = 2k, we write
Further on, for even m, m = 2q, the latter sum can be written down as a finite series of cosines
such that is given explicitly by
For odd m, m = 2q + 1, we have
which gives
Equations (B10), (B14) and (B16) define completely the moment-generating function of a single-trajectory periodogram in the case f = π/(2Δ). The distribution function follows straightforwardly by expanding in a series of exponentials and taking the inverse Laplace transform.
In figure B2 we plot our exact result for the moment-generating function in equation (B10) and the corresponding result for the distribution function of a single-trajectory periodogram, together with the conjectured expressions (56) and (57) with and γR defined by equations (47a) and (47c). We observe a very convincing convergence of the exact results, upon an increase of M, to the conjectured forms.
Download figure:
Standard image High-resolution imageB.3. The case fΔ = 2π/3
In this case the coefficients aj (42a) and bj (42b) are periodic functions of j with period 3, and are given by
within their period.
We assume now that M is divisible by 3. In this case the kernel
Next, we use
which gives, in combination with (B18), the following representation of the kernel
Inserting the latter expansion into equation (52a) and integrating term by term, we obtain the following exact result for the moment-generating function of a single-trajectory periodogram
where is defined in equations (B14) and (B16). The moment-generating function is a finite sum of exponentials and the corresponding distribution is a sum of delta-functions with coefficients which can be readily deduced from equation (B21). In figure B3 we demonstrate that Φλ in equation (B21), and also the cumulative distribution function of the distribution deduced from equation (B21), do indeed converge to the conjectured forms (56) and (57) with and , equations (48a) and (48b).
Download figure:
Standard image High-resolution imageB.4. The case fΔ = 2π
We finally consider the special case when fT is at the endpoint of the prime period. In this case, and , such that we are led to the special case (f = 0) studied already in appendix A. Convergence of the discrete-time moment-generating function in equation (A8) and the cumulative distribution function of the distribution in equation (A9) to the conjectured forms in equations (56) and (57) is evident.