1 Introduction
In this note, which is an extended version of [30 ] , we consider the well-studied problem of parameter estimation for stochastic differential equations (SDEs) from continuous-time observations X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , t ∈ [ 0 , T ] 𝑡 0 𝑇 t\in[0,T] italic_t ∈ [ 0 , italic_T ] [26 ] . It is well-known that the corresponding maximum likelihood estimator does not depend continuously on the observations X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , t ∈ [ 0 , T ] 𝑡 0 𝑇 t\in[0,T] italic_t ∈ [ 0 , italic_T ] , which can result in a systematic estimation bias [28 , 15 ] . In other words, the maximum likelihood estimator is not robust with respect to perturbations in the observations. Here, we revisit this problem from the perspective of online (time-continuous) parameter estimation [7 , 12 ] using the popular ensemble Kalman filter (EnKF) and its continuous-time ensemble Kalman–Bucy filter (EnKBF) formulations [16 , 11 , 27 ] . As for the corresponding maximum likelihood approaches, the EnKBF does not depend continuously on the incoming observations X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , t ≥ 0 𝑡 0 t\geq 0 italic_t ≥ 0 , with respect to the uniform norm topology on the space of continuous functions. This fact has been first investigated in [10 ] using rough path theory [17 ] . In particular, as already demonstrated for the related maximum likelihood estimator in [15 ] , rough path theory allows one to specify an appropriately generalised topology which leads to a continuous dependence of the EnKBF estimators on the observations. Here we expand the analysis of [10 ] to a frequentist analysis of the EnKBF in the spirit of [31 ] , where the primary focus is on the expected behaviour of the EnKBF estimators over all admissible observation paths. One recovers that the discontinuous dependence of the EnKBF estimators on the driving observations results in a systematic bias from a frequentist perspective. This is also a well known fact for SDEs driven by multiplicative noise [24 ] .
The proposed frequentist perspective naturally enables the study of known bias correction methods, such as subsampling the data [28 ] , a recently proposed data filtering approach [2 ] , as well as novel de-biasing approaches [10 ] in the context of the EnKBF.
In order to facilitate a rather elementary mathematical analysis, we consider only the very much simplified problem of parameter estimation for linear SDEs. This restriction allows us to avoid certain technicalities from rough path theory and enables a rather straightforward application of the numerical rough path approach put forward in [14 ] . As a result we are able to demonstrate that the popular approach of subsampling the data [3 , 28 , 6 ] can be well justified from a frequentist perspective. The frequentist perspective also suggests a rather natural approach to the estimation of the required correction term in the case an EnKBF is implemented without subsampling.
We end this introductory paragraph with a reference to [1 ] , which includes a broad survey on alternative estimation techniques. We also point to [10 ] for an in-depth discussion of rough path theory in connection to filtering and parameter estimation.
The remainder of this paper is structured as follows. The problem setting and the EnKBF are introduced in the subsequent Section
2 . The frequentist perspective and its implications on the specific implementations of an EnKBF in the context of low and high frequency data assimilation are laid out in Section 3 . The importance of these considerations becomes transparent when applying the EnKBF to perturbed data in Section 4 . Here again, we restrict attention to a rather simple model setting taken from [18 ] and also used in [10 ] . As a result we build a clear connection between subsampling and the necessity for a correction term in the case high frequency data is assimilated directly.
We also provide a discussion of the data filtering approach [2 ] in the context of our simply model system. A brief numerical demonstration is provided in Section 5 , which is followed by a concluding remark in Section 6 .
2 Ensemble Kalman parameter estimation
We consider the SDE parameter estimation problem
d X t = f ( X t , θ ) d t + γ 1 / 2 d W t d subscript 𝑋 𝑡 𝑓 subscript 𝑋 𝑡 𝜃 d 𝑡 superscript 𝛾 1 2 d subscript 𝑊 𝑡 {\rm d}X_{t}=f(X_{t},\theta){\rm d}t+\gamma^{1/2}{\rm d}W_{t} roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_θ ) roman_d italic_t + italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT roman_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT
(1)
subject to observations X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , t ∈ [ 0 , T ] 𝑡 0 𝑇 t\in[0,T] italic_t ∈ [ 0 , italic_T ] , which arise from the reference system
d X t † = f † ( X t † ) d t + γ 1 / 2 d W t † , d superscript subscript 𝑋 𝑡 † superscript 𝑓 † superscript subscript 𝑋 𝑡 † d 𝑡 superscript 𝛾 1 2 d subscript superscript 𝑊 † 𝑡 {\rm d}X_{t}^{\dagger}=f^{\dagger}(X_{t}^{\dagger}){\rm d}t+\gamma^{1/2}{\rm d%
}W^{\dagger}_{t}, roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = italic_f start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) roman_d italic_t + italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT roman_d italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ,
(2)
where the unknown drift function f † ( x ) superscript 𝑓 † 𝑥 f^{\dagger}(x) italic_f start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ( italic_x ) typically satisfies f † ( x ) = f ( x , θ † ) superscript 𝑓 † 𝑥 𝑓 𝑥 superscript 𝜃 † f^{\dagger}(x)=f(x,\theta^{\dagger}) italic_f start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ( italic_x ) = italic_f ( italic_x , italic_θ start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) and θ † superscript 𝜃 † \theta^{\dagger} italic_θ start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT
denotes the true parameter value. Here we assume for simplicity that the unknown parameter is scalar-valued and that the state
variable is d 𝑑 d italic_d -dimensional with d ≥ 1 𝑑 1 d\geq 1 italic_d ≥ 1 . Furthermore, W t subscript 𝑊 𝑡 W_{t} italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and W t † superscript subscript 𝑊 𝑡 † W_{t}^{\dagger} italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT denote independent standard d 𝑑 d italic_d -dimensional Brownian motions and γ > 0 𝛾 0 \gamma>0 italic_γ > 0 is the (known) diffusion constant.
Following the Bayesian paradigm, we treat the unknown parameter as a random variable Θ Θ \Theta roman_Θ . Furthermore, we apply a sequential approach and update Θ Θ \Theta roman_Θ with the incoming data X t † subscript superscript 𝑋 † 𝑡 X^{\dagger}_{t} italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT as a function of time. Hence we introduce the
random variable Θ t subscript Θ 𝑡 \Theta_{t} roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT which obeys the Bayesian posterior distribution given all observations X τ † superscript subscript 𝑋 𝜏 † X_{\tau}^{\dagger} italic_X start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , τ ∈ [ 0 , t ] 𝜏 0 𝑡 \tau\in[0,t] italic_τ ∈ [ 0 , italic_t ] , up to time t > 0 𝑡 0 t>0 italic_t > 0 . Furthermore, instead of exactly solving the time-continuous Bayesian inference problem as specified by the associated Kushner–Stratonovitch equation [7 , 27 ] , we define the time evolution of Θ t subscript Θ 𝑡 \Theta_{t} roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT
by an application of the (deterministic) ensemble Kalman–Bucy filter (EnKBF) mean-field equations
[11 , 27 ] , which take the form
d Θ t & = γ − 1 π t [ ( θ − π t [ θ ] ) ⊗ f ( X t † , θ ) ] d I t , d I t = d X t † − 1 2 ( f ( X t † , Θ t ) + π t [ f ( X t † , θ ) ] ) d t , formulae-sequence d subscript Θ 𝑡 & superscript 𝛾 1 subscript 𝜋 𝑡 delimited-[] tensor-product 𝜃 subscript 𝜋 𝑡 delimited-[] 𝜃 𝑓 superscript subscript 𝑋 𝑡 † 𝜃 d subscript 𝐼 𝑡 d subscript 𝐼 𝑡 d superscript subscript 𝑋 𝑡 † 1 2 𝑓 superscript subscript 𝑋 𝑡 † subscript Θ 𝑡 subscript 𝜋 𝑡 delimited-[] 𝑓 subscript superscript 𝑋 † 𝑡 𝜃 d 𝑡 {\rm d}\Theta_{t}&=\gamma^{-1}\pi_{t}\left[(\theta-\pi_{t}[\theta])\otimes f(X%
_{t}^{\dagger},\theta)\right]{\rm d}I_{t},\\
{\rm d}I_{t}={\rm d}X_{t}^{\dagger}-\frac{1}{2}\left(f(X_{t}^{\dagger},\Theta_%
{t})+\pi_{t}[f(X^{\dagger}_{t},\theta)]\right){\rm d}t, roman_d roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT & = italic_γ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ ( italic_θ - italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ italic_θ ] ) ⊗ italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , italic_θ ) ] roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ italic_f ( italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_θ ) ] ) roman_d italic_t ,
where π t subscript 𝜋 𝑡 \pi_{t} italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT denotes the probability density function (PDF) of Θ t subscript Θ 𝑡 \Theta_{t} roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and π t [ g ] subscript 𝜋 𝑡 delimited-[] 𝑔 \pi_{t}[g] italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ italic_g ] the associated expectation value of a function g ( θ ) 𝑔 𝜃 g(\theta) italic_g ( italic_θ ) . The column vector I t subscript 𝐼 𝑡 I_{t} italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , defined by (2 b), is called the innovation,
while the row vector
K t ( π t ) = γ − 1 π t [ ( θ − π t [ θ ] ) ⊗ f ( X t † , θ ) ] , subscript 𝐾 𝑡 subscript 𝜋 𝑡 superscript 𝛾 1 subscript 𝜋 𝑡 delimited-[] tensor-product 𝜃 subscript 𝜋 𝑡 delimited-[] 𝜃 𝑓 superscript subscript 𝑋 𝑡 † 𝜃 K_{t}(\pi_{t})=\gamma^{-1}\pi_{t}\left[(\theta-\pi_{t}[\theta])\otimes f(X_{t}%
^{\dagger},\theta)\right], italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = italic_γ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ ( italic_θ - italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ italic_θ ] ) ⊗ italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , italic_θ ) ] ,
(4)
premultiplying the innovation in (2 a) is called the gain. Here the notation a ⊗ b = a b T tensor-product 𝑎 𝑏 𝑎 superscript 𝑏 T a\otimes b=ab^{\rm T} italic_a ⊗ italic_b = italic_a italic_b start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT , where a , b 𝑎 𝑏
a,b italic_a , italic_b can be any two column vectors, has been used. The initial condition Θ 0 ∼ π 0 similar-to subscript Θ 0 subscript 𝜋 0 \Theta_{0}\sim\pi_{0} roman_Θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∼ italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is provided by the prior PDF of the unknown parameter.
A Monte-Carlo implementation of the mean-field equations (2 ) leads to the interacting particle system
d Θ t ( i ) & = γ − 1 π t M [ ( θ − π t M [ θ ] ) ⊗ f ( X t † , θ ) ] d I t ( i ) , d I t ( i ) = d X t † − 1 2 ( f ( X t † , Θ t ( i ) ) + π t M [ f ( X t † , θ ) ] ) d t , formulae-sequence d superscript subscript Θ 𝑡 𝑖 & superscript 𝛾 1 superscript subscript 𝜋 𝑡 𝑀 delimited-[] tensor-product 𝜃 superscript subscript 𝜋 𝑡 𝑀 delimited-[] 𝜃 𝑓 superscript subscript 𝑋 𝑡 † 𝜃 d superscript subscript 𝐼 𝑡 𝑖 d superscript subscript 𝐼 𝑡 𝑖 d superscript subscript 𝑋 𝑡 † 1 2 𝑓 superscript subscript 𝑋 𝑡 † superscript subscript Θ 𝑡 𝑖 superscript subscript 𝜋 𝑡 𝑀 delimited-[] 𝑓 subscript superscript 𝑋 † 𝑡 𝜃 d 𝑡 {\rm d}\Theta_{t}^{(i)}&=\gamma^{-1}\pi_{t}^{M}\left[(\theta-\pi_{t}^{M}[%
\theta])\otimes f(X_{t}^{\dagger},\theta)\right]{\rm d}I_{t}^{(i)},\\
{\rm d}I_{t}^{(i)}={\rm d}X_{t}^{\dagger}-\frac{1}{2}\left(f(X_{t}^{\dagger},%
\Theta_{t}^{(i)})+\pi_{t}^{M}[f(X^{\dagger}_{t},\theta)]\right){\rm d}t, roman_d roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT & = italic_γ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT [ ( italic_θ - italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT [ italic_θ ] ) ⊗ italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , italic_θ ) ] roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT = roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ) + italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT [ italic_f ( italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_θ ) ] ) roman_d italic_t ,
i = 1 , … , M 𝑖 1 … 𝑀
i=1,\ldots,M italic_i = 1 , … , italic_M , where expectations are now taken with respect to the empirical measure. That is,
π t M [ g ] = 1 M ∑ i = 1 M g ( Θ t ( i ) ) superscript subscript 𝜋 𝑡 𝑀 delimited-[] 𝑔 1 𝑀 superscript subscript 𝑖 1 𝑀 𝑔 superscript subscript Θ 𝑡 𝑖 \pi_{t}^{M}[g]=\frac{1}{M}\sum_{i=1}^{M}g(\Theta_{t}^{(i)}) italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT [ italic_g ] = divide start_ARG 1 end_ARG start_ARG italic_M end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT italic_g ( roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT )
(6)
for given function g ( θ ) 𝑔 𝜃 g(\theta) italic_g ( italic_θ ) , and all Monte-Carlo samples are driven by the same (fixed) observations X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT .
The initial samples Θ 0 ( i ) superscript subscript Θ 0 𝑖 \Theta_{0}^{(i)} roman_Θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , i = 1 , … , M 𝑖 1 … 𝑀
i=1,\ldots,M italic_i = 1 , … , italic_M , are drawn identically and independently from the prior distribution
π 0 subscript 𝜋 0 \pi_{0} italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT .
We note in passing that there is also a stochastic variant of the innovation process [27 ] defined by
d I t = d X t † − f ( X t † , Θ t ) d t − γ 1 / 2 d W t , d subscript 𝐼 𝑡 d superscript subscript 𝑋 𝑡 † 𝑓 superscript subscript 𝑋 𝑡 † subscript Θ 𝑡 d 𝑡 superscript 𝛾 1 2 d subscript 𝑊 𝑡 {\rm d}I_{t}={\rm d}X_{t}^{\dagger}-f(X_{t}^{\dagger},\Theta_{t}){\rm d}t-%
\gamma^{1/2}{\rm d}W_{t}, roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) roman_d italic_t - italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT roman_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ,
(7)
which leads to the Monte-Carlo approximation
d I t ( i ) = d X t † − f ( X t † , Θ t ( i ) ) d t − γ 1 / 2 d W t ( i ) d superscript subscript 𝐼 𝑡 𝑖 d superscript subscript 𝑋 𝑡 † 𝑓 superscript subscript 𝑋 𝑡 † superscript subscript Θ 𝑡 𝑖 d 𝑡 superscript 𝛾 1 2 d superscript subscript 𝑊 𝑡 𝑖 {\rm d}I_{t}^{(i)}={\rm d}X_{t}^{\dagger}-f(X_{t}^{\dagger},\Theta_{t}^{(i)}){%
\rm d}t-\gamma^{1/2}{\rm d}W_{t}^{(i)} roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT = roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ) roman_d italic_t - italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT roman_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT
(8)
of the innovation in (2 ).
We now state a numerical implementation with step-size Δ t > 0 Δ 𝑡 0 \Delta t>0 roman_Δ italic_t > 0 and denote the resulting
numerical approximations at t n = n Δ t subscript 𝑡 𝑛 𝑛 Δ 𝑡 t_{n}=n\Delta t italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_n roman_Δ italic_t by Θ n ∼ π n similar-to subscript Θ 𝑛 subscript 𝜋 𝑛 \Theta_{n}\sim\pi_{n} roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∼ italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , n ≥ 1 𝑛 1 n\geq 1 italic_n ≥ 1 . While a standard Euler–Maruyama approximation could be applied, the following stable discrete-time mean-field formulation of the
EnKBF
Θ n + 1 = Θ n + K n { ( X t n + 1 † − X t n † ) − 1 2 ( f ( X t n † , Θ n ) + π n [ f ( X t n † , θ ) ] ) Δ t } subscript Θ 𝑛 1 subscript Θ 𝑛 subscript 𝐾 𝑛 superscript subscript 𝑋 subscript 𝑡 𝑛 1 † superscript subscript 𝑋 subscript 𝑡 𝑛 † 1 2 𝑓 superscript subscript 𝑋 subscript 𝑡 𝑛 † subscript Θ 𝑛 subscript 𝜋 𝑛 delimited-[] 𝑓 subscript superscript 𝑋 † subscript 𝑡 𝑛 𝜃 Δ 𝑡 \Theta_{n+1}=\Theta_{n}+K_{n}\left\{(X_{t_{n+1}}^{\dagger}-X_{t_{n}}^{\dagger}%
)-\frac{1}{2}\left(f(X_{t_{n}}^{\dagger},\Theta_{n})+\pi_{n}[f(X^{\dagger}_{t_%
{n}},\theta)]\right)\Delta t\right\} roman_Θ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT = roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT { ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_f ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) + italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT [ italic_f ( italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_θ ) ] ) roman_Δ italic_t }
(10)
is inspired by [4 ] with Kalman gain
K n & = π n [ ( θ − π n [ θ ] ) ⊗ f ( X t n † , θ ) ] × ( γ + Δ t π n [ ( f ( X t n † , θ ) − π n [ f ( X t n † , θ ) ] ) ⊗ f ( X t n † , θ ) ] ) − 1 . K_{n}&=\pi_{n}\left[(\theta-\pi_{n}[\theta])\otimes f(X_{t_{n}}^{\dagger},%
\theta)\right]\times\\
\quad\left(\gamma+\Delta t\pi_{n}\left[\left(f(X_{t_{n}}^{\dagger},\theta)-\pi%
_{n}[f(X_{t_{n}}^{\dagger},\theta)]\right)\otimes f(X_{t_{n}}^{\dagger},\theta%
)\right]\right)^{-1}. italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT & = italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT [ ( italic_θ - italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT [ italic_θ ] ) ⊗ italic_f ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , italic_θ ) ] × ( italic_γ + roman_Δ italic_t italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT [ ( italic_f ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , italic_θ ) - italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT [ italic_f ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , italic_θ ) ] ) ⊗ italic_f ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , italic_θ ) ] ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .
It is straightforward to combine this time discretisation with the Monte-Carlo approximation (2 ) in order to
obtain a complete numerical implementation of the EnKBF.
The EnKBF provides only an approximate solution to the Bayesian inference problem for general nonlinear f ( x , θ ) 𝑓 𝑥 𝜃 f(x,\theta) italic_f ( italic_x , italic_θ ) .
However, it becomes exact in the mean-field limit for affine drift functions f ( x , θ ) = θ A x + B x + c 𝑓 𝑥 𝜃 𝜃 𝐴 𝑥 𝐵 𝑥 𝑐 f(x,\theta)=\theta Ax+Bx+c italic_f ( italic_x , italic_θ ) = italic_θ italic_A italic_x + italic_B italic_x + italic_c .
Example 2.3 .
Consider the stochastic partial differential equation
∂ t u = − U ∂ y u + ρ ∂ y 2 u + 𝒲 ˙ subscript 𝑡 𝑢 𝑈 subscript 𝑦 𝑢 𝜌 superscript subscript 𝑦 2 𝑢 ˙ 𝒲 \partial_{t}u=-U\partial_{y}u+\rho\partial_{y}^{2}u+\dot{\mathcal{W}} ∂ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_u = - italic_U ∂ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT italic_u + italic_ρ ∂ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_u + over˙ start_ARG caligraphic_W end_ARG
(13)
over a periodic spatial domain y ∈ [ 0 , L ) 𝑦 0 𝐿 y\in[0,L) italic_y ∈ [ 0 , italic_L ) , where 𝒲 ( t , y ) 𝒲 𝑡 𝑦 \mathcal{W}(t,y) caligraphic_W ( italic_t , italic_y ) denotes space-time white noise, U ∈ ℝ 𝑈 ℝ U\in\mathbb{R} italic_U ∈ blackboard_R , and ρ > 0 𝜌 0 \rho>0 italic_ρ > 0 are given parameters. A standard finite-difference discretisation in space with d 𝑑 d italic_d grid points and mesh-size Δ y normal-Δ 𝑦 \Delta y roman_Δ italic_y leads to a linear system of SDEs of the form
d 𝐮 t = − ( U D + ρ D D T ) 𝐮 t d t + Δ y − 1 / 2 d W t , d subscript 𝐮 𝑡 𝑈 𝐷 𝜌 𝐷 superscript 𝐷 T subscript 𝐮 𝑡 d 𝑡 Δ superscript 𝑦 1 2 d subscript 𝑊 𝑡 {\rm d}{\bf u}_{t}=-(UD+\rho DD^{\rm T}){\bf u}_{t}{\rm d}t+\Delta y^{-1/2}{%
\rm d}W_{t}, roman_d bold_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = - ( italic_U italic_D + italic_ρ italic_D italic_D start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ) bold_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT roman_d italic_t + roman_Δ italic_y start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT roman_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ,
(14)
where 𝐮 t ∈ ℝ d subscript 𝐮 𝑡 superscript ℝ 𝑑 {\bf u}_{t}\in\mathbb{R}^{d} bold_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT denotes the vector of grid approximations at time t 𝑡 t italic_t , D ∈ ℝ d × d 𝐷 superscript ℝ 𝑑 𝑑 D\in\mathbb{R}^{d\times d} italic_D ∈ blackboard_R start_POSTSUPERSCRIPT italic_d × italic_d end_POSTSUPERSCRIPT
a finite difference approximation of the spatial derivative ∂ y subscript 𝑦 \partial_{y} ∂ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT , and W t subscript 𝑊 𝑡 W_{t} italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT the standard d 𝑑 d italic_d -dimensional Brownian motion. We can now set X t = 𝐮 t subscript 𝑋 𝑡 subscript 𝐮 𝑡 X_{t}={\bf u}_{t} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = bold_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , γ = Δ y − 1 𝛾 normal-Δ superscript 𝑦 1 \gamma=\Delta y^{-1} italic_γ = roman_Δ italic_y start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT and identify either θ = U 𝜃 𝑈 \theta=U italic_θ = italic_U or θ = ρ 𝜃 𝜌 \theta=\rho italic_θ = italic_ρ as the unknown parameter in order to obtain an SDE of the form (1 ).
In this note, we further simplify our given inference problem to the case
f ( x , θ ) = θ A x , 𝑓 𝑥 𝜃 𝜃 𝐴 𝑥 f(x,\theta)=\theta Ax\,, italic_f ( italic_x , italic_θ ) = italic_θ italic_A italic_x ,
(15)
where A ∈ ℝ d × d 𝐴 superscript ℝ 𝑑 𝑑 A\in\mathbb{R}^{d\times d} italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_d × italic_d end_POSTSUPERSCRIPT is a normal matrix with eigenvalues in the left half plane. That is σ ( A ) ⊂ ℂ − 𝜎 𝐴 subscript ℂ \sigma(A)\subset\mathbb{C}_{-} italic_σ ( italic_A ) ⊂ blackboard_C start_POSTSUBSCRIPT - end_POSTSUBSCRIPT . The reference parameter value is set to θ † = 1 superscript 𝜃 † 1 \theta^{\dagger}=1 italic_θ start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = 1 . Hence the SDE (2 ) possesses a Gaussian invariant measure with mean zero and covariance matrix
C = − γ ( A + A T ) − 1 . 𝐶 𝛾 superscript 𝐴 superscript 𝐴 T 1 C=-\gamma(A+A^{\rm T})^{-1}. italic_C = - italic_γ ( italic_A + italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .
(16)
We assume from now on that the observations X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT are realisations of (2 ) with initial condition X 0 † ∼ N ( 0 , C ) similar-to superscript subscript 𝑋 0 † N 0 𝐶 X_{0}^{\dagger}\sim{\rm N}(0,C) italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ∼ roman_N ( 0 , italic_C ) .
Under these assumptions, the EnKBF (2 ) simplifies drastically, and we obtain
d Θ t & = σ t γ ( A X t † ) T d I t , d I t = d X t † − 1 2 ( Θ t + π t [ θ ] ) A X t † d t , formulae-sequence d subscript Θ 𝑡 & subscript 𝜎 𝑡 𝛾 superscript 𝐴 subscript superscript 𝑋 † 𝑡 T d subscript 𝐼 𝑡 d subscript 𝐼 𝑡 d superscript subscript 𝑋 𝑡 † 1 2 subscript Θ 𝑡 subscript 𝜋 𝑡 delimited-[] 𝜃 𝐴 superscript subscript 𝑋 𝑡 † d 𝑡 {\rm d}\Theta_{t}&=\frac{\sigma_{t}}{\gamma}(AX^{\dagger}_{t})^{\rm T}{\rm d}I%
_{t},\\
{\rm d}I_{t}={\rm d}X_{t}^{\dagger}-\frac{1}{2}\left(\Theta_{t}+\pi_{t}[\theta%
]\right)AX_{t}^{\dagger}{\rm d}t, roman_d roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT & = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ italic_θ ] ) italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT roman_d italic_t ,
with variance
σ t = π t [ ( θ − π t [ θ ] ) 2 ] . subscript 𝜎 𝑡 subscript 𝜋 𝑡 delimited-[] superscript 𝜃 subscript 𝜋 𝑡 delimited-[] 𝜃 2 \sigma_{t}=\pi_{t}\left[(\theta-\pi_{t}[\theta])^{2}\right]. italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ ( italic_θ - italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ italic_θ ] ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] .
(18)
Similarly, the discrete-time mean-field EnKBF (10 ) reduces to
Θ n + 1 = Θ n + K n { ( X t n + 1 † − X t n † ) − 1 2 ( Θ n + π n [ θ ] ) A X t n † Δ t } subscript Θ 𝑛 1 subscript Θ 𝑛 subscript 𝐾 𝑛 superscript subscript 𝑋 subscript 𝑡 𝑛 1 † superscript subscript 𝑋 subscript 𝑡 𝑛 † 1 2 subscript Θ 𝑛 subscript 𝜋 𝑛 delimited-[] 𝜃 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 † Δ 𝑡 \Theta_{n+1}=\Theta_{n}+K_{n}\left\{(X_{t_{n+1}}^{\dagger}-X_{t_{n}}^{\dagger}%
)-\frac{1}{2}\left(\Theta_{n}+\pi_{n}[\theta]\right)AX_{t_{n}}^{\dagger}\Delta
t\right\} roman_Θ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT = roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT { ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT [ italic_θ ] ) italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT roman_Δ italic_t }
(20)
with Kalman gain
K n = σ n ( A X t n † ) T ( γ + Δ t σ n ( A X t n † ) T A X t n † ) − 1 . subscript 𝐾 𝑛 subscript 𝜎 𝑛 superscript 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 † T superscript 𝛾 Δ 𝑡 subscript 𝜎 𝑛 superscript 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 † T 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 † 1 K_{n}=\sigma_{n}(AX_{t_{n}}^{\dagger})^{\rm T}\left(\gamma+\Delta t\sigma_{n}(%
AX_{t_{n}}^{\dagger})^{\rm T}AX_{t_{n}}^{\dagger}\right)^{-1}\,. italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ( italic_γ + roman_Δ italic_t italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .
(21)
Furthermore, since X t † ∼ N ( 0 , C ) similar-to superscript subscript 𝑋 𝑡 † N 0 𝐶 X_{t}^{\dagger}\sim{\rm N}(0,C) italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ∼ roman_N ( 0 , italic_C ) ,
( A X t † ) T A X t † = ( A T A ) : ( X t † ⊗ X t † ) ≈ ( A T A ) : C : superscript 𝐴 superscript subscript 𝑋 𝑡 † T 𝐴 superscript subscript 𝑋 𝑡 † superscript 𝐴 T 𝐴 tensor-product superscript subscript 𝑋 𝑡 † superscript subscript 𝑋 𝑡 † superscript 𝐴 T 𝐴 : 𝐶 (AX_{t}^{\dagger})^{\rm T}AX_{t}^{\dagger}=(A^{\rm T}A):(X_{t}^{\dagger}%
\otimes X_{t}^{\dagger})\approx(A^{\rm T}A):C ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ⊗ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) ≈ ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C
(22)
for d ≫ 1 much-greater-than 𝑑 1 d\gg 1 italic_d ≫ 1 , and we may simplify the Kalman gain to
K n = σ n ( A X t n † ) T ( γ + Δ t σ n ( A T A ) : C ) − 1 . K_{n}=\sigma_{n}\,(AX_{t_{n}}^{\dagger})^{\rm T}\left(\gamma+\Delta t\sigma_{n%
}\,(A^{\rm T}A):C\right)^{-1}. italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ( italic_γ + roman_Δ italic_t italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .
(23)
Here we have used the notation A : B = tr ( A T B ) : 𝐴 𝐵 tr superscript 𝐴 T 𝐵 A:B=\mbox{tr}(A^{\rm T}B) italic_A : italic_B = tr ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_B ) to denote the Frobenius inner product of two matrices A , B ∈ ℝ d × d 𝐴 𝐵
superscript ℝ 𝑑 𝑑 A,B\in\mathbb{R}^{d\times d} italic_A , italic_B ∈ blackboard_R start_POSTSUPERSCRIPT italic_d × italic_d end_POSTSUPERSCRIPT . The approximation (22 ) becomes exact in the limit d → ∞ → 𝑑 d\to\infty italic_d → ∞ , which we will frequently assume in the following section. Please note that
K n = σ n γ ( A X t n † ) T + 𝒪 ( Δ t ) subscript 𝐾 𝑛 subscript 𝜎 𝑛 𝛾 superscript 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 † T 𝒪 Δ 𝑡 K_{n}=\frac{\sigma_{n}}{\gamma}\,(AX_{t_{n}}^{\dagger})^{\rm T}+\mathcal{O}(%
\Delta t) italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT + caligraphic_O ( roman_Δ italic_t )
(24)
under the stated assumptions.
Alternatively, if one wishes to explicitly utilise the availability of continuous-time data X t † subscript superscript 𝑋 † 𝑡 X^{\dagger}_{t} italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , one could apply the following variant of (20 ):
Θ n + 1 = Θ n + σ n γ ∫ t n t n + 1 ( A X t † ) T d X t † − 1 2 K n A X t n † ( Θ n + π n [ θ ] ) Δ t , subscript Θ 𝑛 1 subscript Θ 𝑛 subscript 𝜎 𝑛 𝛾 superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript 𝐴 superscript subscript 𝑋 𝑡 † T differential-d superscript subscript 𝑋 𝑡 † 1 2 subscript 𝐾 𝑛 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 † subscript Θ 𝑛 subscript 𝜋 𝑛 delimited-[] 𝜃 Δ 𝑡 \Theta_{n+1}=\Theta_{n}+\frac{\sigma_{n}}{\gamma}\int_{t_{n}}^{t_{n+1}}(AX_{t}%
^{\dagger})^{\rm T}{\rm d}X_{t}^{\dagger}-\frac{1}{2}K_{n}AX_{t_{n}}^{\dagger}%
\left(\Theta_{n}+\pi_{n}[\theta]\right)\Delta t, roman_Θ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT = roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + divide start_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT [ italic_θ ] ) roman_Δ italic_t ,
(30)
and following the Itô/Euler–Maruyama approximation (12 ), discretise the integral with a small inner step-size Δ τ = Δ t / L Δ 𝜏 Δ 𝑡 𝐿 \Delta\tau=\Delta t/L roman_Δ italic_τ = roman_Δ italic_t / italic_L , L ≫ 1 much-greater-than 𝐿 1 L\gg 1 italic_L ≫ 1 ; that is,
∫ t n t n + 1 ( A X t † ) T d X t † ≈ ∑ l = 0 L − 1 ( A X τ l † ) T ( X τ l + 1 † − X τ l † ) superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript 𝐴 superscript subscript 𝑋 𝑡 † T differential-d superscript subscript 𝑋 𝑡 † superscript subscript 𝑙 0 𝐿 1 superscript 𝐴 superscript subscript 𝑋 subscript 𝜏 𝑙 † T superscript subscript 𝑋 subscript 𝜏 𝑙 1 † superscript subscript 𝑋 subscript 𝜏 𝑙 † \int_{t_{n}}^{t_{n+1}}(AX_{t}^{\dagger})^{\rm T}{\rm d}X_{t}^{\dagger}\approx%
\sum_{l=0}^{L-1}(AX_{\tau_{l}}^{\dagger})^{\rm T}(X_{\tau_{l+1}}^{\dagger}-X_{%
\tau_{l}}^{\dagger}) ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ≈ ∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L - 1 end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT )
(31)
with τ l = t n + l Δ τ subscript 𝜏 𝑙 subscript 𝑡 𝑛 𝑙 Δ 𝜏 \tau_{l}=t_{n}+l\Delta\tau italic_τ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_l roman_Δ italic_τ . We note that
∑ l = 0 L − 1 ( A X τ l † ) T ( X τ l + 1 † − X τ l † ) & = ( A X t n † ) T ( X t n + 1 † − X t n † ) + A T : ( ∑ l = 0 L − 1 ( X τ l † − X t n † ) ⊗ ( X τ l + 1 † − X τ l † ) ) , : superscript subscript 𝑙 0 𝐿 1 superscript 𝐴 superscript subscript 𝑋 subscript 𝜏 𝑙 † T superscript subscript 𝑋 subscript 𝜏 𝑙 1 † superscript subscript 𝑋 subscript 𝜏 𝑙 † & superscript 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 † T superscript subscript 𝑋 subscript 𝑡 𝑛 1 † superscript subscript 𝑋 subscript 𝑡 𝑛 † superscript 𝐴 T superscript subscript 𝑙 0 𝐿 1 tensor-product superscript subscript 𝑋 subscript 𝜏 𝑙 † superscript subscript 𝑋 subscript 𝑡 𝑛 † superscript subscript 𝑋 subscript 𝜏 𝑙 1 † superscript subscript 𝑋 subscript 𝜏 𝑙 † \sum_{l=0}^{L-1}(AX_{\tau_{l}}^{\dagger})^{\rm T}(X_{\tau_{l+1}}^{\dagger}-X_{%
\tau_{l}}^{\dagger})&=(AX_{t_{n}}^{\dagger})^{\rm T}(X_{t_{n+1}}^{\dagger}-X_{%
t_{n}}^{\dagger})\,\,+\\
\,\,A^{\rm T}:\left(\sum_{l=0}^{L-1}(X_{\tau_{l}}^{\dagger}-X_{t_{n}}^{\dagger%
})\otimes(X_{\tau_{l+1}}^{\dagger}-X_{\tau_{l}}^{\dagger})\right), ∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L - 1 end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) & = ( italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) + italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT : ( ∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L - 1 end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) ⊗ ( italic_X start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) ) ,
which is at the heart of rough path analysis [14 ] and which we utilise in the following section.
3 Frequentist analysis
It is well-known that the second-order contribution in (2 ) leads to a discontinuous dependence of the integral
on the observed X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT in the uniform norm topology on the space of continuous functions. Rough path theory fixes this problem by defining appropriately extended topologies and has been extended to the EnKBF in [10 ] . In this section, we complement the path-wise analysis from [10 ] by an analysis of the impact of second-order contribution on the EnKBF (2 ) from a frequentist perspective, which analyses the behaviour of EnKBF over all possible observations X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT subject to (2 ). In other words, one switches from a strong solution concept to a weak one. While we assume that the observations satisfy (2 ), throughout this section, we will analyse the impact of a perturbed observation process on the EnKBF in Section 4 .
We first derive evolution equations for the conditional mean and variance under the assumption that Θ 0 subscript Θ 0 \Theta_{0} roman_Θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is Gaussian distributed with given prior mean m prior subscript 𝑚 prior m_{\rm prior} italic_m start_POSTSUBSCRIPT roman_prior end_POSTSUBSCRIPT and variance σ prior subscript 𝜎 prior \sigma_{\rm prior} italic_σ start_POSTSUBSCRIPT roman_prior end_POSTSUBSCRIPT . It follows directly from (2 ) that the conditional mean μ t = π t [ θ ] subscript 𝜇 𝑡 subscript 𝜋 𝑡 delimited-[] 𝜃 \mu_{t}=\pi_{t}[\theta] italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ italic_θ ] , that is the mean of Θ t subscript Θ 𝑡 \Theta_{t} roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , satisfies the SDE
d μ t = σ t γ ( ( A X t † ) T d X t † − μ t ( A T A ) : ( X t † ⊗ X t † ) d t ) , {\rm d}\mu_{t}=\frac{\sigma_{t}}{\gamma}\left((AX_{t}^{\dagger})^{\rm T}{\rm d%
}X^{\dagger}_{t}-\mu_{t}\,(A^{\rm T}A):(X_{t}^{\dagger}\otimes X_{t}^{\dagger}%
)\,{\rm d}t\right), roman_d italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ⊗ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) roman_d italic_t ) ,
(33)
which simplifies to
d μ t = σ t γ ( ( A X t † ) T d X t † − μ t ( A T A ) : C d t ) , {\rm d}\mu_{t}=\frac{\sigma_{t}}{\gamma}\left((AX_{t}^{\dagger})^{\rm T}{\rm d%
}X^{\dagger}_{t}-\mu_{t}\,(A^{\rm T}A):C\,{\rm d}t\right), roman_d italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C roman_d italic_t ) ,
(34)
under the approximation (22 ). The initial condition is μ 0 = m prior subscript 𝜇 0 subscript 𝑚 prior \mu_{0}=m_{\rm prior} italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_m start_POSTSUBSCRIPT roman_prior end_POSTSUBSCRIPT .
The evolution equation for the conditional variance, that is the variance of Θ t subscript Θ 𝑡 \Theta_{t} roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , is given by
d d t σ t = − σ t 2 γ ( A T A ) : ( X t † ⊗ X t † ) : d d 𝑡 subscript 𝜎 𝑡 superscript subscript 𝜎 𝑡 2 𝛾 superscript 𝐴 T 𝐴 tensor-product superscript subscript 𝑋 𝑡 † superscript subscript 𝑋 𝑡 † \frac{\rm d}{{\rm d}t}\sigma_{t}=-\frac{\sigma_{t}^{2}}{\gamma}\,(A^{\rm T}A):%
(X_{t}^{\dagger}\otimes X_{t}^{\dagger}) divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = - divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ⊗ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT )
(35)
with initial condition σ 0 = σ prior subscript 𝜎 0 subscript 𝜎 prior \sigma_{0}=\sigma_{\rm prior} italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_σ start_POSTSUBSCRIPT roman_prior end_POSTSUBSCRIPT and which again reduces to
d d t σ t = − σ t 2 γ ( A T A ) : C : d d 𝑡 subscript 𝜎 𝑡 superscript subscript 𝜎 𝑡 2 𝛾 superscript 𝐴 T 𝐴 𝐶 \frac{\rm d}{{\rm d}t}\sigma_{t}=-\frac{\sigma_{t}^{2}}{\gamma}\,(A^{\rm T}A):C divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = - divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C
(36)
under the approximation (22 ).
We now perform a frequentist analysis of the estimator μ t subscript 𝜇 𝑡 \mu_{t} italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT defined by (34 ) and (36 ), that is,
we perform a weak analysis of the SDE (34 ) in terms of the first two moments of μ t subscript 𝜇 𝑡 \mu_{t} italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [31 ] . In the first step, we take the expectation of (34 ) over all realisations X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT of the SDE (2 ), which we denote by
m t := 𝔼 † [ μ t ] . assign subscript 𝑚 𝑡 superscript 𝔼 † delimited-[] subscript 𝜇 𝑡 m_{t}:=\mathbb{E}^{\dagger}[\mu_{t}]. italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT := blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] .
(37)
The associated evolution equation is given by
d d t m t = σ t γ ( A T A ) : 𝔼 † [ X t † ⊗ X t † ] − σ t γ ( A T A ) : C m t , : d d 𝑡 subscript 𝑚 𝑡 subscript 𝜎 𝑡 𝛾 superscript 𝐴 T 𝐴 superscript 𝔼 † delimited-[] tensor-product superscript subscript 𝑋 𝑡 † superscript subscript 𝑋 𝑡 † subscript 𝜎 𝑡 𝛾 superscript 𝐴 T 𝐴 : 𝐶 subscript 𝑚 𝑡 \frac{\rm d}{{\rm d}t}m_{t}=\frac{\sigma_{t}}{\gamma}\,(A^{\rm T}A):\mathbb{E}%
^{\dagger}\left[X_{t}^{\dagger}\otimes X_{t}^{\dagger}\right]-\frac{\sigma_{t}%
}{\gamma}\,(A^{\rm T}A):C\,m_{t}, divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ⊗ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ] - divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ,
(38)
which reduces to
d d t m t = σ t γ ( A T A ) : C ( 1 − m t ) = σ t ( A T A ) : ( A + A T ) − 1 ( 1 − m t ) . : d d 𝑡 subscript 𝑚 𝑡 subscript 𝜎 𝑡 𝛾 superscript 𝐴 T 𝐴 𝐶 1 subscript 𝑚 𝑡 subscript 𝜎 𝑡 superscript 𝐴 T 𝐴 : superscript 𝐴 superscript 𝐴 T 1 1 subscript 𝑚 𝑡 \frac{\rm d}{{\rm d}t}m_{t}=\frac{\sigma_{t}}{\gamma}\,(A^{\rm T}A):C\,(1-m_{t%
})=\sigma_{t}\,(A^{\rm T}A):(A+A^{\rm T})^{-1}\,(1-m_{t}). divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C ( 1 - italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : ( italic_A + italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( 1 - italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) .
(39)
In the second step, we also look at the frequentist variance
p t := 𝔼 † [ ( μ t − m t ) 2 ] . assign subscript 𝑝 𝑡 superscript 𝔼 † delimited-[] superscript subscript 𝜇 𝑡 subscript 𝑚 𝑡 2 p_{t}:=\mathbb{E}^{\dagger}[(\mu_{t}-m_{t})^{2}]. italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT := blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ ( italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] .
(40)
Using
d ( μ t − m t ) & = σ t γ { ( A T A ) : ( X t † ⊗ X t † − C ) d t + γ 1 / 2 ( A X t † ) T d W t † } − σ t γ ( A T A ) : C ( μ t − m t ) d t , : d subscript 𝜇 𝑡 subscript 𝑚 𝑡 & limit-from subscript 𝜎 𝑡 𝛾 conditional-set superscript 𝐴 T 𝐴 tensor-product superscript subscript 𝑋 𝑡 † superscript subscript 𝑋 𝑡 † 𝐶 d 𝑡 superscript 𝛾 1 2 superscript 𝐴 superscript subscript 𝑋 𝑡 † T d subscript superscript 𝑊 † 𝑡 subscript 𝜎 𝑡 𝛾 superscript 𝐴 T 𝐴
𝐶 subscript 𝜇 𝑡 subscript 𝑚 𝑡 d 𝑡 {\rm d}(\mu_{t}-m_{t})&=\frac{\sigma_{t}}{\gamma}\left\{(A^{\rm T}A):\left(X_{%
t}^{\dagger}\otimes X_{t}^{\dagger}-C\right){\rm d}t+\gamma^{1/2}(AX_{t}^{%
\dagger})^{\rm T}{\rm d}W^{\dagger}_{t}\right\}\,\,-\\
\qquad\qquad\frac{\sigma_{t}}{\gamma}(A^{\rm T}A):C\,(\mu_{t}-m_{t}){\rm d}t, roman_d ( italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) & = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG { ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ⊗ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_C ) roman_d italic_t + italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } - divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C ( italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) roman_d italic_t ,
we obtain
d d t p t & = − σ t γ ( A T A ) : C ( 2 p t − σ t ) + 2 σ t γ ( A T A ) : 𝔼 † [ ( X t † ⊗ X t † − C ) ( μ t − m t ) ] , : d d 𝑡 subscript 𝑝 𝑡 & subscript 𝜎 𝑡 𝛾 superscript 𝐴 T 𝐴 limit-from 𝐶 2 subscript 𝑝 𝑡 subscript 𝜎 𝑡 2 subscript 𝜎 𝑡 𝛾 superscript 𝐴 T 𝐴
: superscript 𝔼 † delimited-[] tensor-product superscript subscript 𝑋 𝑡 † superscript subscript 𝑋 𝑡 † 𝐶 subscript 𝜇 𝑡 subscript 𝑚 𝑡 \frac{\rm d}{{\rm d}t}p_{t}&=-\frac{\sigma_{t}}{\gamma}\,(A^{\rm T}A):C\left(2%
p_{t}-\sigma_{t}\right)\,\,+\\
\qquad\qquad\frac{2\sigma_{t}}{\gamma}\,(A^{\rm T}A):\mathbb{E}^{\dagger}\left%
[(X_{t}^{\dagger}\otimes X_{t}^{\dagger}-C)\,(\mu_{t}-m_{t})\right], divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT & = - divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C ( 2 italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + divide start_ARG 2 italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ⊗ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_C ) ( italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ] ,
which we simplify to
d d t p t = σ t γ ( A T A ) : C ( σ t − 2 p t ) = σ t ( A T A ) : ( A + A T ) − 1 ( σ t − 2 p t ) : d d 𝑡 subscript 𝑝 𝑡 subscript 𝜎 𝑡 𝛾 superscript 𝐴 T 𝐴 𝐶 subscript 𝜎 𝑡 2 subscript 𝑝 𝑡 subscript 𝜎 𝑡 superscript 𝐴 T 𝐴 : superscript 𝐴 superscript 𝐴 T 1 subscript 𝜎 𝑡 2 subscript 𝑝 𝑡 \frac{\rm d}{{\rm d}t}p_{t}=\frac{\sigma_{t}}{\gamma}\,(A^{\rm T}A):C\left(%
\sigma_{t}-2p_{t}\right)=\sigma_{t}\,(A^{\rm T}A):(A+A^{\rm T})^{-1}\left(%
\sigma_{t}-2p_{t}\right) divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C ( italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - 2 italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : ( italic_A + italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - 2 italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT )
(43)
under the approximation (22 ). The initial conditions are m 0 = m prior subscript 𝑚 0 subscript 𝑚 prior m_{0}=m_{\rm prior} italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_m start_POSTSUBSCRIPT roman_prior end_POSTSUBSCRIPT and p 0 = 0 subscript 𝑝 0 0 p_{0}=0 italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 , respectively. We note that the differential equations (36 ) and (43 ) are explicitly solvable. For example, it holds that
σ t = σ 0 1 + ( A T A ) : ( A T + A ) − 1 σ 0 t subscript 𝜎 𝑡 subscript 𝜎 0 : 1 superscript 𝐴 T 𝐴 superscript superscript 𝐴 T 𝐴 1 subscript 𝜎 0 𝑡 \sigma_{t}=\frac{\sigma_{0}}{1+(A^{\rm T}A):(A^{\rm T}+A)^{-1}\,\sigma_{0}t} italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 1 + ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT + italic_A ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_t end_ARG
(44)
and one finds that σ t ∼ 1 / ( ( A T A ) : ( A T + A ) − 1 t ) \sigma_{t}\sim 1/((A^{\rm T}A):(A^{\rm T}+A)^{-1}\,t) italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∼ 1 / ( ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT + italic_A ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_t ) for t ≫ 1 much-greater-than 𝑡 1 t\gg 1 italic_t ≫ 1 . It can also be shown that p t ≤ σ t subscript 𝑝 𝑡 subscript 𝜎 𝑡 p_{t}\leq\sigma_{t} italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≤ italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT for all t ≥ 0 𝑡 0 t\geq 0 italic_t ≥ 0 . Furthermore, this analysis suggests that the learning rate in the stochastic gradient descent formulation (2.4 ) should be chosen as
α t = min { α ¯ , 1 ( A T A ) : ( A T + A ) − 1 t } , subscript 𝛼 𝑡 ¯ 𝛼 1 : superscript 𝐴 T 𝐴 superscript superscript 𝐴 T 𝐴 1 𝑡 \alpha_{t}=\min\left\{\bar{\alpha},\frac{1}{(A^{\rm T}A):(A^{\rm T}+A)^{-1}\,t%
}\right\}, italic_α start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_min { over¯ start_ARG italic_α end_ARG , divide start_ARG 1 end_ARG start_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT + italic_A ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_t end_ARG } ,
(45)
where α ¯ > 0 ¯ 𝛼 0 \bar{\alpha}>0 over¯ start_ARG italic_α end_ARG > 0 denotes an initial learning rate; for example α ¯ = σ 0 ¯ 𝛼 subscript 𝜎 0 \bar{\alpha}=\sigma_{0} over¯ start_ARG italic_α end_ARG = italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT .
We finally conduct a formal analysis of the ensemble Kalman filter time-stepping (20 ) and demonstrate that the method is first-order accurate with regard to the implied frequentist mean m t subscript 𝑚 𝑡 m_{t} italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT . We recall (24 )
and conclude from (20 ) that the implied update on the variance σ n subscript 𝜎 𝑛 \sigma_{n} italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT satisfies
σ n + 1 = σ n − σ n 2 γ ( A T A ) : C Δ t + 𝒪 ( Δ t 2 ) , : subscript 𝜎 𝑛 1 subscript 𝜎 𝑛 superscript subscript 𝜎 𝑛 2 𝛾 superscript 𝐴 T 𝐴 𝐶 Δ 𝑡 𝒪 Δ superscript 𝑡 2 \sigma_{n+1}=\sigma_{n}-\frac{\sigma_{n}^{2}}{\gamma}\,(A^{\rm T}A):C\Delta t+%
\mathcal{O}(\Delta t^{2}), italic_σ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT = italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - divide start_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C roman_Δ italic_t + caligraphic_O ( roman_Δ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,
(46)
which provides a first-order approximation to (36 ).
We next analyse the evolution equation (34 ) for the conditional mean μ t subscript 𝜇 𝑡 \mu_{t} italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and its numerical approximation
μ n + 1 = μ n + K n { ( X t n + 1 † − X t n † ) − μ n A X t n † Δ t } subscript 𝜇 𝑛 1 subscript 𝜇 𝑛 subscript 𝐾 𝑛 superscript subscript 𝑋 subscript 𝑡 𝑛 1 † superscript subscript 𝑋 subscript 𝑡 𝑛 † subscript 𝜇 𝑛 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 † Δ 𝑡 \mu_{n+1}=\mu_{n}+K_{n}\left\{(X_{t_{n+1}}^{\dagger}-X_{t_{n}}^{\dagger})-\mu_%
{n}AX_{t_{n}}^{\dagger}\Delta t\right\} italic_μ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT { ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) - italic_μ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT roman_Δ italic_t }
(47)
arising from (20 ). Here we follow [14 ] in order to analyse the impact of the data X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT on the estimator. An in-depth theoretical treatment can be found in [10 ] .
Comparing (47 ) to (34 ) and utilising (24 ), we find that the key quantity of
interest is
J t n , t n + 1 † := ∫ t n t n + 1 ( A X t † ) T d X t † , assign subscript superscript 𝐽 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript 𝐴 superscript subscript 𝑋 𝑡 † T differential-d superscript subscript 𝑋 𝑡 † J^{\dagger}_{t_{n},t_{n+1}}:=\int_{t_{n}}^{t_{n+1}}(AX_{t}^{\dagger})^{\rm T}{%
\rm d}X_{t}^{\dagger}, italic_J start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT := ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ,
(48)
which we can rewrite as
J t n , t n + 1 † = A T : ( X t n † ⊗ X t n , t n + 1 † ) + A T : 𝕏 t n , t n + 1 † . : subscript superscript 𝐽 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript 𝐴 T tensor-product subscript superscript 𝑋 † subscript 𝑡 𝑛 subscript superscript 𝑋 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript 𝐴 T : superscript subscript 𝕏 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
† J^{\dagger}_{t_{n},t_{n+1}}=A^{\rm T}:(X^{\dagger}_{t_{n}}\otimes X^{\dagger}_%
{t_{n},t_{n+1}})+A^{\rm T}:\mathbb{X}_{t_{n},t_{n+1}}^{\dagger}\,. italic_J start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT : ( italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) + italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT : blackboard_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT .
(49)
Here, motivated by (2 ) and following standard rough path notation, we have used
X t n , t n + 1 † := X t n + 1 † − X t n † assign subscript superscript 𝑋 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript subscript 𝑋 subscript 𝑡 𝑛 1 † superscript subscript 𝑋 subscript 𝑡 𝑛 † X^{\dagger}_{t_{n},t_{n+1}}:=X_{t_{n+1}}^{\dagger}-X_{t_{n}}^{\dagger} italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT := italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT
(50)
and the second-order iterated Itô integral
𝕏 t n , t n + 1 † := ∫ t n t n + 1 ( X t † − X t n † ) ⊗ d X t † . assign superscript subscript 𝕏 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
† superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 tensor-product subscript superscript 𝑋 † 𝑡 subscript superscript 𝑋 † subscript 𝑡 𝑛 differential-d superscript subscript 𝑋 𝑡 † \mathbb{X}_{t_{n},t_{n+1}}^{\dagger}:=\int_{t_{n}}^{t_{n+1}}(X^{\dagger}_{t}-X%
^{\dagger}_{t_{n}})\otimes{\rm d}X_{t}^{\dagger}. blackboard_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT := ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ⊗ roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT .
(51)
The difference between the integral (48 ) and its corresponding approximation in (47 ) is provided by
A T : 𝕏 t n , t n + 1 † : superscript 𝐴 T superscript subscript 𝕏 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
† A^{\rm T}:\mathbb{X}_{t_{n},t_{n+1}}^{\dagger} italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT : blackboard_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT plus higher-order terms arising from (24 ). The iterated integral 𝕏 t n , t n + 1 † subscript superscript 𝕏 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
\mathbb{X}^{\dagger}_{t_{n},t_{n+1}} blackboard_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT becomes a random variable from the frequentist perspective. Taking note of (2 ), we find that the drift, f ( x ) = A x 𝑓 𝑥 𝐴 𝑥 f(x)=Ax italic_f ( italic_x ) = italic_A italic_x , contributes with terms of order 𝒪 ( Δ t 2 ) 𝒪 Δ superscript 𝑡 2 \mathcal{O}(\Delta t^{2}) caligraphic_O ( roman_Δ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) to 𝕏 t n , t n + 1 † subscript superscript 𝕏 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
\mathbb{X}^{\dagger}_{t_{n},t_{n+1}} blackboard_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and the expected value of 𝕏 t n , t n + 1 † subscript superscript 𝕏 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
\mathbb{X}^{\dagger}_{t_{n},t_{n+1}} blackboard_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT therefore satisfies
𝔼 † [ 𝕏 t n , t n + 1 † ] = 𝒪 ( Δ t 2 ) , superscript 𝔼 † delimited-[] subscript superscript 𝕏 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
𝒪 Δ superscript 𝑡 2 \mathbb{E}^{\dagger}[\mathbb{X}^{\dagger}_{t_{n},t_{n+1}}]=\mathcal{O}(\Delta t%
^{2}), blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ blackboard_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] = caligraphic_O ( roman_Δ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,
(52)
since 𝔼 † [ W t n , τ † ] = 0 superscript 𝔼 † delimited-[] subscript superscript 𝑊 † subscript 𝑡 𝑛 𝜏
0 \mathbb{E}^{\dagger}[W^{\dagger}_{t_{n},\tau}]=0 blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_τ end_POSTSUBSCRIPT ] = 0 for τ > t n 𝜏 subscript 𝑡 𝑛 \tau>t_{n} italic_τ > italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , and
𝔼 † [ 𝕎 t n , t n + 1 † ] = 1 2 𝔼 † [ W t n , t n + 1 † ⊗ W t n , t n + 1 † − [ W t n † , W t n , t n + 1 † ] ] − Δ t 2 I = 0 , superscript 𝔼 † delimited-[] superscript subscript 𝕎 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
† 1 2 superscript 𝔼 † delimited-[] tensor-product subscript superscript 𝑊 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
subscript superscript 𝑊 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
subscript superscript 𝑊 † subscript 𝑡 𝑛 subscript superscript 𝑊 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
Δ 𝑡 2 𝐼 0 \mathbb{E}^{\dagger}[\mathbb{W}_{t_{n},t_{n+1}}^{\dagger}]=\frac{1}{2}\mathbb{%
E}^{\dagger}[W^{\dagger}_{t_{n},t_{n+1}}\otimes W^{\dagger}_{t_{n},t_{n+1}}-[W%
^{\dagger}_{t_{n}},W^{\dagger}_{t_{n},t_{n+1}}]]-\frac{\Delta t}{2}I=0, blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ blackboard_W start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ] = divide start_ARG 1 end_ARG start_ARG 2 end_ARG blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - [ italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] ] - divide start_ARG roman_Δ italic_t end_ARG start_ARG 2 end_ARG italic_I = 0 ,
(53)
where we have introduced the commutator
[ W t n † , W t n , t n + 1 † ] := W t n † ⊗ W t n , t n + 1 † − W t n , t n + 1 † ⊗ W t n † . assign superscript subscript 𝑊 subscript 𝑡 𝑛 † superscript subscript 𝑊 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
† tensor-product subscript superscript 𝑊 † subscript 𝑡 𝑛 subscript superscript 𝑊 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
tensor-product subscript superscript 𝑊 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript subscript 𝑊 subscript 𝑡 𝑛 † [W_{t_{n}}^{\dagger},W_{t_{n},t_{n+1}}^{\dagger}]:=W^{\dagger}_{t_{n}}\otimes W%
^{\dagger}_{t_{n},t_{n+1}}-W^{\dagger}_{t_{n},t_{n+1}}\otimes W_{t_{n}}^{%
\dagger}. [ italic_W start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , italic_W start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ] := italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_W start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT .
(54)
Hence we find that, while (47 ) is not a first-order (strong) approximation of the SDE (34 ), the approximation becomes first-order in m t subscript 𝑚 𝑡 m_{t} italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT when averaged over realisations X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT of the SDE (2 ). More precisely, one obtains
𝔼 † [ J t n , t n + 1 † ] = ( A T A ) : C Δ t + 𝒪 ( Δ t 2 ) . : superscript 𝔼 † delimited-[] subscript superscript 𝐽 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript 𝐴 T 𝐴 𝐶 Δ 𝑡 𝒪 Δ superscript 𝑡 2 \mathbb{E}^{\dagger}[J^{\dagger}_{t_{n},t_{n+1}}]=(A^{\rm T}A):C\Delta t+%
\mathcal{O}(\Delta t^{2}). blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_J start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] = ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C roman_Δ italic_t + caligraphic_O ( roman_Δ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .
(55)
We note that the modified scheme (30 ) leads to the same time evolution in the variance σ n subscript 𝜎 𝑛 \sigma_{n} italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT while the update in μ n subscript 𝜇 𝑛 \mu_{n} italic_μ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is changed to
μ n + 1 = μ n + σ n γ ∫ t n t n + 1 ( A X t † ) T d X t † − K n A X t n † μ n Δ t . subscript 𝜇 𝑛 1 subscript 𝜇 𝑛 subscript 𝜎 𝑛 𝛾 superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript 𝐴 superscript subscript 𝑋 𝑡 † T differential-d superscript subscript 𝑋 𝑡 † subscript 𝐾 𝑛 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 † subscript 𝜇 𝑛 Δ 𝑡 \mu_{n+1}=\mu_{n}+\frac{\sigma_{n}}{\gamma}\int_{t_{n}}^{t_{n+1}}(AX_{t}^{%
\dagger})^{\rm T}{\rm d}X_{t}^{\dagger}-K_{n}AX_{t_{n}}^{\dagger}\mu_{n}\Delta
t. italic_μ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + divide start_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT roman_Δ italic_t .
(56)
This modification results in a more accurate evolution in the conditional mean μ n subscript 𝜇 𝑛 \mu_{n} italic_μ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , but because of (52 )
it does not impact to leading order the evolution of the underlying frequentist mean, m n = 𝔼 † [ μ n ] subscript 𝑚 𝑛 superscript 𝔼 † delimited-[] subscript 𝜇 𝑛 m_{n}=\mathbb{E}^{\dagger}[\mu_{n}] italic_m start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_μ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] .
We summarise our findings in the following proposition.
Proposition 3.1 .
The discrete-time EnKBF implementations (20 ) and (30 ) both provide first-order approximations
to the time evolution of the frequentist mean, m t subscript 𝑚 𝑡 m_{t} italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , and the frequentist variance, p t subscript 𝑝 𝑡 p_{t} italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT . In other words, both methods converge weakly with order one.
We also note that the frequentist uncertainty is essentially data-independent and depends only on the time window [ 0 , T ] 0 𝑇 [0,T] [ 0 , italic_T ] over which the data gets observed. Hence, for fixed observation interval [ 0 , T ] 0 𝑇 [0,T] [ 0 , italic_T ] , it makes sense to choose the step-size Δ t Δ 𝑡 \Delta t roman_Δ italic_t such that the discretisation error (bias) remains on the same order of magnitude as p T 1 / 2 ≈ σ T 1 / 2 superscript subscript 𝑝 𝑇 1 2 superscript subscript 𝜎 𝑇 1 2 p_{T}^{1/2}\approx\sigma_{T}^{1/2} italic_p start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ≈ italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT . Selecting a much smaller step-size would not significantly reduce the frequentist estimation error in the conditional estimator μ T subscript 𝜇 𝑇 \mu_{T} italic_μ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT .
4 Multi-scale data
We now have all the material in place to study the dependency of the EnKBF estimator on a set of observations
X t ( ϵ ) superscript subscript 𝑋 𝑡 italic-ϵ X_{t}^{(\epsilon)} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT , ϵ > 0 italic-ϵ 0 \epsilon>0 italic_ϵ > 0 , which approach the theoretical X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT with respect to the uniform norm topology
on the space of continuous functions as ϵ → 0 → italic-ϵ 0 \epsilon\to 0 italic_ϵ → 0 . Since the second-order contribution in (2 ), that is
(51 ), does not depend continuously on such perturbations, we demonstrate in this section that a systematic
bias arises in the EnKBF. Furthermore, we show how the bias can be eliminated either via subsampling the data,
which effectively amounts to ignoring these second-order contributions, or via an appropriate correction term, which ensures a continuous dependence on observations X t ( ϵ ) superscript subscript 𝑋 𝑡 italic-ϵ X_{t}^{(\epsilon)} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT with respect to the uniform norm topology. More specifically, we investigate the impact of a possible discrepancy between the SDE model (1 ), for which we aim to estimate the parameter θ 𝜃 \theta italic_θ , and the data generating SDE (2 ). We therefore replace (2 ) by the the following two-scale SDE [18 ] :
d X t ( ϵ ) & = A X t ( ϵ ) d t + γ 1 / 2 ϵ M P t ( ϵ ) d t , d P t ( ϵ ) = − 1 ϵ M P t ( ϵ ) d t + d W t † , formulae-sequence d subscript superscript 𝑋 italic-ϵ 𝑡 & 𝐴 subscript superscript 𝑋 italic-ϵ 𝑡 d 𝑡 superscript 𝛾 1 2 italic-ϵ 𝑀 subscript superscript 𝑃 italic-ϵ 𝑡 d 𝑡 d subscript superscript 𝑃 italic-ϵ 𝑡 1 italic-ϵ 𝑀 subscript superscript 𝑃 italic-ϵ 𝑡 d 𝑡 d subscript superscript 𝑊 † 𝑡 {\rm d}X^{(\epsilon)}_{t}&=AX^{(\epsilon)}_{t}\,{\rm d}t+\frac{\gamma^{1/2}}{%
\epsilon}MP^{(\epsilon)}_{t}\,{\rm d}t,\\
{\rm d}P^{(\epsilon)}_{t}=-\frac{1}{\epsilon}MP^{(\epsilon)}_{t}\,{\rm d}t+{%
\rm d}W^{\dagger}_{t}, roman_d italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT & = italic_A italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT roman_d italic_t + divide start_ARG italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_ϵ end_ARG italic_M italic_P start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT roman_d italic_t , roman_d italic_P start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = - divide start_ARG 1 end_ARG start_ARG italic_ϵ end_ARG italic_M italic_P start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT roman_d italic_t + roman_d italic_W start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ,
where
M = ( 1 β − β 1 ) , 𝑀 1 𝛽 𝛽 1 M=\left(\begin{array}[]{cc}1&\beta\\
-\beta&1\end{array}\right), italic_M = ( start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL italic_β end_CELL end_ROW start_ROW start_CELL - italic_β end_CELL start_CELL 1 end_CELL end_ROW end_ARRAY ) ,
(59)
β = 2 𝛽 2 \beta=2 italic_β = 2 and ϵ = 0.01 italic-ϵ 0.01 \epsilon=0.01 italic_ϵ = 0.01 . The dimension of state space is d = 2 𝑑 2 d=2 italic_d = 2 throughout this section. While we restrict here to the simple two-scale model (4 ), similar scenarios can arise from deterministic fast-slow systems [25 , 8 ] .
The associated EnKBF mean-field equations in the parameter Θ t subscript Θ 𝑡 \Theta_{t} roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , which we now denote by Θ t ( ϵ ) superscript subscript Θ 𝑡 italic-ϵ \Theta_{t}^{(\epsilon)} roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT in order to explicitly record its dependence on the scale parameter ϵ ≪ 1 much-less-than italic-ϵ 1 \epsilon\ll 1 italic_ϵ ≪ 1 , become
d Θ t ( ϵ ) & = σ t ( ϵ ) γ ( A X t ( ϵ ) ) T d I t ( ϵ ) , d I t ( ϵ ) = d X t ( ϵ ) − 1 2 ( Θ t ( ϵ ) + π t ( ϵ ) [ θ ] ) A X t ( ϵ ) d t , formulae-sequence d superscript subscript Θ 𝑡 italic-ϵ & superscript subscript 𝜎 𝑡 italic-ϵ 𝛾 superscript 𝐴 subscript superscript 𝑋 italic-ϵ 𝑡 T d superscript subscript 𝐼 𝑡 italic-ϵ d superscript subscript 𝐼 𝑡 italic-ϵ d superscript subscript 𝑋 𝑡 italic-ϵ 1 2 superscript subscript Θ 𝑡 italic-ϵ superscript subscript 𝜋 𝑡 italic-ϵ delimited-[] 𝜃 𝐴 superscript subscript 𝑋 𝑡 italic-ϵ d 𝑡 {\rm d}\Theta_{t}^{(\epsilon)}&=\frac{\sigma_{t}^{(\epsilon)}}{\gamma}(AX^{(%
\epsilon)}_{t})^{\rm T}{\rm d}I_{t}^{(\epsilon)},\\
{\rm d}I_{t}^{(\epsilon)}={\rm d}X_{t}^{(\epsilon)}-\frac{1}{2}\left(\Theta_{t%
}^{(\epsilon)}+\pi_{t}^{(\epsilon)}[\theta]\right)AX_{t}^{(\epsilon)}{\rm d}t, roman_d roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT & = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT , roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT + italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT [ italic_θ ] ) italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT roman_d italic_t ,
with variance
σ t ( ϵ ) = π t ( ϵ ) [ ( θ − π t ( ϵ ) [ θ ] ) 2 ] superscript subscript 𝜎 𝑡 italic-ϵ superscript subscript 𝜋 𝑡 italic-ϵ delimited-[] superscript 𝜃 superscript subscript 𝜋 𝑡 italic-ϵ delimited-[] 𝜃 2 \sigma_{t}^{(\epsilon)}=\pi_{t}^{(\epsilon)}\left[(\theta-\pi_{t}^{(\epsilon)}%
[\theta])^{2}\right] italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT [ ( italic_θ - italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT [ italic_θ ] ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ]
(61)
and Θ t ϵ ∼ π t ( ϵ ) similar-to subscript superscript Θ italic-ϵ 𝑡 superscript subscript 𝜋 𝑡 italic-ϵ \Theta^{\epsilon}_{t}\sim\pi_{t}^{(\epsilon)} roman_Θ start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∼ italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT . The discrete-time mean-field EnKBF (20 ) turns into
Θ n + 1 ( ϵ ) = Θ n ( ϵ ) + K n ( ϵ ) { ( X t n + 1 ( ϵ ) − X t n ( ϵ ) ) − 1 2 ( Θ n ( ϵ ) + π n ( ϵ ) [ θ ] ) A X t n ( ϵ ) Δ t } superscript subscript Θ 𝑛 1 italic-ϵ superscript subscript Θ 𝑛 italic-ϵ superscript subscript 𝐾 𝑛 italic-ϵ superscript subscript 𝑋 subscript 𝑡 𝑛 1 italic-ϵ superscript subscript 𝑋 subscript 𝑡 𝑛 italic-ϵ 1 2 superscript subscript Θ 𝑛 italic-ϵ superscript subscript 𝜋 𝑛 italic-ϵ delimited-[] 𝜃 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 italic-ϵ Δ 𝑡 \Theta_{n+1}^{(\epsilon)}=\Theta_{n}^{(\epsilon)}+K_{n}^{(\epsilon)}\left\{%
\left(X_{t_{n+1}}^{(\epsilon)}-X_{t_{n}}^{(\epsilon)}\right)-\frac{1}{2}\left(%
\Theta_{n}^{(\epsilon)}+\pi_{n}^{(\epsilon)}[\theta]\right)AX_{t_{n}}^{(%
\epsilon)}\Delta t\right\} roman_Θ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT + italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT { ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT + italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT [ italic_θ ] ) italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT roman_Δ italic_t }
(62)
with Kalman gain
K n ( ϵ ) = σ n ( ϵ ) ( A X t n ( ϵ ) ) T ( γ + Δ t σ n ( ϵ ) ( A X t n ( ϵ ) ) T A X t n ( ϵ ) ) − 1 . superscript subscript 𝐾 𝑛 italic-ϵ superscript subscript 𝜎 𝑛 italic-ϵ superscript 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 italic-ϵ T superscript 𝛾 Δ 𝑡 superscript subscript 𝜎 𝑛 italic-ϵ superscript 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 italic-ϵ T 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 italic-ϵ 1 K_{n}^{(\epsilon)}=\sigma_{n}^{(\epsilon)}(AX_{t_{n}}^{(\epsilon)})^{\rm T}%
\left(\gamma+\Delta t\sigma_{n}^{(\epsilon)}(AX_{t_{n}}^{(\epsilon)})^{\rm T}%
AX_{t_{n}}^{(\epsilon)}\right)^{-1}\,. italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ( italic_γ + roman_Δ italic_t italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .
(63)
We also consider the appropriately modified scheme (30 ):
Θ n + 1 ( ϵ ) = Θ n ( ϵ ) + σ n ( ϵ ) γ ∫ t n t n + 1 ( A X t ( ϵ ) ) T d X t ( ϵ ) − 1 2 K n ( ϵ ) A X t n ( ϵ ) ( Θ n ( ϵ ) + π n ( ϵ ) [ θ ] ) Δ t . superscript subscript Θ 𝑛 1 italic-ϵ superscript subscript Θ 𝑛 italic-ϵ superscript subscript 𝜎 𝑛 italic-ϵ 𝛾 superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript 𝐴 superscript subscript 𝑋 𝑡 italic-ϵ T differential-d superscript subscript 𝑋 𝑡 italic-ϵ 1 2 superscript subscript 𝐾 𝑛 italic-ϵ 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 italic-ϵ superscript subscript Θ 𝑛 italic-ϵ superscript subscript 𝜋 𝑛 italic-ϵ delimited-[] 𝜃 Δ 𝑡 \Theta_{n+1}^{(\epsilon)}=\Theta_{n}^{(\epsilon)}+\frac{\sigma_{n}^{(\epsilon)%
}}{\gamma}\int_{t_{n}}^{t_{n+1}}(AX_{t}^{(\epsilon)})^{\rm T}{\rm d}X_{t}^{(%
\epsilon)}-\frac{1}{2}K_{n}^{(\epsilon)}AX_{t_{n}}^{(\epsilon)}\left(\Theta_{n%
}^{(\epsilon)}+\pi_{n}^{(\epsilon)}[\theta]\right)\Delta t. roman_Θ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT + divide start_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ end_ARG ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT + italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT [ italic_θ ] ) roman_Δ italic_t .
(64)
In order to understand the impact of the modified data generating process on the two mean-field EnKBF formulations
(62 ) and (64 ), respectively, we follow [18 ] and investigate the difference between
X t ( ϵ ) subscript superscript 𝑋 italic-ϵ 𝑡 X^{(\epsilon)}_{t} italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and X t † subscript superscript 𝑋 † 𝑡 X^{\dagger}_{t} italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT :
d ( X t ( ϵ ) − X t † ) & = A ( X t ( ϵ ) − X t † ) d t + γ 1 / 2 ϵ M P t ( ϵ ) d t − γ 1 / 2 d W t † = A ( X t ( ϵ ) − X t † ) d t − γ 1 / 2 d P t ( ϵ ) . d subscript superscript 𝑋 italic-ϵ 𝑡 subscript superscript 𝑋 † 𝑡 & 𝐴 subscript superscript 𝑋 italic-ϵ 𝑡 subscript superscript 𝑋 † 𝑡 d 𝑡 superscript 𝛾 1 2 italic-ϵ 𝑀 superscript subscript 𝑃 𝑡 italic-ϵ d 𝑡 superscript 𝛾 1 2 d superscript subscript 𝑊 𝑡 † 𝐴 superscript subscript 𝑋 𝑡 italic-ϵ superscript subscript 𝑋 𝑡 † d 𝑡 superscript 𝛾 1 2 d superscript subscript 𝑃 𝑡 italic-ϵ {\rm d}(X^{(\epsilon)}_{t}-X^{\dagger}_{t})&=A(X^{(\epsilon)}_{t}-X^{\dagger}_%
{t}){\rm d}t+\frac{\gamma^{1/2}}{\epsilon}MP_{t}^{(\epsilon)}{\rm d}t-\gamma^{%
1/2}{\rm d}W_{t}^{\dagger}\\
=A(X_{t}^{(\epsilon)}-X_{t}^{\dagger}){\rm d}t-\gamma^{1/2}{\rm d}P_{t}^{(%
\epsilon)}. roman_d ( italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) & = italic_A ( italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) roman_d italic_t + divide start_ARG italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_ϵ end_ARG italic_M italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT roman_d italic_t - italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT roman_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = italic_A ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) roman_d italic_t - italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT roman_d italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT .
When P t ( ϵ ) subscript superscript 𝑃 italic-ϵ 𝑡 P^{(\epsilon)}_{t} italic_P start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is stationary, it is Gaussian with mean zero and covariance
𝔼 stat [ P t ( ϵ ) ⊗ P t ( ϵ ) ] = ϵ ( M + M T ) − 1 = ϵ 2 I . subscript 𝔼 stat delimited-[] tensor-product superscript subscript 𝑃 𝑡 italic-ϵ superscript subscript 𝑃 𝑡 italic-ϵ italic-ϵ superscript 𝑀 superscript 𝑀 T 1 italic-ϵ 2 𝐼 \mathbb{E}_{\rm stat}\left[P_{t}^{(\epsilon)}\otimes P_{t}^{(\epsilon)}\right]%
=\epsilon\,(M+M^{\rm T})^{-1}=\frac{\epsilon}{2}I. blackboard_E start_POSTSUBSCRIPT roman_stat end_POSTSUBSCRIPT [ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ] = italic_ϵ ( italic_M + italic_M start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = divide start_ARG italic_ϵ end_ARG start_ARG 2 end_ARG italic_I .
(66)
Hence P t ( ϵ ) → 0 → subscript superscript 𝑃 italic-ϵ 𝑡 0 P^{(\epsilon)}_{t}\rightarrow 0 italic_P start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT → 0 as ϵ → 0 → italic-ϵ 0 \epsilon\rightarrow 0 italic_ϵ → 0 and also
X t ( ϵ ) → X t † → subscript superscript 𝑋 italic-ϵ 𝑡 subscript superscript 𝑋 † 𝑡 X^{(\epsilon)}_{t}\rightarrow X^{\dagger}_{t} italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT → italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT
(67)
in L 2 superscript 𝐿 2 L^{2} italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT uniformly in t 𝑡 t italic_t , provided σ ( A ) ⊂ ℂ − 𝜎 𝐴 subscript ℂ \sigma(A)\subset\mathbb{C}_{-} italic_σ ( italic_A ) ⊂ blackboard_C start_POSTSUBSCRIPT - end_POSTSUBSCRIPT and
X 0 ( ϵ ) = X 0 † subscript superscript 𝑋 italic-ϵ 0 subscript superscript 𝑋 † 0 X^{(\epsilon)}_{0}=X^{\dagger}_{0} italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT . This is illustrated in Figure 1 .
Figure 1: SDE driven by mathematical vs. physical Brownian motion (ϵ = 0.01 italic-ϵ 0.01 \epsilon=0.01 italic_ϵ = 0.01 ). The top panel displays both
X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT (blue) and X t ( ϵ ) superscript subscript 𝑋 𝑡 italic-ϵ X_{t}^{(\epsilon)} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT (red) over the long time interval t ∈ [ 0 , 10 ] 𝑡 0 10 t\in[0,10] italic_t ∈ [ 0 , 10 ] , while the lower panel provides a zoomed in perspective over the interval t ∈ [ 0 , 1 ] 𝑡 0 1 t\in[0,1] italic_t ∈ [ 0 , 1 ] .
In order to investigate the problem further, we study the integral
J t n , t n + 1 ( ϵ ) := ∫ t n t n + 1 ( A X t ( ϵ ) ) T d X t ( ϵ ) assign subscript superscript 𝐽 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript 𝐴 superscript subscript 𝑋 𝑡 italic-ϵ T differential-d superscript subscript 𝑋 𝑡 italic-ϵ J^{(\epsilon)}_{t_{n},t_{n+1}}:=\int_{t_{n}}^{t_{n+1}}(AX_{t}^{(\epsilon)})^{%
\rm T}{\rm d}X_{t}^{(\epsilon)} italic_J start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT := ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT
(68)
and its relation to (48 ). As for (48 ), we can rewrite (68 ) as
J t n , t n + 1 ( ϵ ) = A T : ( X t n ( ϵ ) ⊗ X t n , t n + 1 ( ϵ ) ) + A T : 𝕏 t n , t n + 1 ( ϵ ) . : subscript superscript 𝐽 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript 𝐴 T tensor-product subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript 𝐴 T : superscript subscript 𝕏 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
italic-ϵ J^{(\epsilon)}_{t_{n},t_{n+1}}=A^{\rm T}:(X^{(\epsilon)}_{t_{n}}\otimes X^{(%
\epsilon)}_{t_{n},t_{n+1}})+A^{\rm T}:\mathbb{X}_{t_{n},t_{n+1}}^{(\epsilon)}. italic_J start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT : ( italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) + italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT : blackboard_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT .
(69)
We now investigate the limit of the second-order iterated integral
𝕏 t n , t n + 1 ( ϵ ) & = ∫ t n t n + 1 X t n , t ( ϵ ) ⊗ d X t ( ϵ ) = 1 2 X t n , t n + 1 ( ϵ ) ⊗ X t n , t n + 1 ( ϵ ) − 1 2 ∫ t n t n + 1 [ X t n , t ( ϵ ) , d X t ( ϵ ) ] subscript superscript 𝕏 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
& superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 tensor-product subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 𝑡
differential-d superscript subscript 𝑋 𝑡 italic-ϵ tensor-product 1 2 superscript subscript 𝑋 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
italic-ϵ superscript subscript 𝑋 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
italic-ϵ 1 2 superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript subscript 𝑋 subscript 𝑡 𝑛 𝑡
italic-ϵ d superscript subscript 𝑋 𝑡 italic-ϵ \mathbb{X}^{(\epsilon)}_{t_{n},t_{n+1}}&=\int_{t_{n}}^{t_{n+1}}X^{(\epsilon)}_%
{t_{n},t}\otimes{\rm d}X_{t}^{(\epsilon)}\\
=\frac{1}{2}X_{t_{n},t_{n+1}}^{(\epsilon)}\otimes X_{t_{n},t_{n+1}}^{(\epsilon%
)}-\frac{1}{2}\int_{t_{n}}^{t_{n+1}}[X_{t_{n},t}^{(\epsilon)},{\rm d}X_{t}^{(%
\epsilon)}] blackboard_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT & = ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t end_POSTSUBSCRIPT ⊗ roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ⊗ italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT [ italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT , roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ]
as ϵ → 0 → italic-ϵ 0 \epsilon\to 0 italic_ϵ → 0 [18 ] . Here [ . , . ] [.,.] [ . , . ] denotes the commutator defined by
(54 ).
Proposition 4.1 .
The second-order iterated integral 𝕏 t n , t n + 1 ( ϵ ) subscript superscript 𝕏 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
\mathbb{X}^{(\epsilon)}_{t_{n},t_{n+1}} blackboard_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT satisfies
lim ϵ → 0 𝕏 t n , t n + 1 ( ϵ ) = 𝕏 t n , t n + 1 † + Δ t γ 2 M subscript → italic-ϵ 0 subscript superscript 𝕏 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
subscript superscript 𝕏 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
Δ 𝑡 𝛾 2 𝑀 \lim_{\epsilon\to 0}\mathbb{X}^{(\epsilon)}_{t_{n},t_{n+1}}=\mathbb{X}^{%
\dagger}_{t_{n},t_{n+1}}+\frac{\Delta t\,\gamma}{2}M roman_lim start_POSTSUBSCRIPT italic_ϵ → 0 end_POSTSUBSCRIPT blackboard_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = blackboard_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + divide start_ARG roman_Δ italic_t italic_γ end_ARG start_ARG 2 end_ARG italic_M
(71)
Proof 4.2 .
The proof follows [18 ] and can be summarised as follows:
𝕏 t n , t n + 1 ( ϵ ) & = ∫ t n t n + 1 X t n , t ( ϵ ) ⊗ d X t ( ϵ ) → ∫ t n t n + 1 X t n , t † ⊗ d X t † − γ 1 / 2 ∫ t n t n + 1 X t n , t ( ϵ ) ⊗ d P t ( ϵ ) = 𝕏 t n , t n + 1 † − γ 1 / 2 X t n , t n + 1 ( ϵ ) ⊗ P t n + 1 ( ϵ ) + γ 1 / 2 ∫ t n t n + 1 d X t ( ϵ ) ⊗ P t ( ϵ ) → 𝕏 t n , t n + 1 † + γ 1 / 2 ∫ t n t n + 1 { A X t ( ϵ ) + γ 1 / 2 ϵ M P t ( ϵ ) } ⊗ P t ( ϵ ) d t → 𝕏 t n , t n + 1 † + Δ t γ ϵ M 𝔼 stat [ P t n ( ϵ ) ⊗ P t n ( ϵ ) ] = 𝕏 t n , t n + 1 † + Δ t γ 2 M . subscript superscript 𝕏 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
& superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 tensor-product subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 𝑡
differential-d subscript superscript 𝑋 italic-ϵ 𝑡 → superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 tensor-product subscript superscript 𝑋 † subscript 𝑡 𝑛 𝑡
differential-d subscript superscript 𝑋 † 𝑡 superscript 𝛾 1 2 superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 tensor-product subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 𝑡
differential-d subscript superscript 𝑃 italic-ϵ 𝑡 subscript superscript 𝕏 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
tensor-product superscript 𝛾 1 2 subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
subscript superscript 𝑃 italic-ϵ subscript 𝑡 𝑛 1 superscript 𝛾 1 2 superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 tensor-product differential-d subscript superscript 𝑋 italic-ϵ 𝑡 subscript superscript 𝑃 italic-ϵ 𝑡 → subscript superscript 𝕏 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript 𝛾 1 2 superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 tensor-product 𝐴 superscript subscript 𝑋 𝑡 italic-ϵ superscript 𝛾 1 2 italic-ϵ 𝑀 subscript superscript 𝑃 italic-ϵ 𝑡 subscript superscript 𝑃 italic-ϵ 𝑡 differential-d 𝑡 → subscript superscript 𝕏 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
Δ 𝑡 𝛾 italic-ϵ 𝑀 subscript 𝔼 stat delimited-[] tensor-product superscript subscript 𝑃 subscript 𝑡 𝑛 italic-ϵ superscript subscript 𝑃 subscript 𝑡 𝑛 italic-ϵ subscript superscript 𝕏 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
Δ 𝑡 𝛾 2 𝑀 \mathbb{X}^{(\epsilon)}_{t_{n},t_{n+1}}&=\int_{t_{n}}^{t_{n+1}}X^{(\epsilon)}_%
{t_{n},t}\otimes{\rm d}X^{(\epsilon)}_{t}\\
\rightarrow\int_{t_{n}}^{t_{n+1}}X^{\dagger}_{t_{n},t}\otimes{\rm d}X^{\dagger%
}_{t}-\gamma^{1/2}\int_{t_{n}}^{t_{n+1}}X^{(\epsilon)}_{t_{n},t}\otimes{\rm d}%
P^{(\epsilon)}_{t}\\
=\mathbb{X}^{\dagger}_{t_{n},t_{n+1}}-\gamma^{1/2}X^{(\epsilon)}_{t_{n},t_{n+1%
}}\otimes P^{(\epsilon)}_{t_{n+1}}+\gamma^{1/2}\int_{t_{n}}^{t_{n+1}}{\rm d}X^%
{(\epsilon)}_{t}\otimes P^{(\epsilon)}_{t}\\
\rightarrow\mathbb{X}^{\dagger}_{t_{n},t_{n+1}}+\gamma^{1/2}\int_{t_{n}}^{t_{n%
+1}}\left\{AX_{t}^{(\epsilon)}+\frac{\gamma^{1/2}}{\epsilon}MP^{(\epsilon)}_{t%
}\right\}\otimes P^{(\epsilon)}_{t}{\rm d}t\\
\rightarrow\mathbb{X}^{\dagger}_{t_{n},t_{n+1}}+\frac{\Delta t\,\gamma}{%
\epsilon}M\,\mathbb{E}_{\rm stat}\left[P_{t_{n}}^{(\epsilon)}\otimes P_{t_{n}}%
^{(\epsilon)}\right]\\
=\mathbb{X}^{\dagger}_{t_{n},t_{n+1}}+\frac{\Delta t\,\gamma}{2}M. blackboard_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT & = ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t end_POSTSUBSCRIPT ⊗ roman_d italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT → ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t end_POSTSUBSCRIPT ⊗ roman_d italic_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t end_POSTSUBSCRIPT ⊗ roman_d italic_P start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = blackboard_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊗ italic_P start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT → blackboard_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT { italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT + divide start_ARG italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_ϵ end_ARG italic_M italic_P start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } ⊗ italic_P start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT roman_d italic_t → blackboard_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + divide start_ARG roman_Δ italic_t italic_γ end_ARG start_ARG italic_ϵ end_ARG italic_M blackboard_E start_POSTSUBSCRIPT roman_stat end_POSTSUBSCRIPT [ italic_P start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ⊗ italic_P start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ] = blackboard_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + divide start_ARG roman_Δ italic_t italic_γ end_ARG start_ARG 2 end_ARG italic_M .
As discussed in detail in [10 ] already, Proposition 4.1 implies that the scheme (64 )
does not, in general, converge to the scheme (64 ) as ϵ → 0 → italic-ϵ 0 \epsilon\to 0 italic_ϵ → 0 since
J t n , t n + 1 † = lim ϵ → 0 J t n , t n + 1 ( ϵ ) − Δ t γ 2 A T : M . : subscript superscript 𝐽 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
subscript → italic-ϵ 0 subscript superscript 𝐽 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
Δ 𝑡 𝛾 2 superscript 𝐴 T 𝑀 J^{\dagger}_{t_{n},t_{n+1}}=\lim_{\epsilon\to 0}J^{(\epsilon)}_{t_{n},t_{n+1}}%
-\frac{\Delta t\,\gamma}{2}A^{\rm T}:M\,. italic_J start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = roman_lim start_POSTSUBSCRIPT italic_ϵ → 0 end_POSTSUBSCRIPT italic_J start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - divide start_ARG roman_Δ italic_t italic_γ end_ARG start_ARG 2 end_ARG italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT : italic_M .
(73)
This observation suggests the following modification
Θ n + 1 ( ϵ ) & = Θ n ( ϵ ) + σ n ( ϵ ) γ ∫ t n t n + 1 ( A X t ( ϵ ) ) T d X t ( ϵ ) − Δ t 2 σ n ( ϵ ) A T : M − 1 2 K n ( ϵ ) A X t n ( ϵ ) ( Θ n ( ϵ ) + π n ( ϵ ) [ θ ] ) Δ t : superscript subscript Θ 𝑛 1 italic-ϵ & superscript subscript Θ 𝑛 italic-ϵ superscript subscript 𝜎 𝑛 italic-ϵ 𝛾 superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript 𝐴 superscript subscript 𝑋 𝑡 italic-ϵ T differential-d superscript subscript 𝑋 𝑡 italic-ϵ Δ 𝑡 2 superscript subscript 𝜎 𝑛 italic-ϵ superscript 𝐴 T limit-from 𝑀 1 2 superscript subscript 𝐾 𝑛 italic-ϵ 𝐴 superscript subscript 𝑋 subscript 𝑡 𝑛 italic-ϵ superscript subscript Θ 𝑛 italic-ϵ superscript subscript 𝜋 𝑛 italic-ϵ delimited-[] 𝜃 Δ 𝑡
\Theta_{n+1}^{(\epsilon)}&=\Theta_{n}^{(\epsilon)}+\frac{\sigma_{n}^{(\epsilon%
)}}{\gamma}\int_{t_{n}}^{t_{n+1}}(AX_{t}^{(\epsilon)})^{\rm T}{\rm d}X_{t}^{(%
\epsilon)}-\frac{\Delta t}{2}\sigma_{n}^{(\epsilon)}\,A^{\rm T}:M\,\,-\\
\qquad\qquad\frac{1}{2}K_{n}^{(\epsilon)}AX_{t_{n}}^{(\epsilon)}\left(\Theta_{%
n}^{(\epsilon)}+\pi_{n}^{(\epsilon)}[\theta]\right)\Delta t roman_Θ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT & = roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT + divide start_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ end_ARG ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT - divide start_ARG roman_Δ italic_t end_ARG start_ARG 2 end_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT : italic_M - divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT italic_A italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT + italic_π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT [ italic_θ ] ) roman_Δ italic_t
to (64 ).
Please note that it follows from (4 ) that
∫ t n t n + 1 ( A X t ( ϵ ) ) T d X t ( ϵ ) = A T : ( X t n + 1 / 2 ( ϵ ) ⊗ X t n , t n + 1 ( ϵ ) − 1 2 ∫ t n t n + 1 [ X t n , t ( ϵ ) , d X t ( ϵ ) ] ) . : superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript 𝐴 superscript subscript 𝑋 𝑡 italic-ϵ T differential-d superscript subscript 𝑋 𝑡 italic-ϵ superscript 𝐴 T tensor-product superscript subscript 𝑋 subscript 𝑡 𝑛 1 2 italic-ϵ superscript subscript 𝑋 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
italic-ϵ 1 2 superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript subscript 𝑋 subscript 𝑡 𝑛 𝑡
italic-ϵ d superscript subscript 𝑋 𝑡 italic-ϵ \int_{t_{n}}^{t_{n+1}}(AX_{t}^{(\epsilon)})^{\rm T}{\rm d}X_{t}^{(\epsilon)}=A%
^{\rm T}:\left(X_{t_{n+1/2}}^{(\epsilon)}\otimes X_{t_{n},t_{n+1}}^{(\epsilon)%
}-\frac{1}{2}\int_{t_{n}}^{t_{n+1}}[X_{t_{n},t}^{(\epsilon)},{\rm d}X_{t}^{(%
\epsilon)}]\right). ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT : ( italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 / 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ⊗ italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT [ italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT , roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ] ) .
(75)
Proposition 4.3 .
The discrete-time EnKBF (62 ) converges to (20 ) for fixed Δ t normal-Δ 𝑡 \Delta t roman_Δ italic_t as
ϵ → 0 normal-→ italic-ϵ 0 \epsilon\to 0 italic_ϵ → 0 . Similarly, (4 ) converges to (30 ) under the same
limit.
Proof 4.4 .
The first statement follows from σ n ( ϵ ) = σ n superscript subscript 𝜎 𝑛 italic-ϵ subscript 𝜎 𝑛 \sigma_{n}^{(\epsilon)}=\sigma_{n} italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , the limiting behaviour (67 ), and
lim ϵ → 0 K n ( ϵ ) = K n . subscript → italic-ϵ 0 superscript subscript 𝐾 𝑛 italic-ϵ subscript 𝐾 𝑛 \lim_{\epsilon\to 0}K_{n}^{(\epsilon)}=K_{n}. roman_lim start_POSTSUBSCRIPT italic_ϵ → 0 end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT .
(76)
The second statement additionally requires (73 ) to be substituted into (4 )
when taking the limit ϵ → 0 normal-→ italic-ϵ 0 \epsilon\to 0 italic_ϵ → 0 .
4.1 Numerical implementation
The numerical implementation of (4 ) requires an estimator for the generally unknown M 𝑀 M italic_M in (73 ). This task is challenging as we only have access to X t ( ϵ ) superscript subscript 𝑋 𝑡 italic-ϵ X_{t}^{(\epsilon)} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT without any explicit knowledge of the underlying generating process (4 ). While the estimator proposed in [10 ] is based on the idea of subsampling the data, the frequentist perspective taken in this note suggests the alternative
estimator M est subscript 𝑀 est M_{\rm est} italic_M start_POSTSUBSCRIPT roman_est end_POSTSUBSCRIPT defined by
Δ t γ 2 M est = 𝔼 † [ 𝕏 t n , t n + 1 ( ϵ ) ] , Δ 𝑡 𝛾 2 subscript 𝑀 est superscript 𝔼 † delimited-[] subscript superscript 𝕏 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
\frac{\Delta t\,\gamma}{2}M_{\rm est}=\mathbb{E}^{\dagger}[\mathbb{X}^{(%
\epsilon)}_{t_{n},t_{n+1}}], divide start_ARG roman_Δ italic_t italic_γ end_ARG start_ARG 2 end_ARG italic_M start_POSTSUBSCRIPT roman_est end_POSTSUBSCRIPT = blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ blackboard_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] ,
(81)
which follows from (4.2 f) and (52 ). That is, 𝔼 † [ 𝕏 t n , t n + 1 † ] = 𝒪 ( Δ t 2 ) superscript 𝔼 † delimited-[] subscript superscript 𝕏 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
𝒪 Δ superscript 𝑡 2 \mathbb{E}^{\dagger}[\mathbb{X}^{\dagger}_{t_{n},t_{n+1}}]=\mathcal{O}(\Delta t%
^{2}) blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ blackboard_X start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] = caligraphic_O ( roman_Δ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) for Δ t Δ 𝑡 \Delta t roman_Δ italic_t sufficiently small. Note that second-order iterated integral
X t n , t n + 1 ( ϵ ) superscript subscript 𝑋 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
italic-ϵ X_{t_{n},t_{n+1}}^{(\epsilon)} italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT satisfies (4 ) and is therefore easy to compute.
In practice, the frequentist expectation value can be replaced by an approximation along a given single observation path X t ( ϵ ) subscript superscript 𝑋 italic-ϵ 𝑡 X^{(\epsilon)}_{t} italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , t ∈ [ 0 , T ] 𝑡 0 𝑇 t\in[0,T] italic_t ∈ [ 0 , italic_T ] , under the assumption of ergodicity.
An appropriate choice of the outer or sub-sampling step-size Δ t Δ 𝑡 \Delta t roman_Δ italic_t [28 ] constitutes an important aspect for the practical implementation of the EnKBF formulation (62 ) for finite values of ϵ > 0 italic-ϵ 0 \epsilon>0 italic_ϵ > 0 [27 ] . Consistency of the second-order iterated integrals [14 ] implies
𝕏 t n , t n + 2 ( ϵ ) = 𝕏 t n , t n + 1 ( ϵ ) + 𝕏 t n + 1 , t n + 2 ( ϵ ) + X t n , t n + 1 ( ϵ ) ⊗ X t n + 1 , t n + 2 ( ϵ ) . subscript superscript 𝕏 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 2
subscript superscript 𝕏 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
subscript superscript 𝕏 italic-ϵ subscript 𝑡 𝑛 1 subscript 𝑡 𝑛 2
tensor-product subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 1 subscript 𝑡 𝑛 2
\mathbb{X}^{(\epsilon)}_{t_{n},t_{n+2}}=\mathbb{X}^{(\epsilon)}_{t_{n},t_{n+1}%
}+\mathbb{X}^{(\epsilon)}_{t_{n+1},t_{n+2}}+X^{(\epsilon)}_{t_{n},t_{n+1}}%
\otimes X^{(\epsilon)}_{t_{n+1},t_{n+2}}. blackboard_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = blackboard_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + blackboard_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT .
(82)
A sensible choice of Δ t Δ 𝑡 \Delta t roman_Δ italic_t is dictated by
𝔼 † [ X t n , t n + 1 ( ϵ ) ⊗ X t n + 1 , t n + 2 ( ϵ ) ] = 𝒪 ( Δ t 2 ) , superscript 𝔼 † delimited-[] tensor-product subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 1 subscript 𝑡 𝑛 2
𝒪 Δ superscript 𝑡 2 \mathbb{E}^{\dagger}\left[X^{(\epsilon)}_{t_{n},t_{n+1}}\otimes X^{(\epsilon)}%
_{t_{n+1},t_{n+2}}\right]=\mathcal{O}(\Delta t^{2})\,, blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] = caligraphic_O ( roman_Δ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,
(83)
that is, the sub-sampled data X t n ( ϵ ) superscript subscript 𝑋 subscript 𝑡 𝑛 italic-ϵ X_{t_{n}}^{(\epsilon)} italic_X start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT behaves to leading order like solution increments from the reference model (2 ) at scale Δ t Δ 𝑡 \Delta t roman_Δ italic_t independent of the specific value of ϵ italic-ϵ \epsilon italic_ϵ . Note that, on the other hand,
𝔼 † [ X τ l , τ l + 1 ( ϵ ) ⊗ X τ l + 1 , τ l + 2 ( ϵ ) ] = 𝒪 ( ϵ − 1 Δ τ 2 ) superscript 𝔼 † delimited-[] tensor-product subscript superscript 𝑋 italic-ϵ subscript 𝜏 𝑙 subscript 𝜏 𝑙 1
subscript superscript 𝑋 italic-ϵ subscript 𝜏 𝑙 1 subscript 𝜏 𝑙 2
𝒪 superscript italic-ϵ 1 Δ superscript 𝜏 2 \mathbb{E}^{\dagger}\left[X^{(\epsilon)}_{\tau_{l},\tau_{l+1}}\otimes X^{(%
\epsilon)}_{\tau_{l+1},\tau_{l+2}}\right]=\mathcal{O}(\epsilon^{-1}\Delta\tau^%
{2}) blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_τ start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT , italic_τ start_POSTSUBSCRIPT italic_l + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] = caligraphic_O ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Δ italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT )
(84)
for an inner step-size Δ τ ∼ ϵ similar-to Δ 𝜏 italic-ϵ \Delta\tau\sim\epsilon roman_Δ italic_τ ∼ italic_ϵ . In other words, a suitable step-size Δ t > 0 Δ 𝑡 0 \Delta t>0 roman_Δ italic_t > 0 can be defined by
making
h ( Δ t ) := Δ t − 2 ‖ 𝔼 † [ X t n , t n + 1 ( ϵ ) ⊗ X t n + 1 , t n + 2 ( ϵ ) ] ‖ assign ℎ Δ 𝑡 Δ superscript 𝑡 2 norm superscript 𝔼 † delimited-[] tensor-product subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
subscript superscript 𝑋 italic-ϵ subscript 𝑡 𝑛 1 subscript 𝑡 𝑛 2
h(\Delta t):=\Delta t^{-2}\left\|\mathbb{E}^{\dagger}\left[X^{(\epsilon)}_{t_{%
n},t_{n+1}}\otimes X^{(\epsilon)}_{t_{n+1},t_{n+2}}\right]\right\| italic_h ( roman_Δ italic_t ) := roman_Δ italic_t start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ∥ blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ italic_X start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] ∥
(85)
as small as possible while still guaranteeing an accurate numerical approximation in (62 ).
4.2 Filtered data
We finally discuss a recently proposed [2 ] robust modification to the parameter estimation problem
in the light of the mean-field EnKBF equations considered in this paper. The essential idea is to filter the observation paths
X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , t ≥ 0 𝑡 0 t\geq 0 italic_t ≥ 0 , via
d Z t † = 1 δ ( X t † − Z t † ) d t + δ noise 2 d V t † , d superscript subscript 𝑍 𝑡 † 1 𝛿 superscript subscript 𝑋 𝑡 † superscript subscript 𝑍 𝑡 † d 𝑡 subscript 𝛿 noise 2 d superscript subscript 𝑉 𝑡 † {\rm d}Z_{t}^{\dagger}=\frac{1}{\delta}(X_{t}^{\dagger}-Z_{t}^{\dagger}){\rm d%
}t+\delta_{\rm noise}\sqrt{2}{\rm d}V_{t}^{\dagger}, roman_d italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) roman_d italic_t + italic_δ start_POSTSUBSCRIPT roman_noise end_POSTSUBSCRIPT square-root start_ARG 2 end_ARG roman_d italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ,
(87)
where δ > 0 𝛿 0 \delta>0 italic_δ > 0 is a sufficiently small parameter and V t † superscript subscript 𝑉 𝑡 † V_{t}^{\dagger} italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT denotes
independent Brownian motion with δ noise = 1 subscript 𝛿 noise 1 \delta_{\rm noise}=1 italic_δ start_POSTSUBSCRIPT roman_noise end_POSTSUBSCRIPT = 1 (noise added) or δ noise = 0 subscript 𝛿 noise 0 \delta_{\rm noise}=0 italic_δ start_POSTSUBSCRIPT roman_noise end_POSTSUBSCRIPT = 0 (no noise added). Extending the methodology proposed in [2 ] to
the mean-field EnKFB equations (2 ), we now consider
d Θ t & = σ t γ ( A Z t † ) T d I t , d I t = d X t † − 1 2 ( Θ t + π t [ θ ] ) A X t † d t , formulae-sequence d subscript Θ 𝑡 & subscript 𝜎 𝑡 𝛾 superscript 𝐴 subscript superscript 𝑍 † 𝑡 T d subscript 𝐼 𝑡 d subscript 𝐼 𝑡 d superscript subscript 𝑋 𝑡 † 1 2 subscript Θ 𝑡 subscript 𝜋 𝑡 delimited-[] 𝜃 𝐴 superscript subscript 𝑋 𝑡 † d 𝑡 {\rm d}\Theta_{t}&=\frac{\sigma_{t}}{\gamma}(AZ^{\dagger}_{t})^{\rm T}{\rm d}I%
_{t},\\
{\rm d}I_{t}={\rm d}X_{t}^{\dagger}-\frac{1}{2}\left(\Theta_{t}+\pi_{t}[\theta%
]\right)AX_{t}^{\dagger}{\rm d}t, roman_d roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT & = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A italic_Z start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ italic_θ ] ) italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT roman_d italic_t ,
with the variance σ t subscript 𝜎 𝑡 \sigma_{t} italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT defined as before.
Let us first investigate the long-time behaviour of the extended data generating system
d X t † & = A X t † d t + γ 1 / 2 d W t † , d Z t † = 1 δ ( X t † − Z t † ) d t + δ noise 2 d V t † , formulae-sequence d superscript subscript 𝑋 𝑡 † & 𝐴 superscript subscript 𝑋 𝑡 † d 𝑡 superscript 𝛾 1 2 d superscript subscript 𝑊 𝑡 † d superscript subscript 𝑍 𝑡 † 1 𝛿 superscript subscript 𝑋 𝑡 † superscript subscript 𝑍 𝑡 † d 𝑡 subscript 𝛿 noise 2 d superscript subscript 𝑉 𝑡 † {\rm d}X_{t}^{\dagger}&=AX_{t}^{\dagger}{\rm d}t+\gamma^{1/2}{\rm d}W_{t}^{%
\dagger},\\
{\rm d}Z_{t}^{\dagger}=\frac{1}{\delta}(X_{t}^{\dagger}-Z_{t}^{\dagger}){\rm d%
}t+\delta_{\rm noise}\sqrt{2}{\rm d}V_{t}^{\dagger}, roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT & = italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT roman_d italic_t + italic_γ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT roman_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT , roman_d italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) roman_d italic_t + italic_δ start_POSTSUBSCRIPT roman_noise end_POSTSUBSCRIPT square-root start_ARG 2 end_ARG roman_d italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ,
in some detail. Its stationary distribution is Gaussian with mean m ∞ x = m ∞ z = 0 superscript subscript 𝑚 𝑥 superscript subscript 𝑚 𝑧 0 m_{\infty}^{x}=m_{\infty}^{z}=0 italic_m start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT = italic_m start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z end_POSTSUPERSCRIPT = 0 .
The stationary covariance matrices satisfy the relations
0 & = A Σ ∞ x x + Σ ∞ x x A T + γ I , 0 = Σ ∞ x z + Σ ∞ z x − 2 ( Σ ∞ z z − δ noise δ I ) , 0 = A Σ ∞ x z + 1 δ ( Σ ∞ x x − Σ ∞ x z ) . formulae-sequence 0 & 𝐴 superscript subscript Σ 𝑥 𝑥 superscript subscript Σ 𝑥 𝑥 superscript 𝐴 T 𝛾 𝐼 formulae-sequence 0 superscript subscript Σ 𝑥 𝑧 superscript subscript Σ 𝑧 𝑥 2 superscript subscript Σ 𝑧 𝑧 subscript 𝛿 noise 𝛿 𝐼 0 𝐴 superscript subscript Σ 𝑥 𝑧 1 𝛿 superscript subscript Σ 𝑥 𝑥 superscript subscript Σ 𝑥 𝑧 0&=A\Sigma_{\infty}^{xx}+\Sigma_{\infty}^{xx}A^{\rm T}+\gamma I,\\
0=\Sigma_{\infty}^{xz}+\Sigma_{\infty}^{zx}-2(\Sigma_{\infty}^{zz}-\delta_{\rm
noise%
}\delta I),\\
0=A\Sigma_{\infty}^{xz}+\frac{1}{\delta}(\Sigma_{\infty}^{xx}-\Sigma_{\infty}^%
{xz}). 0 & = italic_A roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x italic_x end_POSTSUPERSCRIPT + roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x italic_x end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT + italic_γ italic_I , 0 = roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x italic_z end_POSTSUPERSCRIPT + roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z italic_x end_POSTSUPERSCRIPT - 2 ( roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z italic_z end_POSTSUPERSCRIPT - italic_δ start_POSTSUBSCRIPT roman_noise end_POSTSUBSCRIPT italic_δ italic_I ) , 0 = italic_A roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x italic_z end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ( roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x italic_x end_POSTSUPERSCRIPT - roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x italic_z end_POSTSUPERSCRIPT ) .
We note that Σ ∞ x x = C superscript subscript Σ 𝑥 𝑥 𝐶 \Sigma_{\infty}^{xx}=C roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x italic_x end_POSTSUPERSCRIPT = italic_C with the matrix C 𝐶 C italic_C defined in (16 ) and
that the symmetric part of Σ ∞ z x superscript subscript Σ 𝑧 𝑥 \Sigma_{\infty}^{zx} roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z italic_x end_POSTSUPERSCRIPT and Σ ∞ x z superscript subscript Σ 𝑥 𝑧 \Sigma_{\infty}^{xz} roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x italic_z end_POSTSUPERSCRIPT , respectively,
are equivalent to Σ ∞ z z − δ noise δ I superscript subscript Σ 𝑧 𝑧 subscript 𝛿 noise 𝛿 𝐼 \Sigma_{\infty}^{zz}-\delta_{\rm noise}\delta I roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z italic_z end_POSTSUPERSCRIPT - italic_δ start_POSTSUBSCRIPT roman_noise end_POSTSUBSCRIPT italic_δ italic_I . Hence, following (22 ),
we again make the crucial approximation
( A T A ) : Z t † ⊗ X t † ≈ ( A T A ) : Σ ∞ z x = ( A T A ) : ( Σ ∞ z z − δ noise δ I ) : superscript 𝐴 T 𝐴 tensor-product superscript subscript 𝑍 𝑡 † superscript subscript 𝑋 𝑡 † superscript 𝐴 T 𝐴 : superscript subscript Σ 𝑧 𝑥 superscript 𝐴 T 𝐴 : superscript subscript Σ 𝑧 𝑧 subscript 𝛿 noise 𝛿 𝐼 (A^{\rm T}A):Z_{t}^{\dagger}\otimes X_{t}^{\dagger}\approx(A^{\rm T}A):\Sigma_%
{\infty}^{zx}=(A^{\rm T}A):(\Sigma_{\infty}^{zz}-\delta_{\rm noise}\delta I) ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ⊗ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ≈ ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z italic_x end_POSTSUPERSCRIPT = ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : ( roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z italic_z end_POSTSUPERSCRIPT - italic_δ start_POSTSUBSCRIPT roman_noise end_POSTSUBSCRIPT italic_δ italic_I )
(91)
for d ≫ 1 much-greater-than 𝑑 1 d\gg 1 italic_d ≫ 1 . Let us therefore introduce the shorthand
C ~ = Σ ∞ z z − δ noise δ I . ~ 𝐶 superscript subscript Σ 𝑧 𝑧 subscript 𝛿 noise 𝛿 𝐼 \tilde{C}=\Sigma_{\infty}^{zz}-\delta_{\rm noise}\delta I. over~ start_ARG italic_C end_ARG = roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z italic_z end_POSTSUPERSCRIPT - italic_δ start_POSTSUBSCRIPT roman_noise end_POSTSUBSCRIPT italic_δ italic_I .
(92)
We also note that
C ~ = C + 𝒪 ( δ ) . ~ 𝐶 𝐶 𝒪 𝛿 \tilde{C}=C+\mathcal{O}(\delta). over~ start_ARG italic_C end_ARG = italic_C + caligraphic_O ( italic_δ ) .
(93)
The frequentist analysis from Section 3 delivers
d μ t = σ t γ ( ( A Z t † ) T d X t † − μ t ( A T A ) : C ~ d t ) {\rm d}\mu_{t}=\frac{\sigma_{t}}{\gamma}\left((AZ_{t}^{\dagger})^{\rm T}{\rm d%
}X_{t}^{\dagger}-\mu_{t}(A^{\rm T}A):\tilde{C}\,{\rm d}t\right) roman_d italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( ( italic_A italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : over~ start_ARG italic_C end_ARG roman_d italic_t )
(94)
for the conditional mean and
d d t σ t = − σ t 2 γ ( A T A ) : C ~ : d d 𝑡 subscript 𝜎 𝑡 superscript subscript 𝜎 𝑡 2 𝛾 superscript 𝐴 T 𝐴 ~ 𝐶 \frac{\rm d}{{\rm d}t}\sigma_{t}=-\frac{\sigma_{t}^{2}}{\gamma}(A^{\rm T}A):%
\tilde{C} divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = - divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : over~ start_ARG italic_C end_ARG
(95)
for the conditional variance of the random variable Θ t subscript Θ 𝑡 \Theta_{t} roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , as defined by the modified mean-field evolution equations
(4.2 ). Furthermore, we find that μ t subscript 𝜇 𝑡 \mu_{t} italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT still provides an asymptotically
unbiased estimator since m t = 𝔼 † [ μ t ] subscript 𝑚 𝑡 superscript 𝔼 † delimited-[] subscript 𝜇 𝑡 m_{t}=\mathbb{E}^{\dagger}[\mu_{t}] italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = blackboard_E start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT [ italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] satisfies
d d t m t = σ t γ ( A T A ) : C ~ ( 1 − m t ) . : d d 𝑡 subscript 𝑚 𝑡 subscript 𝜎 𝑡 𝛾 superscript 𝐴 T 𝐴 ~ 𝐶 1 subscript 𝑚 𝑡 \frac{\rm d}{{\rm d}t}m_{t}=\frac{\sigma_{t}}{\gamma}(A^{\rm T}A):\tilde{C}\,(%
1-m_{t}). divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : over~ start_ARG italic_C end_ARG ( 1 - italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) .
(96)
Similarly, the variance, p t subscript 𝑝 𝑡 p_{t} italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT of the estimator μ t subscript 𝜇 𝑡 \mu_{t} italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT satisfies
d d t p t = σ t γ ( A T A ) : C ~ ( σ t − 2 p t ) . : d d 𝑡 subscript 𝑝 𝑡 subscript 𝜎 𝑡 𝛾 superscript 𝐴 T 𝐴 ~ 𝐶 subscript 𝜎 𝑡 2 subscript 𝑝 𝑡 \frac{\rm d}{{\rm d}t}p_{t}=\frac{\sigma_{t}}{\gamma}(A^{\rm T}A):\tilde{C}\,(%
\sigma_{t}-2p_{t}). divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : over~ start_ARG italic_C end_ARG ( italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - 2 italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) .
(97)
In summary, we find that the modified EnKBF mean-field equations (4.2 ) behave exactly as the
original equations (2 ) with the only difference that the stationary covariance matrix C = Σ ∞ x x 𝐶 superscript subscript Σ 𝑥 𝑥 C=\Sigma_{\infty}^{xx} italic_C = roman_Σ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x italic_x end_POSTSUPERSCRIPT
is replaced everywhere by (92 ).
Again following [2 ] , given multi-scale observations X t ( ϵ ) superscript subscript 𝑋 𝑡 italic-ϵ X_{t}^{(\epsilon)} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT , t ≥ 0 𝑡 0 t\geq 0 italic_t ≥ 0 ,
we define associated filtered Z t ( ϵ ) superscript subscript 𝑍 𝑡 italic-ϵ Z_{t}^{(\epsilon)} italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT via
d d t Z t ( ϵ ) = 1 δ ( X t ( ϵ ) − Z t ( ϵ ) ) + δ noise 2 d V t † . d d 𝑡 superscript subscript 𝑍 𝑡 italic-ϵ 1 𝛿 superscript subscript 𝑋 𝑡 italic-ϵ superscript subscript 𝑍 𝑡 italic-ϵ subscript 𝛿 noise 2 d superscript subscript 𝑉 𝑡 † \frac{\rm d}{{\rm d}t}Z_{t}^{(\epsilon)}=\frac{1}{\delta}(X_{t}^{(\epsilon)}-Z%
_{t}^{(\epsilon)})+\delta_{\rm noise}\sqrt{2}{\rm d}V_{t}^{\dagger}. divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT - italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) + italic_δ start_POSTSUBSCRIPT roman_noise end_POSTSUBSCRIPT square-root start_ARG 2 end_ARG roman_d italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT .
(98)
The intriguing observation is that
J ~ t n , t n + 1 ( ϵ ) := ∫ t n t n + 1 ( A Z t ( ϵ ) ) T d X t ( ϵ ) assign superscript subscript ~ 𝐽 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
italic-ϵ superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript 𝐴 superscript subscript 𝑍 𝑡 italic-ϵ T differential-d superscript subscript 𝑋 𝑡 italic-ϵ \tilde{J}_{t_{n},t_{n+1}}^{(\epsilon)}:=\int_{t_{n}}^{t_{n+1}}(AZ_{t}^{(%
\epsilon)})^{\rm T}{\rm d}X_{t}^{(\epsilon)} over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT := ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_A italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT
(99)
converges to
J ~ t n , t n + 1 † := ∫ t n t n + 1 ( A Z t † ) T d X t † assign subscript superscript ~ 𝐽 † subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
superscript subscript subscript 𝑡 𝑛 subscript 𝑡 𝑛 1 superscript 𝐴 superscript subscript 𝑍 𝑡 † T differential-d superscript subscript 𝑋 𝑡 † \tilde{J}^{\dagger}_{t_{n},t_{n+1}}:=\int_{t_{n}}^{t_{n+1}}(AZ_{t}^{\dagger})^%
{\rm T}{\rm d}X_{t}^{\dagger} over~ start_ARG italic_J end_ARG start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT := ∫ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_A italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT
(100)
as ϵ → 0 → italic-ϵ 0 \epsilon\to 0 italic_ϵ → 0 . This simply follows from the fact that both integrals can be interpreted as standard
Riemann–Stieltjes integrals and convergence of X t ( ϵ ) → X t † → superscript subscript 𝑋 𝑡 italic-ϵ superscript subscript 𝑋 𝑡 † X_{t}^{(\epsilon)}\to X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT → italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT and
Z t ( ϵ ) → Z t † → superscript subscript 𝑍 𝑡 italic-ϵ superscript subscript 𝑍 𝑡 † Z_{t}^{(\epsilon)}\to Z_{t}^{\dagger} italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT → italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT as ϵ → 0 → italic-ϵ 0 \epsilon\to 0 italic_ϵ → 0 in
the standard topology of continuous functions is sufficient to conclude the convergence of the associated integrals
J ~ t n , t n + 1 ( ϵ ) → J ~ t n , t n + 1 † → superscript subscript ~ 𝐽 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
italic-ϵ superscript subscript ~ 𝐽 subscript 𝑡 𝑛 subscript 𝑡 𝑛 1
† \tilde{J}_{t_{n},t_{n+1}}^{(\epsilon)}\to\tilde{J}_{t_{n},t_{n+1}}^{\dagger} over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT → over~ start_ARG italic_J end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT . In other words, the extended EnKBF
formulation
d Θ t ( ϵ ) & = σ t ( ϵ ) γ ( A Z t ( ϵ ) ) T d I t ( ϵ ) , d I t ( ϵ ) = d X t ( ϵ ) − 1 2 ( Θ t ( ϵ ) + π t ( ϵ ) [ θ ] ) A X t ( ϵ ) d t , d Z t ( ϵ ) = 1 δ ( X t ( ϵ ) − Z t ( ϵ ) ) d t + δ noise 2 V t † formulae-sequence d superscript subscript Θ 𝑡 italic-ϵ & superscript subscript 𝜎 𝑡 italic-ϵ 𝛾 superscript 𝐴 subscript superscript 𝑍 italic-ϵ 𝑡 T d superscript subscript 𝐼 𝑡 italic-ϵ formulae-sequence d superscript subscript 𝐼 𝑡 italic-ϵ d superscript subscript 𝑋 𝑡 italic-ϵ 1 2 superscript subscript Θ 𝑡 italic-ϵ superscript subscript 𝜋 𝑡 italic-ϵ delimited-[] 𝜃 𝐴 superscript subscript 𝑋 𝑡 italic-ϵ d 𝑡 d superscript subscript 𝑍 𝑡 italic-ϵ 1 𝛿 superscript subscript 𝑋 𝑡 italic-ϵ superscript subscript 𝑍 𝑡 italic-ϵ d 𝑡 subscript 𝛿 noise 2 superscript subscript 𝑉 𝑡 † {\rm d}\Theta_{t}^{(\epsilon)}&=\frac{\sigma_{t}^{(\epsilon)}}{\gamma}(AZ^{(%
\epsilon)}_{t})^{\rm T}{\rm d}I_{t}^{(\epsilon)},\\
{\rm d}I_{t}^{(\epsilon)}={\rm d}X_{t}^{(\epsilon)}-\frac{1}{2}\left(\Theta_{t%
}^{(\epsilon)}+\pi_{t}^{(\epsilon)}[\theta]\right)AX_{t}^{(\epsilon)}{\rm d}t,%
\\
{\rm d}Z_{t}^{(\epsilon)}=\frac{1}{\delta}(X_{t}^{(\epsilon)}-Z_{t}^{(\epsilon%
)}){\rm d}t+\delta_{\rm noise}\sqrt{2}V_{t}^{\dagger} roman_d roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT & = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_γ end_ARG ( italic_A italic_Z start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT , roman_d italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = roman_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT + italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT [ italic_θ ] ) italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT roman_d italic_t , roman_d italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT - italic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT ) roman_d italic_t + italic_δ start_POSTSUBSCRIPT roman_noise end_POSTSUBSCRIPT square-root start_ARG 2 end_ARG italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT
will converge to the correct parameter value θ † = 1 superscript 𝜃 † 1 \theta^{\dagger}=1 italic_θ start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = 1 as t → ∞ → 𝑡 t\to\infty italic_t → ∞ in the limit ϵ → 0 → italic-ϵ 0 \epsilon\to 0 italic_ϵ → 0 , that is,
lim t → ∞ lim ϵ → 0 Θ t ( ϵ ) = θ † . subscript → 𝑡 subscript → italic-ϵ 0 superscript subscript Θ 𝑡 italic-ϵ superscript 𝜃 † \lim_{t\to\infty}\lim_{\epsilon\to 0}\Theta_{t}^{(\epsilon)}=\theta^{\dagger}. roman_lim start_POSTSUBSCRIPT italic_t → ∞ end_POSTSUBSCRIPT roman_lim start_POSTSUBSCRIPT italic_ϵ → 0 end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT = italic_θ start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT .
(102)
This statement is in line with the results from [2 ] . The intriguing point is that the data filtering approach does not require knowledge of the rough path correction term implied by (73 ) while still delivering an unbiased estimator.
5 Numerical example
We consider the linear SDE (2 ) with γ = 1 𝛾 1 \gamma=1 italic_γ = 1 and
A = − 1 2 ( 1 − 1 1 1 ) . 𝐴 1 2 1 1 1 1 A=\frac{-1}{2}\left(\begin{array}[]{cc}1&-1\\
1&1\end{array}\right). italic_A = divide start_ARG - 1 end_ARG start_ARG 2 end_ARG ( start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARRAY ) .
(103)
We find that C = I 𝐶 𝐼 C=I italic_C = italic_I and A T A = 1 / 2 I superscript 𝐴 T 𝐴 1 2 𝐼 A^{\rm T}A=1/2I italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A = 1 / 2 italic_I . Hence ( A T A ) : C = 1 : superscript 𝐴 T 𝐴 𝐶 1 (A^{\rm T}A):C=1 ( italic_A start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT italic_A ) : italic_C = 1 , and the posterior variance simply satisfies σ t = σ 0 / ( 1 + σ 0 t ) subscript 𝜎 𝑡 subscript 𝜎 0 1 subscript 𝜎 0 𝑡 \sigma_{t}=\sigma_{0}/(1+\sigma_{0}t) italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / ( 1 + italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_t ) according to (44 ). We set m prior = 0 subscript 𝑚 prior 0 m_{\rm prior}=0 italic_m start_POSTSUBSCRIPT roman_prior end_POSTSUBSCRIPT = 0 and σ prior = 4 subscript 𝜎 prior 4 \sigma_{\rm prior}=4 italic_σ start_POSTSUBSCRIPT roman_prior end_POSTSUBSCRIPT = 4 for the
Gaussian prior distribution of Θ 0 subscript Θ 0 \Theta_{0} roman_Θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , and the observation interval is [ 0 , T ] 0 𝑇 [0,T] [ 0 , italic_T ] with T = 6 𝑇 6 T=6 italic_T = 6 . We find that σ T = 0.16 subscript 𝜎 𝑇 0.16 \sigma_{T}=0.16 italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT = 0.16 . Solving (39 ) for given σ t subscript 𝜎 𝑡 \sigma_{t} italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT with initial condition m 0 = 0 subscript 𝑚 0 0 m_{0}=0 italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 yields
m t = 1 − σ t σ 0 subscript 𝑚 𝑡 1 subscript 𝜎 𝑡 subscript 𝜎 0 m_{t}=1-\frac{\sigma_{t}}{\sigma_{0}} italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 1 - divide start_ARG italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG
(104)
and m T = 0.96 subscript 𝑚 𝑇 0.96 m_{T}=0.96 italic_m start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT = 0.96 . The corresponding curves are displayed in red in Figure 2 .
Figure 2: a)–b): frequentist mean, m t subscript 𝑚 𝑡 m_{t} italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and variance, p t subscript 𝑝 𝑡 p_{t} italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , from EnKBF implementation (20 ) with step-size
Δ t = 0.06 Δ 𝑡 0.06 \Delta t=0.06 roman_Δ italic_t = 0.06 ; c)–d): same results from EnKBF implementation (30 ) with inner time-step Δ τ = Δ t / 600 Δ 𝜏 Δ 𝑡 600 \Delta\tau=\Delta t/600 roman_Δ italic_τ = roman_Δ italic_t / 600 .
We also display the curves arising for σ t subscript 𝜎 𝑡 \sigma_{t} italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and m t subscript 𝑚 𝑡 m_{t} italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT from the standard Kalman theory using the approximation (22 ).
Note that the posterior variance, σ t subscript 𝜎 𝑡 \sigma_{t} italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , should provide an upper bound on the frequentist uncertainty p t subscript 𝑝 𝑡 p_{t} italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .
We implement the EnKBF schemes (20 ) and (30 ) with t n = n Δ t subscript 𝑡 𝑛 𝑛 Δ 𝑡 t_{n}=n\,\Delta t italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_n roman_Δ italic_t .
The inner time-step is Δ τ = 10 − 4 Δ 𝜏 superscript 10 4 \Delta\tau=10^{-4} roman_Δ italic_τ = 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT while Δ t = 0.06 Δ 𝑡 0.06 \Delta t=0.06 roman_Δ italic_t = 0.06 , that is, L = 600 𝐿 600 L=600 italic_L = 600 . We repeat the experiment N = 10 4 𝑁 superscript 10 4 N=10^{4} italic_N = 10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT times and compare the outcome with the predicted mean value of m T = 0.96 subscript 𝑚 𝑇 0.96 m_{T}=0.96 italic_m start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT = 0.96 and the posterior variance of σ T = 0.16 subscript 𝜎 𝑇 0.16 \sigma_{T}=0.16 italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT = 0.16 in Figure 2 . The differences in the computed time evolutions of m t subscript 𝑚 𝑡 m_{t} italic_m start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and p t subscript 𝑝 𝑡 p_{t} italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT are rather minor and support the idea that it is not necessary to assimilate continuous-time data beyond Δ t Δ 𝑡 \Delta t roman_Δ italic_t . We also find that the simple prediction (104 ),
based on standard Kalman filter theory, is not very accurate for this low-dimensional problem (d = 2 𝑑 2 d=2 italic_d = 2 ). The corresponding
approximation for σ t subscript 𝜎 𝑡 \sigma_{t} italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT provides, however, a good upper bound for p t subscript 𝑝 𝑡 p_{t} italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .
Figure 3: Same experimental setting as in Figure 2 but with the data now generated from the multi-scale SDE (4 ).
Again, subsampling the data in intervals of Δ t = 0.06 Δ 𝑡 0.06 \Delta t=0.06 roman_Δ italic_t = 0.06 and high-frequency assimilation with step-size Δ τ = 10 − 4 Δ 𝜏 superscript 10 4 \Delta\tau=10^{-4} roman_Δ italic_τ = 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT lead to very similar results in terms of their frequentist means and variances.
We now replace the data generating SDE model (2 ) by the multi-scale formulation (4 ) with ϵ = 0.01 italic-ϵ 0.01 \epsilon=0.01 italic_ϵ = 0.01 and β = 2 𝛽 2 \beta=2 italic_β = 2 . This parameter choice agrees with the one used in [10 ] . We again find that assimilating the data at the slow time-scale Δ t = 0.06 Δ 𝑡 0.06 \Delta t=0.06 roman_Δ italic_t = 0.06 leads to very similar results obtained from an assimilation at the fast time-scale Δ τ = 10 − 4 Δ 𝜏 superscript 10 4 \Delta\tau=10^{-4} roman_Δ italic_τ = 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT with the EnKBF formulation (4 ), provided the correction term resulting from the second-order iterated integral (73 ) is included. See Figure 3 . We also verified numerically that Δ t = 0.06 Δ 𝑡 0.06 \Delta t=0.06 roman_Δ italic_t = 0.06 constitutes a nearly optimal step-size in the sense of making (85 ) sufficiently small while maintaining numerical accuracy. For example, reducing the outer step-size to Δ t = 0.02 Δ 𝑡 0.02 \Delta t=0.02 roman_Δ italic_t = 0.02 leads to h ( 0.02 ) − h ( 0.06 ) ≈ 10 ℎ 0.02 ℎ 0.06 10 h(0.02)-h(0.06)\approx 10 italic_h ( 0.02 ) - italic_h ( 0.06 ) ≈ 10 in (85 ).
We finally implement the data filtering approaches (4.2 ) and (4.2 ) with δ = 0.1 𝛿 0.1 \delta=0.1 italic_δ = 0.1 using the true signal X t † superscript subscript 𝑋 𝑡 † X_{t}^{\dagger} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT and its multi-scale representation X t ( ϵ ) superscript subscript 𝑋 𝑡 italic-ϵ X_{t}^{(\epsilon)} italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ϵ ) end_POSTSUPERSCRIPT , respectively. The numerical implementation with
a step-size Δ t = Δ τ = 10 − 4 Δ 𝑡 Δ 𝜏 superscript 10 4 \Delta t=\Delta\tau=10^{-4} roman_Δ italic_t = roman_Δ italic_τ = 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT resulted in approximations μ t n subscript 𝜇 subscript 𝑡 𝑛 \mu_{t_{n}} italic_μ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT which converged to the true parameter value θ † = 1 superscript 𝜃 † 1 \theta^{\dagger}=1 italic_θ start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = 1 as t → T = 6 → 𝑡 𝑇 6 t\to T=6 italic_t → italic_T = 6 without the need for including further corrections terms; as expected from the results in Section 4.2 .