1 Introduction
The vector autoregression (VAR) is
widely applied in control, signal processing, time series and econometrics.
The lag-one VAR (VAR(1)) is more general than it seems
since any higher order VAR can be written as a state space model
whose state equation is a VAR(1) [12 ] ,[27 ] .
More recently, the VAR(1) has been used in
reinforcement learning [23 , 1 , 17 , 19 ] .
High dimensional time series have now become very common
across many disciplines, e.g.
epidemiology [28 ] , financial economics [20 ] .
So rank reduced modelling has gained increasing attention
[20 ] ,[4 ] ,[5 ] .
In physical system, stability is crucial.
And this has led to a continuing development
of stability guaranteed estimators, e.g. for
state space modelling.
A common approach employs a perturbation minimization procedure,
where a preliminary unstable least-squares estimator is stabilized
by the smallest additive perturbation.
For example,
Mari et al. [15 ] consider a semidefinite programming (SP) problem
with a Lyapunov stability constraint.
The problem can be solved by standard linear matrix inequality (LMI) methods.
However, the computation becomes prohibitive with higher-dimensional states.
Miller and de Callafon [16 ] extend the SP method to
include more eigenvalue constraints,
besides stability.
Boots et al. [2 ] and the first algorithm in [14 ]
use singular value constraints instead of eigenvalue constraints,
which can be too conservative.
The second algorithm in [14 ] uses a line search method
based on the gradient sampling and
the reliability is sensitive to the user-defined step size parameter.
Tanaka and Katayama [24 ] and Jongeneel et al. [10 ]
project the unstable estimator onto a stable region
by solving linear quadratic regulator (LQR) problems.
Jongeneel et al. [10 ] show that
the computational efficiency is greatly improved in higher-dimension applications,
and provide error analysis and statistical guarantees.
Other approaches include,
Chui and Maciejowski [6 ]
who iteratively augment the unstable estimator
until its largest eigenvalues have the modulus of the user-defined parameter.
However, this method distorts the estimator and introduces additional bias.
Van Gestel et al. [26 ] introduce a regularization term
to the least squares problem
so that the variance of the estimators is reduced and in the meantime,
the stability is guaranteed.
However, the regularization is too conservative.
Umenberger et al. [25 ] consider the state space modelling
and take the maximum likelihood approach to guarantee stability.
However, their method is computationally expensive.
In recent work,
we [21 ]
solved a state space estimation problem by
using a Burg-type forwards-backwards (FB) optimization
[3 , 22 , 18 ] on the error residuals and
remarkably find a closed-form stable solution that is computationally cheap
and involves no tuning parameters.
Among the above works, [10 ] deal with pure VAR(1) and
the others consider the state space setup
where subspace identification methods are needed before fitting a VAR(1) model [27 ] .
However, none of the above stability-enforced methods
deals with the reduced rank modeling.
In this paper, we
extend the work in [21 ] ,
using the FB approach
to obtain a reduced rank (RR)-VAR(1) with guaranteed stability.
The FB approach generates an estimator that
is no more computationally expensive than
the least squares RR method of [20 ] .
Further, no tuning parameters are required.
The rest of the paper is organized as follows.
We first introduce the forwards and backwards VAR(1) models
in Section 2,
and then develop for the first time:
(i)
in Section 3 a closed-form stable estimator for the full-rank VAR(1),
(ii)
In Section 4, based on (i), a closed-form stable estimator for the RR-VAR(1),
(iii)
in Section 5, statistical consistency, and asymptotic efficiency
for the new estimator.
The results are illustrated in comparative simulations in Section 6.
Section 7 contains conclusions.
We use the following notations.
ρ ( A ) 𝜌 𝐴 \rho(A) italic_ρ ( italic_A ) is the spectral radius of A 𝐴 A italic_A .
‖ A ‖ = tr ( A A ′ ) norm 𝐴 tr 𝐴 superscript 𝐴 ′ \|A\|=\sqrt{\operatorname{tr}(AA^{\prime})} ∥ italic_A ∥ = square-root start_ARG roman_tr ( italic_A italic_A start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG is the Frobenius norm of A 𝐴 A italic_A .
A > 0 𝐴 0 A>0 italic_A > 0 means A 𝐴 A italic_A is positive definite.
A ≥ 0 𝐴 0 A\geq 0 italic_A ≥ 0 denotes positive semi-definiteness.
w.p.1 means with probability 1.
2 VAR(1) and the Backwards Model
In this section, we review the (forwards) VAR(1) and its associated backwards model.
The VAR(1) is a Markov process generated by
y t = F y t − 1 + w t , t = 1 , ⋯ , T , formulae-sequence subscript 𝑦 𝑡 𝐹 subscript 𝑦 𝑡 1 subscript 𝑤 𝑡 𝑡 1 ⋯ 𝑇
\displaystyle y_{t}=Fy_{t-1}+w_{t},\quad t=1,\dotsm,T, italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_F italic_y start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT + italic_w start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t = 1 , ⋯ , italic_T ,
(2.1)
where y t subscript 𝑦 𝑡 y_{t} italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the observed n − limit-from 𝑛 n- italic_n - vector time series,
w t subscript 𝑤 𝑡 w_{t} italic_w start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is a zero-mean driving white noise
with a non-singular covariance matrix Q 𝑄 Q italic_Q ,
and F 𝐹 F italic_F is the transition matrix, assumed to be stable.
The stability of F 𝐹 F italic_F
ensures the existence of a steady state variance matrix Π Π \Pi roman_Π
which obeys a discrete-time (DT) Lyapunov equation
Π = F Π F ′ + Q . Π 𝐹 Π superscript 𝐹 ′ 𝑄 \displaystyle\Pi=F\Pi F^{\prime}+Q. roman_Π = italic_F roman_Π italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_Q .
A sufficient condition for Π Π \Pi roman_Π to be positive definite is that Q 𝑄 Q italic_Q is positive definite.
We introduce the steady state lag-one cross-covariance
Π 10 = E [ y t y t − 1 ′ ] = F Π = Π 01 ′ subscript Π 10 E subscript 𝑦 𝑡 superscript subscript 𝑦 𝑡 1 ′ 𝐹 Π superscript subscript Π 01 ′ \Pi_{10}=\operatorname{E}[y_{t}y_{t-1}^{\prime}]=F\Pi=\Pi_{01}^{\prime} roman_Π start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT = roman_E [ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ] = italic_F roman_Π = roman_Π start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ,
so that
F = Π 10 Π − 1 𝐹 subscript Π 10 superscript Π 1 F=\Pi_{10}\Pi^{-1} italic_F = roman_Π start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .
We further introduce the associated correlation matrix
R F = Π − 1 2 Π 10 Π − 1 2 = Π − 1 2 F Π 1 2 subscript 𝑅 𝐹 superscript Π 1 2 subscript Π 10 superscript Π 1 2 superscript Π 1 2 𝐹 superscript Π 1 2 R_{F}=\Pi^{-\frac{1}{2}}\Pi_{10}\Pi^{-\frac{1}{2}}=\Pi^{-\frac{1}{2}}F\Pi^{%
\frac{1}{2}} italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT = roman_Π start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_Π start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT roman_Π start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT = roman_Π start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_F roman_Π start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT
which has the same eigenvalues as F 𝐹 F italic_F .
Associated with the VAR(1) is a stationary
backwards model [11 ]
y t − 1 = F b y t + w b , t − 1 , t = T , ⋯ , 1 , formulae-sequence subscript 𝑦 𝑡 1 subscript 𝐹 𝑏 subscript 𝑦 𝑡 subscript 𝑤 𝑏 𝑡 1
𝑡 𝑇 ⋯ 1
\displaystyle y_{t-1}=F_{b}y_{t}+w_{b,t-1},\quad t=T,\dotsm,1, italic_y start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT = italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_w start_POSTSUBSCRIPT italic_b , italic_t - 1 end_POSTSUBSCRIPT , italic_t = italic_T , ⋯ , 1 ,
where F b subscript 𝐹 𝑏 F_{b} italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is the backwards transition matrix
and w b , t − 1 subscript 𝑤 𝑏 𝑡 1
w_{b,t-1} italic_w start_POSTSUBSCRIPT italic_b , italic_t - 1 end_POSTSUBSCRIPT is a white noise with covariance matrix Q b subscript 𝑄 𝑏 Q_{b} italic_Q start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT
which is statistically independent of y t subscript 𝑦 𝑡 y_{t} italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .
We note that [11 ]
F b = E [ y t − 1 y t ′ ] E − 1 [ y t y t ] = Π 10 ′ Π − 1 = Π F ′ Π − 1 , subscript 𝐹 𝑏 E subscript 𝑦 𝑡 1 superscript subscript 𝑦 𝑡 ′ superscript E 1 subscript 𝑦 𝑡 subscript 𝑦 𝑡 superscript subscript Π 10 ′ superscript Π 1 Π superscript 𝐹 ′ superscript Π 1 \displaystyle F_{b}=\operatorname{E}[y_{t-1}y_{t}^{\prime}]\operatorname{E}^{-%
1}[y_{t}y_{t}]=\Pi_{10}^{\prime}\Pi^{-1}=\Pi F^{\prime}\Pi^{-1}, italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = roman_E [ italic_y start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ] roman_E start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT [ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] = roman_Π start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = roman_Π italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ,
and
Q b subscript 𝑄 𝑏 \displaystyle Q_{b} italic_Q start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT
= E [ ( y t − 1 − F b y t ) ( y t − 1 − F b y t ) ′ ] absent E subscript 𝑦 𝑡 1 subscript 𝐹 𝑏 subscript 𝑦 𝑡 superscript subscript 𝑦 𝑡 1 subscript 𝐹 𝑏 subscript 𝑦 𝑡 ′ \displaystyle=\operatorname{E}[(y_{t-1}-F_{b}y_{t})(y_{t-1}-F_{b}y_{t})^{%
\prime}] = roman_E [ ( italic_y start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT - italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ( italic_y start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT - italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ]
= Π − F b Π 10 − Π 10 ′ F b ′ + F b Π F b ′ absent Π subscript 𝐹 𝑏 subscript Π 10 superscript subscript Π 10 ′ superscript subscript 𝐹 𝑏 ′ subscript 𝐹 𝑏 Π superscript subscript 𝐹 𝑏 ′ \displaystyle=\Pi-F_{b}\Pi_{10}-\Pi_{10}^{\prime}F_{b}^{\prime}+F_{b}\Pi F_{b}%
^{\prime} = roman_Π - italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT roman_Π start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT - roman_Π start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT roman_Π italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT
= Π − Π 01 Π − 1 Π 10 absent Π subscript Π 01 superscript Π 1 subscript Π 10 \displaystyle=\Pi-\Pi_{01}\Pi^{-1}\Pi_{10} = roman_Π - roman_Π start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Π start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT
= Π − F b Π F b ′ . absent Π subscript 𝐹 𝑏 Π superscript subscript 𝐹 𝑏 ′ \displaystyle=\Pi-F_{b}\Pi F_{b}^{\prime}. = roman_Π - italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT roman_Π italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT .
An elementary argument shows that
F b subscript 𝐹 𝑏 F_{b} italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT has the same eigenvalues of F 𝐹 F italic_F .
Thus F b subscript 𝐹 𝑏 F_{b} italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is stable iff F 𝐹 F italic_F is stable.
The following results are used below.
Theorem 1. Converse Lyapunov result.
If Π Π \Pi roman_Π is positive definite and Q 𝑄 Q italic_Q is positive semi-definite then
any eigenvalue λ 𝜆 \lambda italic_λ of F 𝐹 F italic_F has | λ | ≤ 1 𝜆 1 |\lambda|\leq 1 | italic_λ | ≤ 1 . If Q 𝑄 Q italic_Q is positive definite
then | λ | < 1 𝜆 1 |\lambda|<1 | italic_λ | < 1 .
Proof .
Let v 𝑣 v italic_v be left eigenvector of F 𝐹 F italic_F with eigenvalue λ 𝜆 \lambda italic_λ . Then
denoting ( v ∗ ) ′ superscript superscript 𝑣 ∗ ′ (v^{\ast})^{\prime} ( italic_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT as v H superscript 𝑣 𝐻 v^{H} italic_v start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT we find
v H Π v = | λ | 2 v H Π v + v H Q v ⇒ ( 1 − | λ | 2 ) v H Π v = v H Q v ≥ 0 ⇒ | λ | ≤ 1 . missing-subexpression superscript 𝑣 𝐻 Π 𝑣 superscript 𝜆 2 superscript 𝑣 𝐻 Π 𝑣 superscript 𝑣 𝐻 𝑄 𝑣 ⇒ 1 superscript 𝜆 2 superscript 𝑣 𝐻 Π 𝑣 superscript 𝑣 𝐻 𝑄 𝑣 0 ⇒ 𝜆 1 \displaystyle\begin{array}[]{rrcl}&v^{H}\Pi v&=&|\lambda|^{2}v^{H}\Pi v+v^{H}%
Qv\\
\Rightarrow&(1-|\lambda|^{2})v^{H}\Pi v&=&v^{H}Qv\geq 0\Rightarrow|\lambda|%
\leq 1.\end{array} start_ARRAY start_ROW start_CELL end_CELL start_CELL italic_v start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Π italic_v end_CELL start_CELL = end_CELL start_CELL | italic_λ | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_v start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Π italic_v + italic_v start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_Q italic_v end_CELL end_ROW start_ROW start_CELL ⇒ end_CELL start_CELL ( 1 - | italic_λ | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) italic_v start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Π italic_v end_CELL start_CELL = end_CELL start_CELL italic_v start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_Q italic_v ≥ 0 ⇒ | italic_λ | ≤ 1 . end_CELL end_ROW end_ARRAY
Clearly if Q 𝑄 Q italic_Q is positive definite we get | λ | < 1 𝜆 1 |\lambda|<1 | italic_λ | < 1 .
□ □ \square □
Theorem 2 . Q b subscript 𝑄 𝑏 Q_{b} italic_Q start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT has full rank iff Q 𝑄 Q italic_Q has full rank.
Proof .
First note that Q b = Π Q c Π subscript 𝑄 𝑏 Π subscript 𝑄 𝑐 Π Q_{b}=\Pi Q_{c}\Pi italic_Q start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = roman_Π italic_Q start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT roman_Π where Q c = Π − 1 − F ′ Π − 1 F subscript 𝑄 𝑐 superscript Π 1 superscript 𝐹 ′ superscript Π 1 𝐹 Q_{c}=\Pi^{-1}-F^{\prime}\Pi^{-1}F italic_Q start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT = roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_F .
Then clearly Q b subscript 𝑄 𝑏 Q_{b} italic_Q start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT has full rank iff Q c subscript 𝑄 𝑐 Q_{c} italic_Q start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT has full rank.
Suppose for some vector v ≠ 0 𝑣 0 v\neq 0 italic_v ≠ 0 , we have Q b v = 0 subscript 𝑄 𝑏 𝑣 0 Q_{b}v=0 italic_Q start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_v = 0 .
Then Q c Π v = 0 subscript 𝑄 𝑐 Π 𝑣 0 Q_{c}\Pi v=0 italic_Q start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT roman_Π italic_v = 0 . Thus, Π − 1 Π v = F ′ Π − 1 F Π v superscript Π 1 Π 𝑣 superscript 𝐹 ′ superscript Π 1 𝐹 Π 𝑣 \Pi^{-1}\Pi v=F^{\prime}\Pi^{-1}F\Pi v roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Π italic_v = italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_F roman_Π italic_v ,
i.e. v = F ′ Π − 1 F Π v 𝑣 superscript 𝐹 ′ superscript Π 1 𝐹 Π 𝑣 v=F^{\prime}\Pi^{-1}F\Pi v italic_v = italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_F roman_Π italic_v . Note that we cannot have F Π v = 0 𝐹 Π 𝑣 0 F\Pi v=0 italic_F roman_Π italic_v = 0 since then
v = 0 𝑣 0 v=0 italic_v = 0 which is a contradiction.
Now consider that
Q Π − 1 = I − F Π F ′ Π − 1 ⇒ Q Π − 1 F Π = F Π − F Π F ′ Π − 1 F Π ⇒ Q Π − 1 F Π v = F Π v − F Π F ′ Π − 1 F Π v = F Π v − F Π v = 0 missing-subexpression 𝑄 superscript Π 1 𝐼 𝐹 Π superscript 𝐹 ′ superscript Π 1 ⇒ 𝑄 superscript Π 1 𝐹 Π 𝐹 Π 𝐹 Π superscript 𝐹 ′ superscript Π 1 𝐹 Π ⇒ 𝑄 superscript Π 1 𝐹 Π 𝑣 𝐹 Π 𝑣 𝐹 Π superscript 𝐹 ′ superscript Π 1 𝐹 Π 𝑣 missing-subexpression missing-subexpression 𝐹 Π 𝑣 𝐹 Π 𝑣 0 \displaystyle\begin{array}[]{rrcl}&Q\Pi^{-1}&=&I-F\Pi F^{\prime}\Pi^{-1}\\
\Rightarrow&Q\Pi^{-1}F\Pi&=&F\Pi-F\Pi F^{\prime}\Pi^{-1}F\Pi\\
\Rightarrow&Q\Pi^{-1}F\Pi v&=&F\Pi v-F\Pi F^{\prime}\Pi^{-1}F\Pi v\\
&&=&F\Pi v-F\Pi v=0\end{array} start_ARRAY start_ROW start_CELL end_CELL start_CELL italic_Q roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_CELL start_CELL = end_CELL start_CELL italic_I - italic_F roman_Π italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL ⇒ end_CELL start_CELL italic_Q roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_F roman_Π end_CELL start_CELL = end_CELL start_CELL italic_F roman_Π - italic_F roman_Π italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_F roman_Π end_CELL end_ROW start_ROW start_CELL ⇒ end_CELL start_CELL italic_Q roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_F roman_Π italic_v end_CELL start_CELL = end_CELL start_CELL italic_F roman_Π italic_v - italic_F roman_Π italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_F roman_Π italic_v end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL = end_CELL start_CELL italic_F roman_Π italic_v - italic_F roman_Π italic_v = 0 end_CELL end_ROW end_ARRAY
which is a contradiction and proves ‘if’; then ‘only if’ follows by running the argument
in reverse.
□ □ \square □
3 Stable Estimators for full-rank VAR(1)
Here we review earlier work of [18 , 22 , 21 ]
and develop a new result
on the full-rank case.
Given data y t , t = 0 , ⋯ , T formulae-sequence subscript 𝑦 𝑡 𝑡
0 ⋯ 𝑇
y_{t},t=0,\cdots,T italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t = 0 , ⋯ , italic_T set
Y 0 = [ y 0 ⋯ y T − 1 ] subscript 𝑌 0 delimited-[] matrix subscript 𝑦 0 ⋯ subscript 𝑦 𝑇 1 Y_{0}=[\begin{matrix}y_{0}&\dotsm&y_{T-1}\end{matrix}] italic_Y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_T - 1 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] and
Y 1 = [ y 1 ⋯ y T ] , subscript 𝑌 1 delimited-[] matrix subscript 𝑦 1 ⋯ subscript 𝑦 𝑇 Y_{1}=[\begin{matrix}y_{1}&\dotsm&y_{T}\end{matrix}], italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] ,
and define the sample covariances
S i j = 1 T Y i Y j ′ , i , j ∈ { 0 , 1 } formulae-sequence subscript 𝑆 𝑖 𝑗 1 𝑇 subscript 𝑌 𝑖 superscript subscript 𝑌 𝑗 ′ 𝑖
𝑗 0 1 S_{ij}=\frac{1}{T}Y_{i}Y_{j}^{\prime},\quad i,j\in\{0,1\} italic_S start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_T end_ARG italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_i , italic_j ∈ { 0 , 1 } .
We also introduce the forwards and backwards residual mean squared errors
S w , f ( F ) subscript 𝑆 𝑤 𝑓
𝐹 \displaystyle S_{w,f}(F) italic_S start_POSTSUBSCRIPT italic_w , italic_f end_POSTSUBSCRIPT ( italic_F )
= 1 T ( Y 1 − F Y 0 ) ( Y 1 − F Y 0 ) ′ absent 1 𝑇 subscript 𝑌 1 𝐹 subscript 𝑌 0 superscript subscript 𝑌 1 𝐹 subscript 𝑌 0 ′ \displaystyle=\frac{1}{T}(Y_{1}-FY_{0})(Y_{1}-FY_{0})^{\prime} = divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ( italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_F italic_Y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ( italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_F italic_Y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT
= S 11 − F S 01 − S 10 F ′ + F S 00 F ′ absent subscript 𝑆 11 𝐹 subscript 𝑆 01 subscript 𝑆 10 superscript 𝐹 ′ 𝐹 subscript 𝑆 00 superscript 𝐹 ′ \displaystyle=S_{11}-FS_{01}-S_{10}F^{\prime}+FS_{00}F^{\prime} = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - italic_F italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_F italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT
S w , b ( F ) subscript 𝑆 𝑤 𝑏
𝐹 \displaystyle S_{w,b}(F) italic_S start_POSTSUBSCRIPT italic_w , italic_b end_POSTSUBSCRIPT ( italic_F )
= 1 T ( Y 0 − F b Y 1 ) ( Y 0 − F b Y 1 ) ′ absent 1 𝑇 subscript 𝑌 0 subscript 𝐹 𝑏 subscript 𝑌 1 superscript subscript 𝑌 0 subscript 𝐹 𝑏 subscript 𝑌 1 ′ \displaystyle=\frac{1}{T}(Y_{0}-F_{b}Y_{1})(Y_{0}-F_{b}Y_{1})^{\prime} = divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ( italic_Y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ( italic_Y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT
= S 00 − F b S 10 − S 01 F b ′ + F b S 11 F b ′ , absent subscript 𝑆 00 subscript 𝐹 𝑏 subscript 𝑆 10 subscript 𝑆 01 superscript subscript 𝐹 𝑏 ′ subscript 𝐹 𝑏 subscript 𝑆 11 superscript subscript 𝐹 𝑏 ′ \displaystyle=S_{00}-F_{b}S_{10}-S_{01}F_{b}^{\prime}+F_{b}S_{11}F_{b}^{\prime}, = italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT - italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ,
where
F b = P F ′ P − 1 subscript 𝐹 𝑏 𝑃 superscript 𝐹 ′ superscript 𝑃 1 F_{b}=PF^{\prime}P^{-1} italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = italic_P italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT , where P 𝑃 P italic_P is
a consistent estimator of Π Π \Pi roman_Π to be chosen.
Here F 𝐹 F italic_F is no longer the true value.
The least squares estimator
F ^ L S = S 10 S 00 − 1 subscript ^ 𝐹 𝐿 𝑆 subscript 𝑆 10 superscript subscript 𝑆 00 1 \hat{F}_{LS}=S_{10}S_{00}^{-1} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT minimises tr ( S w , f ) tr subscript 𝑆 𝑤 𝑓
\operatorname{tr}(S_{w,f}) roman_tr ( italic_S start_POSTSUBSCRIPT italic_w , italic_f end_POSTSUBSCRIPT ) ,
but it is NOT guaranteed to be stable.
Continuing, [18 , 22 ] considered the minimizer of the sum
of weighted forwards and backwards sample mean squared errors
J ( F ; P ) = tr { P − 1 ( S w , f + S w , b ) } . 𝐽 𝐹 𝑃
tr superscript 𝑃 1 subscript 𝑆 𝑤 𝑓
subscript 𝑆 𝑤 𝑏
\displaystyle J(F;P)=\operatorname{tr}\{P^{-1}(S_{w,f}+S_{w,b})\}. italic_J ( italic_F ; italic_P ) = roman_tr { italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_S start_POSTSUBSCRIPT italic_w , italic_f end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT italic_w , italic_b end_POSTSUBSCRIPT ) } .
This yields the following remarkable result.
Theorem 3A . [22 , 21 , 18 ] .
The minimizer of J ( F ; P ) 𝐽 𝐹 𝑃
J(F;P) italic_J ( italic_F ; italic_P ) is F ^ ^ 𝐹 \hat{F} over^ start_ARG italic_F end_ARG ,
the solution obeys the Sylvester equation below and is stable
F ^ S 00 P − 1 + S 11 P − 1 F ^ = 2 S 10 P − 1 . ^ 𝐹 subscript 𝑆 00 superscript 𝑃 1 subscript 𝑆 11 superscript 𝑃 1 ^ 𝐹 2 subscript 𝑆 10 superscript 𝑃 1 \displaystyle\hat{F}S_{00}P^{-1}+S_{11}P^{-1}\hat{F}=2S_{10}P^{-1}. over^ start_ARG italic_F end_ARG italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG = 2 italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .
Remarks.
(a)
This result is not explicitly stated in [22 ] but is rather a special case
of his results. Further the argument in [22 ] is
very hard to follow. Partly for that reason, [21 ]
gave a simple direct proof. Note that Theorem 4B below
has nearly the same proof.
(b)
The result only holds with the weighting matrix P 𝑃 P italic_P
used to define F b subscript 𝐹 𝑏 F_{b} italic_F start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT , not
for any other weighting matrix.
(c)
Using vec algebra we can get a closed-form solution.
We now have the further remarkable closed-form result.
Theorem 3B . Set P = S 11 𝑃 subscript 𝑆 11 P=S_{11} italic_P = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT . Then the stable estimator
F ^ = F ^ 11 = 2 S 10 ( S 00 + S 11 ) − 1 , ^ 𝐹 subscript ^ 𝐹 11 2 subscript 𝑆 10 superscript subscript 𝑆 00 subscript 𝑆 11 1 \displaystyle\hat{F}=\hat{F}_{11}=2S_{10}(S_{00}+S_{11})^{-1}, over^ start_ARG italic_F end_ARG = over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT = 2 italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ,
Proof . Follows from Theorem 3A by setting P = S 11 𝑃 subscript 𝑆 11 P=S_{11} italic_P = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT .
Remarks .
(d)
Below we find that the choice P = S 11 𝑃 subscript 𝑆 11 P=S_{11} italic_P = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT
enables our new results.
(e)
Note that
consistency and asymptotic efficiency
has been established for F ^ L S subscript ^ 𝐹 𝐿 𝑆 \hat{F}_{LS} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT in [12 ] .
One can then show the same for F ^ ^ 𝐹 \hat{F} over^ start_ARG italic_F end_ARG .
We omit
details, concentrating here on
the reduced rank case.
4 Stable Reduced Rank VAR(1) Estimation
We now consider the case where F 𝐹 F italic_F has rank m < n 𝑚 𝑛 m<n italic_m < italic_n .
Then we can factor F = A n × m B m × n 𝐹 subscript 𝐴 𝑛 𝑚 subscript 𝐵 𝑚 𝑛 F=A_{n\times m}B_{m\times n} italic_F = italic_A start_POSTSUBSCRIPT italic_n × italic_m end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_m × italic_n end_POSTSUBSCRIPT .
We then have the following result.
Theorem 4A . [20 ] [Section 2.3]. Reduced Rank Least Squares.
min rank ( F ) = m J L S : J L S = tr ( S 11 − 1 S w , f ) : subscript rank 𝐹 𝑚 subscript 𝐽 𝐿 𝑆 subscript 𝐽 𝐿 𝑆 tr superscript subscript 𝑆 11 1 subscript 𝑆 𝑤 𝑓
\displaystyle\min_{\operatorname{rank}(F)=m}J_{LS}:J_{LS}=\operatorname{tr}(S_%
{11}^{-1}S_{w,f}) roman_min start_POSTSUBSCRIPT roman_rank ( italic_F ) = italic_m end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT : italic_J start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT = roman_tr ( italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_w , italic_f end_POSTSUBSCRIPT )
has solution
F ^ R L S = S 11 1 2 V ^ * , m V ^ * , m ′ S 11 − 1 2 F ^ L S subscript ^ 𝐹 𝑅 𝐿 𝑆 superscript subscript 𝑆 11 1 2 subscript ^ 𝑉 𝑚
superscript subscript ^ 𝑉 𝑚
′ superscript subscript 𝑆 11 1 2 subscript ^ 𝐹 𝐿 𝑆 \hat{F}_{RLS}=S_{11}^{\frac{1}{2}}\hat{V}_{*,m}\hat{V}_{*,m}^{\prime}S_{11}^{-%
\frac{1}{2}}\hat{F}_{LS} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R italic_L italic_S end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT
where V ^ * , m = [ v ^ * , 1 ⋯ v ^ * , m ] subscript ^ 𝑉 𝑚
delimited-[] matrix subscript ^ 𝑣 1
⋯ subscript ^ 𝑣 𝑚
\hat{V}_{*,m}=[\begin{matrix}\hat{v}_{*,1}&\dotsm&\hat{v}_{*,m}\end{matrix}] over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT * , 1 end_POSTSUBSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ]
and v ^ ∗ , r subscript ^ 𝑣 ∗ 𝑟
\hat{v}_{\ast,r} over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT ∗ , italic_r end_POSTSUBSCRIPT are the ‘top’ m − limit-from 𝑚 m- italic_m - eigenvectors of
R ^ * = S 11 − 1 2 S 10 S 00 − 1 S 01 S 11 − 1 2 . subscript ^ 𝑅 superscript subscript 𝑆 11 1 2 subscript 𝑆 10 superscript subscript 𝑆 00 1 subscript 𝑆 01 superscript subscript 𝑆 11 1 2 \hat{R}_{*}=S_{11}^{-\frac{1}{2}}S_{10}S_{00}^{-1}S_{01}S_{11}^{-\frac{1}{2}}. over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT .
The noise covariance estimator is
Q ^ R L S = S 11 1 2 ( I − V ^ * , m D ^ * , m 2 V ^ * , m ′ ) S 11 1 2 subscript ^ 𝑄 𝑅 𝐿 𝑆 superscript subscript 𝑆 11 1 2 𝐼 subscript ^ 𝑉 𝑚
superscript subscript ^ 𝐷 𝑚
2 superscript subscript ^ 𝑉 𝑚
′ superscript subscript 𝑆 11 1 2 \hat{Q}_{RLS}=S_{11}^{\frac{1}{2}}(I-\hat{V}_{*,m}\hat{D}_{*,m}^{2}\hat{V}_{*,%
m}^{\prime})S_{11}^{\frac{1}{2}} over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_R italic_L italic_S end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ( italic_I - over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT over^ start_ARG italic_D end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT
where D ^ * , m 2 = diag ( λ ^ * , k ) superscript subscript ^ 𝐷 𝑚
2 diag subscript ^ 𝜆 𝑘
\hat{D}_{*,m}^{2}=\operatorname{diag}(\hat{\lambda}_{*,k}) over^ start_ARG italic_D end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = roman_diag ( over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT * , italic_k end_POSTSUBSCRIPT ) contains the top m 𝑚 m italic_m eigenvalues of R ^ * subscript ^ 𝑅 \hat{R}_{*} over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT .
Remarks .
(f)
F ^ R L S subscript ^ 𝐹 𝑅 𝐿 𝑆 \hat{F}_{RLS} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R italic_L italic_S end_POSTSUBSCRIPT is not guaranteed to be stable.
(g)
We note for future reference that R ^ * subscript ^ 𝑅 \hat{R}_{*} over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT has the same
eigenvalues as L ^ * = S 10 S 00 − 1 S 01 S 11 − 1 subscript ^ 𝐿 subscript 𝑆 10 superscript subscript 𝑆 00 1 subscript 𝑆 01 superscript subscript 𝑆 11 1 \hat{L}_{*}=S_{10}S_{00}^{-1}S_{01}S_{11}^{-1} over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .
(h)
We can re-express the results in terms of the SVD of
G ^ * = S 11 − 1 2 S 10 S 00 − 1 2 = V ^ * D ^ * U ^ * ′ subscript ^ 𝐺 superscript subscript 𝑆 11 1 2 subscript 𝑆 10 superscript subscript 𝑆 00 1 2 subscript ^ 𝑉 subscript ^ 𝐷 superscript subscript ^ 𝑈 ′ \hat{G}_{*}=S_{11}^{-\frac{1}{2}}S_{10}S_{00}^{-\frac{1}{2}}=\hat{V}_{*}\hat{D%
}_{*}\hat{U}_{*}^{\prime} over^ start_ARG italic_G end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT = over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT over^ start_ARG italic_D end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT over^ start_ARG italic_U end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT
since R ^ * = G ^ * G ^ * ′ subscript ^ 𝑅 subscript ^ 𝐺 superscript subscript ^ 𝐺 ′ \hat{R}_{*}=\hat{G}_{*}\hat{G}_{*}^{\prime} over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT = over^ start_ARG italic_G end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT .
It follows from the Biename-Cauchy-Schwarz inequality that
the singular values of G ^ * subscript ^ 𝐺 \hat{G}_{*} over^ start_ARG italic_G end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT are < 1 absent 1 <1 < 1 w.p.1.
(i)
[20 ] gives a different formula for Q ^ R L S subscript ^ 𝑄 𝑅 𝐿 𝑆 \hat{Q}_{RLS} over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_R italic_L italic_S end_POSTSUBSCRIPT
but it is straightforward to show it equals the one given here.
We would now like to set up a FB version of this problem.
But it turns out that for general P 𝑃 P italic_P there is no simple solution.
Fortunately if we choose P = S 11 𝑃 subscript 𝑆 11 P=S_{11} italic_P = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT we can find a simple guaranteed stable estimator.
Theorem 4B .
The solution to
min rank ( F ) = m J ( F ; S 11 ) subscript rank 𝐹 𝑚 𝐽 𝐹 subscript 𝑆 11
\min_{\operatorname{rank}(F)=m}J(F;S_{11}) roman_min start_POSTSUBSCRIPT roman_rank ( italic_F ) = italic_m end_POSTSUBSCRIPT italic_J ( italic_F ; italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) is given by
F ^ R = S 11 1 2 V ^ m V ^ m ′ S 11 − 1 2 F ^ 11 subscript ^ 𝐹 𝑅 superscript subscript 𝑆 11 1 2 subscript ^ 𝑉 𝑚 superscript subscript ^ 𝑉 𝑚 ′ superscript subscript 𝑆 11 1 2 subscript ^ 𝐹 11 \hat{F}_{R}=S_{11}^{\frac{1}{2}}\hat{V}_{m}\hat{V}_{m}^{\prime}S_{11}^{-\frac{%
1}{2}}\hat{F}_{11} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT where
V ^ m = [ v ^ 1 ⋯ v ^ m ] subscript ^ 𝑉 𝑚 delimited-[] matrix subscript ^ 𝑣 1 ⋯ subscript ^ 𝑣 𝑚 \hat{V}_{m}=[\begin{matrix}\hat{v}_{1}&\dotsm&\hat{v}_{m}\end{matrix}] over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ]
and v ^ r subscript ^ 𝑣 𝑟 \hat{v}_{r} over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT are the ‘top’ m 𝑚 m italic_m eigenvectors
(corresponding to the top m 𝑚 m italic_m eigenvalues in D ^ m = diag ( λ ^ k ) subscript ^ 𝐷 𝑚 diag subscript ^ 𝜆 𝑘 \hat{D}_{m}=\operatorname{diag}(\hat{\lambda}_{k}) over^ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = roman_diag ( over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ) of
R ^ = 2 S 11 − 1 2 S 10 ( S 00 + S 11 ) − 1 S 01 S 11 − 1 2 . ^ 𝑅 2 superscript subscript 𝑆 11 1 2 subscript 𝑆 10 superscript subscript 𝑆 00 subscript 𝑆 11 1 subscript 𝑆 01 superscript subscript 𝑆 11 1 2 \displaystyle\hat{R}=2S_{11}^{-\frac{1}{2}}S_{10}(S_{00}+S_{11})^{-1}S_{01}S_{%
11}^{-\frac{1}{2}}. over^ start_ARG italic_R end_ARG = 2 italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT .
Further F ^ R subscript ^ 𝐹 𝑅 \hat{F}_{R} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT is stable.
5 Asymptotic Analysis
Here we show the consistency and central limit theorem (CLT)
for F ^ R subscript ^ 𝐹 𝑅 \hat{F}_{R} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT by reducing them to results for F ^ R L S subscript ^ 𝐹 𝑅 𝐿 𝑆 \hat{F}_{RLS} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R italic_L italic_S end_POSTSUBSCRIPT .
For asymptotic results, we need stronger assumptions.
There are a wide range of possibilities, but
to keep things simple, while still retaining
reasonable generality we use:
Assumption A1 .
w t subscript 𝑤 𝑡 w_{t} italic_w start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in (2.1) are iid with finite fourth moments,
and Q 𝑄 Q italic_Q is positive definite.
Assumption A2 . F 𝐹 F italic_F has rank m 𝑚 m italic_m and the
non zero eigenvalues of F 𝐹 F italic_F are distinct.
This enables the following result
for F ^ R L S subscript ^ 𝐹 𝑅 𝐿 𝑆 \hat{F}_{RLS} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R italic_L italic_S end_POSTSUBSCRIPT (where F 𝐹 F italic_F is again the true value).
Theorem 5A . Assuming m = rank ( F ) 𝑚 rank 𝐹 m=\operatorname{rank}(F) italic_m = roman_rank ( italic_F ) is known
(i)
Under A1: S 00 → 𝑝 Π 𝑝 → subscript 𝑆 00 Π S_{00}\xrightarrow{p}\Pi italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW roman_Π and S 11 → 𝑝 Π 𝑝 → subscript 𝑆 11 Π S_{11}\xrightarrow{p}\Pi italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW roman_Π .
(ii)
Under A1: F ^ L S → 𝑝 F ≡ S 10 → 𝑝 F Π 𝑝 → subscript ^ 𝐹 𝐿 𝑆 𝐹 subscript 𝑆 10 𝑝 → 𝐹 Π \hat{F}_{LS}\xrightarrow{p}F\equiv S_{10}\xrightarrow{p}F\Pi over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_F ≡ italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_F roman_Π .
Under A1, A2: R F = Π − 1 2 F Π 1 2 subscript 𝑅 𝐹 superscript Π 1 2 𝐹 superscript Π 1 2 R_{F}=\Pi^{-\frac{1}{2}}F\Pi^{\frac{1}{2}} italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT = roman_Π start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_F roman_Π start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT has rank m 𝑚 m italic_m and:
(iii)
V ^ * , m → 𝑝 V m 𝑝 → subscript ^ 𝑉 𝑚
subscript 𝑉 𝑚 \hat{V}_{*,m}\xrightarrow{p}V_{m} over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT the eigenvectors of R F R F ′ subscript 𝑅 𝐹 superscript subscript 𝑅 𝐹 ′ R_{F}R_{F}^{\prime} italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT .
U ^ * , m → 𝑝 U m 𝑝 → subscript ^ 𝑈 𝑚
subscript 𝑈 𝑚 \hat{U}_{*,m}\xrightarrow{p}U_{m} over^ start_ARG italic_U end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_U start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT the eigenvectors of R F ′ R F superscript subscript 𝑅 𝐹 ′ subscript 𝑅 𝐹 R_{F}^{\prime}R_{F} italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT .
λ ^ k → 𝑝 λ k and λ ^ * , k → 𝑝 λ k 𝑝 → subscript ^ 𝜆 𝑘 subscript 𝜆 𝑘 and subscript ^ 𝜆 𝑘
𝑝 → subscript 𝜆 𝑘 \hat{\lambda}_{k}\xrightarrow{p}\lambda_{k}\mbox{ and }\hat{\lambda}_{*,k}%
\xrightarrow{p}\lambda_{k} over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT * , italic_k end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT the k t h superscript 𝑘 𝑡 ℎ k^{th} italic_k start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT largest eigenvalue of R F R F ′ subscript 𝑅 𝐹 superscript subscript 𝑅 𝐹 ′ R_{F}R_{F}^{\prime} italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT .
(iv)
G ^ * → 𝑝 R F = Π − 1 2 F Π 1 2 𝑝 → subscript ^ 𝐺 subscript 𝑅 𝐹 superscript Π 1 2 𝐹 superscript Π 1 2 \hat{G}_{*}\xrightarrow{p}R_{F}=\Pi^{-\frac{1}{2}}F\Pi^{\frac{1}{2}} over^ start_ARG italic_G end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT = roman_Π start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_F roman_Π start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT .
(v)
F ^ R L S → 𝑝 F = A B 𝑝 → subscript ^ 𝐹 𝑅 𝐿 𝑆 𝐹 𝐴 𝐵 \hat{F}_{RLS}\xrightarrow{p}F=AB over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R italic_L italic_S end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_F = italic_A italic_B .
(vi)
T vec ( F ^ R L S ′ − F ′ ) ⇒ Z ∼ N ( 0 , Σ ) ⇒ 𝑇 vec superscript subscript ^ 𝐹 𝑅 𝐿 𝑆 ′ superscript 𝐹 ′ 𝑍 similar-to 𝑁 0 Σ \sqrt{T}\operatorname{vec}(\hat{F}_{RLS}^{\prime}-F^{\prime})\Rightarrow Z\sim
N%
(0,\Sigma) square-root start_ARG italic_T end_ARG roman_vec ( over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R italic_L italic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ⇒ italic_Z ∼ italic_N ( 0 , roman_Σ ) where
Σ = M W M ′ Σ 𝑀 𝑊 superscript 𝑀 ′ \Sigma=MWM^{\prime} roman_Σ = italic_M italic_W italic_M start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with M = [ ( I ⊗ B ′ ) , ( A ⊗ I ) ] 𝑀 tensor-product 𝐼 superscript 𝐵 ′ tensor-product 𝐴 𝐼 M=[(I\otimes B^{\prime}),(A\otimes I)] italic_M = [ ( italic_I ⊗ italic_B start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , ( italic_A ⊗ italic_I ) ] and W 𝑊 W italic_W
(which is complicated) is given in [20 ] (2.36).
Proof . (i),(ii) can be found in [8 ] [chapter 11].
(v),(vi) are from [20 ] [Section 2.5 (Theorem 2.4) and Section 5.2].
For (iii) first note that from (i),(ii) L ^ * → 𝑝 L = F Π F ′ Π − 1 𝑝 → subscript ^ 𝐿 𝐿 𝐹 Π superscript 𝐹 ′ superscript Π 1 \hat{L}_{*}\xrightarrow{p}L=F\Pi F^{\prime}\Pi^{-1} over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_L = italic_F roman_Π italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT
which has the same eigenvalues as R = R F R F ′ 𝑅 subscript 𝑅 𝐹 superscript subscript 𝑅 𝐹 ′ R=R_{F}R_{F}^{\prime} italic_R = italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT . The second part follows
in the same way.
Next,
since distinct eigenvalues and eigenvectors are continuous functions
of the underlying matrix entries
[13 ] then V ^ * , m → 𝑝 V m 𝑝 → subscript ^ 𝑉 𝑚
subscript 𝑉 𝑚 \hat{V}_{*,m}\xrightarrow{p}V_{m} over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT
containing the ‘top’ eigenvalues of R 𝑅 R italic_R .
Further, the eigenvalues → 𝑝 𝑝 → \xrightarrow{p} start_ARROW overitalic_p → end_ARROW as stated.
(iv) now follows from (iii).
To continue for F ^ R subscript ^ 𝐹 𝑅 \hat{F}_{R} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT , we need some lemmas.
Lemma 5B . Let Γ t = E [ y t y t ′ ] subscript Γ 𝑡 E subscript 𝑦 𝑡 superscript subscript 𝑦 𝑡 ′ \Gamma_{t}=\operatorname{E}[y_{t}y_{t}^{\prime}] roman_Γ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_E [ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ] and denote ρ = 𝜌 absent \rho= italic_ρ = spectral radius of F 𝐹 F italic_F . Then
‖ Γ t − Π ‖ 2 ≤ c ρ 2 t subscript norm subscript Γ 𝑡 Π 2 𝑐 superscript 𝜌 2 𝑡 \|\Gamma_{t}-\Pi\|_{2}\leq c\rho^{2t} ∥ roman_Γ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - roman_Π ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_c italic_ρ start_POSTSUPERSCRIPT 2 italic_t end_POSTSUPERSCRIPT for some constant c 𝑐 c italic_c .
Proof . Taking variances in (2.1 ) gives
Γ t = F Γ t − 1 F ′ + Q subscript Γ 𝑡 𝐹 subscript Γ 𝑡 1 superscript 𝐹 ′ 𝑄 \Gamma_{t}=F\Gamma_{t-1}F^{\prime}+Q roman_Γ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_F roman_Γ start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_Q . Subtracting the Π Π \Pi roman_Π equation from this gives
Γ t − Π = F ( Γ t − 1 − Π ) F ′ subscript Γ 𝑡 Π 𝐹 subscript Γ 𝑡 1 Π superscript 𝐹 ′ \Gamma_{t}-\Pi=F(\Gamma_{t-1}-\Pi)F^{\prime} roman_Γ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - roman_Π = italic_F ( roman_Γ start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT - roman_Π ) italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT .
Iterating this gives Γ t − Π = F t D ( F ′ ) t subscript Γ 𝑡 Π superscript 𝐹 𝑡 𝐷 superscript superscript 𝐹 ′ 𝑡 \Gamma_{t}-\Pi=F^{t}D(F^{\prime})^{t} roman_Γ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - roman_Π = italic_F start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_D ( italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT where D = Γ 0 − Π 𝐷 subscript Γ 0 Π D=\Gamma_{0}-\Pi italic_D = roman_Γ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - roman_Π .
Now recall Gelfand’s theorem [9 ] : ‖ F t ‖ 1 / t → ρ → superscript norm superscript 𝐹 𝑡 1 𝑡 𝜌 \|F^{t}\|^{1/t}\rightarrow\rho ∥ italic_F start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 1 / italic_t end_POSTSUPERSCRIPT → italic_ρ ,
so that l i m s u p t ‖ ( Γ t − Π ) / ρ 2 t ‖ 1 / t ≤ c 𝑙 𝑖 𝑚 𝑠 𝑢 subscript 𝑝 𝑡 superscript norm subscript Γ 𝑡 Π superscript 𝜌 2 𝑡 1 𝑡 𝑐 limsup_{t}\|(\Gamma_{t}-\Pi)/\rho^{2t}\|^{1/t}\leq c italic_l italic_i italic_m italic_s italic_u italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ( roman_Γ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - roman_Π ) / italic_ρ start_POSTSUPERSCRIPT 2 italic_t end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 1 / italic_t end_POSTSUPERSCRIPT ≤ italic_c .
□ □ \square □
The result follows from this.
Lemma 5C . Under A1: T ‖ S 11 − S 00 ‖ → 𝑝 0 𝑝 → 𝑇 norm subscript 𝑆 11 subscript 𝑆 00 0 \sqrt{T}\|S_{11}-S_{00}\|\xrightarrow{p}0 square-root start_ARG italic_T end_ARG ∥ italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT ∥ start_ARROW overitalic_p → end_ARROW 0 .
Proof . We have
S 11 − S 00 = 1 T [ y T y T ′ − y 0 y 0 ′ ] ⇒ ‖ S 11 − S 00 ‖ 2 ≤ 2 T ( ‖ y T ‖ 2 + ‖ y 0 ‖ 2 ) ⇒ T E ‖ S 11 − S 00 ‖ 2 ≤ 2 T [ tr ( Γ T ) + tr ( Γ 1 ) ] missing-subexpression subscript 𝑆 11 subscript 𝑆 00 1 𝑇 delimited-[] subscript 𝑦 𝑇 superscript subscript 𝑦 𝑇 ′ subscript 𝑦 0 superscript subscript 𝑦 0 ′ ⇒ superscript norm subscript 𝑆 11 subscript 𝑆 00 2 2 𝑇 superscript norm subscript 𝑦 𝑇 2 superscript norm subscript 𝑦 0 2 ⇒ 𝑇 E superscript norm subscript 𝑆 11 subscript 𝑆 00 2 2 𝑇 delimited-[] tr subscript Γ 𝑇 tr subscript Γ 1 \displaystyle\begin{array}[]{rrcl}&S_{11}-S_{00}&=&\frac{1}{T}[y_{T}y_{T}^{%
\prime}-y_{0}y_{0}^{\prime}]\\
\Rightarrow&\|S_{11}-S_{00}\|^{2}&\leq&\frac{2}{T}(\|y_{T}\|^{2}+\|y_{0}\|^{2}%
)\\
\Rightarrow&\sqrt{T}\operatorname{E}\|S_{11}-S_{00}\|^{2}&\leq&\frac{2}{\sqrt{%
T}}[\operatorname{tr}(\Gamma_{T})+\operatorname{tr}(\Gamma_{1})]\end{array} start_ARRAY start_ROW start_CELL end_CELL start_CELL italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG italic_T end_ARG [ italic_y start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ] end_CELL end_ROW start_ROW start_CELL ⇒ end_CELL start_CELL ∥ italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL ≤ end_CELL start_CELL divide start_ARG 2 end_ARG start_ARG italic_T end_ARG ( ∥ italic_y start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∥ italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL ⇒ end_CELL start_CELL square-root start_ARG italic_T end_ARG roman_E ∥ italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL ≤ end_CELL start_CELL divide start_ARG 2 end_ARG start_ARG square-root start_ARG italic_T end_ARG end_ARG [ roman_tr ( roman_Γ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) + roman_tr ( roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ] end_CELL end_ROW end_ARRAY
The second term → 0 → absent 0 \rightarrow 0 → 0 while the first term → 0 → absent 0 \rightarrow 0 → 0 from Lemma 5B.
The result then follows.
□ □ \square □
Lemma 5D . Under A1, A2: T ‖ F ^ L S − F ^ 11 ‖ → 𝑝 0 𝑝 → 𝑇 norm subscript ^ 𝐹 𝐿 𝑆 subscript ^ 𝐹 11 0 \sqrt{T}\|\hat{F}_{LS}-\hat{F}_{11}\|\xrightarrow{p}0 square-root start_ARG italic_T end_ARG ∥ over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ∥ start_ARROW overitalic_p → end_ARROW 0 .
Proof . We have
T ( F ^ L S − F ^ 11 ) = T ( S 10 S 00 − 1 − 2 S 10 ( S 00 + S 11 ) − 1 ) = T 2 S 10 [ ( 2 S 00 ) − 1 − ( S 11 + S 00 ) − 1 ] = T S 10 S 00 − 1 [ S 00 + S 11 − 2 S 00 ] ( S 00 + S 11 ) − 1 = F ^ L S [ T ( S 11 − S 00 ) ] ( S 11 + S 00 ) − 1 missing-subexpression missing-subexpression 𝑇 subscript ^ 𝐹 𝐿 𝑆 subscript ^ 𝐹 11 missing-subexpression 𝑇 subscript 𝑆 10 superscript subscript 𝑆 00 1 2 subscript 𝑆 10 superscript subscript 𝑆 00 subscript 𝑆 11 1 missing-subexpression 𝑇 2 subscript 𝑆 10 delimited-[] superscript 2 subscript 𝑆 00 1 superscript subscript 𝑆 11 subscript 𝑆 00 1 missing-subexpression 𝑇 subscript 𝑆 10 superscript subscript 𝑆 00 1 delimited-[] subscript 𝑆 00 subscript 𝑆 11 2 subscript 𝑆 00 superscript subscript 𝑆 00 subscript 𝑆 11 1 missing-subexpression subscript ^ 𝐹 𝐿 𝑆 delimited-[] 𝑇 subscript 𝑆 11 subscript 𝑆 00 superscript subscript 𝑆 11 subscript 𝑆 00 1 \displaystyle\begin{array}[]{rcl}&&\sqrt{T}(\hat{F}_{LS}-\hat{F}_{11})\\
&=&\sqrt{T}(S_{10}S_{00}^{-1}-2S_{10}(S_{00}+S_{11})^{-1})\\
&=&\sqrt{T}2S_{10}[(2S_{00})^{-1}-(S_{11}+S_{00})^{-1}]\\
&=&\sqrt{T}S_{10}S_{00}^{-1}[S_{00}+S_{11}-2S_{00}](S_{00}+S_{11})^{-1}\\
&=&\hat{F}_{LS}[\sqrt{T}(S_{11}-S_{00})](S_{11}+S_{00})^{-1}\end{array} start_ARRAY start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = end_CELL start_CELL square-root start_ARG italic_T end_ARG ( italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - 2 italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = end_CELL start_CELL square-root start_ARG italic_T end_ARG 2 italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT [ ( 2 italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - ( italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = end_CELL start_CELL square-root start_ARG italic_T end_ARG italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT [ italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - 2 italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT ] ( italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = end_CELL start_CELL over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT [ square-root start_ARG italic_T end_ARG ( italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT ) ] ( italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY
Now from Theorem 5A and Lemma 5C
each term converges in probability, so the product also converges in probability.
But T ( S 11 − S 00 ) → 𝑝 0 𝑝 → 𝑇 subscript 𝑆 11 subscript 𝑆 00 0 \sqrt{T}(S_{11}-S_{00})\xrightarrow{p}0 square-root start_ARG italic_T end_ARG ( italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT ) start_ARROW overitalic_p → end_ARROW 0 yielding the result.
Theorem 5E . Under A1, A2,
(i)
F ^ R → 𝑝 F 𝑝 → subscript ^ 𝐹 𝑅 𝐹 \hat{F}_{R}\xrightarrow{p}F over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_F .
(ii)
T ( F ^ R L S − F ^ R ) → 𝑝 0 𝑝 → 𝑇 subscript ^ 𝐹 𝑅 𝐿 𝑆 subscript ^ 𝐹 𝑅 0 \sqrt{T}(\hat{F}_{RLS}-\hat{F}_{R})\xrightarrow{p}0 square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R italic_L italic_S end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) start_ARROW overitalic_p → end_ARROW 0 .
(iii)
T vec ( F R ′ − F ′ ) ⇒ Z ∼ N ( 0 , Σ ) ⇒ 𝑇 vec superscript subscript 𝐹 𝑅 ′ superscript 𝐹 ′ 𝑍 similar-to 𝑁 0 Σ \sqrt{T}\operatorname{vec}(F_{R}^{\prime}-F^{\prime})\Rightarrow Z\sim N(0,\Sigma) square-root start_ARG italic_T end_ARG roman_vec ( italic_F start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ⇒ italic_Z ∼ italic_N ( 0 , roman_Σ ) .
Proof . (i) follows from (ii) and Theorem 5A(v).
(iii) follows from (ii) and Theorem 5A(vi).
(ii) is proved in the appendix.
□ □ \square □
Figure 1: Pole locations of the first 50 50 50 50 repeats.
Figure 2: Pole magnitude histograms of the 1000 1000 1000 1000 repeats: * * * are the true poles.
Unstable poles for LS for each T 𝑇 T italic_T : 25.9 % , 6.9 % , 0.2 % percent 25.9 percent 6.9 percent 0.2
25.9\%,6.9\%,0.2\% 25.9 % , 6.9 % , 0.2 % .
8 Appendix: Proofs
We use the following reduced-rank least squares lemma.
Lemma R . [20 ] [Theorem 2.2]
C ^ ^ 𝐶 \displaystyle\hat{C} over^ start_ARG italic_C end_ARG
= arg min C m × n : rank ( C ) = m tr { ( Z − C X ) ′ P − 1 ( Z − C X ) } absent subscript : subscript 𝐶 𝑚 𝑛 rank 𝐶 𝑚 tr superscript 𝑍 𝐶 𝑋 ′ superscript 𝑃 1 𝑍 𝐶 𝑋 \displaystyle=\arg\min_{C_{m\times n}:\operatorname{rank}(C)=m}\operatorname{%
tr}\{(Z-CX)^{\prime}P^{-1}(Z-CX)\} = roman_arg roman_min start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_m × italic_n end_POSTSUBSCRIPT : roman_rank ( italic_C ) = italic_m end_POSTSUBSCRIPT roman_tr { ( italic_Z - italic_C italic_X ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_Z - italic_C italic_X ) }
= P 1 2 V ^ m V ^ m ′ P − 1 2 Z X ′ ( X X ′ ) − 1 . absent superscript 𝑃 1 2 subscript ^ 𝑉 𝑚 superscript subscript ^ 𝑉 𝑚 ′ superscript 𝑃 1 2 𝑍 superscript 𝑋 ′ superscript 𝑋 superscript 𝑋 ′ 1 \displaystyle=P^{\frac{1}{2}}\hat{V}_{m}\hat{V}_{m}^{\prime}P^{-\frac{1}{2}}ZX%
^{\prime}(XX^{\prime})^{-1}. = italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_Z italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_X italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .
where V ^ m = [ v ^ 1 , ⋯ , v ^ m ] subscript ^ 𝑉 𝑚 subscript ^ 𝑣 1 ⋯ subscript ^ 𝑣 𝑚
\hat{V}_{m}=[\hat{v}_{1},\dotsm,\hat{v}_{m}] over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = [ over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ] with
v ^ r subscript ^ 𝑣 𝑟 \hat{v}_{r} over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT the eigenvector corresponding to the r 𝑟 r italic_r -th largest eigenvalue of
the matrix
R ^ = P − 1 2 Z X ′ ( X X ′ ) − 1 X Z ′ P − 1 2 ^ 𝑅 superscript 𝑃 1 2 𝑍 superscript 𝑋 ′ superscript 𝑋 superscript 𝑋 ′ 1 𝑋 superscript 𝑍 ′ superscript 𝑃 1 2 \hat{R}=P^{-\frac{1}{2}}ZX^{\prime}(XX^{\prime})^{-1}XZ^{\prime}P^{-\frac{1}{2}} over^ start_ARG italic_R end_ARG = italic_P start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_Z italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_X italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_X italic_Z start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT .
Proof of Theorem 4B
We show that J ( F ; S 11 ) 𝐽 𝐹 subscript 𝑆 11
J(F;S_{11}) italic_J ( italic_F ; italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) has the form specified in Lemma R.
First, reorganize J ( F ; P ) 𝐽 𝐹 𝑃
J(F;P) italic_J ( italic_F ; italic_P ) to get
J ( F ; P ) 𝐽 𝐹 𝑃
\displaystyle J(F;P) italic_J ( italic_F ; italic_P )
= \displaystyle= =
tr { P − 1 ( S w , f + S w , b ) } tr superscript 𝑃 1 subscript 𝑆 𝑤 𝑓
subscript 𝑆 𝑤 𝑏
\displaystyle\operatorname{tr}\{P^{-1}(S_{w,f}+S_{w,b})\} roman_tr { italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_S start_POSTSUBSCRIPT italic_w , italic_f end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT italic_w , italic_b end_POSTSUBSCRIPT ) }
= \displaystyle= =
tr { P − 1 ( − 4 S 10 F ′ + F S 00 F ′ + F P F ′ P − 1 S 11 ) } , tr superscript 𝑃 1 4 subscript 𝑆 10 superscript 𝐹 ′ 𝐹 subscript 𝑆 00 superscript 𝐹 ′ 𝐹 𝑃 superscript 𝐹 ′ superscript 𝑃 1 subscript 𝑆 11 \displaystyle\operatorname{tr}\{P^{-1}(-4S_{10}F^{\prime}+FS_{00}F^{\prime}+%
FPF^{\prime}P^{-1}S_{11})\}, roman_tr { italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( - 4 italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_F italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_F italic_P italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) } ,
where we dropped a constant term not dependent on F 𝐹 F italic_F .
Now, set P = S 11 𝑃 subscript 𝑆 11 P=S_{11} italic_P = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT to get
J ( F ; S 11 ) 𝐽 𝐹 subscript 𝑆 11
\displaystyle J(F;S_{11}) italic_J ( italic_F ; italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT )
= \displaystyle= =
tr { S 11 − 1 ( − 4 S 10 F ′ + F S 00 F ′ + F S 11 F ′ ) } tr superscript subscript 𝑆 11 1 4 subscript 𝑆 10 superscript 𝐹 ′ 𝐹 subscript 𝑆 00 superscript 𝐹 ′ 𝐹 subscript 𝑆 11 superscript 𝐹 ′ \displaystyle\operatorname{tr}\{S_{11}^{-1}(-4S_{10}F^{\prime}+FS_{00}F^{%
\prime}+FS_{11}F^{\prime})\} roman_tr { italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( - 4 italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_F italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_F italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) }
= \displaystyle= =
tr { S 11 − 1 ( − 4 S 10 F ′ + F ( S 00 + S 11 ) F ′ } \displaystyle\operatorname{tr}\{S_{11}^{-1}(-4S_{10}F^{\prime}+F(S_{00}+S_{11}%
)F^{\prime}\} roman_tr { italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( - 4 italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_F ( italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) italic_F start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT }
= \displaystyle= =
‖ S 11 − 1 2 ( 2 S 10 ( S 00 + S 11 ) − 1 2 − F ( S 00 + S 11 ) 1 2 ) ‖ 2 , superscript norm superscript subscript 𝑆 11 1 2 2 subscript 𝑆 10 superscript subscript 𝑆 00 subscript 𝑆 11 1 2 𝐹 superscript subscript 𝑆 00 subscript 𝑆 11 1 2 2 \displaystyle\|S_{11}^{-\frac{1}{2}}(2S_{10}(S_{00}+S_{11})^{-\frac{1}{2}}-F(S%
_{00}+S_{11})^{\frac{1}{2}})\|^{2}, ∥ italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ( 2 italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT - italic_F ( italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,
where again we dropped a constant term not dependent on F 𝐹 F italic_F .
Now apply Lemma R to get the first quoted result.
We now prove that Q * = S 11 − 1 − F ^ R ′ S 11 − 1 F ^ R subscript 𝑄 superscript subscript 𝑆 11 1 superscript subscript ^ 𝐹 𝑅 ′ superscript subscript 𝑆 11 1 subscript ^ 𝐹 𝑅 Q_{*}=S_{11}^{-1}-\hat{F}_{R}^{\prime}S_{11}^{-1}\hat{F}_{R} italic_Q start_POSTSUBSCRIPT * end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT is positive definite.
Then Lyapunov’s theorem gives that F ^ R subscript ^ 𝐹 𝑅 \hat{F}_{R} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT is stable.
We have
Q * subscript 𝑄 \displaystyle Q_{*} italic_Q start_POSTSUBSCRIPT * end_POSTSUBSCRIPT
= ( S 11 − 1 − F ^ 11 ′ S 11 − 1 F ^ 11 ) + ( F ^ 11 ′ S 11 − 1 F ^ 11 − F ^ R ′ S 11 − 1 F ^ R ) absent superscript subscript 𝑆 11 1 superscript subscript ^ 𝐹 11 ′ superscript subscript 𝑆 11 1 subscript ^ 𝐹 11 superscript subscript ^ 𝐹 11 ′ superscript subscript 𝑆 11 1 subscript ^ 𝐹 11 superscript subscript ^ 𝐹 𝑅 ′ superscript subscript 𝑆 11 1 subscript ^ 𝐹 𝑅 \displaystyle=(S_{11}^{-1}-\hat{F}_{11}^{\prime}S_{11}^{-1}\hat{F}_{11})+(\hat%
{F}_{11}^{\prime}S_{11}^{-1}\hat{F}_{11}-\hat{F}_{R}^{\prime}S_{11}^{-1}\hat{F%
}_{R}) = ( italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) + ( over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT )
= S 11 − 1 U b S 11 − 1 + ( F ^ 11 ′ S 11 − 1 F ^ 11 − F ^ R ′ S 11 − 1 F ^ R ) absent superscript subscript 𝑆 11 1 subscript 𝑈 𝑏 superscript subscript 𝑆 11 1 superscript subscript ^ 𝐹 11 ′ superscript subscript 𝑆 11 1 subscript ^ 𝐹 11 superscript subscript ^ 𝐹 𝑅 ′ superscript subscript 𝑆 11 1 subscript ^ 𝐹 𝑅 \displaystyle=S_{11}^{-1}U_{b}S_{11}^{-1}+(\hat{F}_{11}^{\prime}S_{11}^{-1}%
\hat{F}_{11}-\hat{F}_{R}^{\prime}S_{11}^{-1}\hat{F}_{R}) = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + ( over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT )
U b subscript 𝑈 𝑏 \displaystyle U_{b} italic_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT
= S 11 − F ^ b S 11 F ^ b ′ absent subscript 𝑆 11 subscript ^ 𝐹 𝑏 subscript 𝑆 11 superscript subscript ^ 𝐹 𝑏 ′ \displaystyle=S_{11}-\hat{F}_{b}S_{11}\hat{F}_{b}^{\prime} = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT
F ^ b subscript ^ 𝐹 𝑏 \displaystyle\hat{F}_{b} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT
= S 11 F ^ 11 ′ S 11 − 1 absent subscript 𝑆 11 superscript subscript ^ 𝐹 11 ′ superscript subscript 𝑆 11 1 \displaystyle=S_{11}\hat{F}_{11}^{\prime}S_{11}^{-1} = italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT
For the second term we have
F ^ 11 ′ S 11 − 1 F ^ 11 − F ^ R ′ S 11 − 1 F ^ R superscript subscript ^ 𝐹 11 ′ superscript subscript 𝑆 11 1 subscript ^ 𝐹 11 superscript subscript ^ 𝐹 𝑅 ′ superscript subscript 𝑆 11 1 subscript ^ 𝐹 𝑅 \displaystyle\hat{F}_{11}^{\prime}S_{11}^{-1}\hat{F}_{11}-\hat{F}_{R}^{\prime}%
S_{11}^{-1}\hat{F}_{R} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT
= \displaystyle= =
F ^ 11 ′ S 11 − 1 2 ( I − V ^ m V ^ m ′ ) S 11 − 1 2 F ^ 11 ≥ 0 . superscript subscript ^ 𝐹 11 ′ superscript subscript 𝑆 11 1 2 𝐼 subscript ^ 𝑉 𝑚 superscript subscript ^ 𝑉 𝑚 ′ superscript subscript 𝑆 11 1 2 subscript ^ 𝐹 11 0 \displaystyle\hat{F}_{11}^{\prime}S_{11}^{-\frac{1}{2}}(I-\hat{V}_{m}\hat{V}_{%
m}^{\prime})S_{11}^{-\frac{1}{2}}\hat{F}_{11}\geq 0. over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ( italic_I - over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ≥ 0 .
For the first term, we have
2 S 10 = F ^ 11 S 00 + F ^ 11 S 11 ⇒ 2 S 01 = S 00 F ^ 11 ′ + S 11 F ^ 11 ′ ⇒ 2 S 01 = S 00 S 11 − 1 F ^ b S 11 + F ^ b S 11 ⇒ − S 00 S 11 − 1 F ^ b S 11 F ^ b ′ = − 2 S 01 F ^ b ′ + F ^ b S 11 F ^ b ′ ⇒ − Φ U b = S 00 − 2 S 01 F ^ b + F ^ b S 11 F ^ b ′ . missing-subexpression 2 subscript 𝑆 10 subscript ^ 𝐹 11 subscript 𝑆 00 subscript ^ 𝐹 11 subscript 𝑆 11 ⇒ 2 subscript 𝑆 01 subscript 𝑆 00 superscript subscript ^ 𝐹 11 ′ subscript 𝑆 11 superscript subscript ^ 𝐹 11 ′ ⇒ 2 subscript 𝑆 01 subscript 𝑆 00 superscript subscript 𝑆 11 1 subscript ^ 𝐹 𝑏 subscript 𝑆 11 subscript ^ 𝐹 𝑏 subscript 𝑆 11 ⇒ subscript 𝑆 00 superscript subscript 𝑆 11 1 subscript ^ 𝐹 𝑏 subscript 𝑆 11 superscript subscript ^ 𝐹 𝑏 ′ 2 subscript 𝑆 01 superscript subscript ^ 𝐹 𝑏 ′ subscript ^ 𝐹 𝑏 subscript 𝑆 11 superscript subscript ^ 𝐹 𝑏 ′ ⇒ Φ subscript 𝑈 𝑏 subscript 𝑆 00 2 subscript 𝑆 01 subscript ^ 𝐹 𝑏 subscript ^ 𝐹 𝑏 subscript 𝑆 11 superscript subscript ^ 𝐹 𝑏 ′ \displaystyle\begin{array}[]{rrcl}&2S_{10}&=&\hat{F}_{11}S_{00}+\hat{F}_{11}S_%
{11}\\
\Rightarrow&2S_{01}&=&S_{00}\hat{F}_{11}^{\prime}+S_{11}\hat{F}_{11}^{\prime}%
\\
\Rightarrow&2S_{01}&=&S_{00}S_{11}^{-1}\hat{F}_{b}S_{11}+\hat{F}_{b}S_{11}\\
\Rightarrow&-S_{00}S_{11}^{-1}\hat{F}_{b}S_{11}\hat{F}_{b}^{\prime}&=&-2S_{01}%
\hat{F}_{b}^{\prime}+\hat{F}_{b}S_{11}\hat{F}_{b}^{\prime}\\
\Rightarrow&-\Phi U_{b}&=&S_{00}-2S_{01}\hat{F}_{b}+\hat{F}_{b}S_{11}\hat{F}_{%
b}^{\prime}.\end{array} start_ARRAY start_ROW start_CELL end_CELL start_CELL 2 italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⇒ end_CELL start_CELL 2 italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL ⇒ end_CELL start_CELL 2 italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT + over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⇒ end_CELL start_CELL - italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_CELL start_CELL = end_CELL start_CELL - 2 italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL ⇒ end_CELL start_CELL - roman_Φ italic_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT - 2 italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT + over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT . end_CELL end_ROW end_ARRAY
where Φ = − S 00 S 11 − 1 Φ subscript 𝑆 00 superscript subscript 𝑆 11 1 \Phi=-S_{00}S_{11}^{-1} roman_Φ = - italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .
Now take the transpose and add the two equations together to get
− Φ U b + U b ( − Φ ) ′ = Φ subscript 𝑈 𝑏 subscript 𝑈 𝑏 superscript Φ ′ absent \displaystyle-\Phi U_{b}+U_{b}(-\Phi)^{\prime}= - roman_Φ italic_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT + italic_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ( - roman_Φ ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT =
2 [ S 00 − S 01 F ^ b ′ − F ^ b S 10 + F ^ b S 11 F ^ b ′ ] 2 delimited-[] subscript 𝑆 00 subscript 𝑆 01 superscript subscript ^ 𝐹 𝑏 ′ subscript ^ 𝐹 𝑏 subscript 𝑆 10 subscript ^ 𝐹 𝑏 subscript 𝑆 11 superscript subscript ^ 𝐹 𝑏 ′ \displaystyle 2[S_{00}-S_{01}\hat{F}_{b}^{\prime}-\hat{F}_{b}S_{10}+\hat{F}_{b%
}S_{11}\hat{F}_{b}^{\prime}] 2 [ italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT + over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ]
= \displaystyle= =
2 T ( Y 0 − F ^ b Y 1 ) ( Y 0 − F ^ b Y 1 ) ′ = 2 S ^ w , b 2 𝑇 subscript 𝑌 0 subscript ^ 𝐹 𝑏 subscript 𝑌 1 superscript subscript 𝑌 0 subscript ^ 𝐹 𝑏 subscript 𝑌 1 ′ 2 subscript ^ 𝑆 𝑤 𝑏
\displaystyle\frac{2}{T}(Y_{0}-\hat{F}_{b}Y_{1})(Y_{0}-\hat{F}_{b}Y_{1})^{%
\prime}=2\hat{S}_{w,b} divide start_ARG 2 end_ARG start_ARG italic_T end_ARG ( italic_Y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ( italic_Y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 2 over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT italic_w , italic_b end_POSTSUBSCRIPT
which we show below is positive definite.
But note that this is a continuous time Lyapunov equation
where Φ Φ \Phi roman_Φ is a stability matrix - it has the
same eigenvalues as − S 00 1 2 S 11 − 1 S 00 1 2 superscript subscript 𝑆 00 1 2 superscript subscript 𝑆 11 1 superscript subscript 𝑆 00 1 2 -S_{00}^{\frac{1}{2}}S_{11}^{-1}S_{00}^{\frac{1}{2}} - italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT
which is negative definite with, therefore, negative real eigenvalues.
Thus U b subscript 𝑈 𝑏 U_{b} italic_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT must be positive definite and the
result is established.
To see that S ^ w , b subscript ^ 𝑆 𝑤 𝑏
\hat{S}_{w,b} over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT italic_w , italic_b end_POSTSUBSCRIPT is positive definite first note that
from A1 and Theorem 2 ⇒ Q b ⇒ absent subscript 𝑄 𝑏 \Rightarrow Q_{b} ⇒ italic_Q start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is positive definite.
Now the backward least squares estimator
F ^ b , L S = S 01 S 11 − 1 subscript ^ 𝐹 𝑏 𝐿 𝑆
subscript 𝑆 01 superscript subscript 𝑆 11 1 \hat{F}_{b,LS}=S_{01}S_{11}^{-1} over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT minimises S w , b ( F ) subscript 𝑆 𝑤 𝑏
𝐹 S_{w,b}(F) italic_S start_POSTSUBSCRIPT italic_w , italic_b end_POSTSUBSCRIPT ( italic_F ) .
Then for any F 𝐹 F italic_F , completion of squares gives
S w , b ( F ) = Q ^ b , L S + ( F ^ b , L S − F ) S 11 ( F ^ b , L S − F ) ′ subscript 𝑆 𝑤 𝑏
𝐹 subscript ^ 𝑄 𝑏 𝐿 𝑆
subscript ^ 𝐹 𝑏 𝐿 𝑆
𝐹 subscript 𝑆 11 superscript subscript ^ 𝐹 𝑏 𝐿 𝑆
𝐹 ′ \displaystyle S_{w,b}(F)=\hat{Q}_{b,LS}+(\hat{F}_{b,LS}-F)S_{11}(\hat{F}_{b,LS%
}-F)^{\prime} italic_S start_POSTSUBSCRIPT italic_w , italic_b end_POSTSUBSCRIPT ( italic_F ) = over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT + ( over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT - italic_F ) italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ( over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT - italic_F ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT
where Q ^ b , L S = S w , b ( F ^ b , L S ) subscript ^ 𝑄 𝑏 𝐿 𝑆
subscript 𝑆 𝑤 𝑏
subscript ^ 𝐹 𝑏 𝐿 𝑆
\hat{Q}_{b,LS}=S_{w,b}(\hat{F}_{b,LS}) over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT italic_w , italic_b end_POSTSUBSCRIPT ( over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT ) . Set F = 0 𝐹 0 F=0 italic_F = 0 to get
Q ^ b , L S = S 00 − S 01 S 11 − 1 S 10 subscript ^ 𝑄 𝑏 𝐿 𝑆
subscript 𝑆 00 subscript 𝑆 01 superscript subscript 𝑆 11 1 subscript 𝑆 10 \hat{Q}_{b,LS}=S_{00}-S_{01}S_{11}^{-1}S_{10} over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT which → 𝑝 Q b 𝑝 → absent subscript 𝑄 𝑏 \xrightarrow{p}Q_{b} start_ARROW overitalic_p → end_ARROW italic_Q start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT .
Now since Q b subscript 𝑄 𝑏 Q_{b} italic_Q start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is full rank and the eigenvalues of Q ^ b , L S subscript ^ 𝑄 𝑏 𝐿 𝑆
\hat{Q}_{b,LS} over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT are
continuous functions of the entries in Q ^ b , L S subscript ^ 𝑄 𝑏 𝐿 𝑆
\hat{Q}_{b,LS} over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT they cannot
accumulate mass at 0 0 i.e. the smallest eigenvalue of Q ^ b , L S subscript ^ 𝑄 𝑏 𝐿 𝑆
\hat{Q}_{b,LS} over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT
is 0 0 with zero probability. So Q ^ b , L S subscript ^ 𝑄 𝑏 𝐿 𝑆
\hat{Q}_{b,LS} over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT is positive definite w.p.1.
But setting F = F ^ b 𝐹 subscript ^ 𝐹 𝑏 F=\hat{F}_{b} italic_F = over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT above, we see that
S ^ w , b ≥ Q ^ b , L S subscript ^ 𝑆 𝑤 𝑏
subscript ^ 𝑄 𝑏 𝐿 𝑆
\hat{S}_{w,b}\geq\hat{Q}_{b,LS} over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT italic_w , italic_b end_POSTSUBSCRIPT ≥ over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT italic_b , italic_L italic_S end_POSTSUBSCRIPT and so is positive definite w.p.1 as required.
We have
T ( F ^ R L S − F ^ R ) = T S 11 1 2 ( V ^ * , m V ^ * , m ′ S 11 − 1 2 F ^ L S − V ^ m V ^ m ′ S 11 − 1 2 F ^ 11 ) = S 11 1 2 V ^ * , m V ^ * , m ′ S 11 − 1 2 ( F ^ L S − F ^ 11 ) T + S 11 1 2 T ( V ^ * , m V ^ * , m ′ − V ^ m V ^ m ′ ) S 11 − 1 2 F ^ 11 missing-subexpression missing-subexpression missing-subexpression 𝑇 subscript ^ 𝐹 𝑅 𝐿 𝑆 subscript ^ 𝐹 𝑅 missing-subexpression missing-subexpression 𝑇 superscript subscript 𝑆 11 1 2 subscript ^ 𝑉 𝑚
superscript subscript ^ 𝑉 𝑚
′ superscript subscript 𝑆 11 1 2 subscript ^ 𝐹 𝐿 𝑆 subscript ^ 𝑉 𝑚 superscript subscript ^ 𝑉 𝑚 ′ superscript subscript 𝑆 11 1 2 subscript ^ 𝐹 11 missing-subexpression missing-subexpression superscript subscript 𝑆 11 1 2 subscript ^ 𝑉 𝑚
superscript subscript ^ 𝑉 𝑚
′ superscript subscript 𝑆 11 1 2 subscript ^ 𝐹 𝐿 𝑆 subscript ^ 𝐹 11 𝑇 missing-subexpression missing-subexpression superscript subscript 𝑆 11 1 2 𝑇 subscript ^ 𝑉 𝑚
superscript subscript ^ 𝑉 𝑚
′ subscript ^ 𝑉 𝑚 superscript subscript ^ 𝑉 𝑚 ′ superscript subscript 𝑆 11 1 2 subscript ^ 𝐹 11 \displaystyle\begin{array}[]{rrcl}&&&\sqrt{T}(\hat{F}_{RLS}-\hat{F}_{R})\\
&&=&\sqrt{T}S_{11}^{\frac{1}{2}}(\hat{V}_{*,m}\hat{V}_{*,m}^{\prime}S_{11}^{-%
\frac{1}{2}}\hat{F}_{LS}-\hat{V}_{m}\hat{V}_{m}^{\prime}S_{11}^{-\frac{1}{2}}%
\hat{F}_{11})\\
&&=&S_{11}^{\frac{1}{2}}\hat{V}_{*,m}\hat{V}_{*,m}^{\prime}S_{11}^{-\frac{1}{2%
}}(\hat{F}_{LS}-\hat{F}_{11})\sqrt{T}\\
&&+&S_{11}^{\frac{1}{2}}\sqrt{T}(\hat{V}_{*,m}\hat{V}_{*,m}^{\prime}-\hat{V}_{%
m}\hat{V}_{m}^{\prime})S_{11}^{-\frac{1}{2}}\hat{F}_{11}\end{array} start_ARRAY start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R italic_L italic_S end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL = end_CELL start_CELL square-root start_ARG italic_T end_ARG italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ( over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT - over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL = end_CELL start_CELL italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ( over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_L italic_S end_POSTSUBSCRIPT - over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) square-root start_ARG italic_T end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL + end_CELL start_CELL italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY
From Theorem A, Lemma 5D, and the fact that ‖ V ^ * , m V ^ * , m ′ ‖ 2 = m , superscript norm subscript ^ 𝑉 𝑚
superscript subscript ^ 𝑉 𝑚
′ 2 𝑚 \|\hat{V}_{*,m}\hat{V}_{*,m}^{\prime}\|^{2}=m, ∥ over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_m ,
we get the first term → 𝑝 0 𝑝 → absent 0 \xrightarrow{p}0 start_ARROW overitalic_p → end_ARROW 0 .
For the second term, using Theorem 5A, Theorem 3B we are reduced to showing
T ( V ^ * , m V ^ * , m ′ − V ^ m V ^ m ′ ) → 𝑝 0 𝑝 → 𝑇 subscript ^ 𝑉 𝑚
superscript subscript ^ 𝑉 𝑚
′ subscript ^ 𝑉 𝑚 superscript subscript ^ 𝑉 𝑚 ′ 0 \sqrt{T}(\hat{V}_{*,m}\hat{V}_{*,m}^{\prime}-\hat{V}_{m}\hat{V}_{m}^{\prime})%
\xrightarrow{p}0 square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_ARROW overitalic_p → end_ARROW 0 .
Note that as in the proof of Theorem 5A we deduce that V ^ m → 𝑝 V m 𝑝 → subscript ^ 𝑉 𝑚 subscript 𝑉 𝑚 \hat{V}_{m}\xrightarrow{p}V_{m} over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT .
Next set S ¯ = 1 2 ( S 00 + S 11 ) ¯ 𝑆 1 2 subscript 𝑆 00 subscript 𝑆 11 \bar{S}=\frac{1}{2}(S_{00}+S_{11}) over¯ start_ARG italic_S end_ARG = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) and consider that
T ( R ^ − R ^ * ) = T S 11 − 1 2 S 10 [ S ¯ − 1 − S 00 − 1 ] S 10 ′ S 11 − 1 2 = T S 11 − 1 2 S 10 S 00 − 1 ( S 00 − S ¯ ) S ¯ − 1 S 10 ′ S 11 1 2 = T S 11 − 1 2 S 10 S 00 − 1 1 2 ( S 00 − S 11 ) S ¯ − 1 S 10 ′ S 11 − 1 2 missing-subexpression missing-subexpression missing-subexpression 𝑇 ^ 𝑅 subscript ^ 𝑅 missing-subexpression missing-subexpression 𝑇 superscript subscript 𝑆 11 1 2 subscript 𝑆 10 delimited-[] superscript ¯ 𝑆 1 superscript subscript 𝑆 00 1 superscript subscript 𝑆 10 ′ superscript subscript 𝑆 11 1 2 missing-subexpression missing-subexpression 𝑇 superscript subscript 𝑆 11 1 2 subscript 𝑆 10 superscript subscript 𝑆 00 1 subscript 𝑆 00 ¯ 𝑆 superscript ¯ 𝑆 1 superscript subscript 𝑆 10 ′ superscript subscript 𝑆 11 1 2 missing-subexpression missing-subexpression 𝑇 superscript subscript 𝑆 11 1 2 subscript 𝑆 10 superscript subscript 𝑆 00 1 1 2 subscript 𝑆 00 subscript 𝑆 11 superscript ¯ 𝑆 1 superscript subscript 𝑆 10 ′ superscript subscript 𝑆 11 1 2 \displaystyle\begin{array}[]{rrcl}&&&\sqrt{T}(\hat{R}-\hat{R}_{*})\\
&&=&\sqrt{T}S_{11}^{-\frac{1}{2}}S_{10}[\bar{S}^{-1}-S_{00}^{-1}]S_{10}^{%
\prime}S_{11}^{-\frac{1}{2}}\\
&&=&\sqrt{T}S_{11}^{-\frac{1}{2}}S_{10}S_{00}^{-1}(S_{00}-\bar{S})\bar{S}^{-1}%
S_{10}^{\prime}S_{11}^{\frac{1}{2}}\\
&&=&\sqrt{T}S_{11}^{-\frac{1}{2}}S_{10}S_{00}^{-1}\frac{1}{2}(S_{00}-S_{11})%
\bar{S}^{-1}S_{10}^{\prime}S_{11}^{-\frac{1}{2}}\end{array} start_ARRAY start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_R end_ARG - over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL = end_CELL start_CELL square-root start_ARG italic_T end_ARG italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT [ over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ] italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL = end_CELL start_CELL square-root start_ARG italic_T end_ARG italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT - over¯ start_ARG italic_S end_ARG ) over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL = end_CELL start_CELL square-root start_ARG italic_T end_ARG italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_S start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT - italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ) over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY
and we find T ( R ^ − R ^ * ) → 𝑝 0 𝑝 → 𝑇 ^ 𝑅 subscript ^ 𝑅 0 \sqrt{T}(\hat{R}-\hat{R}_{*})\xrightarrow{p}0 square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_R end_ARG - over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT ) start_ARROW overitalic_p → end_ARROW 0
in view of Theorem 5A and lemma 5C.
Consider the ‘k-th’ eigenvector v ^ k subscript ^ 𝑣 𝑘 \hat{v}_{k} over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and eigenvalue λ ^ k subscript ^ 𝜆 𝑘 \hat{\lambda}_{k} over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT of R ^ ^ 𝑅 \hat{R} over^ start_ARG italic_R end_ARG
as well as the corresponding v ^ * , k subscript ^ 𝑣 𝑘
\hat{v}_{*,k} over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT * , italic_k end_POSTSUBSCRIPT and λ ^ * , k subscript ^ 𝜆 𝑘
\hat{\lambda}_{*,k} over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT * , italic_k end_POSTSUBSCRIPT of R ^ * subscript ^ 𝑅 \hat{R}_{*} over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT .
Return now to T ( V ^ * , m V ^ * , m ′ − V ^ m V ^ m ′ ) → 𝑝 0 𝑝 → 𝑇 subscript ^ 𝑉 𝑚
superscript subscript ^ 𝑉 𝑚
′ subscript ^ 𝑉 𝑚 superscript subscript ^ 𝑉 𝑚 ′ 0 \sqrt{T}(\hat{V}_{*,m}\hat{V}_{*,m}^{\prime}-\hat{V}_{m}\hat{V}_{m}^{\prime})%
\xrightarrow{p}0 square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT * , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_ARROW overitalic_p → end_ARROW 0 . We rewrite this as
∑ k = 1 m T ( v ^ k v ^ k ′ − v ^ * , k v ^ * , k ′ ) → 𝑝 0 𝑝 → superscript subscript 𝑘 1 𝑚 𝑇 subscript ^ 𝑣 𝑘 superscript subscript ^ 𝑣 𝑘 ′ subscript ^ 𝑣 𝑘
superscript subscript ^ 𝑣 𝑘
′ 0 \sum_{k=1}^{m}\sqrt{T}(\hat{v}_{k}\hat{v}_{k}^{\prime}-\hat{v}_{*,k}\hat{v}_{*%
,k}^{\prime})\xrightarrow{p}0 ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT * , italic_k end_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT * , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_ARROW overitalic_p → end_ARROW 0 which holds if
T ( v ^ k v ^ k ′ − v ^ * , k v ^ * , k ′ ) → 𝑝 0 𝑝 → 𝑇 subscript ^ 𝑣 𝑘 superscript subscript ^ 𝑣 𝑘 ′ subscript ^ 𝑣 𝑘
superscript subscript ^ 𝑣 𝑘
′ 0 \sqrt{T}(\hat{v}_{k}\hat{v}_{k}^{\prime}-\hat{v}_{*,k}\hat{v}_{*,k}^{\prime})%
\xrightarrow{p}0 square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT * , italic_k end_POSTSUBSCRIPT over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT * , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_ARROW overitalic_p → end_ARROW 0 for each k 𝑘 k italic_k .
This has Frobenius norm T 2 [ 1 − ( v ^ k ′ v ^ * , k ) 2 ] 𝑇 2 delimited-[] 1 superscript superscript subscript ^ 𝑣 𝑘 ′ subscript ^ 𝑣 𝑘
2 \sqrt{T}\sqrt{2[1-(\hat{v}_{k}^{\prime}\hat{v}_{*,k})^{2}]} square-root start_ARG italic_T end_ARG square-root start_ARG 2 [ 1 - ( over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT * , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] end_ARG .
If we set cos ( θ k ) = v ^ k ′ v ^ * , k subscript 𝜃 𝑘 superscript subscript ^ 𝑣 𝑘 ′ subscript ^ 𝑣 𝑘
\cos(\theta_{k})=\hat{v}_{k}^{\prime}\hat{v}_{*,k} roman_cos ( italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT * , italic_k end_POSTSUBSCRIPT then the norm is 2 T | sin ( θ k ) | 2 𝑇 subscript 𝜃 𝑘 \sqrt{2T}|\sin(\theta_{k})| square-root start_ARG 2 italic_T end_ARG | roman_sin ( italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) | .
This allows us to apply a special case of a classic result of [7 ]
which is given in [29 ] [the equation below their equation (1)] namely
| sin ( θ k ) | subscript 𝜃 𝑘 \displaystyle|\sin(\theta_{k})| | roman_sin ( italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) |
≤ 1 δ ^ k ‖ R ^ − R ^ * ‖ absent 1 subscript ^ 𝛿 𝑘 norm ^ 𝑅 subscript ^ 𝑅 \displaystyle\leq\frac{1}{\hat{\delta}_{k}}\|\hat{R}-\hat{R}_{*}\| ≤ divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_δ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ∥ over^ start_ARG italic_R end_ARG - over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT ∥
δ ^ k subscript ^ 𝛿 𝑘 \displaystyle\hat{\delta}_{k} over^ start_ARG italic_δ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT
= m i n ( | λ ^ k − 1 − λ ^ ∗ , k | , | λ ^ k + 1 − λ ^ ∗ , k | ) absent 𝑚 𝑖 𝑛 subscript ^ 𝜆 𝑘 1 subscript ^ 𝜆 ∗ 𝑘
subscript ^ 𝜆 𝑘 1 subscript ^ 𝜆 ∗ 𝑘
\displaystyle=min(|\hat{\lambda}_{k-1}-\hat{\lambda}_{\ast,k}|,|\hat{\lambda}_%
{k+1}-\hat{\lambda}_{\ast,k}|) = italic_m italic_i italic_n ( | over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT ∗ , italic_k end_POSTSUBSCRIPT | , | over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT - over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT ∗ , italic_k end_POSTSUBSCRIPT | )
Since (Theorem 5A) λ ^ l → 𝑝 λ l , λ ^ ∗ , l → 𝑝 λ l formulae-sequence 𝑝 → subscript ^ 𝜆 𝑙 subscript 𝜆 𝑙 𝑝 → subscript ^ 𝜆 ∗ 𝑙
subscript 𝜆 𝑙 \hat{\lambda}_{l}\xrightarrow{p}\lambda_{l},\hat{\lambda}_{\ast,l}\xrightarrow%
{p}\lambda_{l} over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_λ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT ∗ , italic_l end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_λ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT for all l 𝑙 l italic_l then
δ ^ k → 𝑝 δ k = m i n ( | λ k − 1 − λ k | , | λ k + 1 − λ k | ) 𝑝 → subscript ^ 𝛿 𝑘 subscript 𝛿 𝑘 𝑚 𝑖 𝑛 subscript 𝜆 𝑘 1 subscript 𝜆 𝑘 subscript 𝜆 𝑘 1 subscript 𝜆 𝑘 \hat{\delta}_{k}\xrightarrow{p}\delta_{k}=min(|\lambda_{k-1}-\lambda_{k}|,|%
\lambda_{k+1}-\lambda_{k}|) over^ start_ARG italic_δ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_ARROW overitalic_p → end_ARROW italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_m italic_i italic_n ( | italic_λ start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | , | italic_λ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT - italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ) .
However, consider that
T ‖ R ^ − R ^ * ‖ = tr [ T ( R ^ − R ^ * ) T ( R ^ − R ^ * ) ] 𝑇 norm ^ 𝑅 subscript ^ 𝑅 tr 𝑇 ^ 𝑅 subscript ^ 𝑅 𝑇 ^ 𝑅 subscript ^ 𝑅 \displaystyle\sqrt{T}\|\hat{R}-\hat{R}_{*}\|=\sqrt{\operatorname{tr}[\sqrt{T}(%
\hat{R}-\hat{R}_{*})\sqrt{T}(\hat{R}-\hat{R}_{*})]} square-root start_ARG italic_T end_ARG ∥ over^ start_ARG italic_R end_ARG - over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT ∥ = square-root start_ARG roman_tr [ square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_R end_ARG - over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT ) square-root start_ARG italic_T end_ARG ( over^ start_ARG italic_R end_ARG - over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT * end_POSTSUBSCRIPT ) ] end_ARG
but each term → 𝑝 0 𝑝 → absent 0 \xrightarrow{p}0 start_ARROW overitalic_p → end_ARROW 0 and so T | sin ( θ k ) | → 𝑝 0 𝑝 → 𝑇 subscript 𝜃 𝑘 0 \sqrt{T}|\sin(\theta_{k})|\xrightarrow{p}0 square-root start_ARG italic_T end_ARG | roman_sin ( italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) | start_ARROW overitalic_p → end_ARROW 0
and the proof is complete.