Analysis of Symmetric Matrix Valued Functions
Analysis of Symmetric Matrix Valued Functions
To cite this article: H. Mohebi & A. Salemi (2007) Analysis of Symmetric Matrix Valued
Functions, Numerical Functional Analysis and Optimization, 28:5-6, 691-715, DOI:
10.1080/01630560701380999
1. INTRODUCTION
There has been growing interest in the variational analysis of spectral
functions. This growing trend is due to the following reasons. On one
hand, spectral functions have important applications to some fundamental
problems in applied mathematics such as semidefinite programs and
engineering problems (see [7, 13] for many such applications). On the
other hand, efficient nonsmooth analysis tools have only been available in
the past few years (see [21]). Classical interest in spectral functions arose
from their important role in quantum mechanics [9, 22]. Nowadays they
are an inseparable part of optimization [13] and matrix analysis [4, 5].
In modern optimization, the key example is semidefinite programming,
where one encounters problems involving spectral functions. There are
many examples where a property of the spectral function is actually
equivalent to the corresponding property of the underlying symmetric
function. Among them are first-order differentiability [10], convexity
691
692 H. Mohebi and A. Salemi
A = P diag[1 , , n ]P T , (1.1)
for some P ∈ , where diag[1 , , n ] is the diagonal matrix with its ith
diagonal entry i Note that (1.1) is independent of the choice of P ∈
(see [1, 5]).
Let (·) : −→ n be the eigenvalue function such that i (A), i =
1, , n, yield eigenvalues of A for any A ∈ and are ordered in a
nonincreasing order, that is, 1 (A) ≥ · · · ≥ n (A) A function f : n −→
is called symmetric if f is invariant under coordinate permutation, that
is, f (x) = f (Px) for any permutation matrix P and any x ∈ n A spectral
function is a composition of a symmetric function f : n −→ and the
eigenvalue function (·) : −→ n , that is, the spectral function f ◦ :
−→ is defined by
(f ◦ )(A) := f ((A)) (A ∈ )
In the sequel, let denote the set of all permutation matrices in n×n
For given ∈ n , denotes the stabilizer of defined by
:= P ∈ : P =
2. PRELIMINARIES
Let be the space of real symmetric matrices. It is clear that
is a subspace of n×n with dimension n(n + 1)/2 For any A ∈ with
eigenvalues 1 ≥ · · · ≥ n , define
P diag[1 , , n ]P T := P diag[(A)]P T , P ∈
(x, r ) := y ∈ n : y − x ≤ r ,
and
(A, r ) := B ∈ : B − A ≤ r
diag[P ] = P diag[]P T
F (y) − F (z)
lip F (x) := lim sup
y,z−→x y − z
y =z
F (x + th) − F (x)
F (x, h) := lim exists ∀h ∈ k ,
t −→ 0 t
and F is directionally differentiable if F is directionally differentiable at
every x ∈ k We say that F is semidifferentiable at x ∈ k if the limit
F (x + t ĥ) − F (x)
lim
t −→ 0 t
ĥ −→ h
Analysis of Symmetric Matrix Valued Functions 695
exists for every direction h ∈ k It is easy to see that the limit (if it exists)
equals F (x, h) We say that F is differentiable (in the Fréchet sense) at
x ∈ k if there exists a linear mapping F (x) : k −→ l such that
A := P ∈ : P T AP = diag[(A)]
Lemma 2.3. For any A ∈ , there exist scalars > 0 and > 0 such that
(A) − (B) ≤ A − B
and
Finally, we present the following result (see [8, 20]) showing that for
any A ∈ and any H ∈ , the orthonormal eigenvectors of A + tH can be
chosen as a power series in t for all t ∈
Remark 3.2. Note that by part (4) of Lemma 3.1, we conclude that
f satisfies the property () for any symmetric differentiable function
f : n −→ .
For simplicity, let D1 := diag[(f1 ◦ )(B), , (fn ◦ )(B)] and D2 := diag[(f1 ◦)
(A), , (fn ◦ )(A)] Consider
D1 − D2 −→ 0 as A − B −→ 0 (3.4)
It follows from (3.2), (3.3), and (3.4) that f˜ (A) − f˜ (B) −→ 0 as
A − B −→ 0, and hence f˜ is continuous at A.
B := P diag[y]P T −→ P diag[(A)]P T = A
P − Q ≤ H (4.5)
B = Q diag[(B)]Q T , A = P diag[(A)]P T ,
(4.6)
f˜ (B) = Q diag[(f1 ◦ )(B), , (fn ◦ )(B)]Q T ,
and
E := P T EP , H := P T HP , X := diag[(f1 ◦ )(B), , (fn ◦ )(B)], Y :=
diag[(f1 ◦ )(A), , (fn ◦ )(A)], Z := Q T P , C := f [1] ((A)), := (A),
:= (B), i := i (A), and i := i (B), i = 1, , n Then we have
E = Z T XZ − Y − C ◦ H (4.7)
diag[1 , , n ] = P T AP = Z T diag[ 1 , , n ]Z
,
−H
and so
n
i , if i = j
zki zkj k − h̃ij = (4.8)
k=1
0, if i = j
n
1= zii2 + zki2 = zii2 + O(H 2 ) (i = 1, , n), (4.10)
k =i,k=1
700 H. Mohebi and A. Salemi
and
n
0 = zii zij + zji zjj + zki zkj
k =i,j ,k=1
n
n
n
= zki2 fk ( ) − fi () − (Dl fi )() − l + zkl2 k
k=1 l =1 k=1
n
= zii2 fi ( ) − fi () − (Dl fi )() − l + zll2 l + O(H 2 )
l =1
n
− (Dl fi )()(−l + (1 + O(H 2 )) l ) + O(H 2 )
l =1
n
= fi ( ) − fi () − (Dl fi )()( l − l ) + O(H 2 )
l =1
fj () − fi ()
cij = (4.15)
j − i
n
ẽij = zki zkj fk ( ) − cij h̃ij
k=1
n
n
= zki zkj fk ( ) − cij zki zkj k
k=1 k=1
= zii zij fi ( ) + zji zjj fj ( ) − cij (zii zij i + zji zjj j ) + O(H 2 )
= (zii zij + zji zjj )fi ( ) + zji zjj (fj ( ) − fi ( ))
− cij ((zii zij + zji zjj ) i + zji zjj ( j − i )) + O(H 2 )
= zji zjj (fj ( ) − fi ( ) − cij ( j − i )) + O(H 2 )
fj () − fi ()
= zji zjj fj ( ) − fi ( ) − ( j − i ) + O(H 2 )
j − i
= zji zjj ((fj ( ) − fi ( )) − (fj () − fi ())(1 + (( j − j ) − ( i − i ))(j − i )−1 )
+ O(H 2 ) (4.16)
P − Q ≤ H (4.18)
n
1 = zii2 + zki2 = zii2 + O(H 2 ) (i = 1, , n) (4.24)
k =i,k=1
n
zki2 (fk ◦ )(A + H ) − (fi ◦ )(A) − (D(H ))ii = o(H )
k=1
Hence we have
n
zii2 (fi ◦ )(A + H ) + zki2 (fk ◦ )(A + H ) − (fi ◦ )(A) − (D(H ))ii
k =i,k=1
continuous at (A) Then, in view of (4.1) and (4.2), we conclude that f [1]
is continuous at (A). By Lemma 2.3, there exist > 0 and > 0 such that
where
fi
xi
((B)) −→ xfii ((A)) for all i = 1, , n But, we have fi is a symmetric
function (i = 1, , n) Hence, xfij ((B)) −→ xfij ((A)) for all i, j =
1, , n, and so f is continuous at (A), which completes the proof.
where
i (t ) = [P (t )T A(t )P (t )]ii , i = 1, 2, , n,
Hence
We have
P (t )T A(t ) + P (t )T A (t ) = (t )P (t )T + (t )P (t )T ,
P (t )T A(t )P (t ) + P (t )T A (t )P (t ) = (t )P (t )T P (t ) + (t )P (t )T P (t )
Analysis of Symmetric Matrix Valued Functions 707
Because A (t ) = H , we obtain
P (t )T A(t )P (t ) + P (t )T HP (t ) = (t ) + (t )P (t )T P (t )
(t ) = P (t )T HP (t ) and
Now, let H P (t ) = P (t )T P (t ) Then we have
(t ) =
P (t )T A(t )P (t ) + H (t ) + (t )
P (t )
(t ) =
P (t )T P (t ) (t )P (t )T P (t ) + H (t ) + (t )
P (t )
We conclude that
(t ) =
P (t ) (t ) + H (t ) + (t )
P (t ),
and hence
(t ) =
(t ) − H P (t ) (t ) − (t )
P (t )
(t )ii = (t )ii ,
H i = 1, 2, , n, (5.2)
and
(t )ij =
H P (t )ij (j (t ) − i (t )), i = j , i, j = 1, 2, , n (5.3)
P (t )T P (t ) + P (t )T P (t ) = 0,
and hence
P T = −
P , where
P =
P (0) Because P is differentiable at t = 0,
we conclude that
(t ) = P T HP
lim H and lim
P (t ) =
P (5.5)
t −→0 t −→0
(0)ii = (P T HP )ii
i (0) = H
708 H. Mohebi and A. Salemi
and
(0)ij =
H Pij (j (0) − i (0)), i = j ; i, j = 1, 2, , n
(P T HP )ij
Pij = whenever i (0) = j (0), (5.6)
j (0) − i (0)
and
(0)ij = (P T HP )ij = 0
H whenever i (0) = j (0), i = j (5.7)
Therefore, we have
f˜ (A + tH ) − f˜ (A)
lim = Pf [1] ((A); P T HP )P T
t −→0 t
H = P diag[h1 , h2 , , hn ]P T
f˜ (A + tH ) − f˜ (A)
(f˜ ) (A; H ) = lim
t −→0 t
P diag[f1 ((A + tH )) − f1 ((A)), ,
fn ((A + tH )) − fn ((A))]P T
= lim
t −→0 t
Hence,
P T (f˜ ) (A; H )P
f1 ((A + tH )) − f1 ((A)) fn ((A + tH )) − fn ((A))
= lim diag ,,
t −→0 t t
(5.9)
710 H. Mohebi and A. Salemi
i (t ) = [P (t )T (A + tH )P (t )]ii , i = 1, 2, , n
i (t ) = i (A) + thi , i = 1, 2, , n
It follows that
P T (f˜ ) (A; H )P
f1 ((A) + th) − f1 ((A)) fn ((A) + th) − fn ((A))
= lim diag ,,
t −→0 t t
This implies that for each i = 1, 2, , n
Now, for each i ∈ 1, 2, , n, r ∈ and any unit vector u ∈ n define gi :
n −→ by:
fi ((A) + ru), 0 ≤ r ≤
gi ((A) + ru) = (5.11)
fi ((A) + u), r >
(g k )(B) − f˜ (B)
= Q diag g1k ( (B)), , gnk ( (B)) Q T
− Q diag[f1 ( (B)), , fn ( (B))]Q T
= diag g1k ( (B)) − f1 ( (B)) , , gnk ( (B)) − fn ( (B)) , (5.13)
where the second equality follows from the fact that Q T Q = I and
properties of the Frobenius norm · Because gik k≥1 converges
712 H. Mohebi and A. Salemi
uniformly to fi on C (i = 1, 2, , n), it follows from (5.13) that (g k )
k≥1
converges uniformly to f˜ on (A, ) Moreover, in view of (4.2), (5.12),
and continuous differentiability of g k on C (k ≥ 1) and that (B) ∈ C for
all B ∈ (A, ), we obtain
(g
k )(B) = sup (g k )(B)(H )
H =1
+ (g k )[1] ((A)) ◦ (Q T HQ )
≤ sup diag[g k ((A))diag[Q T HQ ]]
H =1
≤ m0 sup Q T HQ = m0 , (5.14)
H =1
(g k )(B) − f˜ (B) ≤ D − F ∀B ∈ (A, ) (5.15)
f˜ (D) − f˜ (F ) = f˜ (D) − (g
k )(D) + (g k )(D) + (g k )(F ) − (gk )(F ) − f˜ (F )
≤ f˜ (D) − (g k )(D) + (g
k )(D) − (g k )(F )
+ (g k )(F ) − f˜ (F )
1
≤ 2D − F +
(g )(F + t (D − F ))(D − F )dt
k
0
≤ (2 + m0 )D − F ∀k ≥ N
Analysis of Symmetric Matrix Valued Functions 713
D := P diag[x1 , , xn ]P T ,
and
F := P diag[y1 , , yn ]P T
A := P diag[x1 , , xn ]P T ,
714 H. Mohebi and A. Salemi
and
B := P diag[y1 , , yn ]P T
Then, we have
ACKNOWLEDGMENT
This research was supported by Mahani Mathematical Research Center.
REFERENCES
1. R. Bhatia (1997). Matrix Analysis. Springer-Verlag, New York.
2. X. Chen and P. Tseng (2003). Non-interior continuation methods for solving semidefinite
complementarity problems. Math. Program. 95:431–474.
3. X. Chen, H.D. Qi, and P. Tseng (2003). Analysis of nonsmooth symmetric-matrix-valued
functions with applications to semidefinite complementarity problems. SIAM J. Optimiz.
13:960–985.
4. R.A. Horn and C.R. Johnson (1985). Matrix Analysis. 2nd ed. Cambridge University Press,
Cambridge.
5. R.A. Horn and C.R. Johnson (1991). Topics in Matrix Analysis. Cambridge University Press,
Cambridge.
6. H. Jiang and D. Ralph (1999). Global and local superlinear convergence analysis of Newton-
type methods for semismooth equations with smooth least squares. In Reformulation: Non-smooth,
Piecewise Smooth, Semismooth and Smoothing Methods, (M. Fukushima and L. Qi, eds.), Appl. Optim.
22. Kluwer Academic Publishers, Dordrecht, pp. 181–209.
7. C. Kanzow and C. Nagel (2002). Semidefinite programs: New search directions, smoothing-type
methods, and numerical results. SIAM J. Optimiz. 13:1–23.
8. T. Kato (1984). Perturbation Theory for Linear Operators. Springer-Verlag, Berlin.
9. E.C. Kemble (1958). The Fundamental Principles of Quantum Mechanics. Dover, New York.
10. A.S. Lewis (1996). Derivatives of spectral functions. Math. Oper. Res. 21:576–588.
11. A.S. Lewis (1999). Nonsmooth analysis of eigenvalues. Math. Program 84:1–24.
12. A.S. Lewis (1996). Convex analysis on the Hermitian matrices. SIAM J. Optimiz. 6:164–177.
13. A.S. Lewis and M.L. Overton (1996). Eigenvalue optimization. In Acta Numerica. Acta Numer. 5.
Cambridge University Press, Cambridge, pp. 149–190.
14. A.S. Lewis and H.S. Sendov (2001). Twice differentiable spectral functions. SIAM J. Matrix Anal.
Appl. 23:368–386.
15. R. Mifflin (1977). Semismooth and semiconvex functions in constrained optimization. SIAM J.
Control Optimiz. 15:959–972.
16. J. von Neumann (1937). Some matrix inequalities and metrization of metric space. Tomsk
University Review 1:286–300.
17. L. Qi (1993). Convergence analysis of some algorithm for solving nonsmooth equations. Math.
Oper. Res. 18:227–244.
18. L. Qi and J. Sun (1993). A nonsmooth version of Newton’s method. Math. Program. 58:353–357.
19. H.D. Qi and X. Yang (2004). Semismoothness of spectral functions. SIAM J. Matrix Anal. Appl.
25:766–783.
20. F. Rellich (1969). Perturbation Theory of Eigenvalue Problems. Gordon and Breach, New York.
Analysis of Symmetric Matrix Valued Functions 715
21. R.T. Rockafellar and R.J.-B. Wets (1998). Variational Analysis. Springer-Verlag, Berlin.
22. L.I. Schiff (1955). Quantum Mechanics. McGraw-Hill, New York.
23. D. Sun and J. Sun (2002). Semismooth matrix valued functions. Math. Oper. Res. 27:150–169.
24. D. Sun and J. Sun (2003). Strong semismoothness of eigenvalues of symmetric matrices and its
application to inverse eigenvalue problem. SIAM J. Numer. Anal. 40:2352–2367.
25. M. Torki (1999). First- and second-order epi-differentiability in eigenvalue optimization. J. Math.
Anal. Appl. 234:391–416.
26. M. Torki (1999). Valeurs Propres de Matrices Symetriques: Analyses de la Sensibilitc d’Ordre
Superieur et Formulations Variationelles pour Leur Estimation. Ph.D. thesis. Paul Sabatier
University, Toulouse, France.
27. M. Torki (2001). Second-order directional derivatives of all eigenvalues of a symmetric matrix.
Non-linear Anal. 46:1133–1150.
28. N.K. Tsing, M.K.H. Fan, and E.I. Verriest (1994). On analyticity of functions involving
eigenvalues. Linear Algebra Appl. 207:159–180.
29. P. Tseng (1998). Merit functions for semidefinite complemetarity problems. Math. Program
83:159–185.