Simple matrix expressions for the curvatures of Grassmannian

Zehua Lai Department of Mathematics, University of Texas, Austin, TX 78712 zehua.lai@austin.utexas.edu , Lek-Heng Lim Computational and Applied Mathematics Initiative, Department of Statistics, University of Chicago, Chicago, IL 60637-1514 lekheng@uchicago.edu and Ke Ye KLMM, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China keyk@amss.ac.cn

Abstract.

We show that modeling a Grassmannian as symmetric orthogonal matrices $\operatorname{Gr}(k,\mathbb{R}^{n})\cong\{Q\in\mathbb{R}^{n\times n}:Q^{% \scriptscriptstyle\mathsf{T}}Q=I,\;Q^{\scriptscriptstyle\mathsf{T}}=Q,\;% \operatorname{tr}(Q)=2k-n\}$ yields exceedingly simple matrix formulas for various curvatures and curvature-related quantities, both intrinsic and extrinsic. These include Riemann, Ricci, Jacobi, sectional, scalar, mean, principal, and Gaussian curvatures; Schouten, Weyl, Cotton, Bach, Plebański, cocurvature, nonmetricity, and torsion tensors; first, second, and third fundamental forms; Gauss and Weingarten maps; and upper and lower delta invariants. We will derive explicit, simple expressions for the aforementioned quantities in terms of standard matrix operations that are stably computable with numerical linear algebra. Many of these aforementioned quantities have never before been presented for the Grassmannian.

1. Introduction

While pure mathematicians typically abhor picking coordinates for manifolds, this is all but inevitable in applied mathematics. A good choice of extrinsic coordinates facilitates computations for the applied mathematician and, as we will see in this article, provides transparent, easy-to-calculate expressions that are useful even for investigations in pure mathematics.

For the Grassmannian of $k$ -planes in $\mathbb{R}^{n}$ , we showed in [27] that points on the manifold may be represented by matrices $Q\in\mathbb{R}^{n\times n}$ that are (i) symmetric $Q^{\scriptscriptstyle\mathsf{T}}=Q$ , (ii) orthogonal $Q^{\scriptscriptstyle\mathsf{T}}Q=I$ , (iii) involutive $Q^{2}=I$ . Clearly any two of these conditions imply the third and thus

(1)

\operatorname{Gr}(k,\mathbb{R}^{n})\cong\{Q\in\mathbb{S}^{n}:Q^{2}=I,\;% \operatorname{tr}(Q)=2k-n\}\eqqcolon\operatorname{Gr}(k,n)

where $\mathbb{S}^{n}$ denotes the Euclidean space of $n\times n$ symmetric matrices. Our motivation in [27] was largely computational — such a coordinate representation of points in $\operatorname{Gr}(k,\mathbb{R}^{n})$ by orthogonal matrices gives immeasurably stabler numerical algorithms compared to other models of the Grassmannian as projection matrices or equivalence classes of matrices.

The goal of this article is to show that, even for calculations by hand, the involution model (1) provides a significant advantage over, say, expressions in [43], which are supposedly simple and already given in terms of linear algebra. Henceforth we define $\operatorname{Gr}(k,n)$ to be the set of matrices on the right of (1) to distinguish it from $\operatorname{Gr}(k,\mathbb{R}^{n})$ , the Grassmannian as an abstract manifold. In our earlier work [27], we derived expressions for basic quantities related to optimization: tangent vector, normal vector, metric, exponential map, geodesic, parallel transport, gradient, Hessian, etc, and showed that they all have simple, easily computable expressions in the involution model. Here we will do the same for various types of curvatures, some of which are notoriously difficult to calculate, but it is nevertheless rewarding as curvatures are likely the most important geometric objects of a Riemannian manifold. One might even argue that Riemannian geometry was created to provide a rigorous platform for studying curvatures.

A secondary goal is to illustrate the ease of using the involution model (1). We believe that many of the expressions derived in this article would be more difficult, some nearing impossible, to derive in other common models of the Grassmannian — as submanifolds of projective spaces, as various homogeneous spaces, or as a manifold of orthogonal projectors, all discussed in Section 7. Moreover, the expressions that we obtained are also more user-friendly, as we will elaborate below after presenting them in Table 1.

Table 1. The point

Q\in\operatorname{Gr}(k,n)

, tangent vectors

X,Y,Z,W\in\mathbb{T}_{Q}\operatorname{Gr}(k,n)

, and normal vector

H\in\mathbb{N}_{Q}\operatorname{Gr}(k,n)

are parameterized as in (11).

\tabulinesep

=0.75ex {tabu}@lll curvature & expression result
\tabucline[1pt ] - first fundamental form $\fff(X,Y)=2\operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0})$ Proposition 4.1
second fundamental form $\operatorname{\fff\fff}(X,Y)=\dfrac{1}{2}V\begin{bmatrix}X_{0}Y_{0}^{% \scriptscriptstyle\mathsf{T}}+Y_{0}X_{0}^{\scriptscriptstyle\mathsf{T}}&0\\ 0&-(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}+Y_{0}^{\scriptscriptstyle\mathsf% {T}}X_{0})\end{bmatrix}V^{\scriptscriptstyle\mathsf{T}}$ Theorem 4.3
third fundamental form $\operatorname{\fff\fff\fff}(X,Y)=-\dfrac{1}{2}\left(\dfrac{n}{2k(n-k)}+\dfrac{% n-2}{4}\right)\operatorname{tr}(XY)$ Corollary 4.8
Gauss map $\mathsf{\Gamma}(Q)=\biggl{\{}V\begin{bmatrix}H_{1}&0\\ 0&H_{2}\end{bmatrix}V^{\scriptscriptstyle\mathsf{T}}:H_{1}\in\mathbb{S}^{k},\;% H_{2}\in\mathbb{S}^{n-k}\biggr{\}}$ Proposition 4.2
Weingarten map $\mathsf{S}(H)(X)=\dfrac{1}{2}V\begin{bmatrix}0&H_{1}X_{0}-X_{0}H_{2}\\ (H_{1}X_{0}-X_{0}H_{2})^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}V^{% \scriptscriptstyle\mathsf{T}}$ Corollary 4.5
mean curvature vector $\mathsf{H}=\dfrac{1}{2k(n-k)}V\begin{bmatrix}-(n-k)I_{k}&0\\ 0&kI_{n-k}\end{bmatrix}V^{\scriptscriptstyle\mathsf{T}}$ Corollary 4.6
mean curvature $\mathsf{H}(H)=\dfrac{(k-n)\operatorname{tr}H_{1}+k\operatorname{tr}H_{2}}{2k(n% -k)}$ Corollary 4.6
Gaussian curvature $\mathsf{G}(H)=\dfrac{1}{2^{k(n-k)}}\prod_{i=1}^{k}\prod_{j=1}^{n-k}(\lambda_{k% +j}-\lambda_{i})$ Corollary 4.7
principal curvature $\upkappa_{ij}(H)=\dfrac{1}{2}(\lambda_{k+j}-\lambda_{i})$ , $i=1,\dots,k$ , $j=1,\dots,n-k$ Corollary 4.7
Riemann curvature $\operatorname{\mathsf{Rie}}(X,Y,Z,W)=\dfrac{1}{2}\operatorname{tr}\bigl{(}(XY-% YX)ZW\bigr{)}$ Proposition 5.1
Jacobi curvature $\mathsf{J}(X,Y,Z,W)=\operatorname{tr}(XYZW)-\operatorname{tr}\Bigl{(}Y\Bigl{(}% \dfrac{XZ+ZX}{2}\Bigr{)}W\Bigr{)}$ Corollary 5.2
sectional curvature $\upkappa(X,Y)=\dfrac{1}{4}\dfrac{\|[X,Y]\|^{2}}{\lVert X\rVert^{2}\lVert Y% \rVert^{2}-\operatorname{tr}(XY)^{2}}$ Corollary 5.3
Ricci curvature $\operatorname{\mathsf{Ric}}(X,Y)=\dfrac{n-2}{8}\operatorname{tr}(XY)$ Corollary 5.4
scalar curvature $\operatorname{\mathsf{Sca}}=\dfrac{k(n-k)(n-2)}{8}$ Corollary 5.4
traceless Ricci curvature $\mathsf{Z}(X,Y)=0$ Corollary 5.4
upper delta invariant $\overline{\updelta}_{2,r}=\dfrac{k(n-k)(n-2)}{8}$ Theorem 5.5
lower delta invariant $\underline{\updelta}_{2,r}=\dfrac{k(n-k)(n-2)}{8}-\dfrac{r}{4}$ Theorem 5.5
Schouten curvature $\mathsf{P}(X,Y)=\dfrac{n-2}{16(k(n-k)-1)}\operatorname{tr}(XY)$ Corollary 5.6
Cotton curvature $\mathsf{C}(X,Y,Z)=0$ Corollary 5.7
Bach curvature $\mathsf{B}(X,Y)=\dfrac{(n-2)^{2}}{32(k(n-k)-2)}\operatorname{tr}(XY)$ Corollary 5.9
Weyl curvature $\mathsf{W}(X,Y,Z,W)=\dfrac{1}{2}\operatorname{tr}\bigl{(}(XY-YX)ZW\bigr{)}$ Corollary 5.8
$-\dfrac{(n-2)}{8(k(n-k)-1)}\bigl{(}\operatorname{tr}(XZ)\operatorname{tr}(YW)-% \operatorname{tr}(XW)\operatorname{tr}(YZ)\bigr{)}$

A few of the curvatures in Table 1 are extrinsic, i.e., they depend specifically on our model (1). These include the second and third fundamental forms; the Gauss and Weingarten maps; mean, Gaussian, and principal curvatures. They help us better understand the embedded geometry of Grassmannian given by (1) but they also expedite our calculations of the intrinsic curvatures. These include the Riemann, Ricci, sectional, and scalar curvatures; the Schouten, Cotton, Weyl, and Bach tensors; and the upper and lower delta invariants.

While the value of an intrinsic curvature is independent of our choice of models, the expression or formula that gives this value is not. As is evident from Table 1, the involution model yields simple, stably computable formulas for extrinsic and intrinsic curvatures alike, essentially reducing curvatures of the Grassmannian to matrix analysis and their computations to numerical linear algebra. For example, computing the value of the Riemann curvature, a daunting order- $4$ tensor, is a trivial one-line calculation using our formula (and the proof of this formula is notably also a one-liner). For contrast, we will show in Section 7.2 what the corresponding calculation would entail if we use the most common Grassmannian model $\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{O}(k)\times% \operatorname{O}(n-k)\bigr{)}$ .

What’s new

For the intrinsic curvatures, the Schouten, Cotton, Weyl, Bach curvature tensors, the upper and lower delta invariants have never been explicitly calculated for a Grassmannian to the best of our knowledge. The formulas for Riemann, Ricci, sectional, and scalar curvatures of the Grassmannian modeled as various quotient spaces (see Section 7.2) are well-known and classical [9, 16, 40, 49] and they have also been calculated in [3, 32] for the projection model (see Section 7.3). The novelty of our calculations for these is that we derived intrinsic curvatures from extrinsic curvatures. This is why we will calculate extrinsic curvatures first.

The formulas for the extrinsic curvatures — second and third fundamental forms; Gauss and Weingarten maps; mean, Gaussian, and principal curvatures — are all new. Of course this is just a consequence of the relative obscurity of the involution model (1). Unlike intrinsic invariants, extrinsic ones are model-dependent, and it is expected that these have never been calculated for a new model. Also, extrinsic curvatures do not apply to the quotient models in Section 7.2 as they are only defined for embedded manifolds.

We emphasize that by ‘formula’ we mean an explicit expression like those in Table 1, involving actual matrices, and has no undetermined quantities. These curvatures may of course be expressed in terms of local coordinates or equivalence classes or horizontal spaces, but these invariably require additional computational overhead, which we will discuss in Section 7. Our formulas do not contain ambiguities that require further choices and effort to resolve.

In addition to the curvatures in Table 1, we will also discuss the cocurvature, nonmetricity, torsion, and Plebański tensors. We will see in Proposition 3.1 and Corollary 6.1 that they are trivially zero. We also proved in Corollary 4.4 that the index of relative nullity vanishes; in Corollary 6.2 that the third fundamental form, the Ricci curvature, the Schouten and Bach tensors are all Codazzi tensors; and in Corollary 6.4 that the Riemann and Weyl curvatures are divergence-free.

2. Notations and conventions

In this article, we use blackboard bold fonts for vector spaces (e.g., tangent and normal spaces, space of $n\times n$ symmetric matrices, etc.) and san serif fonts for all curvatures and curvature-related quantities (e.g., Table 1). The Riemann, Ricci, and scalar curvatures, arguably the three most important quantities, are given three-letter notations $\operatorname{\mathsf{Rie}},\operatorname{\mathsf{Ric}},\operatorname{\mathsf{% Sca}}$ for emphasis.

We write $\mathbb{E}^{m}$ for a Euclidean space of dimension $m$ equipped with its Euclidean inner product $\langle\,\cdot,\cdot\,\rangle$ . For concreteness, one may assume that this Euclidean space is $\mathbb{R}^{n}$ with $\langle x,y\rangle=x^{\scriptscriptstyle\mathsf{T}}y$ , or $\mathbb{S}^{n}$ with $\langle X,Y\rangle=\operatorname{tr}(XY)$ , or $\mathbb{R}^{m\times n}$ with $\langle X,Y\rangle=\operatorname{tr}(X^{\scriptscriptstyle\mathsf{T}}Y)$ . We write $\operatorname{proj}_{\mathbb{W}}:\mathbb{E}^{m}\to\mathbb{E}^{m}$ for the orthogonal projection onto a subspace $\mathbb{W}\subseteq\mathbb{E}^{m}$ . The space of all linear maps between vector spaces $\mathbb{V}$ and $\mathbb{W}$ will be denoted $\operatorname{Hom}(\mathbb{V},\mathbb{W})$ with $\operatorname{Hom}(\mathbb{V},\mathbb{V})$ denoted specially as $\operatorname{End}(\mathbb{V})$ . We write $\operatorname{id}$ for the identity map on any set.

For $X,Y\in\mathbb{R}^{n\times n}$ , we write $[X,Y]=XY-YX$ for the commutator. We write $\mathfrak{so}(n)=\{X\in\mathbb{R}^{n\times n}:X^{\scriptscriptstyle\mathsf{T}}% =-X\}$ for the special orthogonal Lie algebra, i.e., the set of skew-symmetric matrices with $[\,\cdot,\cdot\,]$ as its Lie bracket.

We write $\mathcal{M}$ for a smooth manifold, $C^{\infty}(\mathcal{M})$ for its ring of smooth real-valued functions, $\mathbb{T}_{x}\mathcal{M}$ and $\mathbb{N}_{x}\mathcal{M}$ for its tangent and normal spaces at $x\in\mathcal{M}$ respectively, and $\mathcal{X}(\mathcal{M})$ for its $C^{\infty}(\mathcal{M})$ -module of smooth vector fields. We denote vector fields with an arrow like $\vec{v}$ . The word tensor in this article will always mean a tensor over a $C^{\infty}(\mathcal{M})$ -module, i.e., a smooth tensor field on $\mathcal{M}$ , and will be cast in the form of multilinear maps between tangent and normal spaces. With few exceptions, all multilinear maps in this article are defined at a specific point $x\in\mathcal{M}$ . So as not to be overly verbose, we write “on/of $\mathcal{M}$ ” when we mean “on/of $\mathcal{M}$ at $x$ ” and we sometimes drop the subscript $x$ like in Table 1 when there is no cause for confusion.

We write $\operatorname{Gr}(k,\mathbb{V})$ for the Grassmannian of $k$ -dimensional subspaces in the vector space $\mathbb{V}$ . We emphasize that in this article,

(2)

\operatorname{Gr}(k,n)\coloneqq\{Q\in\mathbb{S}^{n}:Q^{2}=I,\;\operatorname{tr% }(Q)=2k-n\},

i.e., $\operatorname{Gr}(k,n)\subseteq\mathbb{S}^{n}$ is the image of the embedding

(3)

\varepsilon:\operatorname{Gr}(k,\mathbb{R}^{n})\to\mathbb{S}^{n},\quad\mathbb{% W}\mapsto P_{\mathbb{W}}-P_{\mathbb{W}^{\perp}}\eqqcolon Q_{\mathbb{W}},

where $P_{\mathbb{W}}\in\mathbb{S}^{n}$ is the orthogonal projection matrix with image $\mathbb{W}$ . So $\varepsilon$ sends a $k$ -dimensional subspace $\mathbb{W}\subseteq\mathbb{R}^{n}$ to a matrix $Q_{\mathbb{W}}\in\mathbb{S}^{n}$ . It is easy to verify [27] that $Q_{\mathbb{W}}$ has the properties in (2), $\varepsilon$ gives an embedding of Riemannian manifolds, and

\operatorname{Gr}(k,n)=\varepsilon\bigl{(}\operatorname{Gr}(k,\mathbb{R}^{n})% \bigr{)}.

3. Curvature zoo

We will review the definitions of various curvatures and curvature-related quantities. This section is not intended to be pedagogical, and only contains minimal commentaries. The goal is just to collect definitions scattered across standard references [5, 16, 24, 25, 26, 30, 38] and some slightly less standard ones [10, 11, 15, 22, 37] and present them in a unified set of notations (see Section 2) for the reader’s easy reference.

All discussions below assume that $\mathcal{M}$ is a Riemannian manifold with Riemannian metric $\mathsf{g}$ , i.e., $\mathsf{g}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to% \mathbb{R}$ defines an inner product at $x\in\mathcal{M}$ . Section 3.2 applies to $\mathcal{M}$ intrinsically. Section 3.1 applies when $\mathcal{M}$ embedded in an Euclidean space $\mathbb{E}^{m}$ . More precisely by embedding we always mean an isometric embedding $\varepsilon:\mathcal{M}\to\mathbb{E}^{m}$ preserving Riemannian metric, i.e., $\mathsf{g}_{x}(v,w)=\langle d_{x}\varepsilon(v),d_{x}\varepsilon(w)\rangle$ for all $x\in\mathcal{M}$ , $v,w\in\mathbb{T}_{x}\mathcal{M}$ . Henceforth, we will identify $\mathcal{M}$ with its image under the embedding $\varepsilon$ so that we have $\mathcal{M}\subseteq\mathbb{E}^{m}$ and $\mathsf{g}_{x}(v,w)=\langle v,w\rangle$ .

As usual, we will use the Levi-Civita connection throughout. We will say a few words about this choice in Section 3.3

3.1. Extrinsic curvatures

In this section, $\mathcal{M}\subseteq\mathbb{E}^{m}$ is an $n$ -dimensional submanifold of an $m$ -dimensional Euclidean space. For any $x\in\mathcal{M}$ , we have the canonical identification

\mathbb{E}^{m}\cong\mathbb{T}_{x}\mathbb{E}^{m}=\mathbb{T}_{x}\mathcal{M}% \oplus\mathbb{N}_{x}\mathcal{M}

and the corresponding orthogonal projections $\operatorname{proj}_{\mathbb{T}_{x}\mathcal{M}}$ and $\operatorname{proj}_{\mathbb{N}_{x}\mathcal{M}}$ .

The first fundamental form is

\fff_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to\mathbb{R}% ,\quad\fff_{x}(v,w)\coloneqq\langle v,w\rangle.

This is nothing more than the Riemannian metric $\mathsf{g}$ on $\mathcal{M}$ expressed in terms of the inner product $\langle\,\cdot,\cdot\,\rangle$ on $\mathbb{E}^{m}$ .

The Gauss map is defined by

\mathsf{\Gamma}:\mathcal{M}\to\operatorname{Gr}(m-n,\mathbb{E}^{m}),\quad% \mathsf{\Gamma}(x)\coloneqq\mathbb{N}_{x}\mathcal{M}.

Take any $\mathbb{V}\in\operatorname{Gr}(m-n,\mathbb{E}^{m})$ , note that this is an $(m-n)$ -dimensional subspace of $\mathbb{E}^{m}$ and we have

\mathbb{T}_{\mathbb{V}}\operatorname{Gr}(m-n,\mathbb{E}^{m})\cong\operatorname% {Hom}(\mathbb{V},\mathbb{V}^{\perp}).

In particular, $\mathbb{T}_{\mathbb{N}_{x}\mathcal{M}}\operatorname{Gr}(m-n,\mathbb{E}^{m})% \cong\operatorname{Hom}(\mathbb{T}_{x}\mathcal{M},\mathbb{N}_{x}\mathcal{M})$ and we may also regard the Gauss map as

(4)

\mathsf{\Gamma}:\mathcal{M}\to\operatorname{Hom}(\mathbb{T}_{x}\mathcal{M},% \mathbb{N}_{x}\mathcal{M}).

The second fundamental form $\operatorname{\fff\fff}_{x}$ of $\mathcal{M}$ is given by the derivative of the Gauss map in the form of (4), i.e.,

\operatorname{\fff\fff}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}% \mathcal{M}\to\mathbb{N}_{x}\mathcal{M},\quad\operatorname{\fff\fff}_{x}(v,w)% \coloneqq d_{x}\mathsf{\Gamma}(v)(w).

This is likely the most important extrinsic differential geometric invariant of an embedded manifold. Indeed many of our curvatures, including intrinsic ones, will be derived from the second fundamental form. We define the index of relative nullity [15] of $\mathcal{M}$ as

\upnu_{x}\coloneqq\dim\{v\in\mathbb{T}_{x}\mathcal{M}:\operatorname{\fff\fff}_% {x}(v,w)=0\text{~{}for all~{}}w\in\mathbb{T}_{x}\mathcal{M}\},

noting that $\operatorname{\fff\fff}_{x}(v,w)=\operatorname{\fff\fff}_{x}(w,v)$ . For each $\eta\in\mathbb{N}_{x}\mathcal{M}$ , we may regard $\langle\operatorname{\fff\fff}_{x},\eta\rangle$ as an endomorphism on $\mathbb{T}_{x}\mathcal{M}$ and this is called the Weingarten map or shape operator,

\mathsf{S}_{x}(\eta):\mathbb{T}_{x}\mathcal{M}\to\mathbb{T}_{x}\mathcal{M},% \quad\langle\mathsf{S}_{x}(\eta)(v),w\rangle\coloneqq\langle\operatorname{\fff% \fff}_{x}(v,w),\eta\rangle.

This operator is self-adjoint as the second fundamental form is symmetric. The eigenvalues (necessarily real)

\lambda_{1}(\eta),\dots,\lambda_{n}(\eta)\in\mathbb{R},

are called the principal curvatures of $\mathcal{M}$ along $\eta$ . Their product is called the Gaussian curvature of $\mathcal{M}$ along $\eta$

\mathsf{G}_{x}(\eta)\coloneqq\det\mathsf{S}_{x}(\eta)\in\mathbb{R},

and their sum is called the mean curvature of $\mathcal{M}$ along $\eta$

\mathsf{H}_{x}(\eta)\coloneqq\operatorname{tr}\mathsf{S}_{x}(\eta)\in\mathbb{R}.

For any orthonormal basis $\eta_{1},\dots,\eta_{m-n}\in\mathbb{N}_{x}\mathcal{M}$ , the mean curvature vector of $\mathcal{M}$ is

\mathsf{H}_{x}\coloneqq\sum_{i=1}^{m-n}\mathsf{H}_{x}(\eta_{i})\eta_{i}\in% \mathbb{N}_{x}\mathcal{M}

and its value is independent of the choice of orthonormal basis. Clearly, $\mathsf{H}_{x}(\eta)=\langle\mathsf{H}_{x},\eta\rangle$ .

The Gauss–Obata map [37] of $\mathcal{M}$ is

\mathsf{Q}_{x}:\mathbb{T}_{x}\mathcal{M}\to\mathbb{T}_{x}\mathcal{M},\quad% \mathsf{Q}_{x}(v)\coloneqq\sum_{j=1}^{m-n}\mathsf{S}_{x}(\eta_{j})^{2}(v).

and the third fundamental form is

\operatorname{\fff\fff\fff}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}% \mathcal{M}\to\mathbb{R},\quad\operatorname{\fff\fff\fff}_{x}(v,w)\coloneqq% \langle\mathsf{Q}_{x}(v),w\rangle.

The version defined here differs from an alternative version defined in classical differential [14, Equation 21] and algebraic [20, Equations 1.45 and 1.46] geometry. The latter allows for $k$ th fundamental forms for all $k\geq 4$ . Nevertheless the important thing is that both versions agree with the classical third fundamental form for a surface $\mathcal{M}\subseteq\mathbb{R}^{3}$ .

Getting slightly ahead of ourselves, the second fundamental form may also be expressed using the Levi-Civita connection $\nabla$ in (6):

(5)

\operatorname{\fff\fff}_{x}(v,w)=\partial_{v}\vec{w}-\nabla_{\!v}\vec{w}=% \operatorname{proj}_{\mathbb{N}_{x}\mathcal{M}}(\partial_{v}\vec{w})

for any $v,w\in\mathbb{T}_{x}\mathcal{M}$ and where $\vec{w}\in\mathcal{X}(\mathcal{M})$ is any vector field with $\vec{w}(x)=w$ [30, Theorem 8.2]. The value of $\operatorname{\fff\fff}_{x}(v,w)$ is independent of the choice of $\vec{w}$ [30, Proposition 8.1].

3.2. Intrinsic curvatures

In this section $\mathcal{M}$ is a Riemannian manifold with metric tensor $\mathsf{g}$ . One feature of our approach is that we will calculate intrinsic curvatures in extrinsic coordinates given by our involution model, vastly simplifying the work involved. As such it suffices to define the Levi-Civita connection $\nabla$ of $\mathcal{M}$ as an embedded manifold:

(6)

\nabla:\mathcal{X}(\mathcal{M})\times\mathcal{X}(\mathcal{M})\to\mathcal{X}(% \mathcal{M}),\quad(\nabla(\vec{v},\vec{w}))(x)\coloneqq\operatorname{proj}_{% \mathbb{T}_{x}\mathcal{M}}(\partial_{v}\vec{w}),

where $v\coloneqq\vec{v}(x)\in\mathbb{T}_{x}\mathcal{M}$ and $\partial_{v}\vec{w}$ is the standard directional derivative of the vector field, i.e., derivative of the vector-valued function $\vec{w}:\mathcal{M}\to\mathbb{E}^{m}$ along the direction $v\in\mathbb{T}_{x}\mathcal{M}\subseteq\mathbb{E}^{m}$ . Note that this simple definition is possible only because both $\mathcal{M}$ and $\mathbb{T}_{x}\mathcal{M}$ are regarded as subsets of $\mathbb{E}^{m}$ . It is also common to write, for a fixed $v\in\mathbb{T}_{x}\mathcal{M}$ ,

\nabla_{\!v}\vec{w}:\mathcal{X}(\mathcal{M})\to\mathcal{X}(\mathcal{M}),\quad(% \nabla_{\!v}\vec{w})(x)\coloneqq(\nabla(\vec{v},\vec{w}))(x),

as it behaves like a directional derivative. A slight variation of this notation makes $v\in\mathbb{T}_{x}\mathcal{M}$ the variable and fixes $x\in\mathcal{M}$ , giving

\leftidx{{}_{x}\!}{\nabla}\vec{w}:\mathbb{T}_{x}\mathcal{M}\to\mathbb{T}_{x}% \mathcal{M},\quad(\leftidx{{}_{x}\!}{\nabla}\vec{w})(v):=(\nabla_{\!v}\vec{w})% (x).

Since $\leftidx{{}_{x}\!}{\nabla}\vec{w}$ is a linear operator, it has a trace, which defines the divergence for a vector field $\vec{w}$ ,

\operatorname{div}:\mathcal{X}(\mathcal{M})\to C^{\infty}(\mathcal{M}),\quad% \operatorname{div}(\vec{w})(x)\coloneqq\operatorname{tr}(\leftidx{{}_{x}\!}{% \nabla}\vec{w}).

For higher-order tensor fields, the convention is to apply divergence to the last argument: If $\vec{w}_{1},\dots,\vec{w}_{k}\in\mathcal{X}(\mathcal{M})$ , then

\operatorname{div}(\vec{w}_{1}\otimes\cdots\otimes\vec{w}_{k-1}\otimes\vec{w}_% {k})\coloneqq\operatorname{div}(\vec{w}_{k})\vec{w}_{1}\otimes\cdots\otimes% \vec{w}_{k-1},

and extended linearly to all $k$ -tensor fields [38].

We will need two common notions [5] defined for any symmetric bilinear forms on $\mathbb{T}_{x}\mathcal{M}$ . Let $\alpha,\beta:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to% \mathbb{R}$ be symmetric and bilinear, their Kulkarni–Nomizu product $\alpha\varowedge\beta$ is the symmetric quadrilinear form

(7)

\begin{gathered}\alpha\varowedge\beta:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T% }_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}% \to\mathbb{R},\\ \alpha\varowedge\beta(u,v,w,z)\coloneqq\alpha(u,w)\beta(v,z)-\alpha(u,z)\beta(% v,w)-\alpha(v,w)\beta(u,z)+\alpha(v,z)\beta(u,w).\end{gathered}

If a symmetric bilinear form $\beta:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to\mathbb{R}$ satisfies

(\nabla_{\!u}\beta)(v,w)=(\nabla_{\!v}\beta)(u,w)

for all $u,v,w\in\mathbb{T}_{x}\mathcal{M}$ , then it is called a Codazzi tensor.

The Riemann curvature or curvature tensor of $\mathcal{M}$ is

\operatorname{\mathsf{Rie}}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}% \mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to% \mathbb{R},\quad\operatorname{\mathsf{Rie}}_{x}(u,v,w,z)\coloneqq\langle\nabla% _{\!u}\nabla_{\!v}w-\nabla_{\!v}\nabla_{\!u}w-\nabla_{\![u,v]}w,z\rangle.

There is a common variant known by the same name:

\mathsf{R}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to% \operatorname{End}(\mathbb{T}_{x}\mathcal{M}),\quad\mathsf{R}_{x}(u,v)w% \coloneqq\nabla_{\!u}\nabla_{\!v}w-\nabla_{\!v}\nabla_{\!u}w-\nabla_{\![u,v]}w.

Note that $\operatorname{\mathsf{Rie}}_{x}(u,v,w,z)=\langle\mathsf{R}_{x}(u,v)w,z\rangle$ with the slight difference being that $\mathsf{R}_{x}$ is a bilinear map and $\operatorname{\mathsf{Rie}}_{x}$ is a quadrilinear form. There is also a symmetric variant called the Jacobi tensor of $\mathcal{M}$ ,

\mathsf{J}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\times% \mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to\mathbb{R},\quad% \mathsf{J}_{x}(u,v,w,z)\coloneqq\frac{1}{2}\left(\operatorname{\mathsf{Rie}}_{% x}(u,v,w,z)+\operatorname{\mathsf{Rie}}_{x}(w,v,u,z)\right).

If $v,w\in\mathbb{T}_{x}\mathcal{M}$ are linearly independent, then the sectional curvature of $\mathcal{M}$ is

\upkappa_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to% \mathbb{R},\quad\upkappa_{x}(v,w)\coloneqq\frac{\operatorname{\mathsf{Rie}}_{x% }(v,w,w,v)}{\lVert v\wedge w\rVert^{2}}=\frac{\operatorname{\mathsf{Rie}}_{x}(% v,w,w,v)}{\lVert v\rVert^{2}\lVert w\rVert^{2}-\langle v,w\rangle^{2}}.

Let $v_{1},\dots,v_{n}\in\mathbb{T}_{x}\mathcal{M}$ be an orthonormal basis. The Ricci curvature of $\mathcal{M}$ is

\operatorname{\mathsf{Ric}}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}% \mathcal{M}\to\mathbb{R},\quad\operatorname{\mathsf{Ric}}_{x}(v,w)\coloneqq% \sum_{j=1}^{n}\operatorname{\mathsf{Rie}}_{x}(v,v_{j},v_{j},w).

The scalar curvature of $\mathcal{M}$ is

\operatorname{\mathsf{Sca}}_{x}\coloneqq\operatorname{tr}(\operatorname{% \mathsf{Ric}}_{x})=\sum_{1\leq j<k\leq n}\upkappa_{x}(v_{j},v_{k})=\sum_{1\leq j% <k\leq n}\operatorname{\mathsf{Rie}}_{x}(v_{j},v_{k},v_{k},v_{j}).

The last two curvatures gives us the traceless Ricci curvature,

\mathsf{Z}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to% \mathbb{R},\quad\mathsf{Z}_{x}(v,w)\coloneqq\operatorname{\mathsf{Ric}}_{x}(v,% w)-\frac{\operatorname{\mathsf{Sca}}_{x}}{n}\mathsf{g}_{x}(v,w),

important as it gives a $\mathsf{g}$ -orthogonal decomposition of Ricci curvature. A manifold with $\mathsf{Z}=0$ is called an Einstein manifold [5].

The scalar curvature allows a generalization to any $d$ -dimensional subspace $\mathbb{V}\subseteq\mathbb{T}_{x}\mathcal{M}$ ,

\operatorname{\mathsf{Sca}}_{x}(\mathbb{V})\coloneqq\sum_{1\leq j<k\leq d}% \upkappa_{x}(v_{j},v_{k}),

where $v_{1},\dots,v_{d}\in\mathbb{V}$ is any orthonormal basis. Clearly $\operatorname{\mathsf{Sca}}_{x}(\mathbb{T}_{x}\mathcal{M})=\operatorname{% \mathsf{Sca}}_{x}$ . From these one may construct the upper and lower delta invariants [10, 11], given respectively by

	$\displaystyle\overline{\updelta}_{x}(d_{1},\dots,d_{r})$	$\displaystyle\coloneqq\operatorname{\mathsf{Sca}}_{x}-\inf_{\begin{subarray}{c% }\dim\mathbb{V}_{j}=d_{j}\\ \mathbb{V}_{j}\perp\mathbb{V}_{k},\;j<k\end{subarray}}\biggl{[}\sum_{j=1}^{r}% \operatorname{\mathsf{Sca}}_{x}(\mathbb{V}_{j})\biggr{]},$
	$\displaystyle\underline{\updelta}_{x}(d_{1},\dots,d_{r})$	$\displaystyle\coloneqq\operatorname{\mathsf{Sca}}_{x}-\sup_{\begin{subarray}{c% }\dim\mathbb{V}_{j}=d_{j}\\ \mathbb{V}_{j}\perp\mathbb{V}_{k},\;j<k\end{subarray}}\biggl{[}\sum_{j=1}^{r}% \operatorname{\mathsf{Sca}}_{x}(\mathbb{V}_{j})\biggr{]},$

where $d_{1},\dots,d_{r}\in\mathbb{Z}$ are such that $2\leq d_{1}\leq\cdots\leq d_{r}$ and $d_{1}+\dots+d_{r}\leq\dim\mathcal{M}$ .

We will next define a well-known quartet of closely related curvature tensors [5]. The Schouten tensor of $\mathcal{M}$ is

\mathsf{P}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to% \mathbb{R},\quad\mathsf{P}_{x}(v,w)\coloneqq\frac{1}{n-2}\Bigl{(}\operatorname% {\mathsf{Ric}}_{x}(v,w)-\frac{\operatorname{\mathsf{Sca}}_{x}}{2(n-1)}\mathsf{% g}_{x}(v,w)\Bigr{)}.

The Cotton tensor of $\mathcal{M}$ is

\mathsf{C}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\times% \mathbb{T}_{x}\mathcal{M}\to\mathbb{R},\quad\mathsf{C}_{x}(u,v,w)\coloneqq(% \nabla_{\!u}\mathsf{P}_{x})(v,w)-(\nabla_{\!v}\mathsf{P}_{x})(u,w).

The Weyl tensor of $\mathcal{M}$ is

\mathsf{W}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\times% \mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to\mathbb{R},\quad% \mathsf{W}_{x}\coloneqq\operatorname{\mathsf{Rie}}_{x}-\frac{1}{n-2}\mathsf{Z}% _{x}\varowedge\mathsf{g}_{x}-\frac{\operatorname{\mathsf{Sca}}_{x}}{2n(n-1)}% \mathsf{g}_{x}\varowedge\mathsf{g}_{x}.

The Bach tensor of $\mathcal{M}$ is

	$\displaystyle\mathsf{B}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}% \mathcal{M}\to\mathbb{R},$
	$\displaystyle\mathsf{B}_{x}(u,w)\coloneqq\frac{1}{n-3}\sum_{i,j=1}^{n}\nabla^{% 2}_{v_{i},v_{j}}\mathsf{W}_{x}(u,v_{i},v_{j},w)+\frac{1}{n-2}\sum_{i,j=1}^{n}% \operatorname{\mathsf{Ric}}_{x}(v_{i},v_{j})\mathsf{W}_{x}(u,v_{i},v_{j},w),$

We will also describe some intrinsic curvatures that are more typically studied in non-Riemannian geometry, i.e., for a connection $\nabla$ other than the Levi-Civita connection (see Section 3.3).

The torsion tensor [24, Chapter III, Section 5] of $\mathcal{M}$ is

\mathsf{T}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\to% \mathbb{R},\quad\mathsf{T}_{x}(v,w)\coloneqq\nabla_{\!v}w-\nabla_{\!w}v-[v,w].

The nonmetricity tensor [22] of $\mathcal{M}$ is

\mathsf{Q}_{x}:\mathbb{T}_{x}\mathcal{M}\times\mathbb{T}_{x}\mathcal{M}\times% \mathbb{T}_{x}\mathcal{M}\to\mathbb{R},\quad\mathsf{Q}_{x}(u,v,w)\coloneqq-% \nabla_{\!u}\mathsf{g}_{x}(v,w)+\mathsf{g}_{x}(\nabla_{\!u}v,w)+\mathsf{g}_{x}% (v,\nabla_{\!u}w).

For any vector subbundle $\mathbb{V}\mathcal{M}$ of $\mathbb{T}\mathcal{M}$ with projection $\pi:\mathbb{T}\mathcal{M}\to\mathbb{V}\mathcal{M}$ , its cocurvature [35, 26] is

\mathsf{R}^{*}_{\pi}:\mathbb{X}(\mathcal{M})\times\mathcal{X}(\mathcal{M})\to% \mathcal{X}(\mathcal{M}),\quad\mathsf{R}^{*}_{\pi}(\vec{v},\vec{w})\coloneqq(% \operatorname{id}-\pi)\bigl{(}[\pi(\vec{v}),\pi(\vec{w})]\bigr{)}.

In general $\mathsf{R}^{*}_{\pi}$ measures the failure of integrability of $\mathbb{V}\mathcal{M}$ . To put the cocurvature in perspective, the curvature in this context is

\mathsf{R}_{\pi}:\mathcal{X}(\mathcal{M})\times\mathcal{X}(\mathcal{M})\to% \mathcal{X}(\mathcal{M}),\quad\mathsf{R}_{\pi}(\vec{v},\vec{w})\coloneqq\pi% \bigl{(}[(\operatorname{id}-\pi)(\vec{v}),(\operatorname{id}-\pi)(\vec{w})]% \bigr{)}.

Evidently, this is a more general notion than the Riemann curvature $\mathsf{R}$ but we will see how they are related in Proposition 3.1.

3.3. Connections

Our choice of Levi-Civita connection is all but preordained by the Fundamental Theorem of Riemannian Geometry [16, Chapter 2, Theorem 3.6], namely, it is the unique affine connection that is torsion-free and metric-compatible. To reassure the readers, we will add a few more words to justify this choice.

In manifold optimization, one needs a metric to identify tangent space with cotangent space, and the Levi-Civita connection is the most natural one (i.e., torsion free) compatible with our Riemannian metric $\mathsf{g}$ . While the goal of our article is not optimization per se, there are not many options among other common connections either: (i) The Weyl connection is a generalization of Levi-Civita connection to conformal metrics and for our choice of $\mathsf{g}$ the two are identical. (ii) The affine, Cartan, Ehresmann, and Koszul connections are unnatural in our context. (iii) The Gauss–Manin and Grothendieck connections are intended for schemes and incompatible with our consideration of the Grassmannian as a manifold. (iv) Other connections like those of Connes and Weitzenböck are even further removed from our treatment in this article.

Unsurprisingly the non-Riemannian curvatures all turn out to be trivial for a Riemannian manifold. The statement (a) below is just stated for ease of referencing.

Proposition 3.1.

Let $\nabla$ be the Levi-Civita connection on $\mathcal{M}$ .

(a)

The torsion tensor and the nonmetricity tensor vanishes identically, i.e.,

\mathsf{T}_{x}(v,w)=0,\qquad\mathsf{Q}_{x}(u,v,w)=0

for all $x\in\mathcal{M}$ and all $u,v,w\in\mathbb{T}_{x}\mathcal{M}$ .

(b)

Let $\pi:\operatorname{O}(\mathcal{M})\to\mathcal{M}$ be the orthonormal frame bundle on $\mathcal{M}$ , $\mathbb{V}\mathcal{M}\coloneqq\ker(d\pi)$ , and $\widehat{\pi}:\mathbb{T}\operatorname{O}(\mathcal{M})\to\mathbb{V}\mathcal{M}$ the projection induced by $\nabla$ . Then the cocurvature vanishes and the curvature equals the Riemann curvature up to sign, i.e.,

\mathsf{R}^{*}_{\widehat{\pi}}(\vec{v},\vec{w})=0,\qquad\mathsf{R}_{\widehat{% \pi}}(\vec{v},\vec{w})=-\mathsf{R}(\vec{v},\vec{w})

where the first equality holds for all $\vec{v},\vec{w}\in\mathcal{X}(\operatorname{O}(\mathcal{M}))$ , the second for all $\vec{v},\vec{w}\in\mathcal{X}(\mathcal{M})\subseteq\mathcal{X}(\operatorname{O% }(\mathcal{M}))$ .

Proof.

Since the Levi-Civita connection on $\mathcal{M}$ is, by definition, the unique connection that is torsion free and compatible with the metric $\mathsf{g}$ , both $\mathsf{T}$ and $\mathsf{Q}$ vanish.

The horizontal bundle $\ker\widehat{\pi}$ is $\mathbb{T}\mathcal{M}$ , whose integral manifold is $\mathcal{M}$ . So the cocurvature $\mathsf{R}^{*}_{\widehat{\pi}}$ must vanish identically. Since $\operatorname{O}(\mathcal{M})$ consists of orthonormal frames on $\mathcal{M}$ , the typical fiber of $\mathbb{V}(\mathcal{M})=\ker(d\pi)$ is $\mathfrak{so}(n)$ , where $n=\dim\mathcal{M}$ . As $\widehat{\pi}$ is induced by $\nabla$ , it can be regarded as an $\mathfrak{so}(n)$ -valued differential $1$ -form $\omega$ on $\operatorname{O}(\mathcal{M})$ . The projection map $\widehat{\pi}$ gives a decomposition

\mathbb{T}\operatorname{O}(\mathcal{M})\simeq\mathbb{V}\mathcal{M}\oplus% \mathbb{T}\mathcal{M}.

Thus for any $\vec{v},\vec{w}\in\mathcal{X}(\operatorname{O}(\mathcal{M}))$ , we may identify $(\operatorname{id}-\widehat{\pi})(\vec{v})$ and $(\operatorname{id}-\widehat{\pi})(\vec{w})$ with elements in $\mathcal{X}(\mathcal{M})$ and

-\mathsf{R}_{\widehat{\pi}}(\vec{v},\vec{w})=-\omega([(\operatorname{id}-% \widehat{\pi})(\vec{v}),(\operatorname{id}-\widehat{\pi})(\vec{w})])=d\omega% \bigl{(}(\operatorname{id}-\widehat{\pi})(\vec{v}),(\operatorname{id}-\widehat% {\pi})(\vec{w})\bigr{)}.

In other words, $-\mathsf{R}_{\widehat{\pi}}$ turns out to be the curvature $2$ -form of $\omega$ , which by [24, Section III.5] is equal to the Riemann curvature $\mathsf{R}$ . ∎

4. Extrinsic curvatures of the Grassmannian

We are now in a position to calculate various curvatures of the Grassmannian modeled as $\operatorname{Gr}(k,n)$ and express them as simple matrix formulas. Our strategy is first calculate the extrinsic curvatures in this section, notably the second fundamental form, and then use it as the basis for our calculation of intrinsic curvatures in Section 5.

Our ambient Euclidean space of choice is $\mathbb{S}^{n}$ equipped with the standard (also called trace or Frobenius) inner product on $\mathbb{R}^{m\times n}$ given by

\langle X,Y\rangle\coloneqq\operatorname{tr}(X^{\scriptscriptstyle\mathsf{T}}Y% )=\sum_{i=1}^{n}\sum_{j=1}^{n}x_{ij}y_{ij}.

When restricted to $\operatorname{Gr}(k,n)$ , it gives us our Riemannian metric

(8)

\mathsf{g}_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{T}_{Q}% \operatorname{Gr}(k,n)\to\mathbb{R},\quad\mathsf{g}_{Q}(X,Y)=\operatorname{tr}% (X^{\scriptscriptstyle\mathsf{T}}Y)

for all $Q\in\operatorname{Gr}(k,n)$ . Of course, we have $X^{\scriptscriptstyle\mathsf{T}}=X$ as $X\in\mathbb{S}^{n}$ but we choose to keep the transpose in our notation to remind ourselves that this is the trace inner product.

Given $Q\in\operatorname{Gr}(k,n)\subseteq\mathbb{S}^{n}$ , we have an eigenvalue decomposition $Q=VI_{k,n-k}V^{\scriptscriptstyle\mathsf{T}}$ for some $V\in\operatorname{O}(n)$ and $I_{k,n-k}\coloneqq\operatorname{diag}(I_{k},-I_{n-k})=\operatorname{diag}(1,% \dots,1,-1,\dots,-1)$ . The tangent and normal spaces of $\operatorname{Gr}(k,n)$ at $Q$ are

(9)		$\displaystyle\mathbb{T}_{Q}\operatorname{Gr}(k,n)$	$\displaystyle=\biggl{\{}V\begin{bmatrix}0&X_{0}\\ X_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}V^{\scriptscriptstyle% \mathsf{T}}\in\mathbb{S}^{n}:X_{0}\in\mathbb{R}^{k\times(n-k)}\biggr{\}},$
(10)		$\displaystyle\mathbb{N}_{Q}\operatorname{Gr}(k,n)$	$\displaystyle=\biggl{\{}V\begin{bmatrix}H_{1}&0\\ 0&H_{2}\end{bmatrix}V^{\scriptscriptstyle\mathsf{T}}\in\mathbb{S}^{n}:H_{1}\in% \mathbb{S}^{k},\;H_{2}\in\mathbb{S}^{n-k}\biggr{\}}.$

Henceforth we will consistently write any point $Q\in\operatorname{Gr}(k,n)$ , tangent vector $X\in\mathbb{T}_{Q}\operatorname{Gr}(k,n)$ and normal vector $H\in\mathbb{N}_{Q}\operatorname{Gr}(k,n)$ as

(11)

Q=V\begin{bmatrix}I_{k}&0\\ 0&-I_{n-k}\end{bmatrix}V^{\scriptscriptstyle\mathsf{T}},\quad X=V\begin{% bmatrix}0&X_{0}\\ X_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}V^{\scriptscriptstyle% \mathsf{T}},\quad H=V\begin{bmatrix}H_{1}&0\\ 0&H_{2}\end{bmatrix}V^{\scriptscriptstyle\mathsf{T}}.

The simple parameterization of these three basic objects in the involution model is a key to the simplicity of our calculations. A convenient orthonormal basis of $\mathbb{T}_{I_{k,n-k}}\operatorname{Gr}(k,n)$ is given by

(12)

\biggl{\{}\dfrac{\sqrt{2}}{2}\begin{bmatrix}0&E_{ij}\\ E_{ij}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\in\mathbb{S}^{n}:i=1,% \dots,k,\;j=1,\dots,n-k\biggr{\}}

where $E_{ij}$ is the $k\times(n-k)$ matrix with one in the $(i,j)$ th entry and zero everywhere else. We refer readers to [27, Section 3] for the proofs of statements in this and the last paragraph.

The next two results require no calculation and are just stated for completeness.

Proposition 4.1 (First fundamental form).

The first fundamental form $\fff_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{T}_{Q}\operatorname% {Gr}(k,n)\to\mathbb{R}$ is given by

\fff_{Q}(X,Y)=2\operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0})

with $Q$ , $X$ , $Y$ parameterized as in (11).

Proposition 4.2 (Gauss map).

The Gauss map of $\operatorname{Gr}(k,n)$ in $\mathbb{S}^{n}$ is given by

	$\displaystyle\mathsf{\Gamma}:\operatorname{Gr}(k,n)$	$\displaystyle\to\operatorname{Gr}\bigl{(}\tbinom{n+1}{2}-k(n-k),\mathbb{S}^{n}% \bigr{)},$
	$\displaystyle\mathsf{\Gamma}(Q)$	$\displaystyle=\mathbb{N}_{Q}(\operatorname{Gr}(k,n))=V\left\{\begin{bmatrix}H_% {1}&0\\ 0&H_{2}\end{bmatrix}:H_{1}\in\mathbb{S}^{k},H_{2}\in\mathbb{S}^{n-k}\right\}V^% {\scriptscriptstyle\mathsf{T}},$

with $Q$ parameterized as in (11).

The next calculation is our key to unlocking other calculations in this article.

Theorem 4.3 (Second fundamental form).

The second fundamental form $\operatorname{\fff\fff}_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{% T}_{Q}\operatorname{Gr}(k,n)\to\mathbb{N}_{Q}\operatorname{Gr}(k,n)$ is given by

\operatorname{\fff\fff}_{Q}(X,Y)=\frac{1}{2}V\begin{bmatrix}X_{0}Y_{0}^{% \scriptscriptstyle\mathsf{T}}+Y_{0}X_{0}^{\scriptscriptstyle\mathsf{T}}&0\\ 0&-X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}-Y_{0}^{\scriptscriptstyle\mathsf{% T}}X_{0}\end{bmatrix}V^{\scriptscriptstyle\mathsf{T}},

with $Q$ , $X$ , $Y$ parameterized as in (11).

Proof.

Since $\operatorname{O}(n)$ acts on $\operatorname{Gr}(k,n)$ transitively and isometrically, it suffices to calculate $\operatorname{\fff\fff}$ at $I_{k,n-k}\in\operatorname{Gr}(k,n)$ . In this case $X,Y\in\mathbb{T}_{I_{k,n-k}}\operatorname{Gr}(k,n)$ may be written as

X=\begin{bmatrix}0&X_{0}\\ X_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix},\quad Y=\begin{bmatrix}0&Y% _{0}\\ Y_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}

for some $X_{0},Y_{0}\in\mathbb{R}^{k\times(n-k)}$ . Points near $I_{k,n-k}$ can be parametrized as

\varphi(B,H_{1},H_{2})=\exp\biggl{(}\begin{bmatrix}0&-B\\ B^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\biggr{)}\biggl{(}I_{k,n-k}+% \begin{bmatrix}H_{1}&0\\ 0&H_{2}\end{bmatrix}\biggr{)}\exp\biggl{(}\begin{bmatrix}0&-B\\ B^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}^{\scriptscriptstyle\mathsf{T}}% \biggr{)},

where $B\in\mathbb{R}^{k\times(n-k)}$ , $H_{1}\in\mathbb{S}^{k}$ , and $H_{2}\in\mathbb{S}^{n-k}$ have sufficiently small norms. Clearly, we have $\varphi(B,H_{1},H_{2})\in\operatorname{Gr}(k,n)$ if and only if $H_{1}=0$ and $H_{2}=0$ . Thus we may extend $X$ by

\widetilde{X}\bigl{(}\varphi(B,H_{1},H_{2})\bigr{)}=\exp\biggl{(}\begin{% bmatrix}0&-B\\ B^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\biggr{)}\begin{bmatrix}H_{1}&X% _{0}\\ X_{0}^{\scriptscriptstyle\mathsf{T}}&H_{2}\end{bmatrix}\exp\biggl{(}\begin{% bmatrix}0&-B\\ B^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\biggr{)}^{\scriptscriptstyle% \mathsf{T}}.

Such an $\widetilde{X}$ is an extension of a local vector field around $I_{k,n-k}$ on $\operatorname{Gr}(k,n)$ . By (5),

\operatorname{\fff\fff}_{I_{k,n-k}}(X,Y)=\operatorname{proj}_{\mathbb{N}_{I_{k% ,n-k}}\operatorname{Gr}(k,n)}\bigl{(}\langle\widetilde{\nabla}\widetilde{X}(I_% {k,n-k}),Y\rangle\bigr{)}

where $\widetilde{\nabla}$ denotes the covariant derivative in the Euclidean space $\mathbb{S}^{n}$ , i.e.,

\widetilde{\nabla}\widetilde{X}=\bigl{(}\partial_{B}\widetilde{X},\partial_{H_% {1}}\widetilde{X},\partial_{H_{2}}\widetilde{X}\bigr{)}

Since $Y$ is a tangent vector, we obtain

\bigl{\langle}\widetilde{\nabla}\widetilde{X}(I_{k,n-k}),Y\bigr{\rangle}=\sum_% {i=1}^{k}\sum_{j=1}^{n-k}\frac{\partial\widetilde{X}}{\partial b_{ij}}(I_{k,n-% k})y_{0ij},

where we have written $B=(b_{ij})$ and $Y_{0}=(y_{0ij})$ . Observe that

	$\displaystyle\frac{\partial\widetilde{X}}{\partial b_{ij}}(I_{k,n-k})$	$\displaystyle=-\frac{1}{2}\begin{bmatrix}0&E_{ij}\\ -E_{ij}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\begin{bmatrix}0&X_{0}\\ X_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}+\frac{1}{2}\begin{bmatrix}% 0&X_{0}\\ X_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\begin{bmatrix}0&E_{ij}\\ -E_{ij}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}$
		$\displaystyle=\frac{1}{2}\begin{bmatrix}-E_{ij}X_{0}^{\scriptscriptstyle% \mathsf{T}}-X_{0}E_{ij}^{\scriptscriptstyle\mathsf{T}}&0\\ 0&E_{ij}^{\scriptscriptstyle\mathsf{T}}X_{0}+X_{0}^{\scriptscriptstyle\mathsf{% T}}E_{ij}\end{bmatrix}$

where the factor $\frac{1}{2}$ is a result of our choice of Riemannian metric on $\mathbb{S}^{n}$ . Therefore we have

	$\displaystyle\sum_{i=1}^{k}\sum_{j=1}^{n-k}\frac{\partial\widetilde{X}}{% \partial b_{ij}}(I_{k,n-k})y_{0ij}$	$\displaystyle=\frac{1}{2}\sum_{i=1}^{k}\sum_{j=1}^{n-k}\begin{bmatrix}-(E_{ij}% X_{0}^{\scriptscriptstyle\mathsf{T}}+X_{0}E_{ij}^{\scriptscriptstyle\mathsf{T}% })y_{0ij}&0\\ 0&(E_{ij}^{\scriptscriptstyle\mathsf{T}}X_{0}+X_{0}^{\scriptscriptstyle\mathsf% {T}}E_{ij})y_{0ij}\end{bmatrix}$
		$\displaystyle=\frac{1}{2}\begin{bmatrix}-X_{0}Y_{0}^{\scriptscriptstyle\mathsf% {T}}-Y_{0}X_{0}^{\scriptscriptstyle\mathsf{T}}&0\\ 0&X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}+Y_{0}^{\scriptscriptstyle\mathsf{T% }}X_{0}\end{bmatrix},$

where the last expression is our required $\operatorname{\fff\fff}_{I_{k,n-k}}(X,Y)$ . ∎

We record an observation that follows from an additional step of singular value decomposition.

Corollary 4.4 (Index of relative nullity).

The index of relative nullity $\upnu_{Q}$ of $\operatorname{Gr}(k,n)$ is zero.

Proof.

Let $X\in\mathbb{T}_{Q}\operatorname{Gr}(k,n)$ be such that $\operatorname{\fff\fff}_{Q}(X,Y)=0$ for all $Y\in\mathbb{T}_{Q}\operatorname{Gr}(k,n)$ , with $Q,X,Y$ parametrized as in (11). We claim that $X=0$ . By Theorem 4.3, we must have

(13)

X_{0}Y_{0}^{\scriptscriptstyle\mathsf{T}}+Y_{0}X_{0}^{\scriptscriptstyle% \mathsf{T}}=0,\quad X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}+Y_{0}^{% \scriptscriptstyle\mathsf{T}}X_{0}=0

for any $Y_{0}\in\mathbb{R}^{k\times(n-k)}$ . Let $X_{0}=U\Sigma V^{\scriptscriptstyle\mathsf{T}}$ be a singular value decomposition with $U\in\operatorname{O}(k)$ and $V\in\operatorname{O}(n-k)$ . Then (13) becomes

\Sigma(U^{\scriptscriptstyle\mathsf{T}}Y_{0}V)^{\scriptscriptstyle\mathsf{T}}+% (U^{\scriptscriptstyle\mathsf{T}}Y_{0}V)\Sigma=0,\quad\Sigma(U^{% \scriptscriptstyle\mathsf{T}}Y_{0}V)+(U^{\scriptscriptstyle\mathsf{T}}Y_{0}V)^% {\scriptscriptstyle\mathsf{T}}\Sigma=0.

Since $Y_{0}$ is arbitrary, we may set $X_{0}=\Sigma$ in (13). Now by taking $Y_{0}$ to be an arbitrary diagonal $k\times(n-k)$ matrix, we see that $\Sigma=0$ . Hence $X_{0}=0$ and $X=0$ . ∎

The Weingarten map is an alternative way to express the second fundamental form and thus follows easily from Theorem 4.3.

Corollary 4.5 (Weingarten map).

The Weingarten map $\mathsf{S}_{Q}(H):\mathbb{T}_{Q}\operatorname{Gr}(k,n)\to\mathbb{T}_{Q}% \operatorname{Gr}(k,n)$ along the normal direction $H\in\mathbb{N}_{Q}\operatorname{Gr}(k,n)$ is given by

(14)

\mathsf{S}_{Q}(H)(X)=\frac{1}{2}V\begin{bmatrix}0&H_{1}X_{0}-X_{0}H_{2}\\ (H_{1}X_{0}-X_{0}H_{2})^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}V^{% \scriptscriptstyle\mathsf{T}}

with $Q$ , $X$ , $H$ parameterized as in (11).

Proof.

We plug in the expressions from (11) into $\langle\mathsf{S}_{Q}(H)(X),Y\rangle=\langle\operatorname{\fff\fff}_{Q}(X,Y),H\rangle$ and use standard properties of trace to get

	$\displaystyle\langle\mathsf{S}_{Q}(H)(X),Y\rangle$	$\displaystyle=\frac{1}{2}\bigl{[}\operatorname{tr}\bigl{(}(X_{0}Y_{0}^{% \scriptscriptstyle\mathsf{T}}+Y_{0}X_{0}^{\scriptscriptstyle\mathsf{T}})H_{1}% \bigr{)}-\operatorname{tr}\bigl{(}(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}+Y% _{0}^{\scriptscriptstyle\mathsf{T}}X_{0})H_{2}\bigr{)}\bigr{]}$
		$\displaystyle=\frac{1}{2}\bigl{[}\operatorname{tr}\bigl{(}(H_{1}X_{0}-X_{0}H_{% 2})\bigr{)}Y_{0}^{\scriptscriptstyle\mathsf{T}})+\operatorname{tr}\bigl{(}(X_{% 0}^{\scriptscriptstyle\mathsf{T}}H_{1}-H_{2}X_{0}^{\scriptscriptstyle\mathsf{T% }})Y_{0}\bigr{)}\bigr{]}$
		$\displaystyle=\operatorname{tr}\biggl{(}\frac{1}{2}\begin{bmatrix}0&H_{1}X_{0}% -X_{0}H_{2}\\ X_{0}^{\scriptscriptstyle\mathsf{T}}H_{1}-H_{2}X_{0}^{\scriptscriptstyle% \mathsf{T}}&0\end{bmatrix}^{\scriptscriptstyle\mathsf{T}}\begin{bmatrix}0&Y_{0% }\\ Y_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\biggr{)},$

and thereby deducing (14). ∎

The calculation of mean curvature is also straightforward.

Corollary 4.6 (Mean curvature).

The mean curvature vector of $\operatorname{Gr}(k,n)$ is given by

\mathsf{H}_{Q}=\frac{1}{k(n-k)}\operatorname{tr}(\operatorname{\fff\fff}_{Q})=% \frac{1}{2k(n-k)}V\begin{bmatrix}-(n-k)I_{k}&0\\ 0&kI_{n-k}\end{bmatrix}V^{\scriptscriptstyle\mathsf{T}}

and the mean curvature of $\operatorname{Gr}(k,n)$ along $H\in\mathbb{N}_{Q}\operatorname{Gr}(k,n)$ is given by

\mathsf{H}_{Q}(H)=\frac{(k-n)\operatorname{tr}H_{1}+k\operatorname{tr}H_{2}}{2% k(n-k)}

with $Q$ , $X$ , $H$ parameterized as in (11).

Proof.

We use the orthonormal basis of $\mathbb{T}_{I_{k,n-k}}\operatorname{Gr}(k,n)$ in (12). A straightforward but slightly messy calculation gives

	$\displaystyle(E_{ij}E_{i^{\prime}j^{\prime}}^{\scriptscriptstyle\mathsf{T}})_{pq}$	$\displaystyle=\begin{cases}1&\text{if }j=j^{\prime}\text{ and }(p,q)=(i,i^{% \prime}),\\ 0&\text{otherwise},\end{cases}$
for any $p,q\in\{1,\dots,k\}$ ; and
	$\displaystyle(E_{ij}^{\scriptscriptstyle\mathsf{T}}E_{i^{\prime}j^{\prime}})_{pq}$	$\displaystyle=\begin{cases}1&\text{if }i=i^{\prime}\text{ and }(p,q)=(j,j^{% \prime})\\ 0&\text{otherwise}\end{cases}$

for any $p,q\in\{1,\dots,n-k\}$ . Using these, we may evaluate

\operatorname{\fff\fff}_{I_{k,n-k}}\biggl{(}\begin{bmatrix}0&E_{ij}\\ E_{ij}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix},\begin{bmatrix}0&E_{i^{% \prime}j^{\prime}}\\ E_{i^{\prime}j^{\prime}}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\biggr{)% }=\frac{1}{2}\begin{bmatrix}-\delta_{jj^{\prime}}(E_{ii^{\prime}}+E_{i^{\prime% }i})&0\\ 0&\delta_{ii^{\prime}}(E_{jj^{\prime}}+E_{j^{\prime}j})\end{bmatrix}

and obtain the required expression by summing over the basis. The mean curvature along $H$ is then calculated from $\mathsf{H}_{Q}(H)=\langle\mathsf{H}_{Q},H\rangle$ . ∎

Corollary 4.7 (Principal and Gaussian curvatures).

Let $Q\in\operatorname{Gr}(k,n)$ and $H\in\mathbb{N}_{Q}\operatorname{Gr}(k,n)$ be parameterized as in (11). Then the Weingarten map $\mathsf{S}_{Q}(H)$ has eigenpairs given by

\left(\frac{1}{2}(\lambda_{k+j}-\lambda_{i}),V\begin{bmatrix}0&Q_{1}E_{ij}Q_{2% }^{\scriptscriptstyle\mathsf{T}}\\ Q_{2}E_{ij}^{\scriptscriptstyle\mathsf{T}}Q_{1}^{\scriptscriptstyle\mathsf{T}}% &0\end{bmatrix}V^{\scriptscriptstyle\mathsf{T}}\right),\quad i=1,\dots,k,\;j=1% ,\dots,n-k,

where $H_{1}=Q_{1}\Lambda_{1}Q_{1}^{\scriptscriptstyle\mathsf{T}}$ and $H_{2}=Q_{2}\Lambda_{2}Q_{2}^{\scriptscriptstyle\mathsf{T}}$ are eigenvalue decompositions with $\Lambda_{1}=\operatorname{diag}(\lambda_{1},\dots,\lambda_{k})$ and $\Lambda_{2}=\operatorname{diag}(\lambda_{k+1},\dots,\lambda_{n})$ .

(a)

The principal curvatures of $\operatorname{Gr}(k,n)$ along $H$ are

\upkappa_{ij}=\frac{1}{2}(\lambda_{k+j}-\lambda_{i}),\quad i=1,\dots,k,\;j=1,% \dots,n-k.

(b)

The Gaussian curvature of $\operatorname{Gr}(k,n)$ along $H$ is

\mathsf{G}_{Q}(H)=\frac{1}{2^{k(n-k)}}\prod_{i=1}^{k}\prod_{j=1}^{n-k}(\lambda% _{k+j}-\lambda_{i}).

Proof.

By (14), we have

\mathsf{S}_{Q}(H)\left(V\begin{bmatrix}0&X_{0}\\ X_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}V^{\scriptscriptstyle% \mathsf{T}}\right)=V\mathsf{S}_{I_{k,n-k}}\left(H_{0}\right)\left(\begin{% bmatrix}0&X_{0}\\ X_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\right)V^{% \scriptscriptstyle\mathsf{T}},\quad H_{0}\coloneqq\begin{bmatrix}H_{1}&0\\ 0&H_{2}\end{bmatrix}.

Write $\Lambda\coloneqq\operatorname{diag}(\Lambda_{1},\Lambda_{2})=\operatorname{% diag}(\lambda_{1},\dots,\lambda_{n})$ . Then

	$\displaystyle\mathsf{S}_{I_{k,n-k}}(H_{0})\left(\begin{bmatrix}0&X_{0}\\ X_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\right)$	$\displaystyle=\frac{1}{2}\begin{bmatrix}0&H_{1}X_{0}-X_{0}H_{2}\\ (H_{1}X_{0}-X_{0}H_{2})^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}$
		$\displaystyle=\frac{1}{2}\begin{bmatrix}0&Q_{1}\Lambda_{1}Q_{1}^{% \scriptscriptstyle\mathsf{T}}X_{0}-X_{0}Q_{2}\Lambda_{2}Q_{2}^{% \scriptscriptstyle\mathsf{T}}\\ (Q_{1}\Lambda_{1}Q_{1}^{\scriptscriptstyle\mathsf{T}}X_{0}-X_{0}Q_{2}\Lambda_{% 2}Q_{2}^{\scriptscriptstyle\mathsf{T}})^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}$
		$\displaystyle=\frac{1}{2}\begin{bmatrix}Q_{1}&0\\ 0&Q_{2}\end{bmatrix}\begin{bmatrix}0&Y_{0}\\ Y_{0}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\begin{bmatrix}Q_{1}&0\\ 0&Q_{2}\end{bmatrix}^{\scriptscriptstyle\mathsf{T}}$
		$\displaystyle=\begin{bmatrix}Q_{1}&0\\ 0&Q_{2}\end{bmatrix}\mathsf{S}_{I_{k,n-k}}(\Lambda)\begin{bmatrix}Q_{1}&0\\ 0&Q_{2}\end{bmatrix}^{\scriptscriptstyle\mathsf{T}},$

where $Y_{0}\coloneqq\Lambda_{1}(Q_{1}^{\scriptscriptstyle\mathsf{T}}X_{0}Q_{2})-(Q_{% 1}^{\scriptscriptstyle\mathsf{T}}X_{0}Q_{2})\Lambda_{2}$ . So it suffices to diagonalize the linear operator $\mathsf{S}_{I_{k,n-k}}(\Lambda):\mathbb{T}_{I_{k,n-k}}\operatorname{Gr}(k,n)% \to\mathbb{T}_{I_{k,n-k}}\operatorname{Gr}(k,n)$ . Now observe that

\mathsf{S}_{I_{k,n-k}}(\Lambda)\biggl{(}\frac{\sqrt{2}}{2}\begin{bmatrix}0&E_{% ij}\\ E_{ij}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\biggr{)}=\delta_{ip}% \delta_{jq}\frac{\lambda_{k+j}-\lambda_{i}}{2}\biggl{(}\frac{\sqrt{2}}{2}% \begin{bmatrix}0&E_{pq}\\ E_{pq}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}\biggr{)}

for $i=1,\dots,k$ and $j=1,\dots,n-k$ , gives us the required diagonalization, which is an eigenvalue decomposition as (12) is an orthonormal basis. The values of the principal and Gaussian curvatures follow. ∎

The easiest way to calculate the third fundamental form is to get slightly ahead of our discussion and use the expression for Ricci curvature in Corollary 5.4 together with a result of Obata [37, Theorem 1]. Otherwise we would have to start from the definition in Section 3.1.

Corollary 4.8 (Third fundamental form).

The third fundamental form $\operatorname{\fff\fff\fff}_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times% \mathbb{T}_{Q}\operatorname{Gr}(k,n)\to\mathbb{R}$ is given by

\operatorname{\fff\fff\fff}_{Q}(X,Y)=-\frac{1}{2}\Bigl{(}\frac{n}{2k(n-k)}+% \frac{n-2}{4}\Bigr{)}\operatorname{tr}(XY)=-\Bigl{(}\frac{n}{2k(n-k)}+\frac{n-% 2}{4}\Bigr{)}\operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}),

with $Q$ , $X$ , $Y$ parameterized as in (11).

Proof.

By [37, Theorem 1], we have

\operatorname{\fff\fff\fff}_{Q}(X,Y)=\langle\operatorname{\fff\fff}_{Q}(X,Y),% \mathsf{H}_{Q}\rangle-\operatorname{\mathsf{Ric}}(X,Y).

By Theorem 4.3, Corollaries 4.6 and 5.4, we have

	$\displaystyle\operatorname{\fff\fff\fff}_{Q}(X,Y)$	$\displaystyle=\frac{1}{4k(n-k)}\bigl{(}-(n-k)\operatorname{tr}(X_{0}Y_{0}^{% \scriptscriptstyle\mathsf{T}}+Y_{0}X_{0}^{\scriptscriptstyle\mathsf{T}})-k% \operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}+Y_{0}^{% \scriptscriptstyle\mathsf{T}}X_{0})\bigr{)}-\frac{n-2}{4}\operatorname{tr}(X_{% 0}^{\scriptscriptstyle\mathsf{T}}Y_{0})$
		$\displaystyle=-\Bigl{(}\frac{n}{2k(n-k)}+\frac{n-2}{4}\Bigr{)}\operatorname{tr% }(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}).\qed$

5. Intrinsic curvatures of the Grassmannian

As we will see in Section 7.2, calculating intrinsic curvatures of Grassmannian with intrinsic geometry can get fairly involved. This is particularly striking for the Riemann curvature tensor — our calculation below is essentially one-line using the embedded geometry of the involution model.

Proposition 5.1 (Riemmanian curvature).

The Riemann tensor $\operatorname{\mathsf{Rie}}_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times% \mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{T}_{Q}\operatorname{Gr}(k,n)% \times\mathbb{T}_{Q}\operatorname{Gr}(k,n)\to\mathbb{R}$ is given by

	$\displaystyle\operatorname{\mathsf{Rie}}_{Q}(X,Y,Z,W)$	$\displaystyle=\frac{1}{2}\operatorname{tr}\bigl{(}(XY-YX)ZW\bigr{)}$
		$\displaystyle=\frac{1}{2}\operatorname{tr}\bigl{(}(X_{0}^{\scriptscriptstyle% \mathsf{T}}Y_{0}Z_{0}^{\scriptscriptstyle\mathsf{T}}+Z_{0}^{\scriptscriptstyle% \mathsf{T}}Y_{0}X_{0}^{\scriptscriptstyle\mathsf{T}}-Y_{0}^{\scriptscriptstyle% \mathsf{T}}X_{0}Z_{0}^{\scriptscriptstyle\mathsf{T}}-Z_{0}^{\scriptscriptstyle% \mathsf{T}}X_{0}Y_{0}^{\scriptscriptstyle\mathsf{T}})W_{0}\bigr{)}$

with $Q$ , $X,Y,Z,W$ parameterized as in (11).

Proof.

Using the expression for $\operatorname{\fff\fff}_{Q}$ ,

	$\displaystyle\operatorname{\mathsf{Rie}}_{Q}(X,Y,Z,W)$	$\displaystyle=\langle\operatorname{\fff\fff}_{Q}(Y,Z),\operatorname{\fff\fff}_% {Q}(X,W)\rangle-\langle\operatorname{\fff\fff}_{Q}(X,Z),\operatorname{\fff\fff% }_{Q}(Y,W)\rangle$
		$\displaystyle=\begin{multlined}\frac{1}{4}\langle Y_{0}Z_{0}^{% \scriptscriptstyle\mathsf{T}}+Z_{0}Y_{0}^{\scriptscriptstyle\mathsf{T}},X_{0}W% _{0}^{\scriptscriptstyle\mathsf{T}}+W_{0}X_{0}^{\scriptscriptstyle\mathsf{T}}% \rangle+\frac{1}{4}\langle Y_{0}^{\scriptscriptstyle\mathsf{T}}Z_{0}+Z_{0}^{% \scriptscriptstyle\mathsf{T}}Y_{0},X_{0}^{\scriptscriptstyle\mathsf{T}}W_{0}+W% _{0}^{\scriptscriptstyle\mathsf{T}}X_{0}\rangle\\ -\frac{1}{4}\langle X_{0}Z_{0}^{\scriptscriptstyle\mathsf{T}}+Z_{0}X_{0}^{% \scriptscriptstyle\mathsf{T}},Y_{0}W_{0}^{\scriptscriptstyle\mathsf{T}}+W_{0}Y% _{0}^{\scriptscriptstyle\mathsf{T}}\rangle-\frac{1}{4}\langle X_{0}^{% \scriptscriptstyle\mathsf{T}}Z_{0}+Z_{0}^{\scriptscriptstyle\mathsf{T}}X_{0},Y% _{0}^{\scriptscriptstyle\mathsf{T}}W_{0}+W_{0}^{\scriptscriptstyle\mathsf{T}}Y% _{0}\rangle\end{multlined}\frac{1}{4}\langle Y_{0}Z_{0}^{\scriptscriptstyle% \mathsf{T}}+Z_{0}Y_{0}^{\scriptscriptstyle\mathsf{T}},X_{0}W_{0}^{% \scriptscriptstyle\mathsf{T}}+W_{0}X_{0}^{\scriptscriptstyle\mathsf{T}}\rangle% +\frac{1}{4}\langle Y_{0}^{\scriptscriptstyle\mathsf{T}}Z_{0}+Z_{0}^{% \scriptscriptstyle\mathsf{T}}Y_{0},X_{0}^{\scriptscriptstyle\mathsf{T}}W_{0}+W% _{0}^{\scriptscriptstyle\mathsf{T}}X_{0}\rangle\\ -\frac{1}{4}\langle X_{0}Z_{0}^{\scriptscriptstyle\mathsf{T}}+Z_{0}X_{0}^{% \scriptscriptstyle\mathsf{T}},Y_{0}W_{0}^{\scriptscriptstyle\mathsf{T}}+W_{0}Y% _{0}^{\scriptscriptstyle\mathsf{T}}\rangle-\frac{1}{4}\langle X_{0}^{% \scriptscriptstyle\mathsf{T}}Z_{0}+Z_{0}^{\scriptscriptstyle\mathsf{T}}X_{0},Y% _{0}^{\scriptscriptstyle\mathsf{T}}W_{0}+W_{0}^{\scriptscriptstyle\mathsf{T}}Y% _{0}\rangle$
		$\displaystyle=\frac{1}{4}\langle[[X,Y],Z],W\rangle$
		$\displaystyle=\frac{1}{2}\operatorname{tr}\bigl{(}(XY-YX)ZW\bigr{)},$

where the last equality is obtained by observing that $X,Y,Z,W$ are symmetric matrices. ∎

Corollary 5.2 (Jacobi curvature).

The Jacobi tensor $\mathsf{J}_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{T}_{Q}% \operatorname{Gr}(k,n)\times\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{% T}_{Q}\operatorname{Gr}(k,n)\to\mathbb{R}$ is

	$\displaystyle\mathsf{J}_{Q}(X,Y,Z,W)$	$\displaystyle=\operatorname{tr}(XYZW)-\operatorname{tr}\Bigl{(}Y\Bigl{(}\frac{% XZ+ZX}{2}\Bigr{)}W\Bigr{)}$
		$\displaystyle=\begin{multlined}\operatorname{tr}\bigl{(}(X_{0}^{% \scriptscriptstyle\mathsf{T}}Y_{0}Z_{0}^{\scriptscriptstyle\mathsf{T}}+Z_{0}^{% \scriptscriptstyle\mathsf{T}}Y_{0}X_{0}^{\scriptscriptstyle\mathsf{T}})W_{0}% \bigr{)}\\ -\operatorname{tr}\Bigl{(}Y_{0}^{\scriptscriptstyle\mathsf{T}}\Bigl{(}\frac{X_% {0}Z_{0}^{\scriptscriptstyle\mathsf{T}}+Z_{0}X_{0}^{\scriptscriptstyle\mathsf{% T}}}{2}\Bigr{)}W_{0}\Bigr{)}-\operatorname{tr}\Bigl{(}\Bigl{(}\frac{Z_{0}^{% \scriptscriptstyle\mathsf{T}}X_{0}+X_{0}^{\scriptscriptstyle\mathsf{T}}Z_{0}}{% 2}\Bigr{)}Y_{0}^{\scriptscriptstyle\mathsf{T}}W_{0}\bigr{)},\end{multlined}% \operatorname{tr}\bigl{(}(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}Z_{0}^{% \scriptscriptstyle\mathsf{T}}+Z_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}X_{0}^{% \scriptscriptstyle\mathsf{T}})W_{0}\bigr{)}\\ -\operatorname{tr}\Bigl{(}Y_{0}^{\scriptscriptstyle\mathsf{T}}\Bigl{(}\frac{X_% {0}Z_{0}^{\scriptscriptstyle\mathsf{T}}+Z_{0}X_{0}^{\scriptscriptstyle\mathsf{% T}}}{2}\Bigr{)}W_{0}\Bigr{)}-\operatorname{tr}\Bigl{(}\Bigl{(}\frac{Z_{0}^{% \scriptscriptstyle\mathsf{T}}X_{0}+X_{0}^{\scriptscriptstyle\mathsf{T}}Z_{0}}{% 2}\Bigr{)}Y_{0}^{\scriptscriptstyle\mathsf{T}}W_{0}\bigr{)},$

with $Q$ , $X,Y,Z,W$ parameterized as in (11).

Proof.

The expression in Proposition 5.1 and the fact that $X,Y,Z,W$ are symmetric matrices yield

\mathsf{J}_{Q}(X,Y,Z,W)=\frac{1}{2}\operatorname{tr}\bigl{(}(2XYZ-Y(XZ+ZX)W% \bigr{)}

and thus the first expression. Plugging in the parameterizations in (11) for $X,Y,Z,W$ gives the second expression. ∎

Corollary 5.3 (Sectional curvature).

The sectional curvature $\upkappa_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{T}_{Q}% \operatorname{Gr}(k,n)\to\mathbb{R}$ is given by

\upkappa_{Q}(X,Y)=\frac{\|[X,Y]\|^{2}}{4(\lVert X\rVert^{2}\lVert Y\rVert^{2}-% \langle X,Y\rangle^{2})}=\frac{\lVert[X_{0},Y_{0}^{\scriptscriptstyle\mathsf{T% }}]\rVert^{2}+\lVert[X_{0}^{\scriptscriptstyle\mathsf{T}},Y_{0}]\rVert^{2}}{16% (\lVert X_{0}\rVert^{2}\lVert Y_{0}\rVert^{2}-\langle X_{0},Y_{0}\rangle^{2})}% \leq\frac{1}{4},

with $Q$ , $X$ , $Y$ parameterized as in (11). If $X,Y$ are orthonormal, i.e., $\lVert X_{0}\rVert=\lVert Y_{0}\rVert=\sqrt{2}/2$ and $\langle X_{0},Y_{0}\rangle=0$ , then

\upkappa_{Q}(X,Y)=\frac{\|[X,Y]\|^{2}}{4}=\frac{1}{4}(\lVert[X_{0},Y_{0}^{% \scriptscriptstyle\mathsf{T}}]\rVert^{2}+\lVert[X_{0}^{\scriptscriptstyle% \mathsf{T}},Y_{0}]\rVert^{2}).

Proof.

This is a straightforward calculation by

\upkappa_{Q}(X,Y)=\frac{\operatorname{\mathsf{Rie}}_{Q}(X,Y,Y,X)}{\lVert X% \rVert^{2}\lVert Y\rVert^{2}-\langle X,Y\rangle^{2}}=\frac{1}{4}\frac{\langle[% [X,Y],Y],X\rangle}{\lVert X\rVert^{2}\lVert Y\rVert^{2}-\langle X,Y\rangle^{2}% }=\frac{\|[X,Y]\|^{2}}{4(\lVert X\rVert^{2}\lVert Y\rVert^{2}-\langle X,Y% \rangle^{2})}

and the observations that

	$\displaystyle[X,Y]=V\left(\begin{bmatrix}X_{0}Y_{0}^{\scriptscriptstyle\mathsf% {T}}-Y_{0}X_{0}^{\scriptscriptstyle\mathsf{T}}&0\\ 0&X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}-Y_{0}^{\scriptscriptstyle\mathsf{T% }}X_{0}\end{bmatrix}\right)V^{\scriptscriptstyle\mathsf{T}},$
	$\displaystyle\lVert X\rVert^{2}=2\lVert X_{0}\rVert^{2},\quad\lVert Y\rVert^{2% }=2\lVert Y_{0}\rVert^{2},\quad\lVert\langle X,Y\rangle\rVert^{2}=2\langle X_{% 0},Y_{0}\rangle.$

Since $\upkappa_{Q}(X,Y)$ only depends on the two-dimensional subspace of $\mathbb{T}_{Q}\operatorname{Gr}(k,n)$ spanned by $X$ and $Y$ , it suffices to assume that $X,Y$ are orthonormal. The upper bound $\upkappa_{Q}(X,Y)\leq 1/4$ then follows from the inequality $\lVert[A,B]\rVert^{2}\leq 2\lVert A\rVert^{2}\lVert B\rVert^{2}$ for any $A,B\in\mathbb{R}^{n\times n}$ . ∎

It is known that the Grassmannian manifolds are Einstein [5, Paragraphs 0.25 and 0.26]. Our calculations below confirm the fact.

Corollary 5.4 (Ricci and scalar curvatures).

Let $Q$ , $X$ , $Y$ be parameterized as in (11). The Ricci tensor $\operatorname{\mathsf{Ric}}_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times% \mathbb{T}_{Q}\operatorname{Gr}(k,n)\to\mathbb{R}$ is given by

\operatorname{\mathsf{Ric}}_{Q}(X,Y)=\frac{(n-2)}{8}\operatorname{tr}(XY)=% \frac{(n-2)}{4}\operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}).

The scalar curvature $\operatorname{\mathsf{Sca}}_{Q}\in\mathbb{R}$ is given by

\operatorname{\mathsf{Sca}}_{Q}=\frac{k(n-k)(n-2)}{8}.

The traceless Ricci curvature $\mathsf{Z}_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{T}_{Q}% \operatorname{Gr}(k,n)\to\mathbb{R}$ is given by

\mathsf{Z}_{Q}(X,Y)=0,

which shows that the Grassmannian is an Einstein manifold.

Proof.

We write

X_{ij}\coloneqq\frac{\sqrt{2}}{2}V\begin{bmatrix}0&E_{ij}\\ E_{ij}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}V^{\scriptscriptstyle% \mathsf{T}}

for the elements of the orthonormal basis in (12). Then

	$\displaystyle\operatorname{\mathsf{Ric}}_{Q}(X,Y)$	$\displaystyle=\sum_{i=1}^{k}\sum_{j=1}^{n-k}\operatorname{\mathsf{Rie}}_{Q}(X_% {ij},X,Y,X_{ij})=\sum_{i=1}^{k}\sum_{j=1}^{n-k}\frac{1}{4}\langle[[X_{ij},X],Y% ],X_{ij}\rangle\rangle$
		$\displaystyle=\frac{1}{2}\sum_{i=1}^{k}\sum_{j=1}^{n-k}\operatorname{tr}(X_{ij% }^{2}XY-XX_{ij}YX_{ij})$
		$\displaystyle=\frac{n-2}{8}\operatorname{tr}(XY)=\frac{\operatorname{\mathsf{% Sca}}_{Q}}{k(n-k)}\mathsf{g}_{Q}$

where the last equality shows that $\mathsf{Z}_{Q}$ vanishes identically. ∎

For a homogeneous space like $\operatorname{Gr}(k,n)$ , the upper and lower delta invariants $\overline{\updelta}_{Q}(d_{1},\dots,d_{r})$ and $\underline{\updelta}_{Q}(d_{1},\dots,d_{r})$ are independent of the choice of $Q\in\operatorname{Gr}(k,n)$ . We also restrict our attention to $d_{1}=\cdots=d_{r}=2$ . So for notational simplicity, we just write

\overline{\updelta}_{2,r}\coloneqq\overline{\updelta}_{Q}(\underbrace{2,\dots,% 2}_{\text{$r$ times}}),\quad\underline{\updelta}_{2,r}\coloneqq\underline{% \updelta}_{Q}(\underbrace{2,\dots,2}_{\text{$r$ times}}).

Theorem 5.5 (Delta invariants).

Let $r\leq 2\lfloor k/2\rfloor\lfloor(n-k)/2\rfloor$ . Then the upper and lower delta invariants of $\operatorname{Gr}(k,n)$ are given by

\overline{\updelta}_{2,r}=\frac{k(n-k)(n-2)}{8},\qquad\underline{\updelta}_{2,% r}=\frac{k(n-k)(n-2)}{8}-\frac{r}{4}.

Proof.

We will write $\upkappa=\upkappa_{Q}$ below and take $Q=I_{k,n-k}$ . By Corollary 5.4, we have

	$\displaystyle\overline{\updelta}_{2,r}$	$\displaystyle=\frac{k(n-k)(n-2)}{8}-\inf_{\begin{subarray}{c}\dim\mathbb{V}_{j% }=2,\\ \mathbb{V}_{j}\perp\mathbb{V}_{k},j<k\end{subarray}}\biggl{[}\sum_{j=1}^{r}% \upkappa(X_{j},Y_{j})\biggr{]},$
	$\displaystyle\underline{\updelta}_{2,r}$	$\displaystyle=\frac{k(n-k)(n-2)}{8}-\sup_{\begin{subarray}{c}\dim\mathbb{V}_{j% }=2,\\ \mathbb{V}_{j}\perp\mathbb{V}_{k},j<k\end{subarray}}\biggl{[}\sum_{j=1}^{r}% \upkappa(X_{j},Y_{j})\biggr{]},$

where $\{X_{j},Y_{j}\}$ is an orthonormal basis of the two-dimensional subspace $\mathbb{V}_{j}\subseteq\mathbb{T}_{I_{k,n-k}}\operatorname{Gr}(k,n)$ , $j=1,\dots,r$ . By Corollary 5.3, we have

(15)

0\leq\sum_{j=1}^{r}\upkappa(X_{j},Y_{j})\leq\frac{r}{4}.

It remains to show that upper and lower bounds in (15) are attained by some $\mathbb{V}_{1},\dots,\mathbb{V}_{r}$ .

Set $k_{1}\coloneqq\lfloor k/2\rfloor$ and $k_{2}\coloneqq\lfloor(n-k)/2\rfloor$ . We may partition any $X\in\mathbb{T}_{I_{k,n-k}}\operatorname{Gr}(k,n)$ into a block matrix with $2\times 2$ blocks $B_{pq}\in\mathbb{R}^{2\times 2}$ :

X=\begin{bmatrix}0&\cdots&0&B_{1,1}&\cdots&B_{1,k_{2}+1}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ 0&\cdots&0&B_{k_{1}+1,1}&\cdots&B_{k_{1}+1,k_{2}+1}\\ B_{1,1}^{\scriptscriptstyle\mathsf{T}}&\cdots&B_{k_{1}+1,1}^{% \scriptscriptstyle\mathsf{T}}&0&\cdots&0\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ B_{1,k_{2}+1}^{\scriptscriptstyle\mathsf{T}}&\cdots&B_{k_{1}+1,k_{2}+1}^{% \scriptscriptstyle\mathsf{T}}&0&\cdots&0\end{bmatrix}

except in the last row and column where we are required to have

B_{p,k_{2}+1}\in\mathbb{R}^{2\times(n-k-2k_{2})},\quad B_{k_{1}+1,q}\in\mathbb% {R}^{(k-2k_{1})\times 2},\quad B_{k_{1}+1,k_{2}+1}\in\mathbb{R}^{(k-2k_{1})% \times(n-k-2k_{2})}

for $p=1,\dots,k_{1}$ and $q=1,\dots,k_{2}$ .

Let $\widehat{X}_{ij}\in\mathbb{T}_{I_{k,n-k}}\operatorname{Gr}(k,n)$ be the tangent vector obtained from $X$ by setting $B_{pq}=0$ whenever $(p,q)\neq(i,j)$ . Then clearly we have $\operatorname{tr}\bigl{(}\widehat{X}_{ij}^{\scriptscriptstyle\mathsf{T}}% \widehat{X}_{i^{\prime}j^{\prime}}\bigr{)}=0$ whenever $(i,j)\neq(i^{\prime},j^{\prime})$ . Since $r\leq 2k_{1}k_{2}$ , the problem further reduces to attaining the upper and lower bounds in (15) for $r=2$ on $\operatorname{Gr}(2,4)$ . This is a vast simplification as $X\in\mathbb{T}_{I_{2,2}}\operatorname{Gr}(2,4)$ is just $X=\begin{bmatrix}0&B\\ B^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}$ with $B\in\mathbb{R}^{2\times 2}$ . It remains to exhibit an orthonormal basis $X_{1},Y_{1},X_{2},Y_{2}\in T_{I_{2,2}}\operatorname{Gr}(2,4)$ that gives the upper and lower bounds in (15). Using the formula for sectional curvature in Corollary 5.3, we check that

X_{1}=\frac{\sqrt{2}}{2}\begin{bmatrix}0&0&1&0\\ 0&0&0&0\\ 1&0&0&0\\ 0&0&0&0\end{bmatrix},\quad Y_{1}=\frac{1}{2}\begin{bmatrix}0&0&0&1\\ 0&0&1&0\\ 0&1&0&0\\ 1&0&0&0\end{bmatrix},\quad X_{2}=\frac{\sqrt{2}}{2}\begin{bmatrix}0&0&0&0\\ 0&0&0&1\\ 0&0&0&0\\ 0&1&0&0\end{bmatrix},\quad Y_{2}=\frac{1}{2}\begin{bmatrix}0&0&0&1\\ 0&0&-1&0\\ 0&-1&0&0\\ 1&0&0&0\end{bmatrix}

give the required upper bound $\upkappa(X_{1},Y_{1})+\upkappa(X_{2},Y_{2})=\frac{1}{2}$ , whereas

X_{1}=\frac{1}{2}\begin{bmatrix}0&0&1&0\\ 0&0&0&1\\ 1&0&0&0\\ 0&1&0&0\end{bmatrix},\quad Y_{1}=\frac{1}{2}\begin{bmatrix}0&0&1&0\\ 0&0&0&-1\\ 1&0&0&0\\ 0&-1&0&0\end{bmatrix},\quad X_{2}=\frac{1}{2}\begin{bmatrix}0&0&0&1\\ 0&0&1&0\\ 0&1&0&0\\ 1&0&0&0\end{bmatrix},\quad Y_{2}=\frac{1}{2}\begin{bmatrix}0&0&0&1\\ 0&0&-1&0\\ 0&-1&0&0\\ 1&0&0&0\end{bmatrix}

give the required lower bound $\upkappa(X_{1},Y_{1})+\upkappa(X_{2},Y_{2})=0$ . ∎

We next compute the quartet of tensors named after Schouten, Cotton, Weyl, and Bach.

Corollary 5.6 (Schouten curvature).

The Schouten tensor $\mathsf{P}_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{T}_{Q}% \operatorname{Gr}(k,n)\to\mathbb{R}$ is given by

\mathsf{P}_{Q}(X,Y)=\frac{(n-2)}{16(k(n-k)-1)}\operatorname{tr}(XY)=\frac{2(n-% 2)}{16(k(n-k)-1)}\operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0})

with $Q$ , $X$ , $Y$ parameterized as in (11).

Proof.

This is a straightforward calculation from definition:

	$\displaystyle\mathsf{P}_{Q}(X,Y)$	$\displaystyle=\frac{1}{k(n-k)-2}\biggl{[}\operatorname{\mathsf{Ric}}_{Q}(X,Y)-% \frac{\operatorname{\mathsf{Sca}}_{Q}}{2(k(n-k)-1)}\mathsf{g}_{Q}(X,Y)\biggr{]}$
		$\displaystyle=\frac{n-2}{16(k(n-k)-1)}\operatorname{tr}(XY).\qed$

Corollary 5.7 (Cotton curvature).

The Cotton tensor of $\operatorname{Gr}(k,n)$ is zero.

Proof.

By Corollary 5.6, $\mathsf{P}$ is a constant multiple of $\mathsf{g}$ . So $\nabla\mathsf{P}=0$ , and so $\mathsf{C}$ is identically zero. ∎

Corollary 5.8 (Weyl curvature).

The Weyl tensor $\mathsf{W}_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{T}_{Q}% \operatorname{Gr}(k,n)\times\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{% T}_{Q}\operatorname{Gr}(k,n)\to\mathbb{R}$ is given by

	$\displaystyle\mathsf{W}_{Q}(X,Y,Z,W)$	$\displaystyle=\begin{multlined}\frac{1}{2}\operatorname{tr}\bigl{(}(XY-YX)ZW% \bigr{)}\\ -\frac{(n-2)}{8(k(n-k)-1)}\bigl{(}\operatorname{tr}(XZ)\operatorname{tr}(YW)-% \operatorname{tr}(XW)\operatorname{tr}(YZ)\bigr{)}\end{multlined}\frac{1}{2}% \operatorname{tr}\bigl{(}(XY-YX)ZW\bigr{)}\\ -\frac{(n-2)}{8(k(n-k)-1)}\bigl{(}\operatorname{tr}(XZ)\operatorname{tr}(YW)-% \operatorname{tr}(XW)\operatorname{tr}(YZ)\bigr{)}$
		$\displaystyle=\begin{multlined}\frac{1}{2}\operatorname{tr}\bigl{(}(X_{0}^{% \scriptscriptstyle\mathsf{T}}Y_{0}Z_{0}^{\scriptscriptstyle\mathsf{T}}+Z_{0}^{% \scriptscriptstyle\mathsf{T}}Y_{0}X_{0}^{\scriptscriptstyle\mathsf{T}}-Y_{0}^{% \scriptscriptstyle\mathsf{T}}X_{0}Z_{0}^{\scriptscriptstyle\mathsf{T}}-Z_{0}^{% \scriptscriptstyle\mathsf{T}}X_{0}Y_{0}^{\scriptscriptstyle\mathsf{T}})W_{0}% \bigr{)}\\ -\frac{n-2}{2(k(n-k)-1)}\bigl{(}\operatorname{tr}(X_{0}^{\scriptscriptstyle% \mathsf{T}}Z_{0})\operatorname{tr}(Y_{0}^{\scriptscriptstyle\mathsf{T}}W_{0})-% \operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}W_{0})\operatorname{tr}(% Y_{0}^{\scriptscriptstyle\mathsf{T}}Z_{0})\bigr{)}\end{multlined}\frac{1}{2}% \operatorname{tr}\bigl{(}(X_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}Z_{0}^{% \scriptscriptstyle\mathsf{T}}+Z_{0}^{\scriptscriptstyle\mathsf{T}}Y_{0}X_{0}^{% \scriptscriptstyle\mathsf{T}}-Y_{0}^{\scriptscriptstyle\mathsf{T}}X_{0}Z_{0}^{% \scriptscriptstyle\mathsf{T}}-Z_{0}^{\scriptscriptstyle\mathsf{T}}X_{0}Y_{0}^{% \scriptscriptstyle\mathsf{T}})W_{0}\bigr{)}\\ -\frac{n-2}{2(k(n-k)-1)}\bigl{(}\operatorname{tr}(X_{0}^{\scriptscriptstyle% \mathsf{T}}Z_{0})\operatorname{tr}(Y_{0}^{\scriptscriptstyle\mathsf{T}}W_{0})-% \operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}W_{0})\operatorname{tr}(% Y_{0}^{\scriptscriptstyle\mathsf{T}}Z_{0})\bigr{)}$

with $Q$ , $X$ , $Y$ , $Z$ , $W$ parameterized as in (11).

Proof.

Let $m\coloneqq k(n-k)$ . It follows from the vanishing of $\mathsf{Z}_{Q}$ and the expression for $\operatorname{\mathsf{Sca}}_{Q}$ in Corollary 5.4 that

\mathsf{W}_{Q}=\operatorname{\mathsf{Rie}}_{Q}-\frac{1}{m-2}\mathsf{Z}_{Q}% \varowedge\mathsf{g}_{Q}-\frac{\operatorname{\mathsf{Sca}}_{Q}}{2m(m-1)}% \mathsf{g}_{Q}\varowedge\mathsf{g}_{Q}=\operatorname{\mathsf{Rie}}_{Q}-\frac{n% -2}{16(m-1)}\mathsf{g}_{Q}\varowedge\mathsf{g}_{Q}.

Next use the two expressions of $\operatorname{\mathsf{Rie}}_{Q}$ in Proposition 5.1 and expand the Kulkarni–Nomizu product

	$\displaystyle\mathsf{g}_{Q}\varowedge\mathsf{g}_{Q}(X,Y,Z,W)$	$\displaystyle=\begin{multlined}\mathsf{g}_{Q}(X,Z)\mathsf{g}_{Q}(Y,W)-\mathsf{% g}_{Q}(X,W)\mathsf{g}_{Q}(Y,Z)\\ -\mathsf{g}_{Q}(Y,Z)\mathsf{g}_{Q}(X,W)+\mathsf{g}_{Q}(Y,W)\mathsf{g}_{Q}(X,Z)% \end{multlined}\mathsf{g}_{Q}(X,Z)\mathsf{g}_{Q}(Y,W)-\mathsf{g}_{Q}(X,W)% \mathsf{g}_{Q}(Y,Z)\\ -\mathsf{g}_{Q}(Y,Z)\mathsf{g}_{Q}(X,W)+\mathsf{g}_{Q}(Y,W)\mathsf{g}_{Q}(X,Z)$
		$\displaystyle=2\bigl{(}\mathsf{g}_{Q}(X,Z)\mathsf{g}_{Q}(Y,W)-\mathsf{g}_{Q}(X% ,W)\mathsf{g}_{Q}(Y,Z)\bigr{)}$
		$\displaystyle=8\bigl{(}\operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}Z% _{0})\operatorname{tr}(Y_{0}^{\scriptscriptstyle\mathsf{T}}W_{0})-% \operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}W_{0})\operatorname{tr}(% Y_{0}^{\scriptscriptstyle\mathsf{T}}Z_{0})\bigr{)},$

to get the two required expressions for $\mathsf{W}_{Q}(X,Y,Z,W)$ . ∎

Corollary 5.9 (Bach curvature).

The Bach tensor $\mathsf{B}_{Q}:\mathbb{T}_{Q}\operatorname{Gr}(k,n)\times\mathbb{T}_{Q}% \operatorname{Gr}(k,n)\to\mathbb{R}$ is given by

\mathsf{B}_{Q}(X,Y)=\frac{(n-2)^{2}}{32(k(n-k)-2)}\operatorname{tr}(XY)=\frac{% (n-2)^{2}}{16(k(n-k)-2)}\operatorname{tr}(X_{0}^{\scriptscriptstyle\mathsf{T}}% Y_{0})

with $Q$ , $X$ , $Y$ parameterized as in (11).

Proof.

Let $m\coloneqq k(n-k)$ . Using [12, Equation (2-4)], we relate $\mathsf{B}$ to the Cotton tensor $\mathsf{C}$ as

\mathsf{B}_{Q}(X,Y)=\frac{1}{m-2}\biggl{[}\sum_{i=1}^{m}(\nabla_{\!X_{i}}% \mathsf{C}_{Q})(X_{i},X,Y)+\sum_{i=1}^{m}\sum_{j=1}^{m}\operatorname{\mathsf{% Ric}}_{Q}(X_{i},X_{j})\mathsf{W}_{Q}(X,X_{i},X_{j},Y)\biggr{]},

where $X_{1},\dots,X_{m}\in\mathbb{T}_{Q}\operatorname{Gr}(k,n)$ is any orthonormal basis. It follows from the vanishing of $\mathsf{C}$ in Corollary 5.7 that

	$\displaystyle\mathsf{B}_{Q}(X,Y)$	$\displaystyle=\frac{n-2}{8(m-2)}\sum_{i=1}^{m}\mathsf{W}_{Q}(X,X_{i},X_{i},Y)$
		$\displaystyle=\frac{n-2}{8(m-2)}\biggl{[}\sum_{i=1}^{m}\operatorname{\mathsf{% Rie}}_{Q}(X,X_{i},X_{i},Y)-\sum_{i=1}^{m}\frac{n-2}{16(m-1)}\mathsf{g}_{Q}% \varowedge\mathsf{g}_{Q}(X,X_{i},X_{i},Y)\biggr{]}$
		$\displaystyle=\frac{n-2}{8(m-2)}\biggl{[}\operatorname{\mathsf{Ric}}_{Q}(X,Y)-% \frac{2(n-2)}{16(m-1)}\sum_{i=1}^{m}(\mathsf{g}_{Q}(X,X_{i})\mathsf{g}_{Q}(Y,X% _{i})-\mathsf{g}_{Q}(X,Y)\mathsf{g}_{Q}(X_{i},X_{i}))\biggr{]}$
		$\displaystyle=\frac{n-2}{8(m-2)}\operatorname{\mathsf{Ric}}_{Q}(X,Y)-\frac{(n-% 2)^{2}}{64(m-1)(m-2)}(1-m)\mathsf{g}_{Q}(X,Y)$
		$\displaystyle=\frac{(n-2)^{2}}{32(m-2)}\mathsf{g}_{Q}(X,Y).$

Here the first and the second equalities follow from Corollaries 5.4 and 5.8 respectively. The third uses the definition of Ricci curvature and the value $\mathsf{g}_{Q}\varowedge\mathsf{g}_{Q}$ calculated in the proof of Corollary 5.8. The penultimate equality is a result of $X_{1},\dots,X_{m}$ being an orthonormal basis. ∎

6. Geometric insights from these expressions

The intrinsic curvatures in Section 5 are, by definition, independent of the model we choose and apply to $\operatorname{Gr}(k,\mathbb{R}^{n})$ as an abstract manifold. Indeed, the results in this section will all be stated for the Grassmannian $\operatorname{Gr}(k,\mathbb{R}^{n})$ , as opposed to its involution model $\operatorname{Gr}(k,n)$ . The expressions we found in Section 5 by way of the involution model $\operatorname{Gr}(k,n)$ permit us to concretely study the geometry of the abstract manifold $\operatorname{Gr}(k,\mathbb{R}^{n})$ , and thereby obtaining new geometric insights. Even without going out of our way to search for such insights, we can already see a few that, as far as we know, have never been observed before for the Grassmannian.

For example, we may deduce the following, which is mildly surprising because we do not even know how to define a Plebański tensor [39].

Corollary 6.1 (Plebański curvature).

The Plebański tensor of the Grassmannian is zero.

An observant reader might have noticed that this is the only tensor mentioned in Section 1 whose definition did not appear in Section 3. The reason is that we do not know how to define the Plebański tensor in the coordinate-free manner adopted in modern mathematics. Every definition in the literature only gives its coordinates in terms of the coordinates of $\mathsf{Z}$ , the traceless Ricci curvature. But as we do know from Corollary 5.4 that $\mathsf{Z}=0$ for the Grassmannian, its Plebański tensor must be zero as well.

An observant reader might also have noticed that several of the expressions in Table 1 are constant multiples of $\operatorname{tr}(XY)$ . Therein lies two small results:

Corollary 6.2 (Codazzi tensors I).

The Ricci, Schouten, and Bach curvatures of the Grassmannian are Codazzi.

Proof.

Corollaries 5.4, 5.6, and 5.9 show that the tensors in question are all constant multiples of $\mathsf{g}$ and therefore Codazzi since $\nabla\mathsf{g}=0$ . ∎

Corollary 6.3 (Codazzi tensors II).

A symmetric bilinear form $\beta$ on the Grassmannian with constant trace is Codazzi if and only if $\nabla\beta=0$

Proof.

If $\nabla\beta=0$ , then it is Codazzi by definition. For the converse, we invoke the result [4] that any Codazzi tensor with constant trace on a compact Riemannian manifold must have $\nabla\beta=0$ if the sectional curvature $\upkappa\geq 0$ everywhere. By Corollary 5.3, the Grassmannian has $\upkappa\geq 0$ . ∎

The proof of Corollary 6.2 throws up another observation.

Corollary 6.4 (Divergence-free tensors).

The Riemann and Weyl curvatures of the Grassmannian are divergence-free.

Proof.

By Corollary 5.4, $\operatorname{\mathsf{Ric}}$ is a constant multiple of $\mathsf{g}$ and so $\nabla\mathsf{\operatorname{\mathsf{Ric}}}=0$ . Since the Riemann curvature is divergence-free if and only if the Ricci tensor is divergence-free and Codazzi [38, Corollary 9.4.5], it follows from Corollary 6.2 that $\operatorname{div}\mathsf{R}=0$ for the Grassmannian. The relation [12, Equation 2-3] between Weyl and Cotton tensors

\operatorname{div}\mathsf{W}=\frac{\dim\mathcal{M}-2}{\dim\mathcal{M}-3}% \mathsf{C}

for any manifold $\mathcal{M}$ of dimension at least four, taken together with Corollary 5.7 that $\mathsf{C}=0$ , yields $\operatorname{div}\mathsf{W}=0$ . For $\dim\mathcal{M}<4$ , $\mathsf{W}$ is identically zero [12, Remark 2.3]. ∎

The delta invariants obtained in Theorem 5.5 have never before been calculated for a manifold as complex as the Grassmannian. These values may look quotidian to the uninitiated, but they are not. We give an example to show how the value of $\underline{\delta}_{2,r}$ found in Theorem 5.5 vastly improves a classical result.

A geodesic $2$ -sphere is a $2$ -sphere $\mathrm{S}^{2}$ embedded in a Riemannian manifold $\mathcal{M}$ as a totally geodesic submanifold. One fascinating fact about the geometry of the Grassmannian is that it contains a geodesic $2$ -sphere [29, 45, 46, 49]. This is a very unique property. For instance, $\mathbb{R}^{3}$ contains no geodesic $2$ -sphere, even though $\mathrm{S}^{2}$ is, ironically, the unit sphere of $\mathbb{R}^{3}$ . The reason is that $\mathrm{S}^{2}$ is only a Riemannian submanifold but not a totally geodesic submanifold of $\mathbb{R}^{3}$ .

A key result in [47] is that $\operatorname{Gr}(k,\mathbb{R}^{n})$ contains one geodesic $2$ -sphere. We will show below that it in fact contains a product of many geodesic $2$ -spheres. To the best of our knowledge, this insight is new. It is also unusual. For instance, while the $3$ -sphere $\mathrm{S}^{3}$ is known to contain a geodesic $2$ -sphere, it does not contain a product of more than one copy.

Theorem 6.5 (Embedding products of geodesic $2$ -spheres).

(a)

For any $r\leq\min\{\lfloor k/2\rfloor,\lfloor n/4\rfloor\}$ , the product of $r$ copies of $\mathrm{S}^{2}$ can be embedded as a totally geodesic submanifold of $\operatorname{Gr}(k,n)$ .
(b)

For any $r\leq 2\lfloor k/2\rfloor\lfloor(n-k)/2\rfloor$ , an open subset of the product of $r$ copies of $\mathrm{S}^{2}$ can be embedded as a totally geodesic submanifold of $\operatorname{Gr}(k,n)$ .

Proof.

Let $r\leq\min\{\lfloor k/2\rfloor,\lfloor n/4\rfloor\}$ and $\mathbb{V}_{1},\dots,\mathbb{V}_{r}\subseteq\mathbb{R}^{n}$ be any $r$ four-dimensional subspaces that are orthogonal to each other, i.e., $\mathbb{V}_{j}\subseteq\bigl{(}\bigoplus_{i\neq j}\mathbb{V}_{i}\bigr{)}^{\perp}$ for all $j=1,\dots,r$ . Let $\mathbb{W}_{0}\subseteq\bigl{(}\bigoplus_{i=1}^{r}\mathbb{V}_{i}\bigr{)}^{\perp}$ be a $(k-2r)$ -dimensional subspace. We define the embedding

\varepsilon:\operatorname{Gr}(2,\mathbb{V}_{1})\times\cdots\times\operatorname% {Gr}(2,\mathbb{V}_{r})\to\operatorname{Gr}(k,\mathbb{R}^{n}),\quad(\mathbb{W}_% {1},\dots,\mathbb{W}_{r})\mapsto\mathbb{W}_{0}\oplus\mathbb{W}_{1}\oplus\dots% \oplus\mathbb{W}_{r}.

Clearly, the image of $\varepsilon$ is totally geodesic. By [49, Section 4], since $\mathbb{V}_{i}$ is four-dimensional, $\operatorname{Gr}(2,\mathbb{V}_{i})$ contains a geodesic $2$ -sphere¹¹1When $\mathbb{V}$ is four-dimensional, any maximal subset of mutually isoclinic $2$ -planes in $\operatorname{Gr}(2,\mathbb{V})$ is a geodesic $2$ -sphere. $\Sigma^{2}_{i}$ for each $i=1,\dots,r$ . The restriction of $\varepsilon$ to $\Sigma^{2}_{1}\times\dots\times\Sigma^{2}_{r}\cong\mathrm{S}^{2}\times\dots% \times\mathrm{S}^{2}$ ( $r$ copies) gives the desired embedding in (a).

For (b), we will need to use the involution model. By Corollary 5.3, $\upkappa_{Q}(X,Y)\leq\frac{1}{4}$ for any $Q\in\operatorname{Gr}(k,n)$ and orthonormal $X,Y\in\mathbb{T}_{Q}\operatorname{Gr}(k,n)$ . By [49, Theorem 5], $\upkappa_{Q}(X,Y)=\frac{1}{4}$ if and only if $X$ and $Y$ are tangent vectors of a geodesic $2$ -sphere in $\operatorname{Gr}(k,n)$ passing through $Q$ . Note that $X$ and $Y$ must span the tangent space of this geodesic $2$ -sphere. We set $k_{1}\coloneqq\lfloor k/2\rfloor$ , $k_{2}\coloneqq\lfloor(n-k)/2\rfloor$ as in the proof of Theorem 5.5, and also $r\coloneqq 2k_{1}k_{2}$ . Consider the commutative diagram

where $\varphi,\psi$ are the exponential maps on the respective tangent spaces, $U$ is an open subset of $\mathrm{S}^{2}\times\dots\times\mathrm{S}^{2}$ ( $r$ copies) on which $\varphi^{-1}$ is well-defined, and $\rho$ is the linear map defined by

(B_{ij})_{i,j=1}^{k_{1},k_{2}}\mapsto\begin{bmatrix}0&\cdots&0&B_{1,1}&\cdots&% B_{1,k_{2}+1}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ 0&\cdots&0&B_{k_{1}+1,1}&\cdots&B_{k_{1}+1,k_{2}+1}\\ B_{1,1}^{\scriptscriptstyle\mathsf{T}}&\cdots&B_{k_{1}+1,1}^{% \scriptscriptstyle\mathsf{T}}&0&\cdots&0\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ B_{1,k_{2}+1}^{\scriptscriptstyle\mathsf{T}}&\cdots&B_{k_{1}+1,k_{2}+1}^{% \scriptscriptstyle\mathsf{T}}&0&\cdots&0\end{bmatrix}

where we have used the same notation as in the proof of Theorem 5.5 and set $B_{pq}$ to be the zero matrix if either $p=k_{1}+1$ or $q=k_{2}+1$ . As in the proof of Theorem 5.5, we may choose an orthonormal basis $X_{1},Y_{1},X_{2},Y_{2}$ for each copy of $\mathbb{R}^{2\times 2}$ such that $\upkappa_{Q}(\rho(X_{1}),\rho(Y_{1}))=\upkappa_{Q}(\rho(X_{2}),\rho(Y_{2}))=% \frac{1}{4}$ . By shrinking $U$ if necessary, we may assume that $\psi$ is injective on $\rho(\varphi^{-1}(U))$ . Hence $\psi\mathbin{\mathchoice{\vbox{\hbox{$\scriptstyle\circ$}}}{\vbox{\hbox{$% \scriptstyle\circ$}}}{\vbox{\hbox{$\scriptscriptstyle\circ$}}}{\vbox{\hbox{$% \scriptscriptstyle\circ$}}}}\rho\mathbin{\mathchoice{\vbox{\hbox{$\scriptstyle% \circ$}}}{\vbox{\hbox{$\scriptstyle\circ$}}}{\vbox{\hbox{$\scriptscriptstyle% \circ$}}}{\vbox{\hbox{$\scriptscriptstyle\circ$}}}}\varphi^{-1}(U)$ is the required open subset in (b). ∎

Suppose $k$ and $n$ are both even. Then the upper bound in Theorem 6.5(b) is $r=k(n-k)/2$ . This is sharp as the dimension of the product of $r+1$ copies of $\mathrm{S}^{2}$ is $k(n-k)+2$ and it exceeds the dimension of $\operatorname{Gr}(k,n)$ ; so a product of $r+1$ copies of $\mathrm{S}^{2}$ cannot be embedded in $\operatorname{Gr}(k,n)$ .

7. Why we favor the involution model

As we alluded to in Section 1, a secondary goal of this article is to demonstrate the advantages of using the involution model (2). Here we will make some comparisons with other common models of the Grassmannian in algebraic geometry (Section 7.1), differential geometry (Section 7.2), and integral geometry (Section 7.3).

To elaborate, as an abstract manifold, the Grassmannian $\operatorname{Gr}(k,\mathbb{R}^{n})$ is just the set of $k$ -planes in $\mathbb{R}^{n}$ . While any manifold, by definition, can be given local coordinates, experience tells us that they are rarely useful beyond basic proofs — nobody really works with charts and atlases outside a first course in differential geometry. Especially in applied mathematics, but also in pure mathematics, the preferred approach is to give $\operatorname{Gr}(k,\mathbb{R}^{n})$ a system of global, extrinsic coordinates that are easier to work with — this is what we mean by a model for $\operatorname{Gr}(k,\mathbb{R}^{n})$ .

7.1. Plücker model

The standard model of the Grassmannian in algebraic geometry (see [21, Lecture 6] and [41, Chapter 1, Section 4.1]) is as the set of rank-one alternating tensors in projective space, i.e., the image of the Plücker embedding:

\operatorname{Gr}(k,\mathbb{R}^{n})\cong\bigl{\{}[v_{1}\wedge\dots\wedge v_{k}% ]\in\mathbb{P}(\operatorname{\mathsf{\Lambda}}^{k}(\mathbb{R}^{n})):v_{1},% \dots,v_{k}\in\mathbb{R}^{n}\text{ linearly independent}\bigr{\}}.

While this has some desirable mathematical properties [31, Section 1], its main issue is that the ambient space $\mathbb{P}(\operatorname{\mathsf{\Lambda}}^{k}(\mathbb{R}^{n}))$ is a manifold of exceedingly high dimension $\binom{n}{k}-1$ . This not only presents a computational conundrum but also results in complex expressions for even relatively basic quantities. For example, the second fundamental form has been derived in [1, Lemma 2.1 and Proposition 2.3] for the Plücker model and both its calculation and the expression are significantly more involved than those appearing in this article. In fact for even moderate values of $k$ the expressions in [1] are next-to-impossible to use or even compute since they involve lengthy sums of high order tensors.

One observation from the extrinsic curvatures calculated in Section 4 is that the involution model is extremely unlike the Plücker model. For example, the mean curvature of the image of the Plücker embedding is well-known to be zero but it is far from zero in the involution model, as we saw in Corollary 4.6. Given that the mean curvature is determined by the second fundamental form, this shows that the second fundamental forms of both models must be different and therefore so are their Gaussian and principal curvatures.

7.2. Quotient models

The most common models of the Grassmannian in differential geometry (see [25, Chapter VII] and [8, Chapter 9]) are as one of several quotient spaces:

(16)	$\displaystyle\operatorname{Gr}(k,\mathbb{R}^{n})$	$\displaystyle\cong\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{O}(% k)\times\operatorname{O}(n-k)\bigr{)}$
		$\displaystyle\cong\operatorname{V}(k,n)/\operatorname{O}(k)$
		$\displaystyle\cong\operatorname{GL}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{GL% }(k)\times\operatorname{GL}(n-k)\bigr{)}$
		$\displaystyle\cong\operatorname{St}(k,n)/\operatorname{GL}(k),$

where $\operatorname{V}(k,n)\coloneqq\{V\in\mathbb{R}^{n\times k}:V^{% \scriptscriptstyle\mathsf{T}}V=I\}$ and $\operatorname{St}(k,n)\coloneqq\{X\in\mathbb{R}^{n\times k}:\operatorname{rank% }(X)=k\}$ are two common models for the Stiefel manifold of $k$ -frames in $\mathbb{R}^{n}$ . As usual $\operatorname{O}(n)=\operatorname{V}(n,n)$ and $\operatorname{GL}(n)=\operatorname{St}(n,n)$ denote the orthogonal and general linear groups respectively.

By exploiting their homogeneous space structures, the more basic intrinsic curvatures such as Riemann, Ricci, and sectional curvatures of $\operatorname{Gr}(k,\mathbb{R}^{n})$ are standard calculations that are classical in differential geometry [9, 16, 40]. However, the use of quotient spaces inevitably gives rise to formulas involving horizontal lifts of tangent vectors and arbitrary representatives of equivalence classes. This introduces layer upon layer of ambiguities requiring multiple arbitrary choices.

We will walk the reader through the calculation of Riemann curvature in $\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{O}(k)\times% \operatorname{O}(n-k)\bigr{)}$ to illustrate the case in point. We will delimit equivalence classes in $\llbracket\,\cdot\,\rrbracket$ below. In this model,

(i)

a point $\llbracket Q\rrbracket\in\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}% \operatorname{O}(k)\times\operatorname{O}(n-k)\bigr{)}$ is a coset

\llbracket Q\rrbracket=\left\{Q\begin{bmatrix}Q_{1}&0\\ 0&Q_{2}\end{bmatrix}\in\operatorname{O}(n):(Q_{1},Q_{2})\in\operatorname{O}(k)% \times\operatorname{O}(n-k)\right\},

for some $Q\in\operatorname{O}(n)$ but $Q$ is not canonically given;

(ii)

a tangent vector $\llbracket X\rrbracket_{\llbracket Q\rrbracket}\in\mathbb{T}_{\llbracket Q% \rrbracket}\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{O}(k)% \times\operatorname{O}(n-k)\bigr{)}$ is an equivalence class of pairs

\llbracket X\rrbracket_{\llbracket Q\rrbracket}=\bigl{\{}\bigl{(}Q,X+\mathfrak% {so}(k)\oplus\mathfrak{so}(n-k)\bigr{)}\in\operatorname{O}(n)\times\mathfrak{% so}(n)\!\!\bigm{/}\!\!\bigl{(}\mathfrak{so}(k)\oplus\mathfrak{so}(n-k)\bigr{)}% \bigr{\}}\!\!\Bigm{/}\sim

where the equivalence relation is defined by

\bigl{(}Q,X+\mathfrak{so}(k)\oplus\mathfrak{so}(n-k)\bigr{)}\sim\bigl{(}Q^{% \prime},X^{\prime}+\mathfrak{so}(k)\oplus\mathfrak{so}(n-k)\bigr{)}

if and only if there is some $(Q_{1},Q_{2})\in\operatorname{O}(k)\times\operatorname{O}(n-k)$ with

(17)		$\displaystyle Q^{\prime}$	$\displaystyle=Q\begin{bmatrix}Q_{1}&0\\ 0&Q_{2}\end{bmatrix},$
(17)		$\displaystyle X^{\prime}+\mathfrak{so}(k)\oplus\mathfrak{so}(n-k)$	$\displaystyle=\begin{bmatrix}Q_{1}&0\\ 0&Q_{2}\end{bmatrix}^{\scriptscriptstyle\mathsf{T}}\bigl{(}X+\mathfrak{so}(k)% \oplus\mathfrak{so}(n-k)\bigr{)}\begin{bmatrix}Q_{1}&0\\ 0&Q_{2}\end{bmatrix}.$

Evidently, in this model even an object as basic as a tangent vector is an equivalence class (defined by $\sim$ ) of equivalence classes (the coset $X+\mathfrak{so}(k)\oplus\mathfrak{so}(n-k)$ ). Every layer of equivalence relations introduces a layer of ambiguity but more importantly it often takes additional effort in the form extra calculations or computations.

Writing down a tangent vector $\llbracket X\rrbracket_{\llbracket Q\rrbracket}$ as a pair of actual matrices $(Q_{0},X_{0})$ requires making three arbitrary choices: first a representative $Q_{0}$ of $\llbracket Q\rrbracket$ , followed by a representative $(Q^{\prime},X^{\prime}+\mathfrak{so}(k)\oplus\mathfrak{so}(n-k))$ of $\llbracket X\rrbracket_{\llbracket Q\rrbracket}$ , and finally a representative $X_{0}$ of $X^{\prime}+\mathfrak{so}(k)\oplus\mathfrak{so}(n-k)$ . Note that these cannot be chosen arbitrarily nor a priori but need to satisfy (17). We will give the details below.

We begin by picking a representative $Q_{0}\in\pi^{-1}(\llbracket Q\rrbracket)\subseteq\operatorname{O}(n)$ where $\pi:\operatorname{O}(n)\to\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}% \operatorname{O}(k)\times\operatorname{O}(n-k)\bigr{)}$ is the quotient map, a Riemannian submersion. To construct the horizontal lift $X_{0}\in\mathbb{T}_{Q}\operatorname{O}(n)$ of a tangent vector $\llbracket X\rrbracket_{\llbracket Q\rrbracket}\in\mathbb{T}_{\llbracket Q% \rrbracket}\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{O}(k)% \times\operatorname{O}(n-k)\bigr{)}$ , the recommendation in the classic article of Edelman–Arias–Smith [17] is to use the isomorphism

d_{Q}\pi:\left\{Y\in\mathbb{R}^{n\times n}:Q^{\scriptscriptstyle\mathsf{T}}Y=% \begin{bmatrix}0&B\\ -B^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix},\;B\in\mathbb{R}^{k\times(n-k% )}\right\}\to\mathbb{T}_{\llbracket Q\rrbracket}\operatorname{O}(n)\!\!\bigm{/% }\!\!\bigl{(}\operatorname{O}(k)\times\operatorname{O}(n-k)\bigr{)},

defined for any $Q\in\operatorname{O}(n)$ and $\llbracket X\rrbracket_{\llbracket Q\rrbracket}\in\mathbb{T}_{\llbracket Q% \rrbracket}\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{O}(k)% \times\operatorname{O}(n-k)\bigr{)}$ , and then compute the horizontal lift as $X_{0}\coloneqq(d_{Q_{0}}\pi)^{-1}(\llbracket X\rrbracket_{Q_{0}})$ .

To get $X_{0}$ explicitly as a matrix, we will need to pick a representative $(Q^{\prime},X^{\prime}+\mathfrak{so}(k)\oplus\mathfrak{so}(n-k))$ for $\llbracket X\rrbracket_{\llbracket Q_{0}\rrbracket}$ followed by a representative $\widehat{X}$ of $X^{\prime}+\mathfrak{so}(k)\oplus\mathfrak{so}(n-k))$ . Observe that we cannot simply set $Q^{\prime}$ to be $Q_{0}$ since $(Q_{0},\widehat{X})$ will not satisfy (17) in general. Indeed, as we require

\bigl{(}Q^{\prime},\widehat{X}\bigr{)}\sim(Q_{0},\bigl{(}{Q^{\prime}}^{{% \scriptscriptstyle\mathsf{T}}}Q_{0})^{\scriptscriptstyle\mathsf{T}}\widehat{X}% ({Q^{\prime}}^{{\scriptscriptstyle\mathsf{T}}}Q_{0})\bigr{)},

we will need to compute $X_{0}$ as

X_{0}=Q_{0}({Q^{\prime}}^{{\scriptscriptstyle\mathsf{T}}}Q_{0})^{% \scriptscriptstyle\mathsf{T}}\begin{bmatrix}0&\widehat{B}\\ -\widehat{B}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}({Q^{\prime}}^{{% \scriptscriptstyle\mathsf{T}}}Q_{0})=Q^{\prime}\begin{bmatrix}0&\widehat{B}\\ -\widehat{B}^{\scriptscriptstyle\mathsf{T}}&0\end{bmatrix}Q^{\prime{% \scriptscriptstyle\mathsf{T}}}Q_{0},

with $\widehat{B}$ the upper right $k\times(n-k)$ submatrix of $\widehat{X}$ .

It might appear that to compute the Riemann curvature²²2The expression comes from applying the standard method for calculating Riemann curvature on a quotient model of any symmetric space [23, Theorem 4.2]. at $\llbracket Q\rrbracket\in\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}% \operatorname{O}(k)\times\operatorname{O}(n-k)\bigr{)}$ , we simply follow the procedure above to compute horizontal lifts $X,Y,Z,W\in\mathbb{T}_{Q}\operatorname{O}(n)$ of $\llbracket X\rrbracket_{\llbracket Q\rrbracket},\llbracket Y\rrbracket_{% \llbracket Q\rrbracket},\llbracket Z\rrbracket_{\llbracket Q\rrbracket},% \llbracket W\rrbracket_{\llbracket Q\rrbracket}\in\mathbb{T}_{\llbracket Q% \rrbracket}\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{O}(k)% \times\operatorname{O}(n-k)\bigr{)}$ and evaluate the expression on the right hand-side:

(18)

\operatorname{\mathsf{Rie}}(\llbracket X\rrbracket,\llbracket Y\rrbracket,% \llbracket Z\rrbracket,\llbracket W\rrbracket)=-\frac{1}{4}\langle[[X,Y],Z],W\rangle.

But this is a notational illusion. Even if we start from the same representative $Q_{0}\in\pi^{-1}(\llbracket Q\rrbracket)\subseteq\operatorname{O}(n)$ , the procedure above will yield horizontal lifts of tangents vectors at different representatives $Q_{X}^{\prime},Q_{Y}^{\prime},Q_{Z}^{\prime},Q_{W}^{\prime}\in\pi^{-1}(% \llbracket Q\rrbracket)$ . There is no guarantee that $Q_{X}^{\prime}=Q_{Y}^{\prime}=Q_{Z}^{\prime}=Q_{W}^{\prime}=Q_{0}$ and extra steps are necessary to align these different representatives. To align $Q_{X}^{\prime}$ with $Q_{0}$ , we need to compute the horizontal lift of $\llbracket X\rrbracket_{\llbracket Q\rrbracket}$ at $Q_{0}$ as

(d_{Q_{0}}\pi)^{-1}(\llbracket X\rrbracket_{\llbracket Q\rrbracket})=Q_{0}P_{X% }^{{\scriptscriptstyle\mathsf{T}}}{Q_{X}^{\prime}}^{\scriptscriptstyle\mathsf{% T}}(d_{Q_{X}^{\prime}}\pi)^{-1}(\llbracket X\rrbracket_{\llbracket Q\rrbracket% })P_{X}=(d_{Q_{X}^{\prime}}\pi)^{-1}(\llbracket X\rrbracket_{\llbracket Q% \rrbracket})P_{X},

where $P_{X}\coloneqq(Q_{X}^{\prime})^{{\scriptscriptstyle\mathsf{T}}}Q_{0}$ . This process has to be repeated four times to get

(d_{Q_{0}}\pi)^{-1}(\llbracket X\rrbracket_{\llbracket Q\rrbracket}),\quad(d_{% Q_{0}}\pi)^{-1}(\llbracket Y\rrbracket_{\llbracket Q\rrbracket}),\quad(d_{Q_{0% }}\pi)^{-1}(\llbracket Z\rrbracket_{\llbracket Q\rrbracket}),\quad(d_{Q_{0}}% \pi)^{-1}(\llbracket W\rrbracket_{\llbracket Q\rrbracket})

before we may evaluate (18).

Contrast this with Proposition 5.1, where the calculation of Riemann curvature in the involution model avoids all these issues and the derivation of its expression is essentially a one-liner. It is important to point out that although the expression in (18) superficially resembles our expression in Proposition 5.1, this is also a notational illusion — they are completely different. The easiest way to see this is by observing that the matrices in (18) are all skew-symmetric whereas those in Proposition 5.1 are all symmetric.

The goal of Edelman, Arias, and Smith in [17] is to extend line search optimization methods to a function $f:\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{O}(k)\times% \operatorname{O}(n-k)\bigr{)}\to\mathbb{R}$ . By and large this permits them to work with one search direction, i.e., a single tangent vector $\llbracket X\rrbracket_{\llbracket Q\rrbracket}$ , at every point $\llbracket Q\rrbracket$ . As a result, one could get around the problem by optimizing $f\mathbin{\mathchoice{\vbox{\hbox{$\scriptstyle\circ$}}}{\vbox{\hbox{$% \scriptstyle\circ$}}}{\vbox{\hbox{$\scriptscriptstyle\circ$}}}{\vbox{\hbox{$% \scriptscriptstyle\circ$}}}}\pi:\operatorname{O}(n)\to\mathbb{R}$ along horizontal directions. In the calculation of curvatures, we are required to work with four tangent vectors simultaneously and thus the alignment of different representatives of a given pair of equivalence classes cannot be avoided.

Although we have elected to make our point with $\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{O}(k)\times% \operatorname{O}(n-k)\bigr{)}$ , the issues identified above apply to every quotient model in (16). The root of these issues is that there is no global way to describe tangent vectors of $\operatorname{Gr}(k,\mathbb{R}^{n})$ in any of these quotient models. Indeed, the absence of such a global description is the reason why expressions for various curvatures in [48, 49, 9, 40, 16] can only be given locally. The same reason accounts for the expediency of the involution model — not only does it describe all points on the Grassmannian (2), it describes all tangent vectors at all point in a single unified way (9).

7.3. Projection model

The standard model of the Grassmannian in integral geometry (see [36, Chapter 9] and [34, Chapter 3]) is as the set of projection matrices:

(19)

\operatorname{Gr}(k,\mathbb{R}^{n})\cong\{P\in\mathbb{S}^{n}:P^{2}=P,\;% \operatorname{tr}(P)=k\}.

This is the model closest to the involution model. Indeed we showed in [31, Theorem 6.1] that they are two instances in an infinite family of such models parameterized by the condition number of matrices used. However they are also on two opposite ends: The projection model is the unique model in this family that represents points as singular matrices (infinitely ill-conditioned) whereas the involution model is the unique model in this family that represents points as orthogonal matrices (perfectly conditioned). Every other model in this family represents points with matrices of condition number strictly between one and infinity.

In numerical computations, methods based on projection matrices [44, Lecture 8] are well-known to be significantly less stable than methods based on orthogonal matrices [44, Lecture 10] — in fact this comparison is famously used to illustrate numerical stability of algorithms.

In hand calculations, the singularity of projection matrices in (19) is a handicap, especially when contrasted against the ease of inverting orthogonal matrices (with a reminder that matrices in (2) are automatically orthogonal).

The equations defining (19) are also less convenient than those defining (2). Any calculations involving tangent vectors in the involution model would require one to differentiate $Q^{2}=I$ to get $XQ+QX=0$ . But doing the same in the projection model would require one to differentiate $P^{2}=P$ to get $XP+PX=X$ . The latter is more difficult to use than the former; indeed any calculation involving the latter would usually involve a change of coordinates $P\mapsto 2P-I$ to simplify but that yields exactly the involution model since $Q=2P-I$ [27, Proposition 3.5]. Nevertheless, because of this relation between the two models, every expression we derived for the involution model in this article gives one for the projection model, up to a constant factor.

The result [19, Proposition 13] comes close to obtaining the principal curvatures in Corollary 4.7 using the projection model. Nevertheless, while (19) was used to model the manifold, the tangent spaces in [19] were still modeled as horizontal spaces in the quotient model $\operatorname{O}(n)\!\!\bigm{/}\!\!\bigl{(}\operatorname{O}(k)\times% \operatorname{O}(n-k)\bigr{)}$ , making the messy calculations in Section 7.2 all but unavoidable.

8. Conclusion

In studying curvatures, it is helpful to have an illuminating instance of a manifold $\mathcal{M}$ where all different forms of curvatures can be explicitly calculated and compared side-by-side like in Table 1. We are unaware of any nontrivial examples of this in the literature. The reason is clear in retrospect: For many of these curvatures, their defining equations in terms of local coordinates are near-impossible to calculate for anything more complex than a sphere. But even when embedded in $\mathbb{R}^{m}$ , which one may do with Nash embedding, the resulting extrinsic coordinates are still difficult to use. The key to our simple formulas in this article is that we have embedded our manifold in a space of matrices, and matrices are endowed with far richer structures — we may multiply or decompose them; impose orthogonality or symmetry on them; calculate their determinant, norm, or rank; find their eigen- or singular values and vectors; among a myriad of yet other features

Future work

The multitude of curvatures discussed in this article might lead the reader to think that we have exhausted the topic. This is not the case.

Some tensors are beyond our reach. The obstruction tensor [18, Equation 3.25] is a $2$ -tensor that equals the Bach tensor in Corollary 5.9 for four-dimensional manifolds but $\dim\operatorname{Gr}(k,n)\neq 4$ if $(k,n)\notin\{(1,5),(2,4)\}$ . The Lanczos tensor [28] is a $3$ -tensor that is an antiderivative of the Weyl tensor in Corollary 5.8 and defined as a solution to partial differential equation, which we are not even sure has a solution for $\operatorname{Gr}(k,n)$ . The Bel and Bel–Robinson tensors [7] are $4$ -tensors constructed from the Weyl tensor that have never been calculated before for $\operatorname{Gr}(k,n)$ .

Although we have limited the discussions in this article to the Levi-Civita connection, the natural choice from the perspective of Riemannian geometry, we saw in Section 3.3 that there are other alternatives if we study the Grassmannian in the context of non-Riemannian geometry. In this case the torsion, nonmetricity, and cocurvature tensors discussed at the end of Section 3.2 may no longer be zero. Using a different connection permits us to study yet other curvatures like the contorsion tensor [6, Theorem 6.2.5], a $3$ -tensor that quantifies its deviation from Levi-Civita.

When presented with a complicated $d$ -tensor $T\in\mathbb{V}^{\otimes d}$ , a common gambit in mathematics and physics [2, 5, 33, 42, 43] is to decompose it by decomposing the space in which it lies. More precisely, for any group $G\subseteq\operatorname{GL}(\mathbb{V})$ , we decompose $\mathbb{V}^{\otimes d}=\bigoplus_{\lambda\in\widehat{G}}\mathbb{V}_{\lambda}$ into irreducible $G$ -submodules, giving a decomposition $T=\sum_{\lambda\in\widehat{G}}T_{\lambda}$ with $T_{\lambda}\in\mathbb{V}_{\lambda}$ . An example is the Ricci decomposition of the Riemann curvature into the scalar, traceless Ricci, and Weyl curvatures [5, Chapter 1, Section G],

\operatorname{\mathsf{Rie}}=\mathsf{W}+\frac{1}{k(n-k)-2}\mathsf{Z}\varowedge% \mathsf{g}+\frac{\operatorname{\mathsf{Sca}}}{2k(n-k)\bigl{(}k(n-k)-1\bigr{)}}% \mathsf{g}\varowedge\mathsf{g},

which we have implicitly used in our definition of $\mathsf{W}$ ; here $d=4$ , $\mathbb{V}=\mathbb{T}_{Q}\operatorname{Gr}(k,n)$ , and $G=\operatorname{O}\bigl{(}k(n-k)\bigr{)}$ . It would be interesting to find similar relations among the curvatures in Table 1.

Last but not least, while manifold optimization is not one of our goals here, it remains at the back of our minds. Existing optimization algorithms almost exclusively rely on two quantities — gradient and Hessian. As shown in [13], it is certainly conceivable to use, say, the second fundamental form to optimize a smooth function. This and other curvatures computed in Table 1 may turn out to be useful in this regard.

References

[1] S. Anan’in and C. H. Grossi. Differential geometry of Grassmannians and the Plücker map. Cent. Eur. J. Math., 10(3):873–884, 2012.
[2] L. Bel. Définition d’une densité d’énergie et d’un état de radiation totale généralisée. C. R. Acad. Sci. Paris, 246:3015–3018, 1958.
[3] T. Bendokat, R. Zimmermann, and P.-A. Absil. A Grassmann manifold handbook: basic geometry and computational aspects. Adv. Comput. Math., 50(1):Paper No. 6, 51, 2024.
[4] M. Berger and D. Ebin. Some decompositions of the space of symmetric tensors on a Riemannian manifold. J. Differential Geometry, 3:379–392, 1969.
[5] A. L. Besse. Einstein manifolds, volume 10 of Ergebnisse der Mathematik und ihrer Grenzgebiete. Springer-Verlag, Berlin, 1987.
[6] D. Bleecker. Gauge theory and variational principles, volume 1 of Global Analysis Pure and Applied: Series A. Addison-Wesley Publishing Co., Reading, MA, 1981.
[7] M. A. G. Bonilla and J. M. M. Senovilla. Some properties of the Bel and Bel-Robinson tensors. Gen. Relativity Gravitation, 29(1):91–116, 1997.
[8] W. M. Boothby. An introduction to differentiable manifolds and Riemannian geometry, volume 120 of Pure and Applied Mathematics. Academic Press, Inc., Orlando, FL, revised second edition, 2003.
[9] E. Cartan. Leçons sur la Géométrie des Espaces de Riemann. Gauthier-Villars, Paris, second edition, 1946.
[10] B.-Y. Chen. Some pinching and classification theorems for minimal submanifolds. Arch. Math. (Basel), 60(6):568–578, 1993.
[11] B.-Y. Chen. $\delta$ -invariants, inequalities of submanifolds and their applications. In Topics in differential geometry, pages 29–155. Ed. Acad. Române, Bucharest, 2008.
[12] Q. Chen and C. He. On Bach flat warped product Einstein manifolds. Pacific J. Math., 265(2):313–326, 2013.
[13] H.-B. Cheng, L.-T. Cheng, and S.-T. Yau. Minimization with the affine normal direction. Commun. Math. Sci., 3(4):561–574, 2005.
[14] S. S. Chern. On the minimal immersions of the two-sphere in a space of constant curvature. In Problems in analysis (Sympos. in honor of Salomon Bochner, Princeton Univ., Princeton, N.J., 1969), pages 27–40. Princeton Univ. Press, Princeton, NJ, 1970.
[15] S.-S. Chern and N. H. Kuiper. Some theorems on the isometric imbedding of compact Riemann manifolds in euclidean space. Ann. of Math., 56:422–430, 1952.
[16] M. P. do Carmo. Riemannian geometry. Mathematics: Theory & Applications. Birkhäuser Boston, Inc., Boston, MA, portuguese edition, 1992.
[17] A. Edelman, T. A. Arias, and S. T. Smith. The geometry of algorithms with orthogonality constraints. SIAM J. Matrix Anal. Appl., 20(2):303–353, 1999.
[18] C. Fefferman and C. R. Graham. The ambient metric, volume 178 of Annals of Mathematics Studies. Princeton University Press, Princeton, NJ, 2012.
[19] F. Feppon and P. F. J. Lermusiaux. The extrinsic geometry of dynamical systems tracking nonlinear matrix projections. SIAM J. Matrix Anal. Appl., 40(2):814–844, 2019.
[20] P. Griffiths and J. Harris. Algebraic geometry and local differential geometry. Ann. Sci. École Norm. Sup. (4), 12(3):355–452, 1979.
[21] J. Harris. Algebraic geometry, volume 133 of Graduate Texts in Mathematics. Springer-Verlag, New York, 1995.
[22] F. W. Hehl, J. D. McCrea, E. W. Mielke, and Y. Ne’eman. Metric-affine gauge theory of gravity: field equations, Noether identities, world spinors, and breaking of dilation invariance. Phys. Rep., 258(1-2):1–171, 1995.
[23] S. Helgason. Differential geometry, Lie groups, and symmetric spaces, volume 34 of Graduate Studies in Mathematics. American Mathematical Society, Providence, RI, 2001. Corrected reprint of the 1978 original.
[24] S. Kobayashi and K. Nomizu. Foundations of differential geometry. Vol. I. Wiley Classics Library. John Wiley & Sons, Inc., New York, 1996.
[25] S. Kobayashi and K. Nomizu. Foundations of differential geometry. Vol. II. Wiley Classics Library. John Wiley & Sons, Inc., New York, 1996.
[26] I. Kolář, P. W. Michor, and J. Slovák. Natural operations in differential geometry. Springer-Verlag, Berlin, 1993.
[27] Z. Lai, L.-H. Lim, and K. Ye. Simpler grassmannian optimization. arXiv:2009.13502, 2020.
[28] C. Lanczos. Lagrangian multiplier and Riemannian spaces. Rev. Modern Physics, 21:497–502, 1949.
[29] A. J. Ledger. Geodesic spheres on Grassmann manifolds. Yokohama Math. J., 34(1-2):59–71, 1986.
[30] J. M. Lee. Introduction to Riemannian manifolds, volume 176 of Graduate Texts in Mathematics. Springer, Cham, second edition, 2018.
[31] L.-H. Lim and K. Ye. Degree of the grassmannian as an affine variety. arXiv:2405.05128, 2024.
[32] A. Machado and I. Salavessa. Grassman manifolds as subsets of euclidean spaces, 2021.
[33] A. Matte. Sur de nouvelles solutions oscillatoires de équations de la gravitation. Canad. J. Math., 5:1–16, 1953.
[34] P. Mattila. Geometry of sets and measures in Euclidean spaces, volume 44 of Cambridge Studies in Advanced Mathematics. Cambridge University Press, Cambridge, 1995.
[35] P. W. Michor. Graded derivations of the algebra of differential forms associated with a connection. In Differential geometry (Peñíscola, 1988), volume 1410 of Lecture Notes in Math., pages 249–261. Springer, Berlin, 1989.
[36] L. I. Nicolaescu. Lectures on the geometry of manifolds. World Scientific Publishing, Hackensack, NJ, third edition, 2021.
[37] M. Obata. The Gauss map of immersions of Riemannian manifolds in spaces of constant curvature. J. Differential Geometry, 2:217–223, 1968.
[38] P. Petersen. Riemannian geometry, volume 171 of Graduate Texts in Mathematics. Springer, Cham, third edition, 2016.
[39] J. Plebański. The algebraic structure of the tensor of matter. Acta Phys. Polon., 26:963–1020, 1964.
[40] H. Samelson. On curvature and characteristic of homogeneous spaces. Michigan Math. J., 5:13–18, 1958.
[41] I. R. Shafarevich. Basic algebraic geometry. 1. Springer, Heidelberg, third edition, 2013.
[42] I. M. Singer and J. A. Thorpe. The curvature of $4$ -dimensional Einstein spaces. In Global Analysis (Papers in Honor of K. Kodaira), pages 355–365. Univ. Tokyo Press, Tokyo, 1969.
[43] R. S. Strichartz. Linear algebra of curvature tensors and their covariant derivatives. Canad. J. Math., 40(5):1105–1143, 1988.
[44] L. N. Trefethen and D. Bau, III. Numerical linear algebra. Society for Industrial and Applied Mathematics, Philadelphia, PA, anniversary edition, 2022.
[45] Q. M. Wang. On totally geodesic spheres in Grassmannians and ${\rm O}(n)$ . Proc. Amer. Math. Soc., 108(3):811–815, 1990.
[46] J. A. Wolf. Geodesic spheres in Grassmann manifolds. Illinois J. Math., 7:425–446, 1963.
[47] Y.-c. Wong. Isoclinic $n$ -planes in Euclidean $2n$ -space, Clifford parallels in elliptic $(2n-1)$ -space, and the Hurwitz matrix equations. Mem. Amer. Math. Soc., 41:iii+112, 1961.
[48] Y.-C. Wong. Differential geometry of Grassmann manifolds. Proc. Nat. Acad. Sci. U.S.A., 57:589–594, 1967.
[49] Y.-C. Wong. Sectional curvatures of Grassmann manifolds. Proc. Nat. Acad. Sci. U.S.A., 60:75–79, 1968.

Simple matrix expressions for the curvatures of Grassmannian

Abstract.

1. Introduction

What’s new

2. Notations and conventions

3. Curvature zoo

3.1. Extrinsic curvatures

3.2. Intrinsic curvatures

3.3. Connections

Proposition 3.1.

Proof.

4. Extrinsic curvatures of the Grassmannian

Proposition 4.1 (First fundamental form).

Proposition 4.2 (Gauss map).

Theorem 4.3 (Second fundamental form).

Proof.

Corollary 4.4 (Index of relative nullity).

Proof.

Corollary 4.5 (Weingarten map).

Proof.

Corollary 4.6 (Mean curvature).

Proof.

Corollary 4.7 (Principal and Gaussian curvatures).

Proof.

Corollary 4.8 (Third fundamental form).

Proof.

5. Intrinsic curvatures of the Grassmannian

Proposition 5.1 (Riemmanian curvature).

Proof.

Corollary 5.2 (Jacobi curvature).

Proof.

Corollary 5.3 (Sectional curvature).

Proof.

Corollary 5.4 (Ricci and scalar curvatures).

Proof.

Theorem 5.5 (Delta invariants).

Proof.

Corollary 5.6 (Schouten curvature).

Proof.

Corollary 5.7 (Cotton curvature).

Proof.

Corollary 5.8 (Weyl curvature).

Proof.

Corollary 5.9 (Bach curvature).

Proof.

6. Geometric insights from these expressions

Corollary 6.1 (Plebański curvature).

Corollary 6.2 (Codazzi tensors I).

Proof.

Corollary 6.3 (Codazzi tensors II).

Proof.

Corollary 6.4 (Divergence-free tensors).

Proof.

Theorem 6.5 (Embedding products of geodesic 2222-spheres).

Proof.

7. Why we favor the involution model

7.1. Plücker model

7.2. Quotient models

7.3. Projection model

8. Conclusion

Future work

References

Theorem 6.5 (Embedding products of geodesic $2$ -spheres).