Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

An SVD-like Decomposition of Bounded-Input Bounded-Output Functions

Brian Charles Brown1, Michael King1, Sean Warnick1, Enoch Yeung2,3, David Grimsman1 1 - Department of Computer Science, Brigham Young University, UT, 84602; 2 - Department of Mechanical Engineering, University of California, Santa Barbara, CA 93106; 3- Department of Bioengineering, University of California, Santa Barbara, CA 93106. This work was funded by DOE Grant #SC0021693, and has been submitted to the 2024 Conference on Decision and Control. Correspondence should be addressed to Brian Brown at bcbrown365@gmail.com.
Abstract

The Singular Value Decomposition (SVD) of linear functions facilitates the calculation of their 2-induced norm and row and null spaces, hallmarks of linear control theory. In this work, we present a function representation that, similar to SVD, provides an upper bound on the 2-induced norm of bounded-input bounded-output functions, as well as facilitates the computation of generalizations of the notions of row and null spaces. Borrowing from the notion of “lifting” in Koopman operator theory, we construct a finite-dimensional lifting of inputs that relaxes the unitary property of the right-most matrix in traditional SVD, Vsuperscript𝑉V^{*}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, to be an injective, norm-preserving mapping to a slightly higher-dimensional space.

I Introduction

Decomposing a function f:np:𝑓superscript𝑛superscript𝑝f:\mathbb{R}^{n}\to\mathbb{R}^{p}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT into smaller or more manageable terms can often lead to valuable insights and tools to analyze f𝑓fitalic_f [23]. For instance, f𝑓fitalic_f can be represented as the weighted sum of sinusoids in the Fourier series, allowing one to identify which frequencies are most significant in computing the output of f𝑓fitalic_f [8]. Other decomposition methods include using radial basis functions [5], wavelets [16], or polynomials [2], each giving a different perspective on f𝑓fitalic_f.

In the case where f𝑓fitalic_f is a linear function of finite-dimensional inputs and outputs, then one can use the well-known singular value decomposition (SVD) to identify inputs whose output has a maximum (or minimum) increase in magnitude [7]. The LU𝐿𝑈LUitalic_L italic_U decomposition and the QR𝑄𝑅QRitalic_Q italic_R decomposition give insight into f1superscript𝑓1f^{-1}italic_f start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, in other words, how to solve the set of linear equations defined by f𝑓fitalic_f and some output [10]. When f𝑓fitalic_f represents a linear dynamical system x˙=f(x)˙𝑥𝑓𝑥\dot{x}=f(x)over˙ start_ARG italic_x end_ARG = italic_f ( italic_x ) (f𝑓fitalic_f maps nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to itself) then methods such as Jordan decomposition [3] or Schur decomposition [10] can be used to identify stability and other properties of the system.

When f𝑓fitalic_f is a linear functional, i.e. is a linear map from nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to \mathbb{R}blackboard_R, the Riesz Representation Theorem implies that f𝑓fitalic_f can be associated with a unique vector un𝑢superscript𝑛u\in\mathbb{R}^{n}italic_u ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT such that f(x)=u,x𝑓𝑥𝑢𝑥f(x)=\langle u,x\rangleitalic_f ( italic_x ) = ⟨ italic_u , italic_x ⟩ for any xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT [1]. This seminal result shows that each linear functional can be represented by an element of its domain, which has widespread benefits in computational physics, for instance. Stacking these functionals leads to a similar result, or decomposition, for linear functions mapping nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to psuperscript𝑝\mathbb{R}^{p}blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT.

Another type of decomposition from which this work draws inspiration is that of Koopman. For a nonlinear dynamical system, the states can be “lifted” to a higher (potentially infinite) dimension, whereby the dynamics of the system are precisely described by the linear Koopman Operator [4]. Much recent work has been devoted to advancing Koopman Operator Theory (see, for instance [22, 13, 15, 21]), including a recent trend to impose that liftings either be invertible [14, 11] or state-inclusive [12]. One key advantage of using the Koopman Operator is that it allows one to use well-known and well-understood tools for analyzing linear dynamical systems, such as the ones described above, in the context of nonlinear systems. While the focus on dynamical systems has generally limited the analysis to functions that map nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to itself, our goal in this work is to study more general functions that map nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to psuperscript𝑝\mathbb{R}^{p}blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT.

The main contribution of this paper is to present a novel decomposition for any arbitrary bounded-input bounded-output function f:np:𝑓superscript𝑛superscript𝑝f:\mathbb{R}^{n}\to\mathbb{R}^{p}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, such that f(x)2<cx2evaluated-atsubscriptnorm𝑓𝑥2bra𝑐𝑥2\|f(x)\|_{2}<c\|x\|_{2}∥ italic_f ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < italic_c ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT for all xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and for some c+<𝑐superscriptc\in\mathbb{R}^{+}<\inftyitalic_c ∈ blackboard_R start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT < ∞. The function f𝑓fitalic_f is decomposed into two parts: a linear part and a norm-preserving injective nonlinear part, as stated precisely in Theorem 1. The primary benefit of this decomposition is that tools used for analyzing linear functions, such as SVD, can be adapted to analyze f𝑓fitalic_f. Indeed, Theorem 1 shows that our decomposition is a generalization of the SVD to a large class of nonlinear functions.

We note that other work in the literature has the goal to generalize the SVD. For instance, both [20] and [9] develop such ideas, but still restricted to linear functions. The work in [6] is aimed at using a generalized SVD for nonlinear dynamical systems, but only to build observers; the generalization is that one decomposes two matrices instead of one. The works in [18] and [19] address the scenario where only a finite number of observations are known about f𝑓fitalic_f, with the goal to identify f𝑓fitalic_f by augmenting the data matrix with columns that are functions (or observables) of the original data. While our approach is somewhat similar, the goal of this work is different in that we seek a representation of the function itself. Furthermore, we believe we are unique in enforcing that our observables are norm-preserving, a key to ensuring that the linear part of the decomposition is as descriptive as possible (see Remark 2 after Theorem 1).

I-A Notation

Per notation common in Koopman operator theory, for some linear mapping K𝐾Kitalic_K and a potentially nonlinear mapping, g𝑔gitalic_g, we use (Kg)(x)𝐾𝑔𝑥(K\circ g)(x)( italic_K ∘ italic_g ) ( italic_x ) to represent the composition of K𝐾Kitalic_K with g𝑔gitalic_g. However, if K𝐾Kitalic_K and g𝑔gitalic_g are finite-dimensional, for example Kp×m𝐾superscript𝑝𝑚K\in\mathbb{R}^{p\times m}italic_K ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_m end_POSTSUPERSCRIPT and g(x)p𝑔𝑥superscript𝑝g(x)\in\mathbb{R}^{p}italic_g ( italic_x ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, then we define (Kg)(x)=Kg(x)𝐾𝑔𝑥𝐾𝑔𝑥(K\circ g)(x)=Kg(x)( italic_K ∘ italic_g ) ( italic_x ) = italic_K italic_g ( italic_x ), i.e. traditional matrix-vector multiplication. We use Vsuperscript𝑉V^{*}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT to denote the Hermitian transpose of some matrix V𝑉Vitalic_V. The matrices U𝑈Uitalic_U, ΣΣ\Sigmaroman_Σ, and Vsuperscript𝑉V^{*}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT will always represent the matrices of the singular value decomposition of some matrix K𝐾Kitalic_K, such that K=UΣV𝐾𝑈Σsuperscript𝑉K=U\Sigma V^{*}italic_K = italic_U roman_Σ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, with Up×p𝑈superscript𝑝𝑝U\in\mathbb{R}^{p\times p}italic_U ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_p end_POSTSUPERSCRIPT, a unitary (and therefore norm-preserving, injective, and surjective) matrix, Vm×msuperscript𝑉superscript𝑚𝑚V^{*}\in\mathbb{R}^{m\times m}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_m end_POSTSUPERSCRIPT, another unitary matrix, and Σp×mΣsuperscript𝑝𝑚\Sigma\in\mathbb{R}^{p\times m}roman_Σ ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_m end_POSTSUPERSCRIPT, a real, non-negative, rectangular-diagonal matrix such that the i𝑖iitalic_i-th diagonal element is given by σisubscript𝜎𝑖\sigma_{i}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, such that σiσj0subscript𝜎𝑖subscript𝜎𝑗0\sigma_{i}\geq\sigma_{j}\geq 0italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≥ 0 for all i<j𝑖𝑗i<jitalic_i < italic_j. Calligraphic letters, e.g. 𝒳𝒳\mathcal{X}caligraphic_X, are always sets, and |𝒳|𝒳|\mathcal{X}|\in\mathbb{Z}| caligraphic_X | ∈ blackboard_Z is the cardinality of 𝒳𝒳\mathcal{X}caligraphic_X.

For some mapping f:np:𝑓superscript𝑛superscript𝑝f:\mathbb{R}^{n}\to\mathbb{R}^{p}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, fi:n:subscript𝑓𝑖superscript𝑛f_{i}:\mathbb{R}^{n}\to\mathbb{R}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R is the i𝑖iitalic_i-th component functional of f𝑓fitalic_f. For brevity, we will denote f(x)𝑓𝑥f(x)italic_f ( italic_x ) as f𝑓fitalic_f and fi(x)subscript𝑓𝑖𝑥f_{i}(x)italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ) as fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT when x𝑥xitalic_x is arbitrary. We denote the 2-induced norm of f𝑓fitalic_f as f22=supxf(x)2x2subscriptnorm𝑓22subscriptsupremum𝑥subscriptnorm𝑓𝑥2subscriptnorm𝑥2\|f\|_{2-2}=\sup_{x}\frac{\|f(x)\|_{2}}{\|x\|_{2}}∥ italic_f ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT = roman_sup start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT divide start_ARG ∥ italic_f ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG, which for a linear f𝑓fitalic_f, is given by the maximum singular value, σ1subscript𝜎1\sigma_{1}italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, of the matrix representation of f𝑓fitalic_f. For some mapping v:nm:𝑣superscript𝑛superscript𝑚v:\mathbb{R}^{n}\to\mathbb{R}^{m}italic_v : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT, then for some particular x𝑥xitalic_x, v(x)2subscriptnorm𝑣𝑥2\|v(x)\|_{2}∥ italic_v ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT represents the 2-norm of the vector v(x)m𝑣𝑥superscript𝑚v(x)\in\mathbb{R}^{m}italic_v ( italic_x ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT.

We use 𝟏psubscript1𝑝\mathbf{1}_{p}bold_1 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT to represent a vector of all ones of dimension p𝑝pitalic_p. Similarly, eisubscript𝑒𝑖e_{i}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is a vector of all zeros except a 1 at the i𝑖iitalic_i-th index. Occasionally, we will need to refer to the i𝑖iitalic_i-th element of a particular vector, xjsubscript𝑥𝑗x_{j}italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. In this case, we will use xj,isubscript𝑥𝑗𝑖x_{j,i}italic_x start_POSTSUBSCRIPT italic_j , italic_i end_POSTSUBSCRIPT to denote the i𝑖iitalic_i-th element of the j𝑗jitalic_j-th vector x𝑥xitalic_x.

We will also use element-wise operations on vectors. For some σp𝜎superscript𝑝\sigma\in\mathbb{R}^{p}italic_σ ∈ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, we define σ1superscript𝜎1\sigma^{-1}italic_σ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, and σσdirect-product𝜎𝜎\sigma\odot\sigmaitalic_σ ⊙ italic_σ, and σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT as follows:

σ1=[1σ11σ21σp],σσ=σ2=[σ12σ12σ12]formulae-sequencesuperscript𝜎1matrix1subscript𝜎11subscript𝜎21subscript𝜎𝑝direct-product𝜎𝜎superscript𝜎2matrixsuperscriptsubscript𝜎12superscriptsubscript𝜎12superscriptsubscript𝜎12\sigma^{-1}=\begin{bmatrix}\frac{1}{\sigma_{1}}\\ \frac{1}{\sigma_{2}}\\ \vdots\\ \frac{1}{\sigma_{p}}\\ \end{bmatrix},\quad\sigma\odot\sigma=\sigma^{2}=\begin{bmatrix}\sigma_{1}^{2}% \\ \sigma_{1}^{2}\\ \vdots\\ \sigma_{1}^{2}\\ \end{bmatrix}italic_σ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = [ start_ARG start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG end_CELL end_ROW start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG italic_σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_ARG end_CELL end_ROW end_ARG ] , italic_σ ⊙ italic_σ = italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = [ start_ARG start_ROW start_CELL italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ]

II Representation of Vector-Valued BIBO Functions with Norm-Preserving Transformation

Theorem 1.

Let f:np:𝑓superscript𝑛superscript𝑝f:\mathbb{R}^{n}\rightarrow\mathbb{R}^{p}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT be an arbitrary bounded-input bounded-output function. Then there exists a unitary matrix, Up×p𝑈superscript𝑝𝑝U\in\mathbb{R}^{p\times p}italic_U ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_p end_POSTSUPERSCRIPT, a real, non-negative, rectangular-diagonal matrix Σp×mΣsuperscript𝑝𝑚\Sigma\in\mathbb{R}^{p\times m}roman_Σ ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_m end_POSTSUPERSCRIPT, and a norm-preserving, injective mapping, v:nm:𝑣superscript𝑛superscript𝑚v:\mathbb{R}^{n}\rightarrow\mathbb{R}^{m}italic_v : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT, with mp+n𝑚𝑝𝑛m\geq p+nitalic_m ≥ italic_p + italic_n, such that:

f(x)=UΣv(x),for all xnformulae-sequence𝑓𝑥𝑈Σ𝑣𝑥for all 𝑥superscript𝑛f(x)=U\Sigma v(x),\quad\text{for all }x\in\mathbb{R}^{n}italic_f ( italic_x ) = italic_U roman_Σ italic_v ( italic_x ) , for all italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT (1)
Proof.

Let m=n+p𝑚𝑛𝑝m=n+pitalic_m = italic_n + italic_p, δ:np:𝛿superscript𝑛superscript𝑝\delta:\mathbb{R}^{n}\to\mathbb{R}^{p}italic_δ : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, xδ:=[δ(x)x]assignsubscript𝑥𝛿matrix𝛿𝑥𝑥x_{\delta}:=\begin{bmatrix}\delta(x)\\ x\end{bmatrix}italic_x start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT := [ start_ARG start_ROW start_CELL italic_δ ( italic_x ) end_CELL end_ROW start_ROW start_CELL italic_x end_CELL end_ROW end_ARG ], and v:nm:𝑣superscript𝑛superscript𝑚v:\mathbb{R}^{n}\to\mathbb{R}^{m}italic_v : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT given by:

v(x):=x2xδ2xδ.assign𝑣𝑥subscriptnorm𝑥2subscriptnormsubscript𝑥𝛿2subscript𝑥𝛿v(x):=\frac{\|x\|_{2}}{\|x_{\delta}\|_{2}}x_{\delta}.italic_v ( italic_x ) := divide start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_x start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG italic_x start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT . (2)

Notice that for any well-defined function δ𝛿\deltaitalic_δ, v𝑣vitalic_v is both norm-preserving, i.e. v(x)2=x2,xnformulae-sequencesubscriptnorm𝑣𝑥2subscriptnorm𝑥2for-all𝑥superscript𝑛\|v(x)\|_{2}=\|x\|_{2},\;\forall x\in\mathbb{R}^{n}∥ italic_v ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ∀ italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, and injective, i.e. for all x1x2n,v(x1)v(x2)formulae-sequencesubscript𝑥1subscript𝑥2superscript𝑛𝑣subscript𝑥1𝑣subscript𝑥2x_{1}\neq x_{2}\in\mathbb{R}^{n},v(x_{1})\neq v(x_{2})italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≠ italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_v ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ≠ italic_v ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). Choosing the appropriate function δ𝛿\deltaitalic_δ and a corresponding real, non-negative, rectangular-diagonal matrix:

ΣΣ\displaystyle\Sigmaroman_Σ :=[σ100𝟎0σ20𝟎00𝟎σp𝟎]p×massignabsentdelimited-[]subscript𝜎10000subscript𝜎20000missing-subexpression0missing-subexpressionsubscript𝜎𝑝0superscript𝑝𝑚\displaystyle:=\left[\begin{array}[]{cccc|c}\sigma_{1}&0&0&\dots&\bf{0}\\ 0&\sigma_{2}&0&\dots&\bf{0}\\ 0&0&\ddots&&\bf{0}\\ \vdots&\vdots&&\sigma_{p}&\bf{0}\\ \end{array}\right]\in\mathbb{R}^{p\times m}:= [ start_ARRAY start_ROW start_CELL italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL … end_CELL start_CELL bold_0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL italic_σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL 0 end_CELL start_CELL … end_CELL start_CELL bold_0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋱ end_CELL start_CELL end_CELL start_CELL bold_0 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL italic_σ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL start_CELL bold_0 end_CELL end_ROW end_ARRAY ] ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_m end_POSTSUPERSCRIPT (7)

with (admissible) σiσj0subscript𝜎𝑖subscript𝜎𝑗0\sigma_{i}\geq\sigma_{j}\geq 0italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≥ 0 for j>i𝑗𝑖j>iitalic_j > italic_i, to satisfy Equation (1), is the key to the proof.

In order to ensure that σiσj0subscript𝜎𝑖subscript𝜎𝑗0\sigma_{i}\geq\sigma_{j}\geq 0italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≥ 0 for all i>j𝑖𝑗i>jitalic_i > italic_j, let fq(i)subscript𝑓𝑞𝑖f_{q(i)}italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT denote the component functional of f𝑓fitalic_f with the i𝑖iitalic_i-th placement in the relative ranking of the 2-induced norms of the component functionals of f𝑓fitalic_f. For example, if fj22subscriptnormsubscript𝑓𝑗22\|f_{j}\|_{2-2}∥ italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT were the smallest among all fi22,i=1,2,,pformulae-sequencesubscriptnormsubscript𝑓𝑖22𝑖12𝑝\|f_{i}\|_{2-2},\;i=1,2,\dots,p∥ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT , italic_i = 1 , 2 , … , italic_p, then fq(p)=fjsubscript𝑓𝑞𝑝subscript𝑓𝑗f_{q(p)}=f_{j}italic_f start_POSTSUBSCRIPT italic_q ( italic_p ) end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, whereas if fl22subscriptnormsubscript𝑓𝑙22\|f_{l}\|_{2-2}∥ italic_f start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT were the largest, then fq(1)=flsubscript𝑓𝑞1subscript𝑓𝑙f_{q(1)}=f_{l}italic_f start_POSTSUBSCRIPT italic_q ( 1 ) end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT. Let fqsubscript𝑓𝑞f_{q}italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT represent f𝑓fitalic_f with its component functionals re-ordered from largest to smallest induced norm, such that the first index of fq=fq(1)subscript𝑓𝑞subscript𝑓𝑞1f_{q}=f_{q(1)}italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_q ( 1 ) end_POSTSUBSCRIPT, etc. Let ΣΣ\Sigmaroman_Σ be defined given some values σisubscript𝜎𝑖\sigma_{i}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, for i=1,2,,p𝑖12𝑝i=1,2,\dots,pitalic_i = 1 , 2 , … , italic_p, such that:

i=1𝑝fq(i)222σi2<1,𝑝𝑖1superscriptsubscriptnormsubscript𝑓𝑞𝑖222superscriptsubscript𝜎𝑖21\displaystyle\overset{p}{\underset{i=1}{\sum}}\frac{\|f_{q(i)}\|_{2-2}^{2}}{% \sigma_{i}^{2}}<1,overitalic_p start_ARG start_UNDERACCENT italic_i = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG ∥ italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG < 1 , (8)

and consider disubscript𝑑𝑖d_{i}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT given by:

di:=σi2x22fq(i)(x)21,assignsubscript𝑑𝑖superscriptsubscript𝜎𝑖2superscriptsubscriptnorm𝑥22subscript𝑓𝑞𝑖superscript𝑥21\displaystyle d_{i}:=\frac{\sigma_{i}^{2}\|x\|_{2}^{2}}{f_{q(i)}(x)^{2}}-1,italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT := divide start_ARG italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT ( italic_x ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG - 1 , (9)

and a matrix, Ap×p𝐴superscript𝑝𝑝A\in\mathbb{R}^{p\times p}italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_p end_POSTSUPERSCRIPT, such that:

A:=[d1111d211dp].assign𝐴matrixsubscript𝑑1111subscript𝑑21missing-subexpression1missing-subexpressionsubscript𝑑𝑝A:=\begin{bmatrix}d_{1}&-1&\dots&-1\\ -1&d_{2}&\dots&-1\\ \vdots&&\ddots&\vdots\\ -1&\dots&&d_{p}\end{bmatrix}.italic_A := [ start_ARG start_ROW start_CELL italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL - 1 end_CELL start_CELL … end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL italic_d start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL - 1 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL - 1 end_CELL start_CELL … end_CELL start_CELL end_CELL start_CELL italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] . (10)

We conjecture that for some choice of admissible σ𝜎\sigmaitalic_σ, Aδ2=x22𝟏𝐴superscript𝛿2superscriptsubscriptnorm𝑥221A\delta^{2}=\|x\|_{2}^{2}\bf{1}italic_A italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_1, and that this equality will imply that f(x)=UΣv(x)𝑓𝑥𝑈Σ𝑣𝑥f(x)=U\Sigma v(x)italic_f ( italic_x ) = italic_U roman_Σ italic_v ( italic_x ) for an appropriate choice of unitary U𝑈Uitalic_U. To check this, we expand the i𝑖iitalic_i-th row of Aδ2=x22𝟏𝐴superscript𝛿2superscriptsubscriptnorm𝑥221A\delta^{2}=\|x\|_{2}^{2}\bf{1}italic_A italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_1 , to find:

(σi2x22fq(i)21)δi2Σjipδj2superscriptsubscript𝜎𝑖2superscriptsubscriptnorm𝑥22superscriptsubscript𝑓𝑞𝑖21superscriptsubscript𝛿𝑖2superscriptsubscriptΣ𝑗𝑖𝑝superscriptsubscript𝛿𝑗2\displaystyle\left(\frac{\sigma_{i}^{2}\|x\|_{2}^{2}}{f_{q(i)}^{2}}-1\right)% \delta_{i}^{2}-\Sigma_{j\neq i}^{p}\delta_{j}^{2}( divide start_ARG italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG - 1 ) italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - roman_Σ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =x22,absentsuperscriptsubscriptnorm𝑥22\displaystyle=\|x\|_{2}^{2},= ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (11)
(1fq(i)2σi2x22)δi2fq(i)2σi2x22Σjipδj21superscriptsubscript𝑓𝑞𝑖2superscriptsubscript𝜎𝑖2superscriptsubscriptnorm𝑥22superscriptsubscript𝛿𝑖2superscriptsubscript𝑓𝑞𝑖2superscriptsubscript𝜎𝑖2superscriptsubscriptnorm𝑥22superscriptsubscriptΣ𝑗𝑖𝑝superscriptsubscript𝛿𝑗2\displaystyle\left(1-\frac{f_{q(i)}^{2}}{\sigma_{i}^{2}\|x\|_{2}^{2}}\right)% \delta_{i}^{2}-\frac{f_{q(i)}^{2}}{\sigma_{i}^{2}\|x\|_{2}^{2}}\Sigma_{j\neq i% }^{p}\delta_{j}^{2}( 1 - divide start_ARG italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - divide start_ARG italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG roman_Σ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =fq(i)2σi2,absentsuperscriptsubscript𝑓𝑞𝑖2superscriptsubscript𝜎𝑖2\displaystyle=\frac{f_{q(i)}^{2}}{\sigma_{i}^{2}},= divide start_ARG italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ,
δi2(σi2fq(i)2δi2x22+σi2fq(i)2Σjipδj2x22)superscriptsubscript𝛿𝑖2superscriptsubscript𝜎𝑖2superscriptsubscript𝑓𝑞𝑖2superscriptsubscript𝛿𝑖2superscriptsubscriptnorm𝑥22superscriptsubscript𝜎𝑖2superscriptsubscript𝑓𝑞𝑖2superscriptsubscriptΣ𝑗𝑖𝑝superscriptsubscript𝛿𝑗2superscriptsubscriptnorm𝑥22\displaystyle\delta_{i}^{2}-\left(\frac{\sigma_{i}^{-2}f_{q(i)}^{2}\delta_{i}^% {2}}{\|x\|_{2}^{2}}+\frac{\sigma_{i}^{-2}f_{q(i)}^{2}\Sigma_{j\neq i}^{p}% \delta_{j}^{2}}{\|x\|_{2}^{2}}\right)italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ( divide start_ARG italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + divide start_ARG italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Σ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) =σi2fq(i)2.absentsuperscriptsubscript𝜎𝑖2superscriptsubscript𝑓𝑞𝑖2\displaystyle=\sigma_{i}^{-2}f_{q(i)}^{2}.= italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .
δi2σi2fq(i)2(δi2+Σjipδj2)x22superscriptsubscript𝛿𝑖2superscriptsubscript𝜎𝑖2superscriptsubscript𝑓𝑞𝑖2superscriptsubscript𝛿𝑖2superscriptsubscriptΣ𝑗𝑖𝑝superscriptsubscript𝛿𝑗2superscriptsubscriptnorm𝑥22\displaystyle\delta_{i}^{2}-\sigma_{i}^{-2}f_{q(i)}^{2}\frac{\left(\delta_{i}^% {2}+\Sigma_{j\neq i}^{p}\delta_{j}^{2}\right)}{\|x\|_{2}^{2}}italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG ( italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + roman_Σ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG =σi2fq(i)2.absentsuperscriptsubscript𝜎𝑖2superscriptsubscript𝑓𝑞𝑖2\displaystyle=\sigma_{i}^{-2}f_{q(i)}^{2}.= italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

Stacking and using element-wise operations yields:

δ2fq2σ2δ22x22superscript𝛿2direct-productsuperscriptsubscript𝑓𝑞2superscript𝜎2superscriptsubscriptnorm𝛿22superscriptsubscriptnorm𝑥22\displaystyle\delta^{2}-f_{q}^{2}\odot\sigma^{-2}\frac{\|\delta\|_{2}^{2}}{\|x% \|_{2}^{2}}italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⊙ italic_σ start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT divide start_ARG ∥ italic_δ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG =fq2σ2absentdirect-productsuperscriptsubscript𝑓𝑞2superscript𝜎2absent\displaystyle=f_{q}^{2}\odot\sigma^{-2}\implies= italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⊙ italic_σ start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ⟹
δ2superscript𝛿2\displaystyle\delta^{2}italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =fq2σ2x22x22+fq2σ2δ22x22,absentdirect-productsuperscriptsubscript𝑓𝑞2superscript𝜎2superscriptsubscriptnorm𝑥22superscriptsubscriptnorm𝑥22direct-productsuperscriptsubscript𝑓𝑞2superscript𝜎2superscriptsubscriptnorm𝛿22superscriptsubscriptnorm𝑥22\displaystyle=f_{q}^{2}\odot\sigma^{-2}\frac{\|x\|_{2}^{2}}{\|x\|_{2}^{2}}+f_{% q}^{2}\odot\sigma^{-2}\frac{\|\delta\|_{2}^{2}}{\|x\|_{2}^{2}},= italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⊙ italic_σ start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT divide start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⊙ italic_σ start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT divide start_ARG ∥ italic_δ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ,
=fq2σ2(Σinxi2)+(Σipδi2)x22,absentdirect-productsuperscriptsubscript𝑓𝑞2superscript𝜎2superscriptsubscriptΣ𝑖𝑛superscriptsubscript𝑥𝑖2superscriptsubscriptΣ𝑖𝑝superscriptsubscript𝛿𝑖2superscriptsubscriptnorm𝑥22\displaystyle=f_{q}^{2}\odot\sigma^{-2}\frac{(\Sigma_{i}^{n}x_{i}^{2})+(\Sigma% _{i}^{p}\delta_{i}^{2})}{\|x\|_{2}^{2}},= italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⊙ italic_σ start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT divide start_ARG ( roman_Σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + ( roman_Σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ,
=fq2σ2(Σinxj2)+(Σipδi2)x2absentdirect-productsuperscriptsubscript𝑓𝑞2superscript𝜎2superscriptsubscriptΣ𝑖𝑛superscriptsubscript𝑥𝑗2superscriptsubscriptΣ𝑖𝑝superscriptsubscript𝛿𝑖2subscriptnorm𝑥2absent\displaystyle=f_{q}^{2}\odot\sigma^{-2}\frac{(\Sigma_{i}^{n}x_{j}^{2})+(\Sigma% _{i}^{p}\delta_{i}^{2})}{\|x\|_{2}}\implies= italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⊙ italic_σ start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT divide start_ARG ( roman_Σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + ( roman_Σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ⟹
δ𝛿\displaystyle\deltaitalic_δ =fqσ1xδ2x2,absentdirect-productsubscript𝑓𝑞superscript𝜎1subscriptnormsubscript𝑥𝛿2subscriptnorm𝑥2\displaystyle=f_{q}\odot\sigma^{-1}\frac{\|x_{\delta}\|_{2}}{\|x\|_{2}},= italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ⊙ italic_σ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∥ italic_x start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ,
σδdirect-product𝜎𝛿\displaystyle\sigma\odot\deltaitalic_σ ⊙ italic_δ =fqxδ2x2,absentsubscript𝑓𝑞subscriptnormsubscript𝑥𝛿2subscriptnorm𝑥2\displaystyle=f_{q}\frac{\|x_{\delta}\|_{2}}{\|x\|_{2}},= italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT divide start_ARG ∥ italic_x start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ,
σδx2xδ2direct-product𝜎𝛿subscriptnorm𝑥2subscriptnormsubscript𝑥𝛿2\displaystyle\sigma\odot\delta\frac{\|x\|_{2}}{\|x_{\delta}\|_{2}}italic_σ ⊙ italic_δ divide start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_x start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG =fq(x),absentsubscript𝑓𝑞𝑥\displaystyle=f_{q}(x),= italic_f start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ( italic_x ) ,
Σv(x)Σ𝑣𝑥\displaystyle\Sigma v(x)roman_Σ italic_v ( italic_x ) =f(x).absent𝑓𝑥\displaystyle=f(x).= italic_f ( italic_x ) . (12)

It now remains to identify the δ𝛿\deltaitalic_δ that satisfies Aδ2=x2𝟏𝐴superscript𝛿2superscriptnorm𝑥21A\delta^{2}=\|x\|^{2}\bf{1}italic_A italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∥ italic_x ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_1, which reduces to finding the inverse of A𝐴Aitalic_A and ensuring that δ2superscript𝛿2\delta^{2}italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is positive so that δ𝛿\deltaitalic_δ is real-valued. We will first address the inverse of A𝐴Aitalic_A by using the Woodbury matrix identity. Define A~:=Diag(A)+Iassign~𝐴Diag𝐴𝐼\tilde{A}:=\text{Diag}(A)+Iover~ start_ARG italic_A end_ARG := Diag ( italic_A ) + italic_I, B~:=𝟏passign~𝐵subscript1𝑝\tilde{B}:=\mathbf{1}_{p}over~ start_ARG italic_B end_ARG := bold_1 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT, a column vector of p𝑝pitalic_p ones, C~:=1assign~𝐶1\tilde{C}:=-1over~ start_ARG italic_C end_ARG := - 1, D~:=𝟏pTassign~𝐷superscriptsubscript1𝑝𝑇\tilde{D}:=\mathbf{1}_{p}^{T}over~ start_ARG italic_D end_ARG := bold_1 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, a row vector of p𝑝pitalic_p ones, and γ:=1+1d1+1+1d2+1++1dp+1assign𝛾11subscript𝑑111subscript𝑑211subscript𝑑𝑝1\gamma:=-1+\frac{1}{d_{1}+1}+\frac{1}{d_{2}+1}+\dots+\frac{1}{d_{p}+1}italic_γ := - 1 + divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_ARG + divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + 1 end_ARG + ⋯ + divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT + 1 end_ARG. We thus have:

A1superscript𝐴1\displaystyle A^{-1}italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT =(A~+B~C~D~)1,absentsuperscript~𝐴~𝐵~𝐶~𝐷1\displaystyle=(\tilde{A}+\tilde{B}\tilde{C}\tilde{D})^{-1},= ( over~ start_ARG italic_A end_ARG + over~ start_ARG italic_B end_ARG over~ start_ARG italic_C end_ARG over~ start_ARG italic_D end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ,
=A~1A~1B~(C~1+D~A~1B~)1D~A~1,absentsuperscript~𝐴1superscript~𝐴1~𝐵superscriptsuperscript~𝐶1~𝐷superscript~𝐴1~𝐵1~𝐷superscript~𝐴1\displaystyle=\tilde{A}^{-1}-\tilde{A}^{-1}\tilde{B}(\tilde{C}^{-1}+\tilde{D}% \tilde{A}^{-1}\tilde{B})^{-1}\tilde{D}\tilde{A}^{-1},= over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_B end_ARG ( over~ start_ARG italic_C end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + over~ start_ARG italic_D end_ARG over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_B end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_D end_ARG over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ,
=A~1A~11p(1+𝟏pTA~1𝟏)1𝟏pTA~1,absentsuperscript~𝐴1superscript~𝐴1subscript1𝑝superscript1superscriptsubscript1𝑝𝑇superscript~𝐴111superscriptsubscript1𝑝𝑇superscript~𝐴1\displaystyle=\tilde{A}^{-1}-\tilde{A}^{-1}\textbf{1}_{p}(-1+\mathbf{1}_{p}^{T% }\tilde{A}^{-1}\mathbf{1})^{-1}\mathbf{1}_{p}^{T}\tilde{A}^{-1},= over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT 1 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( - 1 + bold_1 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_1 ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_1 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ,
=A~1A~1𝟏p1tr(A~1)1𝟏pTA~1,absentsuperscript~𝐴1superscript~𝐴1subscript1𝑝1trsuperscript~𝐴11subscriptsuperscript1𝑇𝑝superscript~𝐴1\displaystyle=\tilde{A}^{-1}-\tilde{A}^{-1}\mathbf{1}_{p}\frac{1}{\text{tr}(% \tilde{A}^{-1})-1}\mathbf{1}^{T}_{p}\tilde{A}^{-1},= over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_1 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG tr ( over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) - 1 end_ARG bold_1 start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ,
A1=[1(d1+1)0001(d2+1)01dp+1]superscript𝐴1limit-frommatrix1subscript𝑑110001subscript𝑑21missing-subexpressionmissing-subexpressionmissing-subexpression0missing-subexpression1subscript𝑑𝑝1\displaystyle A^{-1}=\begin{bmatrix}\frac{1}{(d_{1}+1)}&0&\dots&0\\ 0&\frac{1}{(d_{2}+1)}&&\vdots\\ \vdots&&\ddots&\\ 0&\dots&&\frac{1}{d_{p}+1}\end{bmatrix}-italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = [ start_ARG start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 ) end_ARG end_CELL start_CELL 0 end_CELL start_CELL … end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + 1 ) end_ARG end_CELL start_CELL end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL ⋱ end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL … end_CELL start_CELL end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT + 1 end_ARG end_CELL end_ROW end_ARG ] -
[1d1+11dp+1][1d1+11dp+1]1γ.matrix1subscript𝑑111subscript𝑑𝑝1matrix1subscript𝑑111subscript𝑑𝑝11𝛾\displaystyle\begin{bmatrix}\frac{1}{d_{1}+1}\\ \vdots\\ \frac{1}{d_{p}+1}\end{bmatrix}\begin{bmatrix}\frac{1}{d_{1}+1}&\dots&\frac{1}{% d_{p}+1}\end{bmatrix}\frac{1}{\gamma}.[ start_ARG start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_ARG end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT + 1 end_ARG end_CELL end_ROW end_ARG ] [ start_ARG start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_ARG end_CELL start_CELL … end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT + 1 end_ARG end_CELL end_ROW end_ARG ] divide start_ARG 1 end_ARG start_ARG italic_γ end_ARG .

Element-wise, this yields

δi2=x22(1di+1+j=1p1(di+1)(dj+1)γ)superscriptsubscript𝛿𝑖2superscriptsubscriptnorm𝑥221subscript𝑑𝑖1superscriptsubscript𝑗1𝑝1subscript𝑑𝑖1subscript𝑑𝑗1𝛾absent\displaystyle\delta_{i}^{2}=\|x\|_{2}^{2}\left(\frac{1}{d_{i}+1}+\sum_{j=1}^{p% }\frac{-1}{(d_{i}+1)(d_{j}+1)\gamma}\right)\impliesitalic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 end_ARG + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT divide start_ARG - 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 ) ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) italic_γ end_ARG ) ⟹ (13)
δi=sgn(fq(i)(x))x2(1di+1+j=1p1(di+1)(dj+1)γ)12.subscript𝛿𝑖sgnsubscript𝑓𝑞𝑖𝑥subscriptnorm𝑥2superscript1subscript𝑑𝑖1superscriptsubscript𝑗1𝑝1subscript𝑑𝑖1subscript𝑑𝑗1𝛾12\displaystyle\delta_{i}=\textrm{sgn}(f_{q(i)}(x))\|x\|_{2}\left(\frac{1}{d_{i}% +1}+\sum_{j=1}^{p}\frac{-1}{(d_{i}+1)(d_{j}+1)\gamma}\right)^{\frac{1}{2}}.italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = sgn ( italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT ( italic_x ) ) ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 end_ARG + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT divide start_ARG - 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 ) ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) italic_γ end_ARG ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT . (14)

We now see that in order for δ𝛿\deltaitalic_δ to be real-valued, the expression (1di+1+j=1p1(di+1)(dj+1)γ)1subscript𝑑𝑖1superscriptsubscript𝑗1𝑝1subscript𝑑𝑖1subscript𝑑𝑗1𝛾\left(\frac{1}{d_{i}+1}+\sum_{j=1}^{p}\frac{-1}{(d_{i}+1)(d_{j}+1)\gamma}\right)( divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 end_ARG + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT divide start_ARG - 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 ) ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) italic_γ end_ARG ) must be positive.

δi2>0superscriptsubscript𝛿𝑖20absent\displaystyle\delta_{i}^{2}>0\impliesitalic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT > 0 ⟹
x2(1di+1+j=1𝑝1(di+1)(dj+1)γ)subscriptnorm𝑥21subscript𝑑𝑖1𝑝𝑗11subscript𝑑𝑖1subscript𝑑𝑗1𝛾\displaystyle\|x\|_{2}\left(\frac{1}{d_{i}+1}+\overset{p}{\underset{j=1}{\sum}% }\frac{-1}{(d_{i}+1)(d_{j}+1)\gamma}\right)∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 end_ARG + overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG - 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 ) ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) italic_γ end_ARG ) >0absent0absent\displaystyle>0\implies> 0 ⟹
1di+1>j=1𝑝1(di+1)(dj+1)γ1subscript𝑑𝑖1𝑝𝑗11subscript𝑑𝑖1subscript𝑑𝑗1𝛾absent\displaystyle\frac{1}{d_{i}+1}>\overset{p}{\underset{j=1}{\sum}}\frac{1}{(d_{i% }+1)(d_{j}+1)\gamma}\impliesdivide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 end_ARG > overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 ) ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) italic_γ end_ARG ⟹
1>j=1𝑝1(dj+1)γ1𝑝𝑗11subscript𝑑𝑗1𝛾\displaystyle 1>\overset{p}{\underset{j=1}{\sum}}\frac{1}{(d_{j}+1)\gamma}1 > overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) italic_γ end_ARG (15)

We postulate that γ𝛾\gammaitalic_γ must be negative in order for (15) to be true. Suppose for contradiction that γ>0𝛾0\gamma>0italic_γ > 0. This leads to:

γ>j=1𝑝1(dj+1)𝛾𝑝𝑗11subscript𝑑𝑗1absent\displaystyle\gamma>\overset{p}{\underset{j=1}{\sum}}\frac{1}{(d_{j}+1)}\impliesitalic_γ > overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_ARG ⟹
1+j=1𝑝1(dj+1)>j=1𝑝1(dj+1)1𝑝𝑗11subscript𝑑𝑗1𝑝𝑗11subscript𝑑𝑗1absent\displaystyle-1+\overset{p}{\underset{j=1}{\sum}}\frac{1}{(d_{j}+1)}>\overset{% p}{\underset{j=1}{\sum}}\frac{1}{(d_{j}+1)}\implies- 1 + overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_ARG > overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_ARG ⟹
1>0,10\displaystyle-1>0,- 1 > 0 ,

which is a contradiction. Therefore, γ𝛾\gammaitalic_γ must be negative so that the direction of the inequality switches when multiplying by γ𝛾\gammaitalic_γ. We thus have:

1>j=1𝑝1(dj+1)γ1𝑝𝑗11subscript𝑑𝑗1𝛾absent\displaystyle 1>\overset{p}{\underset{j=1}{\sum}}\frac{1}{(d_{j}+1)\gamma}\implies1 > overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) italic_γ end_ARG ⟹
γ<j=1𝑝1(dj+1)𝛾𝑝𝑗11subscript𝑑𝑗1absent\displaystyle\gamma<\overset{p}{\underset{j=1}{\sum}}\frac{1}{(d_{j}+1)}\impliesitalic_γ < overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_ARG ⟹
1+j=1𝑝1(dj+1)<j=1𝑝1(dj+1)1𝑝𝑗11subscript𝑑𝑗1𝑝𝑗11subscript𝑑𝑗1absent\displaystyle-1+\overset{p}{\underset{j=1}{\sum}}\frac{1}{(d_{j}+1)}<\overset{% p}{\underset{j=1}{\sum}}\frac{1}{(d_{j}+1)}\implies- 1 + overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_ARG < overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_ARG ⟹
1<0.10\displaystyle-1<0.- 1 < 0 .

Now we need to understand the conditions under which γ𝛾\gammaitalic_γ could be less than zero. The only free parameters in γ𝛾\gammaitalic_γ are the values of σ𝜎\sigmaitalic_σ:

γ=1+j=1𝑝1(dj+1)𝛾1𝑝𝑗11subscript𝑑𝑗1\displaystyle\gamma=-1+\overset{p}{\underset{j=1}{\sum}}\frac{1}{(d_{j}+1)}italic_γ = - 1 + overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_ARG <0absent0absent\displaystyle<0\implies< 0 ⟹
j=1𝑝1(dj+1)𝑝𝑗11subscript𝑑𝑗1\displaystyle\overset{p}{\underset{j=1}{\sum}}\frac{1}{(d_{j}+1)}overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_ARG <1absent1\displaystyle<1< 1
j=1𝑝1σj2x22fj(x)2𝑝𝑗11superscriptsubscript𝜎𝑗2superscriptsubscriptnorm𝑥22subscript𝑓𝑗superscript𝑥2\displaystyle\overset{p}{\underset{j=1}{\sum}}\frac{1}{\frac{\sigma_{j}^{2}\|x% \|_{2}^{2}}{f_{j}(x)^{2}}}overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG 1 end_ARG start_ARG divide start_ARG italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_x ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_ARG <1,xnformulae-sequenceabsent1for-all𝑥superscript𝑛\displaystyle<1,\quad\forall x\in\mathbb{R}^{n}< 1 , ∀ italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT
j=1𝑝fj(x)2σj2x22𝑝𝑗1subscript𝑓𝑗superscript𝑥2superscriptsubscript𝜎𝑗2superscriptsubscriptnorm𝑥22\displaystyle\overset{p}{\underset{j=1}{\sum}}\frac{f_{j}(x)^{2}}{\sigma_{j}^{% 2}\|x\|_{2}^{2}}overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_x ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG <1,xnformulae-sequenceabsent1for-all𝑥superscript𝑛\displaystyle<1,\quad\forall x\in\mathbb{R}^{n}< 1 , ∀ italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT

However, by the definition of the 2-induced norm of fjsubscript𝑓𝑗f_{j}italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, |fj(x)|x2fj22,xnformulae-sequencesubscript𝑓𝑗𝑥subscriptnorm𝑥2subscriptnormsubscript𝑓𝑗22for-all𝑥superscript𝑛\frac{|f_{j}(x)|}{\|x\|_{2}}\leq\|f_{j}\|_{2-2},\;\forall x\in\mathbb{R}^{n}divide start_ARG | italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_x ) | end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ≤ ∥ italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT , ∀ italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. Therefore, using the least upper bound property of the 2-induced norm,

j=1𝑝fq(j)222σj2<1.𝑝𝑗1superscriptsubscriptnormsubscript𝑓𝑞𝑗222superscriptsubscript𝜎𝑗21\displaystyle\overset{p}{\underset{j=1}{\sum}}\frac{\|f_{q(j)}\|_{2-2}^{2}}{% \sigma_{j}^{2}}<1.overitalic_p start_ARG start_UNDERACCENT italic_j = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG divide start_ARG ∥ italic_f start_POSTSUBSCRIPT italic_q ( italic_j ) end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG < 1 . (16)

Thus, σ𝜎\sigmaitalic_σ may be chosen to be large enough such that γ<0𝛾0\gamma<0italic_γ < 0, and when γ<0𝛾0\gamma<0italic_γ < 0, we have shown that δ2superscript𝛿2\delta^{2}italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is always positive and thus δ𝛿\deltaitalic_δ is real-valued. In addition, σ𝜎\sigmaitalic_σ may be easily chosen such that σiσj0subscript𝜎𝑖subscript𝜎𝑗0\sigma_{i}\geq\sigma_{j}\geq 0italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≥ 0 for all i>j𝑖𝑗i>jitalic_i > italic_j.

Furthermore, since we have shown that σ𝜎\sigmaitalic_σ chosen in Equation (8) generates a real-valued δ𝛿\deltaitalic_δ, δ𝛿\deltaitalic_δ is well-defined and, from Equation (12), we see that that if U=Ip×p𝑈𝐼superscript𝑝𝑝U=I\in\mathbb{R}^{p\times p}italic_U = italic_I ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_p end_POSTSUPERSCRIPT (which is unitary), then:

f(x)=UΣv(x),xn,formulae-sequence𝑓𝑥𝑈Σ𝑣𝑥for-all𝑥superscript𝑛f(x)=U\Sigma v(x),\;\forall x\in\mathbb{R}^{n},italic_f ( italic_x ) = italic_U roman_Σ italic_v ( italic_x ) , ∀ italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ,

which completes the proof. ∎

Refer to caption
Figure 1: An example of a norm-preserving mapping: The unit disc {\cal H}caligraphic_H in 2superscript2\mathbb{R}^{2}blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is depicted in blue. Two norm-preserving mappings g1:𝒢1:subscript𝑔1subscript𝒢1g_{1}:\mathcal{H}\to{\cal G}_{1}italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT : caligraphic_H → caligraphic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (black and dashed) and g2:𝒢2:subscript𝑔2subscript𝒢2g_{2}:\mathcal{H}\to{\cal G}_{2}italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT : caligraphic_H → caligraphic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (orange and solid) are shown. Both are liftings for distinct hypothetical functions, f1(x)=Kg1(x)subscript𝑓1𝑥𝐾subscript𝑔1𝑥f_{1}(x)=K\circ g_{1}(x)italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x ) = italic_K ∘ italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x ) and f2(x)=Kg2(x)subscript𝑓2𝑥𝐾subscript𝑔2𝑥f_{2}(x)=K\circ g_{2}(x)italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x ) = italic_K ∘ italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x ) (per the notation in Remark 1). Since f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are functionals, K𝐾Kitalic_K can only have one non-zero singular value. The right singular vector corresponding to this non-zero singular value is shown in red and is denoted as v1subscriptsuperscript𝑣1v^{*}_{1}italic_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Note then that g2subscript𝑔2g_{2}italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT would correspond to a function f𝑓fitalic_f that stretches all elements of \mathcal{H}caligraphic_H uniformly.
Remark 1.

In the proof, Up×p𝑈superscript𝑝𝑝U\in\mathbb{R}^{p\times p}italic_U ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_p end_POSTSUPERSCRIPT was chosen to be identity and we had f(x)=UΣv(x)𝑓𝑥𝑈Σ𝑣𝑥f(x)=U\Sigma v(x)italic_f ( italic_x ) = italic_U roman_Σ italic_v ( italic_x ). However, U𝑈Uitalic_U could be selected to be an arbitrary real, unitary matrix, and an additional, real, unitary matrix Vm×msuperscript𝑉superscript𝑚𝑚V^{*}\in\mathbb{R}^{m\times m}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_m end_POSTSUPERSCRIPT could be chosen as well as an injective, norm-preserving lifting, g𝑔gitalic_g, similar to the lifting given in the construction, or perhaps learned from data, such that v(x)=Vg(x)𝑣𝑥superscript𝑉𝑔𝑥v(x)=V^{*}\circ g(x)italic_v ( italic_x ) = italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∘ italic_g ( italic_x ). Combined with an appropriate ΣΣ\Sigmaroman_Σ, this gives the singular value decomposition of some matrix Km×p𝐾superscript𝑚𝑝K\in\mathbb{R}^{m\times p}italic_K ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_p end_POSTSUPERSCRIPT, composed with a lifting such that:

f(x)=UΣ(Vg)(x)=Kg(x),𝑓𝑥𝑈Σsuperscript𝑉𝑔𝑥𝐾𝑔𝑥f(x)=U\Sigma(V^{*}\circ g)(x)=K\circ g(x),italic_f ( italic_x ) = italic_U roman_Σ ( italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∘ italic_g ) ( italic_x ) = italic_K ∘ italic_g ( italic_x ) , (17)

where f𝑓fitalic_f is a bounded-input bounded-output function.

Remark 2.

The requirement that v(x)2=x2subscriptnorm𝑣𝑥2subscriptnorm𝑥2\|v(x)\|_{2}=\|x\|_{2}∥ italic_v ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is essential for identifying a meaningful ΣΣ\Sigmaroman_Σ. For example, if v(x)𝑣𝑥v(x)italic_v ( italic_x ) were not norm-preserving (or injective), then a trivial solution would always exist where v(x)=f(x)𝑣𝑥𝑓𝑥v(x)=f(x)italic_v ( italic_x ) = italic_f ( italic_x ), and Σ=IΣ𝐼\Sigma=Iroman_Σ = italic_I such that f(x)=If(x)𝑓𝑥𝐼𝑓𝑥f(x)=If(x)italic_f ( italic_x ) = italic_I italic_f ( italic_x ). Instead, the provided constraints require that a set of basis functions be identified that simultaneously 1) are collectively a norm-preserving map of the inputs into an alternate space, and 2) span the image of f(x)𝑓𝑥f(x)italic_f ( italic_x ). The existence of such a finite-dimensional mapping is not immediately obvious and is the main contribution of this theorem.

Remark 3.

Recall that the Riesz Representation Theorem [17] focuses on offering a representation of the form f(x)=k,x𝑓𝑥𝑘𝑥f(x)=\langle k,x\rangleitalic_f ( italic_x ) = ⟨ italic_k , italic_x ⟩ for linear functionals of xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, with k𝑘kitalic_k, a characterizing vector of the functional, also in nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. Theorem 1 extends that representation to bounded-input bounded-output functionals, since Equation (17) can be re-written for functionals f:n:𝑓superscript𝑛f:\mathbb{R}^{n}\to\mathbb{R}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R (using the notation from Remark 1) as:

f(x)=k,g(x).𝑓𝑥𝑘𝑔𝑥f(x)=\langle k,g(x)\rangle.italic_f ( italic_x ) = ⟨ italic_k , italic_g ( italic_x ) ⟩ . (18)

In the case of linear functionals, g:nn:𝑔superscript𝑛superscript𝑛g:\mathbb{R}^{n}\to\mathbb{R}^{n}italic_g : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is the identity mapping, i.e. g(x)=x𝑔𝑥𝑥g(x)=xitalic_g ( italic_x ) = italic_x for all xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. In this way, the given theorem provides a generalization of the Riesz representation to bounded-input bounded-output functionals. The norm-preserving and injectivity properties of g𝑔gitalic_g may render this extension useful for future work in nonlinear optimization.

Note that Theorem 1 states that v(x)𝑣𝑥v(x)italic_v ( italic_x ) can always be chosen to be injective. This is a convenience that facilitates computing sets of inputs that are the natural relaxations of the null space and row space of linear functions. To see this, note that the columns of Vsuperscript𝑉V^{*}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, as defined in Remark 1, define a basis for the null space and row space of K𝐾Kitalic_K, depending on whether their associated singular value is zero or non-zero, respectively. If gLsuperscript𝑔𝐿g^{-L}italic_g start_POSTSUPERSCRIPT - italic_L end_POSTSUPERSCRIPT is the left-inverse of g𝑔gitalic_g, and 𝒴𝒴\mathcal{Y}caligraphic_Y is the intersection of the image of g𝑔gitalic_g with the null space of K𝐾Kitalic_K, then {gL(y)|y𝒴}conditional-setsuperscript𝑔𝐿𝑦𝑦𝒴\{g^{-L}(y)|\;y\in\mathcal{Y}\}{ italic_g start_POSTSUPERSCRIPT - italic_L end_POSTSUPERSCRIPT ( italic_y ) | italic_y ∈ caligraphic_Y } is the appropriate relaxation of the null space of f𝑓fitalic_f.

The injectivity of v(x)𝑣𝑥v(x)italic_v ( italic_x ) is also a way of ensuring that all the nonlinear portions of the computations in f𝑓fitalic_f remain reversible until the final linear computation. Furthermore, if σi=0subscript𝜎𝑖0\sigma_{i}=0italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0, then vi(x)subscript𝑣𝑖𝑥v_{i}(x)italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ) represents information about x𝑥xitalic_x that is lost during the computation of f(x)𝑓𝑥f(x)italic_f ( italic_x ). Conversely, v1(x)subscript𝑣1𝑥v_{1}(x)italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x ) represents information about x𝑥xitalic_x that most strongly contributes to the 2-norm of f(x)𝑓𝑥f(x)italic_f ( italic_x ).

Note that in the construction, the function v𝑣vitalic_v maps from a lower dimensional space into a higher-dimensional space (sometimes referred to as a “lifting”). This is important for maintaining injectivity without making v(x)𝑣𝑥v(x)italic_v ( italic_x ) difficult to compute. However, it is only necessary that v𝑣vitalic_v map np+1superscript𝑛superscript𝑝1\mathbb{R}^{n}\to\mathbb{R}^{p+1}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p + 1 end_POSTSUPERSCRIPT, as demonstrated in Lemma (1) in the Appendix. Furthermore, if p<n2𝑝𝑛2p<n-2italic_p < italic_n - 2, then v(x)𝑣𝑥v(x)italic_v ( italic_x ) need not be a “lifting” at all, but rather a mapping more aptly called a “lowering.” We leave the detailing of such mappings to future work, noting that they may likely be harder to compute. In contrast, the construction given in the proof is easily computed, as will be demonstrated in Section III.

II-A Bounds on Induced Norms

There are several reasons why bounded-input bounded-output functions are a natural extension of linear functions:

  • All linear functions are bounded-input bounded-output functions.

  • All bounded-input bounded-output functions can be bounded by a linear envelope.

This linear envelope is intricately connected to the σ𝜎\sigmaitalic_σ given in the construction:

Corollary 1.

Let f:np:𝑓superscript𝑛superscript𝑝f:\mathbb{R}^{n}\rightarrow\mathbb{R}^{p}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT be an arbitrary bounded-input bounded-output function. Then there exists a unitary matrix, Up×p𝑈superscript𝑝𝑝U\in\mathbb{R}^{p\times p}italic_U ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_p end_POSTSUPERSCRIPT, a real, non-negative, rectangular-diagonal matrix Σp×mΣsuperscript𝑝𝑚\Sigma\in\mathbb{R}^{p\times m}roman_Σ ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_m end_POSTSUPERSCRIPT, and a norm-preserving, injective mapping, v:nm:𝑣superscript𝑛superscript𝑚v:\mathbb{R}^{n}\rightarrow\mathbb{R}^{m}italic_v : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT, with mp+n𝑚𝑝𝑛m\geq p+nitalic_m ≥ italic_p + italic_n, such that:

f(x)=UΣv(x),for all xn,xn,formulae-sequence𝑓𝑥𝑈Σ𝑣𝑥formulae-sequencefor all 𝑥superscript𝑛for-all𝑥superscript𝑛f(x)=U\Sigma v(x),\quad\text{for all }x\in\mathbb{R}^{n},\quad\forall x\in% \mathbb{R}^{n},italic_f ( italic_x ) = italic_U roman_Σ italic_v ( italic_x ) , for all italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , ∀ italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ,

and σ1subscript𝜎1\sigma_{1}italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, the maximum entry in ΣΣ\Sigmaroman_Σ, is an upper bound on the 2-induced norm of f𝑓fitalic_f, i.e.

f22x2<x2σ1,xnformulae-sequencesubscriptnorm𝑓22subscriptnorm𝑥2subscriptnorm𝑥2subscript𝜎1for-all𝑥superscript𝑛\|f\|_{2-2}\|x\|_{2}<\|x\|_{2}\sigma_{1},\quad\forall x\in\mathbb{R}^{n}∥ italic_f ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ∀ italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT (19)
Proof.
f22subscriptnorm𝑓22\displaystyle\|f\|_{2-2}∥ italic_f ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT =supxf(x)2x2absentsubscriptsupremum𝑥subscriptnorm𝑓𝑥2subscriptnorm𝑥2\displaystyle=\sup_{x}\frac{\|f(x)\|_{2}}{\|x\|_{2}}= roman_sup start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT divide start_ARG ∥ italic_f ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG
=supxUΣv(x)2x2absentsubscriptsupremum𝑥subscriptnorm𝑈Σ𝑣𝑥2subscriptnorm𝑥2\displaystyle=\sup_{x}\frac{\|U\Sigma v(x)\|_{2}}{\|x\|_{2}}= roman_sup start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT divide start_ARG ∥ italic_U roman_Σ italic_v ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG
supxU22Σ22v(x)2x2absentsubscriptsupremum𝑥subscriptnorm𝑈22subscriptnormΣ22subscriptnorm𝑣𝑥2subscriptnorm𝑥2\displaystyle\leq\sup_{x}\frac{\|U\|_{2-2}\|\Sigma\|_{2-2}\|v(x)\|_{2}}{\|x\|_% {2}}≤ roman_sup start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT divide start_ARG ∥ italic_U ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT ∥ roman_Σ ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT ∥ italic_v ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG
=supxΣ22x2x2absentsubscriptsupremum𝑥subscriptnormΣ22subscriptnorm𝑥2subscriptnorm𝑥2\displaystyle=\sup_{x}\frac{\|\Sigma\|_{2-2}\|x\|_{2}}{\|x\|_{2}}= roman_sup start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT divide start_ARG ∥ roman_Σ ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG
=Σ22absentsubscriptnormΣ22\displaystyle=\|\Sigma\|_{2-2}= ∥ roman_Σ ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT

By the constraint in Equation (8) in the construction, σi>fisubscript𝜎𝑖subscript𝑓𝑖\sigma_{i}>f_{i}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for all i=1,2,p𝑖12𝑝i=1,2,\dots pitalic_i = 1 , 2 , … italic_p. This makes the inequality strict, i.e. f22<σ1subscriptnorm𝑓22subscript𝜎1\|f\|_{2-2}<\sigma_{1}∥ italic_f ∥ start_POSTSUBSCRIPT 2 - 2 end_POSTSUBSCRIPT < italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. ∎

Thus, σ𝜎\sigmaitalic_σ becomes more meaningful when it is minimized during the construction of the function representation.

In linear functions, any scaling of x=supxf(x)2x2superscript𝑥subscriptsupremum𝑥subscriptnorm𝑓𝑥2subscriptnorm𝑥2x^{*}=\sup_{x}\frac{\|f(x)\|_{2}}{\|x\|_{2}}italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_sup start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT divide start_ARG ∥ italic_f ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG will be stretched by the same amount, σ1subscript𝜎1\sigma_{1}italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, under f𝑓fitalic_f. In the extension to bounded-input bounded-output functions, this is no longer the case. In bounded-input bounded-output functions, the point or set of points that achieve the induced norm of the function may be an irregular set in nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. For example, in the first panel of Figure 2, only a few inputs come close to achieving the upper bound given by ±σ1x2plus-or-minussubscript𝜎1subscriptnorm𝑥2\pm\sigma_{1}\|x\|_{2}± italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

Refer to caption
Figure 2: Numerical example of the lifting proposed in this paper: The top panel represents the function (the blue curve) while the orange curve represents its numerical reconstruction using f(x)=Kg(x)𝑓𝑥𝐾𝑔𝑥f(x)=K\circ g(x)italic_f ( italic_x ) = italic_K ∘ italic_g ( italic_x ), with g(x)𝑔𝑥g(x)italic_g ( italic_x ) norm-preserving and injective. The black dotted diagonal lines represent the upper bound on the induced norm of the function, given by ±σ1x2plus-or-minussubscript𝜎1subscriptnorm𝑥2\pm\sigma_{1}\|x\|_{2}± italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (see Corollary 1). The bottom panel shows the norm-preserving, injective lifting that is given in the construction of the Theorem (1).

III Examples and Experiments

We tested the construction given in Theorem 1 on several bounded-input bounded output functions. In each case, f(xi)=Kg(xi)𝑓subscript𝑥𝑖𝐾𝑔subscript𝑥𝑖f(x_{i})=Kg(x_{i})italic_f ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_K italic_g ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) for all xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT tested. The first function tested was a single-input single-output bounded-input bounded-output function with a 2-induced norm of 1:

f(x)=xsin(x)+xcos(x2)2.𝑓𝑥𝑥𝑥𝑥superscript𝑥22f(x)=\frac{x\sin(x)+x\cos(x^{2})}{2}.italic_f ( italic_x ) = divide start_ARG italic_x roman_sin ( italic_x ) + italic_x roman_cos ( italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG start_ARG 2 end_ARG . (20)

The results of this experiment, as well as a visualization of the computed lifting, can be seen in Figure 2.

We also visualized the lifting for a multi-input single-output function, which, for convenience, we will write as several component functions, listed below:

h1(x)subscript1𝑥\displaystyle h_{1}(x)italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x ) =sin(0.1x1x2),absent0.1subscript𝑥1subscript𝑥2\displaystyle=\sin(0.1x_{1}*x_{2}),= roman_sin ( 0.1 italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∗ italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ,
h2(x)subscript2𝑥\displaystyle h_{2}(x)italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x ) =0.1cos(3x1/x2),\displaystyle=0.1\cos(3\frac{x_{1}/x_{2}}{)},= 0.1 roman_cos ( 3 divide start_ARG italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ) end_ARG ,
h3(x)subscript3𝑥\displaystyle h_{3}(x)italic_h start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_x ) =0.4sin(20x1),absent0.420subscript𝑥1\displaystyle=0.4\sin(20x_{1}),= 0.4 roman_sin ( 20 italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ,
h4(x)subscript4𝑥\displaystyle h_{4}(x)italic_h start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ( italic_x ) =0.3cos(x2+4),absent0.3subscript𝑥24\displaystyle=0.3\cos(x_{2}+4),= 0.3 roman_cos ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + 4 ) ,
h5(x)subscript5𝑥\displaystyle h_{5}(x)italic_h start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT ( italic_x ) =0.3sin(0.1ex1),absent0.30.1superscript𝑒subscript𝑥1\displaystyle=0.3\sin(0.1e^{x_{1}}),= 0.3 roman_sin ( 0.1 italic_e start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) ,
h6(x)subscript6𝑥\displaystyle h_{6}(x)italic_h start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT ( italic_x ) =0.2cos(1x12),absent0.21superscriptsubscript𝑥12\displaystyle=0.2\cos(\frac{1}{x_{1}^{2}}),= 0.2 roman_cos ( divide start_ARG 1 end_ARG start_ARG italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) ,
h7(x)subscript7𝑥\displaystyle h_{7}(x)italic_h start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT ( italic_x ) =0.1sin(0.1(x1+x2)),absent0.10.1subscript𝑥1subscript𝑥2\displaystyle=0.1\sin(0.1(x_{1}+x_{2})),= 0.1 roman_sin ( 0.1 ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ) ,
h8(x)subscript8𝑥\displaystyle h_{8}(x)italic_h start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT ( italic_x ) =0.1cos(0.001x22),absent0.10.001superscriptsubscript𝑥22\displaystyle=0.1\cos(0.001x_{2}^{2}),= 0.1 roman_cos ( 0.001 italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,

such that:

f(x)=x22.5i=18hi(x).𝑓𝑥subscriptnorm𝑥22.58𝑖1subscript𝑖𝑥f(x)=\frac{\|x\|_{2}}{2.5}\overset{8}{\underset{i=1}{\sum}}h_{i}(x).italic_f ( italic_x ) = divide start_ARG ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG 2.5 end_ARG over8 start_ARG start_UNDERACCENT italic_i = 1 end_UNDERACCENT start_ARG ∑ end_ARG end_ARG italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ) . (21)

The function and computed lifting can be visualized in Figure 3.

Refer to caption
Figure 3: Numerical example of the lifting proposed in this paper: The top panel represents a multi-input single-output bounded-input bounded-output function. The bottom panel shows the norm-preserving lifting computed using the construction given in this paper. Let K=UΣV𝐾𝑈Σsuperscript𝑉K=U\Sigma V^{*}italic_K = italic_U roman_Σ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT be the SVD of K𝐾Kitalic_K. Then the functional g3(x)subscript𝑔3𝑥g_{3}(x)italic_g start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_x ) is the projection of the lifting g𝑔gitalic_g onto v1superscriptsubscript𝑣1v_{1}^{*}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, which corresponds to σ1=1subscript𝜎11\sigma_{1}=1italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 in this instance, and thus lifting further in g3(x)subscript𝑔3𝑥g_{3}(x)italic_g start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_x ) corresponds to increased stretching of x𝑥xitalic_x. In contrast, g2(x)subscript𝑔2𝑥g_{2}(x)italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x ) and g3(x)subscript𝑔3𝑥g_{3}(x)italic_g start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_x ) correspond with the projections of the lifting onto v2superscriptsubscript𝑣2v_{2}^{*}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and v3superscriptsubscript𝑣3v_{3}^{*}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, which correspond to σ2=σ3=0subscript𝜎2subscript𝜎30\sigma_{2}=\sigma_{3}=0italic_σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_σ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0, i.e. g1subscript𝑔1g_{1}italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and g2subscript𝑔2g_{2}italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT store information that is lost from xf(x)𝑥𝑓𝑥x\rightarrow f(x)italic_x → italic_f ( italic_x ).

IV Conclusion

Here we have demonstrated that every bounded-input bounded-output function, f:np:𝑓superscript𝑛superscript𝑝f:\mathbb{R}^{n}\to\mathbb{R}^{p}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, has an SVD-like decomposition with a finite-dimensional representation of the form:

f(x)=UΣv(x)𝑓𝑥𝑈Σ𝑣𝑥f(x)=U\Sigma v(x)italic_f ( italic_x ) = italic_U roman_Σ italic_v ( italic_x )

By leveraging an injective “lifting,” i.e. v:nn+p:𝑣superscript𝑛superscript𝑛𝑝v:\mathbb{R}^{n}\to\mathbb{R}^{n+p}italic_v : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_n + italic_p end_POSTSUPERSCRIPT, this decomposition facilitates the computation of the extensions of null and row spaces for bounded-input and bounded-output functions. A constraint on the lifting to be norm-preserving causes the 2-induced norm of f𝑓fitalic_f to be upper-bounded by the maximum element of ΣΣ\Sigmaroman_Σ. When p=1𝑝1p=1italic_p = 1, the representation also provides a natural extension of the Reisz Reprsentation Theorem to bounded-input bounded-output functionals.

V Acknowledgments

We express our gratitude to Tyler Jarvis, Mark Transtrum, and Jared Whitehead for their invaluable internal reviews of the early stages of this work.

References

  • [1] George Bachman and Lawrence Narici. Functional analysis. Courier Corporation, 2000.
  • [2] David R Barton and Richard Zippel. Polynomial decomposition algorithms. Journal of Symbolic Computation, 1(2):159–168, 1985.
  • [3] Frédéric Brechenmacher. Histoire du théorème de Jordan de la décomposition matricielle (1870-1930). Formes de représentation et méthodes de décomposition. PhD thesis, Ecole des Hautes Etudes en Sciences Sociales (EHESS), 2006.
  • [4] Steven L. Brunton, Marko Budišić, Eurika Kaiser, and J. Nathan Kutz. Modern koopman theory for dynamical systems. SIAM Review, 64(2):229–340, 2022.
  • [5] M. D. Buhmann. Radial basis functions. Acta Numerica, 9:1–38, 2000.
  • [6] Gbolahan P. Dada and Antonios Armaou. Generalized svd reduced-order observers for nonlinear systems. In 2020 American Control Conference (ACC), pages 3473–3478, 2020.
  • [7] Carl Eckart and Gale Young. The approximation of one matrix by another of lower rank. Psychometrika, 1(3):211–218, 1936.
  • [8] JBJ Fourier. Mémoire sur la propagation de la chaleur dans les corps solides,(nepublikováno) pro institute de france. Paris, podáno, 21, 1807.
  • [9] Michael J Greenacre. Theory and applications of correspondence analysis. 1984.
  • [10] Roger A Horn and Charles R Johnson. Matrix analysis. Cambridge university press, 2012.
  • [11] Yuhong Jin, Lei Hou, and Shun Zhong. Extended dynamic mode decomposition with invertible dictionary learning. Neural Networks, 173:106177, 2024.
  • [12] Charles A. Johnson and Enoch Yeung. A class of logistic functions for approximating state-inclusive koopman operators. In 2018 Annual American Control Conference (ACC), pages 4803–4810, 2018.
  • [13] Alexandre Mauroy, Y Susuki, and Igor Mezić. Koopman operator in systems and control. Springer, 2020.
  • [14] Yuhuang Meng, Jianguo Huang, and Yue Qiu. Koopman operator learning using invertible neural networks. Journal of Computational Physics, 501:112795, 2024.
  • [15] Igor Mezić. Analysis of fluid flows via spectral properties of the koopman operator. Annual review of fluid mechanics, 45:357–378, 2013.
  • [16] Tao Qian, Mang I Vai, and Yuesheng Xu. Wavelet analysis and applications. Springer Science & Business Media, 2007.
  • [17] Walter Rudin. Real and complex analysis. 1987. Cited on, 156:16, 1987.
  • [18] Qinghua Tao, Francesco Tonin, Panagiotis Patrinos, and Johan AK Suykens. Nonlinear svd with asymmetric kernels: feature learning and asymmetric nystr\\\backslash\” om method. arXiv preprint arXiv:2306.07040, 2023.
  • [19] Prabhakar G Vaidya, Nithin Nagaraj, et al. A non-linear generalization of singular value decomposition and its application to cryptanalysis. arXiv preprint arXiv:0711.4910, 2007.
  • [20] Charles F. Van Loan. Generalizing the singular value decomposition. SIAM Journal on Numerical Analysis, 13(1):76–83, 1976.
  • [21] Matthew O Williams, Ioannis G Kevrekidis, and Clarence W Rowley. A data–driven approximation of the koopman operator: Extending dynamic mode decomposition. Journal of Nonlinear Science, 25:1307–1346, 2015.
  • [22] Enoch Yeung, Soumya Kundu, and Nathan Hodas. Learning deep neural network representations for koopman operators of nonlinear dynamical systems, 2017.
  • [23] Blaz Zupan, Marko Bohanec, Ivan Bratko, and Janez Demsar. Machine learning by function decomposition. In ICML, pages 421–429. Citeseer, 1997.

VI Appendix

Lemma 1.

There does not exist an injective, norm-preserving mapping, g:np:𝑔superscript𝑛superscript𝑝g:\mathbb{R}^{n}\rightarrow\mathbb{R}^{p}italic_g : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, with an associated matrix, Kp×p𝐾superscript𝑝𝑝K\in\mathbb{R}^{p\times p}italic_K ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_p end_POSTSUPERSCRIPT, satisfying

f(x)=Kg(x)𝑓𝑥𝐾𝑔𝑥f(x)=K\circ g(x)italic_f ( italic_x ) = italic_K ∘ italic_g ( italic_x ) (22)

for all bounded-input bounded-output functions, f:np:𝑓superscript𝑛superscript𝑝f:\mathbb{R}^{n}\to\mathbb{R}^{p}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, with n,p+𝑛𝑝superscriptn,p\in\mathbb{N}^{+}italic_n , italic_p ∈ blackboard_N start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT.

Proof.

Consider a bounded-input bounded-output function, f:np:𝑓superscript𝑛superscript𝑝f:\mathbb{R}^{n}\rightarrow\mathbb{R}^{p}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, such that

f(x)=[f1(x)f2(x)fp(x)]=[a1x2a2x2apx2]𝑓𝑥matrixsubscript𝑓1𝑥subscript𝑓2𝑥subscript𝑓𝑝𝑥matrixsubscript𝑎1subscriptnorm𝑥2subscript𝑎2subscriptnorm𝑥2subscript𝑎𝑝subscriptnorm𝑥2\displaystyle f(x)=\begin{bmatrix}f_{1}(x)\\ f_{2}(x)\\ \vdots\\ f_{p}(x)\end{bmatrix}=\begin{bmatrix}a_{1}\|x\|_{2}\\ a_{2}\|x\|_{2}\\ \vdots\\ a_{p}\|x\|_{2}\end{bmatrix}italic_f ( italic_x ) = [ start_ARG start_ROW start_CELL italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x ) end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x ) end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) end_CELL end_ROW end_ARG ] = [ start_ARG start_ROW start_CELL italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] (23)

Note that f(x)𝑓𝑥f(x)italic_f ( italic_x ) is trivially bounded-input bounded-output since fq(i)(x)2aix2subscriptnormsubscript𝑓𝑞𝑖𝑥2subscript𝑎𝑖subscriptnorm𝑥2\|f_{q(i)}(x)\|_{2}\leq a_{i}\|x\|_{2}∥ italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT for i=1,2,,n𝑖12𝑛i=1,2,\dots,nitalic_i = 1 , 2 , … , italic_n.

Let g:np:𝑔superscript𝑛superscript𝑝g:\mathbb{R}^{n}\rightarrow\mathbb{R}^{p}italic_g : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT be any norm-preserving, injective mapping, i.e. g(x)2=x2subscriptnorm𝑔𝑥2subscriptnorm𝑥2\|g(x)\|_{2}=\|x\|_{2}∥ italic_g ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT for all xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, and, for all x1x2n,g(x1)g(x2)formulae-sequencesubscript𝑥1subscript𝑥2superscript𝑛𝑔subscript𝑥1𝑔subscript𝑥2x_{1}\neq x_{2}\in\mathbb{R}^{n},g(x_{1})\neq g(x_{2})italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≠ italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_g ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ≠ italic_g ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). One consequence of this last property is that for a set with no repeats, 𝒳n𝒳superscript𝑛\mathcal{X}\subset\mathbb{R}^{n}caligraphic_X ⊂ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, with a given cardinality, c𝑐citalic_c, the set {g(x)|x𝒳}conditional-set𝑔𝑥𝑥𝒳\{g(x)|x\in\mathcal{X}\}{ italic_g ( italic_x ) | italic_x ∈ caligraphic_X } must have the same cardinality when repeats are removed. Let |||\cdot|| ⋅ | be an operation measuring the repeat-removed cardinality of a set. Then, |𝒳|=|{g(x)|x𝒳}|=c𝒳conditional-set𝑔𝑥𝑥𝒳𝑐|\mathcal{X}|=|\{g(x)|x\in\mathcal{X}\}|=c| caligraphic_X | = | { italic_g ( italic_x ) | italic_x ∈ caligraphic_X } | = italic_c.

Because g(x)𝑔𝑥g(x)italic_g ( italic_x ) is a norm-preserving mapping, we can, without loss of generality, consider the actions of f𝑓fitalic_f and g𝑔gitalic_g on a set of inputs, rnsubscript𝑟superscript𝑛\mathcal{H}_{r}\subset\mathbb{R}^{n}caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ⊂ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, such that xrx2=r𝑥subscript𝑟subscriptnorm𝑥2𝑟x\in\mathcal{H}_{r}\implies\|x\|_{2}=ritalic_x ∈ caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ⟹ ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_r. Now consider the set 𝒢r={g(x)|xr}subscript𝒢𝑟conditional-set𝑔𝑥𝑥subscript𝑟\mathcal{G}_{r}=\{g(x)|x\in\mathcal{H}_{r}\}caligraphic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = { italic_g ( italic_x ) | italic_x ∈ caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT }. Note that if n>1𝑛1n>1italic_n > 1, then |𝒢r|=|r|=20subscript𝒢𝑟subscript𝑟superscript2subscript0|\mathcal{G}_{r}|=|\mathcal{H}_{r}|=2^{\aleph_{0}}| caligraphic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT | = | caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT | = 2 start_POSTSUPERSCRIPT roman_ℵ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, and 𝒢rsubscript𝒢𝑟\mathcal{G}_{r}caligraphic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT is necessarily a subset of an origin-centered hypersphere of radius r𝑟ritalic_r and dimension p𝑝pitalic_p.

Define Kp×p𝐾superscript𝑝𝑝K\in\mathbb{R}^{p\times p}italic_K ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × italic_p end_POSTSUPERSCRIPT, an arbitrary linear function, such that its singular value decomposition is given by K=UΣV𝐾𝑈Σsuperscript𝑉K=U\Sigma V^{*}italic_K = italic_U roman_Σ italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, with the standard definitions of U,Σ𝑈ΣU,\Sigmaitalic_U , roman_Σ, and Vsuperscript𝑉V^{*}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Now suppose that f(x)=(Kg)(x)𝑓𝑥𝐾𝑔𝑥f(x)=(K\circ g)(x)italic_f ( italic_x ) = ( italic_K ∘ italic_g ) ( italic_x ). Without loss of generality, we may consider U=I𝑈𝐼U=Iitalic_U = italic_I and V=Isuperscript𝑉𝐼V^{*}=Iitalic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_I, with I𝐼Iitalic_I of the appropriate dimensions. With these values of U𝑈Uitalic_U and Vsuperscript𝑉V^{*}italic_V start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, f(x)=(Kg)(x)fq(i)(x)=σigi(x)𝑓𝑥𝐾𝑔𝑥subscript𝑓𝑞𝑖𝑥subscript𝜎𝑖subscript𝑔𝑖𝑥f(x)=(K\circ g)(x)\implies f_{q(i)}(x)=\sigma_{i}g_{i}(x)italic_f ( italic_x ) = ( italic_K ∘ italic_g ) ( italic_x ) ⟹ italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT ( italic_x ) = italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ).

We now note a relationship between the image of g𝑔gitalic_g and the pre-image of K𝐾Kitalic_K that must be satisfied for f(x)=(Kg)(x)𝑓𝑥𝐾𝑔𝑥f(x)=(K\circ g)(x)italic_f ( italic_x ) = ( italic_K ∘ italic_g ) ( italic_x ) to hold. Consider 𝒴𝒴\mathcal{Y}caligraphic_Y, the set of vectors such that Ky=f(x)𝐾𝑦𝑓𝑥Ky=f(x)italic_K italic_y = italic_f ( italic_x ) for y𝒴𝑦𝒴y\in\mathcal{Y}italic_y ∈ caligraphic_Y and xr𝑥subscript𝑟x\in\mathcal{H}_{r}italic_x ∈ caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT. Given our choice of f𝑓fitalic_f, f(x)𝑓𝑥f(x)italic_f ( italic_x ) is identical for all xr𝑥subscript𝑟x\in\mathcal{H}_{r}italic_x ∈ caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT. If n>1𝑛1n>1italic_n > 1 (i.e. |r|=20subscript𝑟superscript2subscript0|\mathcal{H}_{r}|=2^{\aleph_{0}}| caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT | = 2 start_POSTSUPERSCRIPT roman_ℵ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT), then to simultaneously satisfy injectivity and representation, the intersection of 𝒴𝒴\mathcal{Y}caligraphic_Y and 𝒢rsubscript𝒢𝑟\mathcal{G}_{r}caligraphic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT must have the cardinality of the continuum, i.e. |𝒢r𝒴|=20subscript𝒢𝑟𝒴superscript2subscript0|\mathcal{G}_{r}\cap\mathcal{Y}|=2^{\aleph_{0}}| caligraphic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∩ caligraphic_Y | = 2 start_POSTSUPERSCRIPT roman_ℵ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT.

Now let Pi(g(x))subscript𝑃𝑖𝑔𝑥P_{i}(g(x))italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g ( italic_x ) ) be the projection of g(x)𝑔𝑥g(x)italic_g ( italic_x ) onto the one-dimensional subspace spanned by visubscriptsuperscript𝑣𝑖v^{*}_{i}italic_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Given the norm-preserving constraints on g(x)𝑔𝑥g(x)italic_g ( italic_x ), we then have that Pi(g(x))=bix2subscript𝑃𝑖𝑔𝑥subscript𝑏𝑖subscriptnorm𝑥2P_{i}(g(x))=b_{i}\|x\|_{2}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g ( italic_x ) ) = italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, for some 1bk11subscript𝑏𝑘1-1\leq b_{k}\leq 1- 1 ≤ italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ 1. Note that since there is no upper bound on σ𝜎\sigmaitalic_σ, σisubscript𝜎𝑖\sigma_{i}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (and therefore bisubscript𝑏𝑖b_{i}italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT) can always be chosen such that σibi=aksubscript𝜎𝑖subscript𝑏𝑖subscript𝑎𝑘\sigma_{i}b_{i}=a_{k}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, implying that the intersection of the image of g(x)𝑔𝑥g(x)italic_g ( italic_x ) for xr𝑥subscript𝑟x\in\mathcal{H}_{r}italic_x ∈ caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT and 𝒴𝒴\mathcal{Y}caligraphic_Y can be non-empty.

Let 𝒴isubscript𝒴𝑖\mathcal{Y}_{i}\in\mathcal{H}caligraphic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_H be the set of all vectors reachable from g𝑔gitalic_g and in the pre-image of K𝐾Kitalic_K satisfying

Kyi=[c1c2σigi(gL(yi))cp],for yi𝒴iformulae-sequence𝐾subscript𝑦𝑖matrixsubscript𝑐1subscript𝑐2subscript𝜎𝑖subscript𝑔𝑖superscript𝑔𝐿subscript𝑦𝑖subscript𝑐𝑝for subscript𝑦𝑖subscript𝒴𝑖\displaystyle Ky_{i}=\begin{bmatrix}c_{1}\\ c_{2}\\ \vdots\\ \sigma_{i}g_{i}(g^{-L}(y_{i}))\\ \vdots\\ c_{p}\end{bmatrix},\quad\text{for }y_{i}\in\mathcal{Y}_{i}italic_K italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g start_POSTSUPERSCRIPT - italic_L end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_c start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] , for italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT

with cjipsubscript𝑐𝑗𝑖superscript𝑝c_{j\neq i}\in\mathbb{R}^{p}italic_c start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT not specified.

Thus, for y𝒴=(𝑖𝑝𝒴i)𝑦𝒴𝑝𝑖subscript𝒴𝑖y\in\mathcal{Y}=(\overset{p}{\underset{i}{\cap}}\mathcal{Y}_{i})italic_y ∈ caligraphic_Y = ( overitalic_p start_ARG underitalic_i start_ARG ∩ end_ARG end_ARG caligraphic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ),

Ky=[σ1g1(gL(y))σ2g2(gL(y))σpgp(gL(y))]𝐾𝑦matrixsubscript𝜎1subscript𝑔1superscript𝑔𝐿𝑦subscript𝜎2subscript𝑔2superscript𝑔𝐿𝑦subscript𝜎𝑝subscript𝑔𝑝superscript𝑔𝐿𝑦Ky=\begin{bmatrix}\sigma_{1}g_{1}(g^{-L}(y))\\ \sigma_{2}g_{2}(g^{-L}(y))\\ \vdots\\ \sigma_{p}g_{p}(g^{-L}(y))\\ \end{bmatrix}italic_K italic_y = [ start_ARG start_ROW start_CELL italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_g start_POSTSUPERSCRIPT - italic_L end_POSTSUPERSCRIPT ( italic_y ) ) end_CELL end_ROW start_ROW start_CELL italic_σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_g start_POSTSUPERSCRIPT - italic_L end_POSTSUPERSCRIPT ( italic_y ) ) end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_σ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_g start_POSTSUPERSCRIPT - italic_L end_POSTSUPERSCRIPT ( italic_y ) ) end_CELL end_ROW end_ARG ] (24)

and furthermore r𝒴=subscript𝑟𝒴\mathcal{H}_{r}\cap\mathcal{Y}=\emptysetcaligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∩ caligraphic_Y = ∅, unless σ𝜎\sigmaitalic_σ is chosen such that σiPi(gi(gL(y)))=aigiL(y)2subscript𝜎𝑖subscript𝑃𝑖subscript𝑔𝑖superscript𝑔𝐿𝑦subscript𝑎𝑖subscriptnormsuperscriptsubscript𝑔𝑖𝐿𝑦2\sigma_{i}P_{i}(g_{i}(g^{-L}(y)))=a_{i}\|g_{i}^{-L}(y)\|_{2}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g start_POSTSUPERSCRIPT - italic_L end_POSTSUPERSCRIPT ( italic_y ) ) ) = italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_L end_POSTSUPERSCRIPT ( italic_y ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, in which case |r𝒴|=1subscript𝑟𝒴1|\mathcal{H}_{r}\cap\mathcal{Y}|=1| caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∩ caligraphic_Y | = 1, see Figure 4. This non-infinite cardinality violates the injectivity of g𝑔gitalic_g since |𝒢|=20𝒢superscript2subscript0|\mathcal{G}|=2^{\aleph_{0}}| caligraphic_G | = 2 start_POSTSUPERSCRIPT roman_ℵ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, implying that an injective, norm-preserving mapping g(x)𝑔𝑥g(x)italic_g ( italic_x ) can only satisfy f(x)=(Kg)(x)𝑓𝑥𝐾𝑔𝑥f(x)=(K\circ g)(x)italic_f ( italic_x ) = ( italic_K ∘ italic_g ) ( italic_x ) for a single x𝑥xitalic_x when fq(i)(x)=aix2subscript𝑓𝑞𝑖𝑥subscript𝑎𝑖subscriptnorm𝑥2f_{q(i)}(x)=a_{i}\|x\|_{2}italic_f start_POSTSUBSCRIPT italic_q ( italic_i ) end_POSTSUBSCRIPT ( italic_x ) = italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. ∎

Refer to caption
Figure 4: 2-dimensional demonstration of the geometric reasoning in Lemma 1: In this example, g:22:𝑔superscript2superscript2g:\mathbb{R}^{2}\to\mathbb{R}^{2}italic_g : blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, with f:22:𝑓superscript2superscript2f:\mathbb{R}^{2}\to\mathbb{R}^{2}italic_f : blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. The blue, yellow, and red curves correspond to the sets rsubscript𝑟\mathcal{H}_{r}caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT, 𝒴1subscript𝒴1\mathcal{Y}_{1}caligraphic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, and 𝒴2subscript𝒴2\mathcal{Y}_{2}caligraphic_Y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT mentioned in the proof. The key points of the proof are 1) to be norm-preserving, g𝑔gitalic_g must map rrsubscript𝑟subscript𝑟\mathcal{H}_{r}\to\mathcal{H}_{r}caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT → caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT, 2) to satisfy representation of f(x)=ax2𝑓𝑥𝑎subscriptnorm𝑥2f(x)=a\|x\|_{2}italic_f ( italic_x ) = italic_a ∥ italic_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (an arbitrary BIBO function) g𝑔gitalic_g must map r𝒴1subscript𝑟subscript𝒴1\mathcal{H}_{r}\to\mathcal{Y}_{1}caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT → caligraphic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and simultaneously r𝒴2subscript𝑟subscript𝒴2\mathcal{H}_{r}\to\mathcal{Y}_{2}caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT → caligraphic_Y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, and 3) the cardinality of the intersection of rsubscript𝑟\mathcal{H}_{r}caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT, 𝒴1subscript𝒴1\mathcal{Y}_{1}caligraphic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, and 𝒴2subscript𝒴2\mathcal{Y}_{2}caligraphic_Y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is at most 1. This contradicts the injectivity of g𝑔gitalic_g, since, in order for g𝑔gitalic_g to be injective, the set {g(x)|xr}conditional-set𝑔𝑥𝑥subscript𝑟\{g(x)|x\in\mathcal{H}_{r}\}{ italic_g ( italic_x ) | italic_x ∈ caligraphic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT } must have cardinality of the continuum, 20superscript2subscript02^{\aleph_{0}}2 start_POSTSUPERSCRIPT roman_ℵ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Therefore a norm-preserving, injective mapping from nnsuperscript𝑛superscript𝑛\mathbb{R}^{n}\to\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT composed with Kn×n𝐾superscript𝑛𝑛K\in\mathbb{R}^{n\times n}italic_K ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT cannot represent all bounded-input bounded-output functions.