Generalized subdifferentials: a Baire categorical approach

Jonathan Borwein

TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY Volume 353, Number 10, Pages 3875–3893 S 0002-9947(01)02820-3 Article electronically published on May 14, 2001 GENERALIZED SUBDIFFERENTIALS: A BAIRE CATEGORICAL APPROACH JONATHAN M. BORWEIN, WARREN B. MOORS, AND XIANFU WANG Abstract. We use Baire categorical arguments to construct pathological locally Lipschitz functions. The origins of this approach can be traced back to Banach and Mazurkiewicz (1931) who independently used similar categorical arguments to show that “almost every continuous real-valued function defined on [0,1] is nowhere differentiable”. As with the results of Banach and Mazurkiewicz, it appears that it is easier to show that almost every function possesses a certain property than to construct a single concrete example. Among the most striking results contained in this paper are: Almost every 1-Lipschitz function defined on a Banach space has a Clarke subdifferential mapping that is identically equal to the dual ball; if {T1 , T2 , . . . , Tn } is a family of maximal cyclically monotone operators defined on a Banach space X then there exists a real-valued locally Lipschitz function g such that ∂0 g(x) = co{T1 (x), T2 (x), . . . , Tn (x)} for each x ∈ X; in a separable Banach space each non-empty weak∗ compact convex subset in the dual space is identically equal to the approximate subdifferential mapping of some Lipschitz function and for locally Lipschitz functions defined on separable spaces the notions of strong and weak integrability coincide. 1. Introduction An important aspect of developing a mathematical theory is in producing both examples and counterexamples that illuminate the content and boundaries of the subject. In this paper we give a general method for constructing examples and counterexamples for the differentiability theory of Lipschitz functions. The first and perhaps best known counterexample in differentiability theory is the construction of a continuous nowhere differentiable function. The explicit constructions given in the 19th century were later (in 1931) augmented by the use of Baire categorical arguments. Since this time the use of Baire category for the construction of functions (either well behaved or pathological) has been applied to several areas of analysis, (see, [12], [13] and [23] to name but a few). In this paper we continue this tradition by using Baire category arguments to construct Lipschitz functions that have ‘large’ generalized derivatives. The first and most crucial step towards achieving this result is to produce a candidate complete metric space on Received by the editors March 24, 1999 and, in revised form, February 25, 2000. 1991 Mathematics Subject Classification. Primary 49J52, 54E52. Key words and phrases. Subdifferentials, differentiability, Baire category, upper semi– continuous set–valued map, T -Lipschitz function. Research of the first author was supported by NSERC and the Shrum endowment of Simon Fraser University. Research of the second author was supported by a Marsden fund grant, VUW 703, administered by the Royal Society of New Zealand. c 2001 American Mathematical Society 3875 License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use 3876 JONATHAN M. BORWEIN, WARREN B. MOORS, AND XIANFU WANG which we may apply the Baire category theorem. Here we consider the class of T -Lipschitz functions, denoted XT . Loosely speaking (though precise in a smooth ∗ space), for a weak∗ cusco T : X → 2X from a Banach space X into its dual X ∗ , we say that a locally Lipschitz function f on X is T -Lipschitz if ∇f (x) ∈ T (x) whenever ∇f (x) exists. In Lemma 5 we show that for a fixed weak∗ cusco T the set of all T -Lipschitz functions form a complete metric space (under an appropriately defined metric). This simple result provides the basis which enables us to derive a plentiful supply of both examples and counterexamples. Notation. For a normed linear space (X, || · ||), we denote by X ∗ its dual space; BX := {x ∈ X : ||x|| ≤ 1}; BX ∗ := {x∗ ∈ X ∗ : ||x∗ || ≤ 1}; Bδ [x] := {y ∈ X : kx − yk ≤ δ}; SX := {x ∈ X : ||x|| = 1}; Bδ (x) := {y ∈ X : ||x − y|| < δ}; 0 if y ∈ Bδ [x], IBδ [x] (y) := +∞ otherwise. w∗ ∗ For a non-empty subset E of X ∗ we denote by E the weak∗ closure of E; cow E the weak∗ closed convex hull of E. In a topological space (A, τ ) we shall denote by BA the Borel sets on A, that is, the σ-algebra generated by the open subsets of A and as usual λ will denote the Lebesgue measure. The structure of the paper is as follows. In the remainder of Section 1 we will review some basic facts concerning the Clarke and approximate subdifferentials, then in Section 2 we will derive the basic properties of the T -Lipschitz functions. In Section 3 we will show that in any separable Banach space Gf := {g ∈ XT : ∂a f (x) ⊆ ∂a g(x) for all x ∈ A} is residual in (XT , ρ) for each f ∈ XT and then derive some of the consequences of this result. Section 4 deals with extending the results from Section 3 to non-separable Banach spaces, while in Section 5 we briefly look at the question of how to determine when XT 6= ∅. This section also examines the question of the existence of Lipschitz functions with ‘minimal’ subdifferential mappings. This provides a contrast to the results contained in Sections 3 and 4. We begin by recalling some preliminary definitions and properties of locally Lipschitz functions defined on Banach spaces. 1.1. Uscos and cuscos. Let T be a set-valued mapping from a topological space A into the dual of a normed linear space X. We say that T is weak∗ upper semicontinuous on A if for each weak∗ open subset W of X ∗ , {x ∈ A : T (x) ⊆ W } is open in A. When the images of T are non-empty and compact we call T a weak∗ usco and if, in addition, the images of T are also convex, then we call T a weak∗ cusco. We call T a minimal weak∗ usco (cusco) if its graph does not properly contain the graph of any other weak∗ usco (cusco) on A. By the graph of T we mean the set Gr(T ) := {(x, x∗ ) : x∗ ∈ T (x)}, which is closed whenever T is an usco. The following result from [1] enables us to generate uscos and cuscos from densely defined set-valued mappings. Lemma 1 ([1]). Let T be a densely defined set-valued mapping that maps from a topological space A into the dual of a Banach space X. If T is locally bounded on A then there exists a unique smallest weak∗ usco (weak∗ cusco) containing T , denoted USC(T ) (CSC(T )) and given by \ w∗ USC(T )(x) := {T (V ) : V is an open neighbourhood of x}, License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use GENERALIZED SUBDIFFERENTIALS: A BAIRE CATEGORICAL APPROACH CSC(T )(x) := 3877 \ ∗ {cow T (V ) : V is an open neighborhood of x}. The next result reveals one of the key properties enjoyed by minimal uscos (cuscos). Lemma 2 ([6]). Let A be a non-empty open subset of a Banach space X. If T : ∗ ∗ A → 2X is a weak* usco (weak* cusco) and S : A → 2X is a minimal weak* usco (weak* cusco) and T (x) ∩ S(x) 6= ∅ for each x ∈ A, then S(x) ⊆ T (x) for all x ∈ A. 1.2. Subderivatives and subdifferentials. Let f : A ⊆ X → R be a locally Lipschitz function defined on a non-empty open set A of a Banach space X. The Clarke derivative of f at x ∈ A, [11] is given by f 0 (x, v) := lim sup t↓0,y→x f (y + tv) − f (y) . t The upper and lower Dini–derivatives of f at x are given by f + (x, v) := lim sup t↓0 f (x + tv) − f (x) t and f − (x, v) := lim inf t↓0 f (x + tv) − f (x) . t The corresponding generalized subdifferentials are defined by ∂♯ f (x) := {x∗ ∈ X ∗ : x∗ (v) ≤ f ♯ (x, v) for all v ∈ X}, where ♯ is one of 0, +, −. When X is a smooth Banach space (i.e., has an equivalent Gâteaux differentiable renorm), the approximate subdifferential [3] is given by, ∂a f (x) := U SC(∂− f )(x). If in addition (X ∗ , weak∗ ) is angelic (e.g. when X is weakly Lindelöf determined [14], which includes weakly compactly generated spaces) then, ∂a f (x) = {x∗ : x∗ = w∗ - lim x∗n , and x∗n ∈ ∂− f (xn )}. xn →x In all cases ∂0 f is a weak∗ cusco on A and ∂a f is a weak∗ usco on A. 2. Basic properties of the space of T –Lipschitz functions Let T be weak∗ cusco that maps from a non-empty open subset A of a Banach space X into its dual space X ∗ . For such a cusco mapping one may consider the following (possibly empty) set of locally Lipschitz functions defined on A, called the T -Lipschitz functions on A. XT := {f ∈ RA : f is locally Lipschitz and ∂0 f (x) ⊆ T (x) for all x ∈ A}. When X is smooth, we have by [21] the following simplified definition: XT = {f ∈ RA : f is locally Lipschitz and ∇f (x) ∈ T (x) whenever ∇f (x) exists}. On XT , we may define a metric ρ by ρ(f, g) := min{1, d(f, g)}, where d(f, g) := sup |f (x)−g(x)|. If T is identically equal to some non-empty weak∗ compact, convex x∈A subset C of X ∗ then we may simply write XC in place of XT . Some basic properties of T and XT are Lemma 3. Let X be a Banach space and let A be a metric space, then each weak∗ usco from A into X ∗ is locally bounded on A. License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use 3878 JONATHAN M. BORWEIN, WARREN B. MOORS, AND XIANFU WANG Proof. Let us assume, in order to obtain a contradiction, that T is not locally bounded on A. Then there exists a point x0 ∈ A such that for each n ∈ N the set T (B1/n (x0 ))\nBX ∗ 6= ∅. Using this we may construct two sequences (xn : n ∈ N) in A and (x∗n : n ∈ N) in X ∗ so that (xn : n ∈ N) converges to x0 and x∗n ∈ T (xn )\nBX ∗ for all n ∈ N. Now if we set K := {x0 , x1 , x2 , . . . , xn , . . .}, then K is compact and so T (K) is weak∗ compact. Therefore, by the uniform boundedness theorem T (K) is bounded, which is impossible since {x∗n : n ∈ N} ⊆ T (K) is unbounded. Hence, T must be locally bounded on A. With a little extra effort one can show that each weak∗ usco mapping from a q-space, [19], into X ∗ is locally bounded. However, we have no need for this extra generality here. Lemma 4. Let A be a non-empty open subset of a normed linear space X and let ∗ T : A → 2X be a weak∗ cusco on A, then XT is a convex sub-lattice of the locally Lipschitz functions defined on A. Proof. By Propositions 2.3.12 and 2.3.3 in [11], we have ∂0 (f ∨ g)(x) ⊆ co{∂0 f (x), ∂0 g(x)} ⊆ T (x) for all x ∈ A, since T (x) is convex. Similarly, one can show that ∂0 (f ∧g)(x) ⊆ T (x) for all x ∈ A. Also for any 0 ≤ λ ≤ 1 we have, ∂0 (λf + (1 − λ)g)(x) ⊆ λ∂0 f (x) + (1 − λ)∂0 g(x) ⊆ λT (x) + (1 − λ)T (x) = T (x) for all x ∈ A. Lemma 5. Let A be a non-empty open subset of a Banach space X and let T : ∗ A → 2X be a weak∗ cusco on A, then (XT , ρ) is a complete metric space. Proof. Let (fn : n ∈ N) be a Cauchy sequence in (XT , ρ) and let f∞ be the pointwise limit of the sequence (fn : n ∈ N). Note: f∞ is well-defined since for each x ∈ A, (fn (x) : n ∈ N) is a Cauchy sequence in R. Now since T is locally bounded f∞ is locally Lipschitz on A. Indeed, if U is a convex open neighbourhood of some point x0 ∈ A and T (U ) ⊆ nBX ∗ , then for each f ∈ XT , |f (x) − f (y)| ≤ n||x − y|| for all x, y ∈ U . So in particular we have |f∞ (x) − f∞ (y)| = lim |fk (x) − fk (y)| ≤ n||x − y||, k→∞ for all x, y ∈ U . We now need to show that ∂0 f∞ (x) ⊆ T (x) for all x ∈ A. To do 0 (x0 ; y) = this it will suffice to show that for each x0 ∈ A, y ∈ SX and ε > 0, f∞ ∗ ∗ ∗ ∗ max{x (y) : x ∈ ∂f∞ (x0 )} ≤ max{x (y) : x ∈ T (x0 )} + ε. So let us fix x0 ∈ A, y ∈ SX and ε > 0. Now since the mapping x → max{x∗ (y) : x∗ ∈ T (x)} is upper semi-continuous on A there exists a δ > 0 such that max{x∗ (y) : x∗ ∈ T (z)} < max{x∗ (y) : x∗ ∈ T (x0 )} + ε for all z ∈ B2δ (x0 ). From this and the Lebourg mean-value theorem it follows that f (z + λy) − f (z) ≤ max{x∗ (y) : x∗ ∈ T (x0 )} + ε, λ for all f ∈ XT , 0 < λ < δ and z ∈ Bδ (x0 ). Therefore, fn (z + λy) − fn (z) f∞ (z + λy) − f∞ (z) = lim ≤ max{x∗ (y) : x∗ ∈ T (x0 )} + ε, n→∞ λ λ 0 (x0 , y) ≤ max{x∗ (y) : x∗ ∈ T (x0 )} + ε. for all 0 < λ < δ and z ∈ Bδ (x0 ). Hence, f∞ This completes the proof. The following result follows from Lemmas 1 and 3. License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use GENERALIZED SUBDIFFERENTIALS: A BAIRE CATEGORICAL APPROACH 3879 Proposition 1. Let F be a family of real-valued locally Lipschitz functions defined on a non-empty open subset A of a Banach space X. Then the functions in F are ∗ T -Lipschitz for some weak∗ cusco T : A → 2X if and only if the family of functions F is locally equi-Lipschitz on A. 3. Results on separable Banach spaces Lemma 6. Let X be a separable Banach space and let f be a locally Lipschitz function defined on a non-empty open subset A of X, then there exists a countable set C ⊆ Gr(∂− f ) such that Gr(∂a f ) = C, where the closure is taken with respect to the product topology on X × X ∗ and X ∗ is endowed with weak∗ topology. Proof. For each m ∈ N define Am := {x ∈ A : ∂− f (x) ⊆ mBX ∗ }. Since BX ∗ is weak* compact and metrizable, Am × mBX ∗ is hereditarily separable and thus Gr(∂− f ) ∩ (Am × mBX ∗ ) is separable. Hence there exists a countable set, Cm ⊆ Gr(∂− f ) ∩ (Am × mBX ∗ ) with Gr(∂− f ) ∩ (Am × mBX ∗ ) ⊆ Cm . S Let C := m∈N Cm . Then C is countable and [ (Am × mBX ∗ ) Gr(∂− f ) = Gr(∂− f ) ∩ m∈N = [ Gr(∂− f ) ∩ (Am × mBX ∗ ) ⊆ m∈N [ Cm ⊆ C. m∈N Since Gr(∂a f ) = Gr(∂− f ) we have Gr(∂a f ) ⊆ C ⊆ Gr(∂a f ) = Gr(∂a f ). Lemma 7 ([21]). If X is a smooth Banach space and Y is a finite dimensional subspace of X, then the distance function x 7→ dY (x) := miny∈Y kx − yk is smooth on X \ Y and d2Y is smooth on X. Lemma 8. Let Y be a finite dimensional subspace of a smooth Banach space X and let h : X → (−∞, +∞] be a proper lower semi-continuous function. If ε, δ > 0 and z0 ∈ X are given and h satisfies: (i) h is bounded below on Bδ [z0 ]; (ii) h(z) − h(z0 ) > −δ · ε for all z ∈ z0 + δBY . Then there exists a point z ∈ Bδ (z0 ) and z ∗ ∈ ∂− h(z) with kz ∗ |Y k < 2ε. Proof. First let us choose K sufficiently large so that inf{h(z) + Kd2z0 +Y (z) : z ∈ Bδ [z0 ]} > h(z0 ) − ε · δ. By Lemma 7, d2z0 +Y is smooth on X and constant along lines parallel to Y . Therefore we have that ∇(d2z0 +Y )(z)|Y = 0 for every z ∈ X. Now by the Borwein-Preiss smooth variational principle [9] we obtain z ∈ Bδ (z0 ) and x∗ ∈ X ∗ so that kx∗ k < 2ε and 0 ∈ ∂− (h + Kd2z0 +Y + IBδ [z0 ] )(z) + x∗ = ∂− h(z) + K∇(d2z0 +Y )(z) + x∗ . Therefore if we set z ∗ := −(x∗ + K∇(d2z0 +Y )(z)), then z ∗ ∈ ∂− h(z) and kz ∗|Y k = kx∗ |Y k < 2ε. Theorem 1 (The approximate subdifferential). Let A be a non–empty open subset of a separable Banach space X (finite or infinite dimensional) and let T : ∗ A → 2X be a weak∗ cusco on A, then for each f ∈ XT , {g ∈ XT : ∂a f (x) ⊆ ∂a g(x) for all x ∈ A} is residual in (XT , ρ). License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use 3880 JONATHAN M. BORWEIN, WARREN B. MOORS, AND XIANFU WANG Proof. Since X is separable, we may S choose an increasing sequence of finite dimensional subspaces of X such that n∈N Xn = X. For each (x, x∗ ) ∈ Gr(∂− f ) and n ∈ N we consider the set, G(x,x∗ ,n) := {g ∈ XT : there exists a z ∈ B1/n (x) and z ∗ ∈ ∂− g(z) so that k(x∗ − z ∗ )|Xn k ≤ 4/n}. The proof is divided into three parts: (a) For each (x, x∗ ) ∈ Gr(∂− f ) and n ∈ N, intG(x,x∗ ,n) is dense in (XT , ρ). Suppose (g0 , ε) ∈ XT ×(0, 1) is given. We need to verify that Bε (g0 )∩intG(x,x∗ ,n) 6= ∅. Define h1 ∈ XT by h1 (z) := f (z) + (g0 (x) − ε/3 − f (x)) and the function h2 ∈ XT by h2 (z) := min{g0 (z), h1 (z)}. Clearly, h2 (z) ≤ g0 (z) for all z ∈ A. Next we define h3 ∈ XT by h3 (z) := max{h2 (z), g0 (z) − 2ε/3} and obtain that g0 (z) − 2ε 3 ≤ h3 (z) ≤ g0 (z) for all z ∈ A, and so ρ(h3 , g0 ) = min{1, d(h3 , g0 )} < ε. We claim that h3 ∈ intG(x,x∗ ,n) . To see this, first note that 2ε ε < h3 (x) = h2 (x) = h1 (x) = g0 (x) − < g0 (x). 3 3 Hence there exists an open neighbourhood U of x so that h1 = h2 = h3 on U . Since x∗ ∈ ∂− f (x), f is Lipschitz around x and SXn is compact, there exists 0 < δ < 1/n so that (i) Bδ [x] ⊆ U ; (ii) f is Lipschitz on Bδ [x]; (iii) g0 (x) − (f − x∗ )(x + λv) − (f − x∗ )(x) > − We now show that Br (h3 ) ⊆ G(x,x∗ ,n) let g be any member of Br (h3 ). Then, λ −δ ≥ for all 0 < λ ≤ δ and v ∈ SXn . n n for any 0 < r < δ/(2n). To accomplish this, 2δ for all z ∈ x + δBXn . n By Lemma 8, there exists z ∈ Bδ (x) and y ∗ ∈ ∂− (g − x∗ )(z) with ky ∗ |Xn k < 4/n. Thus if we set z ∗ := y ∗ + x∗ , then z ∗ ∈ ∂− g(z) and k(z ∗ − x∗ )|Xn k < 4/n. This shows that g ∈ G(x,x∗ ,n) . T (b) Fix (x, x∗ ) ∈ Gr(∂− f ), then for each g in G(x,x∗ ) := n∈N G(x,x∗ ,n) , we have x∗ ∈ ∂a g(x). If g ∈ G(x,x∗) , then for each n ∈ N there exists an xn ∈ B1/n (x) and x∗n ∈ ∂− g(xn ) so that kx∗n |Xn − x∗ |Xn k ≤ 4/n. Since the subspaces (Xn : n ∈ N) ∗ ∗ on are S monotonely increasing, we see that (xn : n ∈ N) converges to x pointwise ∗ n∈N Xn . Moreover, since g is locally Lipschitz around x, the sequence (xn : n ∈ N) is norm bounded, and so we have that (x∗n : n ∈ N) converges to x∗ pointwise on S ∗ ∗ ∗ n∈N Xn = X. Hence ((xn , xn ) : n ∈ N) converges to (x, x ) in A × X , with A ∗ endowed with the norm topology and X with the weak* topology. However, as Gr(∂a g) is closed in A × X ∗ we obtain that x∗ ∈ ∂a g(x). (c) By Lemma 6, we may choose a countable set C ⊆ Gr(∂− f ) so that Gr(∂a f ) = C. Let \ G := {G(x,x∗ ) : (x, x∗ ) ∈ C}. (g − x∗ )(z) − (g − x∗ )(x) > − By (a), G is a residual set in (XT , ρ) and if g ∈ G, then for every (x, x∗ ) ∈ C we have x∗ ∈ ∂a g(x). That is, C ⊆ Gr(∂a g). Now since Gr(∂a g) is closed in the product topology on A × X ∗ we have Gr(∂a f ) = C ⊆ Gr(∂a g). This shows that if g ∈ G then ∂a f (x) ⊆ ∂a g(x) for all x ∈ A. License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use GENERALIZED SUBDIFFERENTIALS: A BAIRE CATEGORICAL APPROACH 3881 Corollary 1. Let {fn : n ∈ N} be a sequence of locally equi–Lipschitz functions defined on a non-empty open S subset A of a separable Banach space X. If we define ∗ T : A → 2X by T (x) := n∈N ∂a fn (x), then {g ∈ XCSC(T ) : USC(T )(x) ⊆ ∂a g(x) ⊆ ∂0 g(x) ⊆ CSC(T )(x) for every x ∈ A} is residual in (XCSC(T ) , ρ). Proof. By Proposition 1, CSC(T ) exists. For each n ∈ N, we may apply Theorem 1 to deduce that the set Gn := {g ∈ XCSC(T ) :T∂a fn (x) ⊆ ∂a g(x) for all x ∈ A} is residual in (XCSC(T ) , ρ). ThusSthe set G := n∈N Gn is residual in (XCSC(T ) , ρ) and if g ∈ G, then we have n∈N ∂a fn (x) ⊆ ∂a g(x) for every x ∈ A. Hence USC(T )(x) ⊆ ∂a g(x) ⊆ ∂0 g(x) ⊆ CSC(T )(x) for all x ∈ A. Theorem 2 (The Clarke subdifferential). Let A be a non-empty open subset of a ∗ separable Banach space X and let T : A → 2X be a weak∗ cusco on A, then for each f ∈ XT , {g ∈ XT : ∂0 f (x) ⊆ ∂0 g(x) for all x ∈ A} is residual in (XT , ρ). ∗ Proof. This follows from Theorem 1 and the fact that ∂0 f (x) = cow ∂a f (x) for all x ∈ A, [17]. Corollary 2. Let {fn : n ∈ N} be a locally equi-Lipschitz family of real-valued functions defined on a non-empty open S subset A of a separable Banach space X. If ∗ we define T : A → 2X by T (x) := n∈N ∂0 fn (x), then {g ∈ XCSC(T ) : ∂0 g(x) = CSC(T )(x) for all x ∈ A} is residual in (XCSC(T ) , ρ). Corollary 3. Let f1 , f2 , . . . , fn be real-valued locally Lipschitz functions defined ∗ on a non-empty open subset A of a separable Banach space X. If T : A → 2X is defined by T (x) := co{∂0 f1 (x), ∂0 f2 (x), . . . , ∂0 fn (x)}, then {g ∈ XT : ∂0 g(x) = T (x) for all x ∈ A} is residual in (XT , ρ). In particular, the Clarke subdifferential is closed under the operation of taking finite convex hulls. Proof. By Corollary 2 it suffices to show that T is a weak∗ cusco on A. To see that ∗ this is indeed the case, consider the set-valued mapping Ω : A → 2X defined by S ∗ Ω(x) ≡ {∂0 fj (x) : 1 ≤ j ≤ n}. Clearly Ω is a weak usco on A. Hence, by Lemma ∗ ∗ 7.12 in [20] the mapping T : A → 2X defined by T (x) := cow Ω(x) = co Ω(x) is a ∗ weak cusco on A. This completes the proof. Corollary 3 improves the main result of [8], where the minimality of each ∂0 fj was required. Corollary 4. Let f be a real-valued locally Lipschitz function defined on a nonempty open connected subset A of a separable Banach space X. Then the following conditions are equivalent: (i) f is “strongly integrable” that is, for each real-valued locally Lipschitz function g defined on A with ∂0 g(x) ⊆ ∂0 f (x) for all x ∈ A, f − g ≡ constant. (ii) f is “weakly integrable” that is, for each real-valued locally Lipschitz function g defined on A with ∂0 g(x) = ∂0 f (x) for all x ∈ A, f − g ≡ constant. Proof. The fact that (i) implies (ii) is obvious. So it suffices for us to justify that (ii) implies (i). Fix x0 ∈ A and let g be any member of X∂0 f . By Corollary 3 we may select, for each 0 < ε < 1, a function gε ∈ X∂f so that ρ(g, gε ) < ε and License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use 3882 JONATHAN M. BORWEIN, WARREN B. MOORS, AND XIANFU WANG ∂0 gε (x) = ∂0 f (x) for all x ∈ A. Then for any x ∈ A, |(f − g)(x) − (f − g)(x0 )| ≤ |(f − gε )(x) − (f − gε )(x0 )| + |(gε − g)(x) − (gε − g)(x0 )| = |(gε − g)(x) − (gε − g)(x0 )| (since f − gε ≡ constant on A) ≤ |(gε − g)(x)| + |(gε − g)(x0 )| ≤ 2ε. However, as our choice of ε was arbitrary, we must have that (f −g)(x) = (f −g)(x0 ). This shows that f − g ≡ constant on A. Let I be an open interval in R and let f : I → R. We say that f is robustly lower (upper) semi-continuous if, ! f (x) = lim inf f (y) f (x) = lim sup f (y) y→x y→x y6∈N y6∈N for each Lebesgue null set N of I. Corollary 5. Let I be an open interval in R and let α and β be functions on I such that α ≤ β. Then the following are equivalent: (i) α is robustly lower semi-continuous and β is robustly upper semi-continuous; (ii) there exists a locally Lipschitz function f on I such that ∂0 f (x) = [α(x), β(x)] for all x ∈ I; (iii) if T : I → 2R is defined by T (x) := [α(x), β(x)], then XT is non-empty and {g ∈ XT : ∂0 g(x) = [α(x), β(x)] for all x ∈ I} is residual in XT . Proof. (i)⇒(iii) Since α is lower semi-continuous and β is upper semi-continuous on I, both functions are Lebesgue integrable. Choose any p ∈ I and define f1 , f2 : I → R by Z x Z x α(t)dt and f2 (x) := β(t)dt. f1 (x) := p p Let N ⊆ I be any Lebesgue null set so that f1′ (x) = α(x) and f2′ (x) = β(x) on I\N . Then we have ∂0 f1 (x) = [lim inf f1′ (t), lim sup f1′ (t)] = [α(x), lim sup f1′ (t)] t6∈N t→x t6∈N t→x and t6∈N t→x ∂0 f2 (x) = [lim inf f2′ (t), lim sup f2′ (t)] = [lim inf f2′ (t), β(x)]. t6∈N t→x t6∈N t→x t6∈N t→x Moreover, since α(x) ≤ β(x) for all x ∈ I we have lim sup f1′ (t) ≤ lim sup β(t) = β(x) and lim inf f2′ (t) ≥ lim inf α(t) = α(x). t6∈N t→x t6∈N t→x t6∈N t→x t6∈N t→x Therefore, T (x) = co{∂0 f1 (x), ∂0 f2 (x)} for each x ∈ I and so the result follows by Corollary 3. (iii)⇒ (ii) is clear and (ii)⇒(i) follows from the basic properties of the Clarke subdifferential mapping. This has recovered and improved the main result in [2]. Example 1. Theorem 1 fails if T is only assumed to be a weak∗ usco. To see this, we consider T : R → 2R defined by T (x) := {0, 1}. If Theorem 1 holds for this T , then there would exist a residual set G in XT where ∂a g = T for each g ∈ G. However, T is not an approximate subdifferential map of any Lipschitz function. License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use GENERALIZED SUBDIFFERENTIALS: A BAIRE CATEGORICAL APPROACH 3883 Because if there exists an f with ∂a f = T , then ∂0 f (x) = co ∂a f (x) = [0, 1] for all x ∈ R and so by Theorem 2.2 in [2] we would have that ∂a f (x) = ∂0 f (x) = [0, 1] for all x ∈ R; a contradiction. With some extra work one can show that the sum and lattice rules of subdifferential calculus hold with equality for almost all Lipschitz functions. Indeed, if on XBX ∗ × XBX ∗ we define the complete metric ρ1 by ρ1 ((f1 , f2 ), (g1 , g2 )) := ρ(f1 , g1 )+ρ(f2 , g2 ), then we may obtain the following from a more elaborate version of Theorem 1. Theorem 3 ([26]). Let A be a non-empty open subset of a separable Banach space X, then there exists a residual set G in (XBX ∗ ×XBX ∗ , ρ1 ) so that for each (f1 , f2 ) ∈ G and y1 , y2 ≥ 0, ∂0 (y1 f1 + y2 f2 )(x) = y1 ∂0 f1 (x) + y2 ∂0 f2 (x), ∂0 f1 (x) = ∂0 f2 (x) = BX ∗ and ∂0 [min{f1 , f2 }](x) = ∂0 [max{f1 , f2 }](x) = co{∂0 f1 (x), ∂0 f2 (x)} for all x ∈ A. Example 2. In this example we examine what can be said about the size of the generalized Jacobian of a Lipschitz mapping acting between Banach spaces. For a vector–valued locally Lipschitz function F : Rn → Rm given by F (x) := [f1 (x), . . . , fm (x)]. The generalized Jacobian of F at x, denoted by ∂0 F (x), is defined by ∂0 F (x) := co{lim ∇F (xi ) : xi → x, xi 6∈ ΩF }, where ΩF denotes the set of points at which F fails to be differentiable. In XBRn × XBRn there exists a residual set G such that for every F ∈ G, ∂0 F is not a minimal cusco. Indeed, by Theorem 3 we obtain a residual set G ⊂ XBRn × XBRn such that for F := (f1 , f2 ) ∈ G we have ∂0 (f1 + f2 ) = 2BRn . By Theorem 2.6.6 [11] we have ∂0 (f1 + f2 ) = (1, 1)∂0 F . Since 2BRn is not a minimal cusco, ∂0 F is not minimal for F ∈ G. However, if each fi is strictly differentiable almost everywhere on A for i = 1, . . . , m, then ∂0 F is a minimal cusco on A. The scalarization formula for the coderivative of F [25, page 366] also shows that the Clarke coderivatives have large images for every F ∈ G since the Clarke coderivative Dc∗ F (x)(y1 , y2 ) ⊇ (y1 + y2 )BRn holds for all x ∈ A and (y1 , y2 ) ∈ R2+ \ {0}. Similar results hold for F having arbitrary m components. 4. Results on general Banach spaces In general Banach spaces we obtain weaker, but still highly useful results. Lemma 9. Let A be a non-empty open subset of a Banach space X and let T : ∗ A → 2X be a weak∗ cusco on A. If f, g ∈ XT and E ⊆ R is Lebesgue measurable, then the function h : A → R defined by h(x) := λE ((f − g)(x)) +Rg(x) belongs to x XT and ρ(h, g) ≤ λ(E), where λE : R → R is defined by, λE (x) := 0 XE (t)dt. Proof. Suppose f, g ∈ XT , we need to show h ∈ XT . To this end, let us fix x ∈ A and choose δ > 0 so that B2δ (x) ⊆ A. For each v ∈ SX we define the function Kv : Bδ (x) × (0, δ) → [0, 1] by ( 0 if (f − g)(z + λv) = (f − g)(z), R (f −g)(z+λv) Kv (z, λ) := χ (t)dt E (f −g)(z) otherwise. (f −g)(z+λv)−(f −g)(z) License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use 3884 JONATHAN M. BORWEIN, WARREN B. MOORS, AND XIANFU WANG Then we have h(z + λv) − h(z) λ = ≤ g(z + λv) − g(z) f (z + λv) − f (z) (1 − Kv (z, λ)) + Kv (z, λ) λ λ g(z + λv) − g(z) f (z + λv) − f (z) , , max λ λ for all (z, λ) ∈ Bδ (x) × (0, δ). Therefore h0 (x, v) ≤ max{g 0 (x, v), f 0 (x, v)} for all v ∈ SX and so ∂0 h(x) ⊆ co{∂0 g(x), ∂0 f (x)} ⊆ T (x). Since the point x was arbitrary we have that h ∈ XT . The proof that ρ(g, h) ≤ λ(E) is obvious. We will say that a Banach space is smoothable if it has an equivalent smooth renorm (as is the case in all separable, reflexive or WCG spaces). Theorem 4 (The approximate subdifferential in smooth Banach space). Let A be ∗ a non-empty open subset of a smoothable Banach space X and let T : A → 2X be a weak∗ cusco on A. If f ∈ XT and x → ∂a f (x) is a minimal weak∗ usco, then {g ∈ XT : ∂a f (x) ⊆ ∂a g(x) for all x ∈ A} is residual in (XT , ρ). Proof.S For each m ∈ N, let Am := int{t ∈ A : T (t) ⊆ mBX ∗ }. Then by Lemma 3, A = m∈N Am and each g ∈ XT is m-Lipschitz on each convex subset of Am . Let J := {Jn : n ∈ N} be an enumeration of all the open intervals in R with rational end–points. For each (m, n, p, ε) ∈ N3 × (0, ∞) we consider the set O(m,n,p,ε) := {g ∈ XT : for each connected open set U with U + 1p BX ⊆ Am and Jn ⊆ (f − g)(U ) there exists a z0 ∈ U and 0 < r0 < 1/p so that (g − f )(z)− (g − f )(z0 ) > −εr0 for all z ∈ Br0 (z0 )}. The proof is now divided into two parts: (a) For each (m, n, p, ε) ∈ N3 × (0, ∞), intO(m,n,p,ε) is dense in (XT , ρ). Suppose (g0 , δ) ∈ XT × (0, 1), we need to verify that Bδ (g0 ) ∩ intO(m,n,p,ε) 6= ∅. To this end, suppose Jn := (rn , sn ) and 0 < δ ′ := min{(sn − rn )/5, δ}. Now let us choose a dense open subset E of R such that µ(E) < δ ′ and define h : A → R by Z x χE (t)dt. h(x) := λE ((f − g0 )(x)) + g0 (x) where λE (x) := 0 By Lemma 9 we have h ∈ XT and ρ(g0 , h) < δ ′ ≤ δ. We claim that h ∈ intO(m,n,p,ε) ∩ Bδ (g0 ). To this end, choose 0 < r < 2m/p and t ∈ R so that [t − r, t + r] ⊆ (rn + 2δ ′ , sn − 2δ ′ ) ∩ E and set d := min{(εr)/(4m), δ ′ }. We will show that Bd (h) ⊆ O(m,n,p,ε) . Let g ∈ Bd (h) and let U be any connected open subset of Am with U + 1/pBX ⊆ Am and Jn ⊆ (f − g)(U ). Then, [t − r, t + r] ⊆ (rn + 2δ ′ , sn − 2δ ′ ) ⊆ (f − g0 )(U ), since (f − g0 )(U ) is connected (hence convex) and k(f − g0 ) − (f − g)k∞ = kg0 − gk∞ ≤ kg0 − hk∞ + kg − hk∞ < δ ′ + d ≤ 2δ ′ . Choose z0 ∈ U so that (f − g0 )(z0 ) = t then for any z ∈ Br0 (z0 ) with r0 = r/2m < 1/p, (f − g0 )(z) ∈ [t − r, t + r] ⊆ E and so h(z) − h(z0 ) = f (z) − f (z0 ). Therefore by our choice of d, εr0 εr0 (g − f )(z) − (g − f )(z0 ) = (g − h)(z) − (g − h)(z0 ) > − − = −εr0 , 2 2 for all z ∈ Br0 (z0 ). This shows that g ∈ O(m,n,p,ε) . License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use GENERALIZED SUBDIFFERENTIALS: A BAIRE CATEGORICAL APPROACH 3885 T (b) The set G := {O(n1 ,n2 ,n3 ,1/n4 ) : (n1 , n2 , n3 , n4 ) ∈ N4 } is residual in (XT , ρ) and for each g ∈ G we have ∂a g(x) ∩ ∂a f (x) 6= ∅ for every x ∈ A. Indeed, if this is not the case then there exists a g ∈ G and x0 ∈ A such that ∂a g(x0 ) ∩ ∂a f (x0 ) = ∅. Moreover, since {x ∈ A : ∂a g(x) ∩ ∂a f (x) = ∅} is open in A we may assume that f is strictly differentiable at x0 ∈ A, [22]. We may now select a finite set F ⊆ SX and α ∈ R so that if W := {x∗ : |x∗ (y)| < α, y ∈ F } then (∂a g(x0 ) + W ) ∩ (∂a f (x0 ) + W ) = ∅ and by the weak* upper semi-continuity of x → ∂a g(x), there exists r > 0 so that ∂a g(Br (x0 )) ⊆ ∂a g(x0 )+W . Next, we choose n4 ∈ N so that 4/n4 < α and set Y := spF . Then by the strict differentiability of f at x0 , the Lipschitzness of f and the compactness of SY , there exists a δ > 0 so that f (z + λy) − f (z) 1 − ∇f (x0 )(y) > − λ n4 for all kz − x0 k ≤ δ, 0 < λ ≤ δ and y ∈ SY . We may now select n1 , n3 ∈ N so that 1/n3 < δ and B2/n3 (x0 ) ⊆ An1 ∩ Br (x0 ). Then we set U := B1/n3 (x0 ). Now (f − g)(U ) is convex, so either (f − g)(U ) = {a} for some a ∈ R or Jn ⊆ (f − g)(U ) for some n. In the first case we get that f (x) = g(x) + a on U which is impossible since ∂a f (x0 ) 6= ∂a g(x0 ). Therefore, there is some n2 ∈ N so that Jn2 ⊆ (f − g)(U ). However, as g ∈ O(n1 ,n2 ,n3 ,1/n4 ) there exists a point z0 ∈ U and 0 < r0 ≤ 1/n3 so that (g − ∇f (x0 ))(z) − (g − ∇f (x0 ))(z0 ) > − 2r0 n4 for all z ∈ z0 + r0 BY , since (f − ∇f (x0 ))(z0 + λy) − (f − ∇f (x0 ))(z0 ) > −λ · n14 for 0 < λ ≤ n13 and y ∈ SY . By Lemma 8 there exists z1 ∈ Br0 (z0 ) and z ∗ ∈ X ∗ so that z ∗ ∈ ∂− (g − ∇f (x0 ))(z1 ) and kz ∗ |Y k < 4/n4 , i.e., ∇f (x0 ) + z ∗ ∈ ∂− g(z1 ). However, ∇f (x0 ) + z ∗ ∈ ∇f (x0 ) + W , which is impossible since ∂− g(z1 ) ∩ (∇f (x0 ) + W ) = ∅. Therefore, it must be the case that for each g ∈ G, ∂a f (x) ∩ ∂a g(x) 6= ∅ for all x ∈ A. The result now follows by Lemma 2. While minimality of x → ∂a f (x) is quite restrictive, it does hold when f is smooth or concave on A. Even this allows for some nice applications: Corollary 6. Let A be a non-empty open subset of a smoothable Banach space X and suppose C ⊆ X ∗ is non-empty, weak∗ compact, convex and weak∗ separable, then {g ∈ XC : ∂a g(x) = C for all x ∈ A} is residual in (XC , ρ). Proof. Let {x∗n : n ∈ N} be a dense subset of (C, weak*) and for each nT∈ N, let Gn := {g ∈ XC : x∗n ∈ ∂a g(x) for all x ∈ A}. Then by Theorem 4, G := n∈N Gn is residual in (XC , ρ) and ∂a g ≡ C for each g ∈ G. Theorem 5 (The Clarke subdifferential in arbitrary Banach space). Let A be a ∗ non-empty open subset of a Banach space X and let T : A → 2X be a weak∗ cusco on A, then for each f ∈ XT , {g ∈ XT : ∂0 g(x) ∩ ∂0 f (x) 6= ∅ for all x ∈ A} is residual in (XT , ρ). In particular, if ∂0 f is a minimal weak∗ cusco, then {g ∈ XT : ∂0 f (x) ⊆ ∂0 g(x) for all x ∈ A} is residual in (XT , ρ). Proof. S For each m ∈ N, let Am := int{t ∈ A : T (t) ⊆ mBX ∗ }. Then by Lemma 3, A = m∈N Am and each g ∈ XT is m–Lipschitz on each convex subset of Am . Let License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use 3886 JONATHAN M. BORWEIN, WARREN B. MOORS, AND XIANFU WANG J := {Jn : n ∈ N} be an enumeration of all the open intervals in R with rational end–points. For each (m, n, ε) ∈ N2 × (0, +∞) consider the set O(m,n,ε) := {g ∈ XT : for each connected open set U with U + εBX ⊆ Am and Jn ⊆ (f − g)(U ) there exist z0 ∈ U and 0 < λ0 < ε so that (g − f )(z0 + λ0 v)− (g − f )(z0 ) ≥ −λ0 ε for all v ∈ SX }. (a) intO(m,n,ε) is dense in (XT , ρ) for each (m, n, ε) ∈ N2 × (0, +∞). Suppose (g0 , δ) ∈ XT × (0, 1). We need to verify that Bδ (g0 ) ∩ intO(m,n,ε) 6= ∅. To this end, suppose Jn := (rn , sn ) and δ ′ := min{(sn − rn )/5, δ}. Now let us choose a dense open subset E of R such that µ(E) < δ ′ and define h : A → R by Z x χE (t)dt. h(x) := λE ((f − g0 )(x)) + g0 (x) where λE (x) := 0 ′ Then by Lemma 9, h ∈ XT and ρ(g0 , h) < δ ≤ δ. We claim that h ∈ intO(m,n,ε) ∩ Bδ (g0 ). To this end, choose 0 < r < 2mε and t ∈ R so that [t − r, t + r] ⊆ (rn + 2δ ′ , sn − 2δ ′ ) ∩ E and set d := min{(εr)/(4m), δ ′ }. We show that Bd (h) ⊆ O(m,n,ε) . Let g ∈ Bd (h) and let U be any connected open subset of Am with U + εBX ⊆ Am and Jn ⊆ (f − g)(U ). Since (f − g0 )(U ) is connected (hence convex) and k(f − g) − (f − g0 )k∞ = kg0 − gk∞ ≤ kg0 − hk∞ + kg − hk∞ ≤ d + δ ′ ≤ 2δ ′ , we have [t − r, t + r] ⊆ (rn + 2δ ′ , sn − 2δ ′ ) ⊆ (f − g0 )(U ). Choose z0 ∈ U so that (f − g0 )(z0 ) = t, then for every 0 < λ ≤ r/(2m) < ε and v ∈ SX we have (f − g0 )(z0 + λv) ∈ [t − r, t + r] ⊆ E, thus if we set λ0 := r/(2m) g(z0 + λ0 v) − g(z0 ) λ0 ≥ ≥ h(z0 + λ0 v) − h(z0 ) 2d f (z0 + λ0 v) − f (z0 ) 2d − = − r λ0 λ0 λ0 2m f (z0 + λ0 v) − f (z0 ) − ε. λ0 This shows that g ∈ O(m,n,ε) . T (b) The set G := {O(n1 ,n2 ,1/n3 ) : (n1 , n2 , n3 ) ∈ N3 } is residual in (XT , ρ) and for each g ∈ G we have ∂0 g(x) ∩ ∂0 f (x) 6= ∅ for all x ∈ A. Indeed, if this is not the case, then there exists a g ∈ G and x0 ∈ A such that ∂0 g(x0 ) ∩ ∂0 f (x0 ) = ∅. By the strong separation theorem, applied in the locally convex space (X ∗ , weak∗ ) there exists a y ∈ SX , α ∈ R and ε > 0 such that −f 0 (x0 , −y) = min{x∗ (y) : x∗ ∈ ∂0 f (x0 )} > α + ε > α − ε > max{x∗ (y) : x∗ ∈ ∂0 g(x0 )} = g 0 (x0 , y). Now x0 ∈ An1 for some n1 ∈ N and inf −f 0 (x0 , −y) = lim z→x t↓0 0 f (z + ty) − f (z) , t g 0 (x0 , y) = lim sup z→x0 t↓0 g(z + ty) − g(z) . t Therefore there exists an n3 ∈ N such that 1/n3 < ε, B2/n3 (x0 ) ⊆ An1 and 1 f (z + λy) − f (z) 1 :0<λ≤ , z ∈ B1/n3 (x0 ) ≥ α + inf λ n3 n3 (∗) g(z + λy) − g(z) 1 1 :0<λ≤ ≥ sup , z ∈ B1/n3 (x0 ) >α− n3 λ n3 License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use GENERALIZED SUBDIFFERENTIALS: A BAIRE CATEGORICAL APPROACH 3887 Let U := B1/n3 (x0 ), then U + 1/n3 BX ⊆ An1 . Now (f − g)(U ) is convex, so either (f − g)(U ) = {a} for some a ∈ R or Jn ⊆ (f − g)(U ) for some n ∈ N. In the first case, we get that f (x) = g(x)+ a on U which is impossible since ∂0 f (x0 ) 6= ∂0 g(x0 ). Therefore, there exists some n2 ∈ N so that Jn2 ⊆ (f − g)(U ). Since g ∈ G, we have that g ∈ O(n1 ,n2 ,1/n3 ) and so there exists z0 ∈ U and 0 < λ0 < 1/n3 so that f (z0 + λ0 v) − f (z0 ) 1 g(z0 + λ0 v) − g(z0 ) ≥ − λ0 λ0 n3 for every v ∈ SX , which contradicts (∗). Therefore, it must be the case that for each g ∈ G we have ∂0 f (x) ∩ ∂0 g(x) 6= ∅ for each x ∈ A. The case when ∂0 f is a minimal weak* cusco follows from Lemma 2. Corollary 7. Let A be a non–empty open subset of a Banach space X and let {fn : ∗ n ∈ N} be a sequence ofSlocally equi–Lipschitz real–valued functions. If T : A → 2X ∗ is defined by T (x) := n∈N ∂0 fn (x) and each ∂0 fn is a minimal weak cusco then {g ∈ XCSC(T ) : ∂0 g(x) = CSC(T )(x) for each x ∈ A} is residual in (XCSC(T ) , ρ). Since each maximal cyclically monotone operator defined on a non-empty open convex subset of a Banach space is the Clarke subgradient of some convex locally Lipschitz function we have Corollary 8. Let A be a non-empty open convex subset of a Banach space X and let {T1 , T2 , . . . , Tn } be a finite family of maximal cyclically monotone operators from A into non–empty subsets of X ∗ . Then there exists a real–valued locally Lipschitz function f defined on A such that ∂0 f (x) = co{T1 (x), T2 (x), . . . , Tn (x)} for every x ∈ A. This generalizes Corollary 2 in [8] to non-separable spaces. Example 3. There are Clarke subdifferentials that can not be expressed as the cusco generated by a countable family of minimal cuscos. Indeed, let f : R → R be a differentiable and nowhere monotone Lipschitz function [10], then {x : f is strictly differentiable at x and f ′ (x) = 0} is residual in R and so the only possible minimal cusco lying inside ∂0 f is T ≡ {0}. Lemma 10 (Lemma 2.5, [15]). A weak∗ cusco T from a topological space A into subsets of the dual of a Banach space X is a minimal weak∗ cusco if, and only if, given any open subset U of A and weak∗ closed convex subset K of X ∗ with T (U ) 6⊆ K, there exists a non-empty open subset V ⊆ U such that T (V ) ∩ K = ∅. Theorem 6. Let f be a real–valued locally Lipschitz function defined on a non– empty open subset A of a Banach space X. If ∂0 f is a minimal weak∗ cusco and ∗ T : A → 2X is defined by T (x) := ∂0 f (x) + BX ∗ , then {g ∈ XT : ∂0 g(x) = T (x) for all x ∈ A} is residual in (XT , ρ). Proof. For each n ∈ N choose a maximal disjoint family of an open ball {B1/n (xα n) : α ∈ Γn } in A and define fn : A → R by fn (x) := f (x) + dCn (x), where Cn := {xα n : α ∈ Γn }. Let Gn be any residual set in (XT , ρ) such that for each g ∈ Gn , ∂0 g(x)∩∂0 fn (x) 6= ∅ for all x ∈ A. By [5] we know that the sum of a function whose Clarke subdifferential mapping is a minimal weak∗ cusco and a regular function has a Clarke subgradient that is a minimal weak∗ cusco. Therefore, ∂0 fn is a License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use 3888 JONATHAN M. BORWEIN, WARREN B. MOORS, AND XIANFU WANG α α minimal weak∗ cusco on B1/n (xα n ) and so ∂0 fn (xn ) ⊆ ∂0 g(xn ) for each α ∈ Γn . Set T∞ G := n=1 Gn . We will show that for each g ∈ G, ∂0 g(x) = T (x) for all x ∈ A. To this end, let g be any member of G and let us suppose that ∂0 g(x0 ) 6= T (x0 ) for some x0 ∈ A. Then there is some y ∈ SX and α ∈ R so that g 0 (x0 , y) = max{x∗ (y) : x∗ ∈ ∂0 g(x0 )} < α < max{x∗ (y) : x∗ ∈ T (x0 )} = f 0 (x0 , y) + 1. Now, by the upper semi-continuity of x → g 0 (x, y) there exists an open neighbourhood U of x0 in A such that g 0 (z, y) < α for all z ∈ U . On the other hand, from the minimality of x → ∂0 f (x) (see Lemma 10) there exists a non-empty open subset V of U so that ∂0 f (V ) ∩ {x∗ ∈ X ∗ : x∗ (y) ≤ α − 1} = ∅. Therefore, α − 1 < min{x∗ (y) : x∗ ∈ ∂0 f (x)} ≡ −f 0 (x, −y) for all x ∈ V . Note that in particular, f − (x, y) > α − 1 for all x ∈ V . Next, we may choose n ∈ N so that Cn ∩ V 6= ∅ and calculate fn− (z, y) = f − (z, y) + d′Cn (z, y) for z ∈ Cn ∩ V , to get fn− (z, y) > α − 1 + 1 = α. However, this is impossible since fn− (z, y) ≤ fn0 (z, y) ≤ g 0 (z, y) < α for all z ∈ U ∩ Cn . Hence for each g ∈ G, ∂0 g(x) = T (x) for all x ∈ A. An important special case of the above theorem is the following: Corollary 9 (The dual ball). Let A be a non-empty open subset of a Banach space X, then {g ∈ XBX ∗ : ∂0 g(x) = BX ∗ for all x ∈ A} is residual in (XBX ∗ , ρ). We may also extend a central case of Theorem 1: Corollary 10. Let A be a non-empty open subset of an infinite dimensional smoothable Banach space X. Suppose that ||| · ||| is an equivalent norm on X such that ext BX ∗ , the extreme points of its associated dual ball BX ∗ , is weak* dense in BX ∗ , then {g ∈ XBX ∗ : ∂a g(x) = BX ∗ for all x ∈ A} is residual in (XBX ∗ , ρ). Proof. Let G := {g ∈ XBX ∗ : ∂0 g(x) = BX ∗ for all x ∈ A}. We claim that for each g ∈ G, ∂a g(x) = BX ∗ for all x ∈ A. To see this, simply note that by the converse of the Krein-Milman theorem ext BX ∗ ⊆ ∂a g(x) for each g ∈ G and x ∈ A since, ∗ BX ∗ = ∂0 g(x) = cow ∂a g(x) and ∂a g(x) is weak* closed for each x ∈ A. Remark 1. The hypotheses of the last corollary are satisfied if the norm ||| · ||| on X is smooth. Corollary 11. Let A be a non-empty open subset of a Banach space X, then {g ∈ XBX ∗ : for each v ∈ SX , {x ∈ A : g ′ (x; v) exists} is first category} is residual in (XBX ∗ , ρ). Proof. Let G := {g ∈ XBX ∗ : ∂0 g(x) = BX ∗ for all x ∈ A}. We claim that for each g ∈ G, Dy := {x ∈ A : g ′ (x; y) exists} is first category in A for each y ∈ SX . To see this, let us fix y ∈ SX . Then, by [16] there exists a dense Gδ subset Py of A where g 0 (x; y) = g + (x; y) and −g 0 (x; −y) = −g + (x; −y) for each x ∈ Py . We will License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use GENERALIZED SUBDIFFERENTIALS: A BAIRE CATEGORICAL APPROACH 3889 now show that Dy ⊆ A\Py . Indeed, if x0 ∈ Py ∩ Dy , then 1 = g 0 (x0 ; y) = g ′ (x0 ; y) = −g ′ (x0 ; −y) = −g 0 (x0 ; −y) = −1, which is absurd. Therefore, Dy ⊆ A\Py and so first category in A. 5. When is XT non-empty? Thus far we have not dwelt too much upon the question of when XT is nonempty. However, below we show that this issue is in fact finitely determined, that is, determined by the behaviour of T on finite dimensional subspaces. ∗ Let A be a non-empty open subset of a Banach space X and let T : A → 2X be a weak∗ cusco defined on A. Then for each subspace Y of X with Y ∩ A 6= ∅ we ∗ define TY : Y ∩ A → 2Y by TY (x) := {y ∗ ∈ Y ∗ : y ∗ = x∗ |Y and x∗ ∈ T (x)}. Theorem 7. Let A be a non-empty open connected subset of a Banach space X ∗ and let T : A → 2X be a weak∗ cusco on A. Then XT 6= ∅ if and only if there exists an upwardly directed set (D, ⊆) of finite dimensional subspaces of X such that (i) S A ⊆ Y ∈D Y and (ii) XTY 6= ∅ for each Y ∈ D with Y ∩ A 6= ∅. Proof. It is clear that if XT 6= ∅ then (i) and (ii) are satisfied. So we will consider the converse question. Fix x0 ∈ A and define XTY (x0 ) := {g ∈ XTY : g(x0 ) = 0}. Note that by possibly making D smaller we may assume that x0 ∈ Y for all Y ∈ D. Then for each Y ∈ D choose gY ∈ XTY (x0 ). For each such function we consider the following extension g̃Y : A → R defined by gY (x) if x ∈ A ∩ Y , g̃Y (x) := 0 otherwise. Thus, (g̃Y : D) is a net in (Re )A —which is compact by Tychonoff’s theorem. Therefore (g̃Y : D) has a convergent subnet which converges to some element g ∈ (Re )A . It is now routine to check that g is real-valued and locally Lipschitz on A. In fact one can show that ∂0 g(x) ⊆ T (x) for all x ∈ A and g(x0 ) = 0. This shows that g ∈ XT (x0 ) ⊆ XT . Remark 2. By applying a similar argument to the above proof we can show that for each x0 ∈ A, XT (x0 ) is a pointwise compact, convex sub-lattice of XT . The problem of determining when XT 6= ∅ now reduces to the semi-classical problem of determining when XTY 6= ∅. Below we give an obvious first step in this direction. Let (M, σ) be a measurable space and let X be a normed linear space. We say that a function f : (M, σ) → X ∗ is weak∗ measurable if f −1 (U ) ∈ σ for each weak∗ open subset U of X ∗ . Note: if X is a separable normed linear space, then this is equivalent to demanding that for each x ∈ X, the mapping x̂ ◦ f : A → R defined by (x̂ ◦ f )(t) := f (t)(x) is measurable. For a non-empty open subset A of a normed linear space X the line integral along a line segment [a, b] ⊆ A of a weak∗ measurable function f : (A, BA ) → X ∗ is the Lebesgue integral Z 1 Z f (z)dz := f (tb + (1 − t)a)(b − a)dt. [a,b] 0 License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use 3890 JONATHAN M. BORWEIN, WARREN B. MOORS, AND XIANFU WANG A polygonal path C in A is an ordered collection of line segments {[ai , ai+1 ] : 1 ≤ i ≤ n − 1} for some integer n. Such a path is said to be closed if a1 = an . The line integral of f on C is defined as Z n−1 XZ f (z)dz := f (z)dz. C i=1 [ai ,ai+1 ] For any fixed ε > 0 we will call an ordered collection of line segments P (ε) := {[ai , bi ] : 1 ≤ i ≤ n − 1} an ε–path from a to b provided ka − a1 k + n−1 X kai+1 − bi k + kbn − bk < ε. i=1 Such a path is closed if a = b. For a Borel subset E of A we say that P is an E–admissible ε–path from a to b if P is an ε–path from a to b and λ({t ∈ [0, 1] : tbi + (1 − t)ai 6∈ E}) = 0 for 1 ≤ i ≤ n − 1. Line integrals on an ε-path are defined similarly as above. Theorem 8 ([7]). Let A be a non-empty open connected subset of a finite dimen∗ sional normed linear space X and let T : A → 2X be a bounded weak∗ cusco on A. Then XT 6= ∅ if and only if there exists a Borel set E ⊆ A with λ(A\E) = 0 and a weak∗ measurable selection f : (E, BE ) → X ∗ of T so that Z f (z)dz = 0, lim+ ε→0 P (ε) where P (ε) is any closed E-admissible ε-path in A. We now turn to the existence of Lipschitz functions with ‘minimal’ subdifferential mappings. These results form a sharp contrast to the existence of Lipschitz functions with maximal subdifferential mappings given in Sections 3 and 4. Theorem 9 (Minimal approximate subdifferential). Let A be a non-empty open connected subset of a smoothable Banach space X and let f : A → R be a locally Lipschitz function on A. Then there exists a locally Lipschitz g : A → R such that ∂a g(x) ⊆ ∂a f (x) for all x ∈ A and ∂a g is minimal in the sense that for each ĝ with ∂a ĝ(x) ⊆ ∂a g(x) for all x ∈ A we have that ∂a ĝ = ∂a g. Proof. Fix x0 ∈ A and define, P := {Gr(G) ⊆ Gr(∂a f ) : there exists a locally Lipschitz function g on A with ∂a g ≡ G}. On P, which is clearly non-empty, we may define a partial order ‘≤’ by G1 ≤ G2 if and only if G1 ⊆ G2 . We will use Zorn’s lemma to show that (P, ≤) has a minimal element. To this end, let C be a chain in P. We will show that C has a lower bound in P. For each G ∈ C let gG be a locally Lipschitz function on A such that ∂a gG ≡ G and gG (x0 ) = f (x0 ). Thus, (gG : C) is a net in (Re )A —which is compact. Therefore (gG : C) has a subnet which converges to some element g ∈ (Re )A . As with Theorem 7 it is routine to show that g is real-valued and locally Lipschitz on A. We claim that ∂a g is a lower bound for C. To prove this claim let us consider any G0 ∈ C. We need to show that ∂− g(x) ⊆ ∂a gG0 (x) for all x ∈ A. Indeed, if this is not the case, then there exist an x1 ∈ A and x∗1 ∈ ∂− g(x1 ) such that x∗1 6∈ ∂a gG0 (x1 ). We may now select a finite set F ⊆ SX and α ∈ R so that if License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use GENERALIZED SUBDIFFERENTIALS: A BAIRE CATEGORICAL APPROACH 3891 W := {x∗ ∈ X ∗ : |x∗ (y)| < 4α, y ∈ F }, then (x∗1 + W ) ∩ (∂a gG0 (x1 ) + W ) = ∅ and by the weak∗ upper semi-continuity of x → ∂a gG0 (x) there exists an r > 0 so that ∂a gG0 (Br (x1 )) ⊆ ∂a gG0 (x1 ) + W . Next we set Y := spF . Then since g is Lipschitz around x1 and SY is compact there exists a 0 < δ < r so that (g − x∗1 )(x1 + λv) − (g − x∗1 )(x1 ) > −λ · α ≥ −δ · α for all 0 < λ ≤ δ and v ∈ SY . Now by the Arzela-Ascoli theorem the set {gG |x1 +δBY : G ∈ C} is relatively norm compact in (C(x1 +δBY ), ||·||∞ ). Hence the net (gG |x1 +δBY : C) has a subnet which converges to g|x1 +δBY with respect to the sup-norm on x1 + δBY . In particular, this means that we can choose G ≤ G0 so that, (gG − x∗1 )(z) − (gG − x∗1 )(x1 ) > −2δ · α for all z ∈ x1 + δBY . Therefore, by Lemma 8 there exists a z ∈ Bδ (x1 ) and z ∗ ∈ X ∗ with z ∗ ∈ ∂− (gG − x∗1 )(z) and ||z ∗ |Y || < 4α. That is, x∗1 + z ∗ ∈ ∂− gG (z). However, x∗1 + z ∗ ∈ x∗1 + W ; which is impossible since ∂− gG (z) ∩ (x∗1 + W ) ⊆ ∂a gG0 (z) ∩ (x∗1 + W ) = ∅. This shows that ∂a g(x) ⊆ ∂a gG0 (x) for all x ∈ A. Hence by Zorn’s lemma (P, ≤) has a minimal element which is the desired subgradient. One may similarly prove the following: Theorem 10 (Minimal Clarke subdifferential). Let A be a non-empty open connected subset of Banach space X and let f : A → R be a locally Lipschitz function on A. Then there exists a locally Lipschitz g : A → R such that ∂0 g(x) ⊆ ∂0 f (x) for all x ∈ A and ∂0 g is minimal in the sense that for each ĝ with ∂0 ĝ(x) ⊆ ∂0 g(x) for all x ∈ A we have that ∂0 ĝ = ∂0 g. As with an usco (cusco) mapping which may contain several minimal uscos (cuscos), an approximate (Clarke) subdifferential mapping may contain several minimal approximate (Clarke) subdifferential mappings. On the real line, Theorem 10 is clear by using the fact that every cusco contains a minimal cusco and that every minimal cusco on the line is a Clarke subdifferential map of some locally Lipschitz function. In fact on the line ∂0 g is a minimal cusco if and only if ∂0 g is minimal in the sense of Theorem 10 while the latter is equivalent to ∂a g being minimal in the sense of Theorem 9 as in one dimension the Clarke subdifferential determines the approximate subdifferential by Theorem 2.2 [2]. However, each strongly integrable function is minimal in the sense defined in Theorems 9 and 10 and there are integrable functions whose generalized subdifferentials are neither minimal weak∗ cuscos nor minimal weak∗ uscos (see Example 7.1 in [6]). In Corollary 3 we saw that the Clarke subdifferential mapping is closed under the operation of taking finite convex hulls. Hence it is natural to ask the question of whether the Clarke subdifferential is closed under the operation of taking finite intersections. The next example shows that this fails in a rather strong way. Example 4. Let f : R → R be an everywhere differentiable and strictly increasing function such that {x ∈ R : f ′ (x) = 0} is dense in R and |f (x) − f (y)| < |x − y| for all x, y ∈ R. Let C := epi(f ) := {(x, y) ∈ R2 : f (x) ≤ y}. Next, consider the distance function dC defined on R2 by the l1 norm and the set C. Let g be any Lipschitz function on R2 such that ∂0 g(x) is equivalently equal to the l2 norm ball. 2 If we define T : R2 → 2R by T (x) := ∂0 g(x) ∩ ∂0 dC (x) for all x ∈ R2 , then T is a cusco but XT = ∅. Indeed, if h ∈ XT , then ∂0 h(x) ⊆ ∂0 dC (x) for all x ∈ R2 . However, as dC is integrable [6] we must have that ∂0 h(x) = ∂0 dC (x) for all x ∈ R2 . License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use 3892 JONATHAN M. BORWEIN, WARREN B. MOORS, AND XIANFU WANG But this is impossible since ∂0 dC (x) is not contained in the l2 ball for all values of x ∈ R2 . References 1. J. M. Borwein, Minimal CUSCOS and subgradients of Lipschitz functions, in Fixed Point Theory and its Applications, Pitman Research Notes 252 (1991), 57–81. MR 92j:46077 2. J. M. Borwein and S. Fitzpatrick, Characterization of Clarke subgradients among one– dimensional multifunctions, in Proc. of the Optimization Miniconference II, edited by B. M. Glover and V. Jeyakumar, (1995), 61–73. 3. J. M. Borwein and A. Ioffe, Proximal analysis in smooth spaces, Set–Valued Anal. 4 (1996), 1–24. MR 96m:49028 4. J. M. Borwein and W. B. Moors, Null sets and essentially smooth Lipschitz functions, SIAM J. Optim. 8 (1998), 309–323. MR 99g:49013 5. J. M. Borwein and W. B. Moors, Separable determination of integrability and minimality of the Clarke subdifferential mapping, Proc. Amer. Math. Soc. 128 (2000), 215–221. MR 2000e:49025 6. J. M. Borwein and W. B. Moors, Essentially smooth Lipschitz functions, J. Funct. Anal. 149 (1997), 305–351. MR 98i:58028 7. J. M. Borwein and W. B. Moors, Y. Shao, Subgradient representation of multi-functions, J. Austral. Math. Soc. Ser. B. 40 (1998), 1–13. MR 2001b:49020 8. J. M. Borwein, W. B. Moors and X. Wang, Lipschitz functions with prescribed derivatives and subderivatives, Nonlinear Anal. 29 (1997), 53–64. MR 98j:49019 9. J. M. Borwein and D. Preiss, A smooth variational principle with applications to subdifferentiability and to differentiability of convex functions, Trans. Amer. Math. Soc. 303 (1987), 517–527. MR 88k:49013 10. A. M. Bruckner, J. B. Bruckner, B. S. Thomson, Real Analysis, Prentice–Hall, Inc. 1997. 11. F. H. Clarke, Optimization and Nonsmooth Analysis, Wiley Interscience, New York, 1983. MR 85m:49002 12. B. Dacorogna and P. Marcellini, General existence theorems for Hamilton-Jacobi equations in the scalar and vectorial cases, Acta Math. 178 (1997), 1–37. MR 98d:35029 13. B. Dacorogna and P. Marcellini, Dirichlet problem for nonlinear first order partial differential equations, Optimization methods in partial differential equations (South Hadley, MA, 1996), 43–57, Contemp. Math. 209, Amer. Math. Soc., Providence, RI, 1997. MR 98h:35037 14. Marián J. Fabian, Gâteaux Differentiability of Convex Functions and Topology: Weak Asplund Spaces, Wiley Interscience, New York, 1997. MR 98h:46009 15. J. R. Giles and W. B. Moors, A continuity property related to Kuratowski’s index of noncompactness, its relevance to the drop property and its implications for differentiability, J. Math. Anal. and Appl. 178 (1993), 247–268. MR 94m:46022 16. J. R. Giles and S. Sciffer, Locally Lipschitz functions are generically pseudo-regular on separable Banach spaces, Bull. Austral. Math. Soc. 47 (1993), 205–212. MR 94a:58022 17. A. D. Ioffe, Approximate subdifferentials and applications II, Mathematika 33 (1986), 111– 128. MR 87k:49028 18. A. D. Ioffe, Approximate subdifferentials and applications III. The metric theory, Mathematika 36 (1989), 1–38. MR 90g:49012 19. E. Michael, A note on closed maps and compact sets, Israel J. Math. 2 (1964), 173–176. MR 31:1659 20. R. R. Phelps, Convex Functions, Monotone Operators and Differentiability, Lecture Notes in Mathematics, 1364 Springer-Verlag Berlin Heidelberg, 1993. MR 94f:46055 21. D. Preiss, Differentiability of Lipschitz functions on Banach spaces, J. Funct. Anal. 91 (1990), 312–345. MR 91g:46051 22. D. Preiss, R. R. Phelps and I. Namioka, Smooth Banach spaces, weak Asplund spaces and monotone or usco mappings, Israel J. Math. 72 (1990), 257–279. MR 92h:46021 23. D. Preiss and J. Tiser, Points of non-differentiability of typical Lipschitz functions, Real Anal. Exchange 20 (1995), 219–226. MR 95m:26006 24. R. T. Rockafellar, The Theory of Subgradients and Its Applications to Problems of Optimization: Convex and Nonconvex Functions, Heldermann-Verlag, Berlin, 1981. MR 83b:90126 License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use GENERALIZED SUBDIFFERENTIALS: A BAIRE CATEGORICAL APPROACH 3893 25. R. T. Rockafellar, R. J-B. Wets, Variational Analysis, Springer-Verlag, Berlin, 1998. MR 98m:49001 26. Xianfu Wang, Fine and pathological properties of subdifferentials, Ph. D. Thesis, Simon Fraser University, 1999. Centre for Experimental and Constructive Mathematics, Department of Mathematics and Statistics, Simon Fraser University, Burnaby, B.C. V5A 1S6, Canada E-mail address: jborwein@cecm.sfu.ca Department of Mathematics, The University of Waikato, Private bag 3105 Hamilton, New Zealand E-mail address: moors@math.waikato.ac.nz Centre for Experimental and Constructive Mathematics, Department of Mathematics and Statistics, Simon Fraser University, Burnaby, B.C. V5A 1S6, Canada E-mail address: xwang@cecm.sfu.ca License or copyright restrictions may apply to redistribution; see http://www.ams.org/journal-terms-of-use

Log In

Generalized subdifferentials: a Baire categorical approach

Generalized subdifferentials: a Baire categorical approach

Related Papers

RELATED PAPERS