1 Introduction
The theory of risk measures in mathematical finance has become mainstream, especially since the landmark paper of Artzner etΒ al., (1999). For a comprehensive review, see the books of Delbaen, (2012) and Follmer and Schied, (2016). A risk measure is a functional over some set of random variables (see below formal definitions of the concepts exposed in this introduction), where is then the monetary value for the risk of .
Knightian uncertainty is a very important risk management feature because it prevents perfect information from being attained. In this setup, decision-makers face the consequences of their risk assessments
under partial information. Thus, considering uncertainty sets to determine the value of a risk measure allows us to make robust decisions. For risk measures, in order to deal with such uncertainty, it is usual to consider a worst-case approach, i.e., by considering a risk measure that is a point-wise supremum of a base risk measure over some uncertainty set.
A usual stream is linked to scenarios, where , and thus robustness is over probability chosen, as considered in Wang and Ziegel, (2021), Bellini etΒ al., (2018) and Fadina etΒ al., (2024), for instance. A more general possibility is to deal with uncertainty over the choice of the risk measure, as in Righi, (2023) and Wang and Xu, (2023) for instance, where . In both cases, the uncertainty set is fixed for any ; thus, the analysis is well documented. For instance, the penalty term for , a key feature in the literature of risk measures computed as the convex conjugate, is given as the lower semicontinuous convex envelope of , i.e., the point-wise infimum of the individual penalty terms.
A more intricate setup regards uncertainty regarding the random variables and how they affect risk measures. It is a prominent topic in the mainstream literature since it is linked to model uncertainty and risk. In this case, the uncertainty depends on the random variables as
|
|
|
where is the uncertainty set specific for . Thus, by varying , there is a variation on the set where the supremum is taken. This approach is very relevant for distributionally robust optimization. See Esfahani and Kuhn, (2018) for a detailed discussion.
In this paper, we then study the general properties of worst-case convex risk measures under uncertainty on random variables on spaces. More specifically, we are interested in the properties of the map . Our study is the first paper to deal with such features for general convex risk measures. The goal of most papers in this stream (see the mentioned paper below) is to develop closed forms over specific uncertainty sets, mostly for distortion risk measures or other specific classes of risk measures, instead of the properties of as a risk measure per se. Exceptions are Moresco etΒ al., (2023), where it is studied on a dynamic setup the interplay between the primal properties of and those for , and Righi etΒ al., (2024), where risk measures over sets of random variables are studied. Nonetheless, none of such papers deal with the features we approach in this study or in the same generality we do.
In 1, we prove results that establish its convex conjugate, also known as penalty term, in the specialized risk measures literature. We show that the penalty term becomes
|
|
|
where is some element of , the usual topological dual of , and is the usual set for dual representation of . The key ingredient is to use worst-case expectations , for , as building blocks. With such a penalty term for dual representation, we can provide more concrete formulations for key tools in the risk measures literature, such as the acceptance sets, as well as refine results for closed forms of worst-case convex risk measures for specific choices of the uncertainty sets . Most papers in the literature, such as in Bartl etΒ al., (2020), Bernard etΒ al., (2023), Cornilly etΒ al., (2018), Cornilly and Vanduffel, (2019), Shao and Zhang, 2023b , and Hu etΒ al., (2024), focus on developing closed forms over specific uncertainty sets, mostly for distortion risk measures or other specific classes of risk measures, instead of the more general features we address in this paper.
In 2, from the obtained penalty term, we provide results to establish sub-differentials for worst-case convex risk measures. We then link the sub-differential with the building blocks and characterize it as
|
|
|
|
|
|
|
|
where belongs to the argmin of the penalty term regarding to , and is the argmax of dual representation for . This characterization is crucial for robust optimization problems. Intuitively, this approach introduces an adversary whose problem is inner maximization to account for the impact of the model uncertainty. Such worst-case situations are naturally difficult to address for optimization. In this sense, recent work has been considered, especially by showing the problem is equivalent to usual convex ones or even finite-dimensional as in Pflug etΒ al., (2012), Wozabal, (2014), Cai etΒ al., (2023), Pesenti etΒ al., (2022), Pesenti and Jaimungal, (2023), Blanchet etΒ al., (2022), Li, (2018), Chen and Xie, (2021), Liu etΒ al., (2022). However, none of these papers deal with the topic of the sub-differential as we do in the current paper.
In 3, we develop closed forms for worst-case law invariant convex risk measures under sets for random variables based on mean and variance. More specifically, we obtain for the mean-variance uncertainty set the closed form as
|
|
|
This is a generalization of the results for this set exposed in
Li, (2018), Cornilly etΒ al., (2018), Cornilly and Vanduffel, (2019), Chen and Xie, (2021), Shao and Zhang, 2023b , Shao and Zhang, 2023a , Zhao etΒ al., (2024), Zuo and Yin, (2024) and Cai etΒ al., (2023), which study the class of spectral or concave distortion risk measures. This result may be understood as a generalization even for non-concave distortion risk measures since the cited authors show that the worst-case risk measure of a non-concave distortion is the same as taking its concave envelope, using techniques such as concentration of distributions and isotonic projections in order to make the problem convex. We explore concrete examples of popular risk measures under this setup, with a connection between our result and the cited literature.
In 4, we obtain a closed form for worst-case law invariant convex risk measures over closed balls in the Wasserstein distance. Closed balls around under some suitable distance are typical choices for uncertainty sets, and the Wasserstein metric is prominent since it is related to quantiles in its one-dimensional form. We show that in this case the penalty term simplifies to
|
|
|
where is the desired radius of the ball. Moreover, the closed form becomes
|
|
|
Thus, the key ingredient is the supremum norm of the sub-differential set of at . We also provide other equivalent results for this closed form and identify its argmax elements. These results generalize the literature since the papers deal with specific cases and risk measures. In Bartl etΒ al., (2020) and Li and Tian, (2023), it is investigated the worst-case of optimized certainty equivalents and shortfall risks over such balls, Hu etΒ al., (2024) study the case of expectiles, while in Liu etΒ al., (2022), the result is obtained for concave spectral risk measures. None of such papers expose a general approach as we do. We also expose concrete examples or risk measures, relating our results to the literature.
The remainder of this paper is structured as follows. In SectionΒ 2, we define our setup and prove the general results regarding the worst-case convex risk measure, with emphasis on dual representations and sub-differentials. In SectionΒ 3, we address the case of partial information on sets for random variables based on mean and variance, with a focus on the closed form for the worst-case risk measure. In SectionΒ 4, we study the case of uncertainty on closed balls for the Wasserstein metric in order to specialize results from the general setup and determine equivalent closed forms for the worst-case risk measure.
2 Robust convex risk measures
Consider the real-valued random result of any asset ( is a gain and is a loss) that is defined on a probability space . All equalities and inequalities are considered almost surely in . We define , , and as the indicator function for an event . Let be the space of (equivalent classes of) random variables such that for and for , where is the expectation operator. Further, let and for be, respectively, the distribution function and the (left) quantile of .
For any , we define as its characteristic function on , which assumes if , and , otherwise.
For any , its sub-gradient at is . We say is GΓ’teaux differentiable at when is differentiable at
for any and the derivative defines a continuous linear functional on . When not explicit, it means that definitions and claims are valid for any fixed with its usual p-norm. We denote by the closed convex hull of a set in . As usual, , is the usual dual of . For , we consider the dual pair , where we call weak topology for its weak* topology. Let be the set of all probability measures on that are absolutely continuous with respect to , with RadonβNikodym derivative . With some abuse of notation, we treat probability measures as elements of .
A functional is a risk measure, and it may possess the following properties:
-
(i)
Monotonicity: if , then .
-
(ii)
Translation Invariance: .
-
(iii)
Convexity: .
-
(iv)
Positive Homogeneity: .
-
(v)
Law Invariance: if , then .
-
(vi)
Comonotonic additivity: for any comonotonic pair .
We have that is called monetary if it fulfills (i) and (ii), convex if it is monetary and respects (iii), coherent if it is convex and fulfills (iv), law invariant if it fulfills (v), and comonotone if it has (vi). Unless otherwise stated, we assume that risk measures are normalized in the sense that . The acceptance set of is defined as .
We now focus on exposing our proposed approach for robust convex risk measures. We begin with the formal definition of worst-case risk measure.
Definition 1.
Let be a risk measure. Its worst-case version is given as
|
|
|
where is closed and bounded set with for any .
Remark 1.
-
(i)
When fulfills monotonicity, we have that is real valued because is bounded. More precisely, we have for any that
|
|
|
-
(ii)
There is preservation for the worst-case determination for operations preserved under point-wise supremum. More specifically, we have: if , then ; for any ; for any ; and , where is arbitrary non-empty set.
We now state a simple but useful result regarding the preservation by the worst-case of main properties from the base risk measure .
Proposition 1.
We have that . Also, we have the following sufficient conditions for to preserve properties from :
-
(i)
Monotonicity: if implies for any , there is such that .
-
(ii)
Translation Invariance: if .
-
(iii)
Convexity: .
-
(iv)
Normalization: if .
-
(v)
Positive Homogeneity:
-
(vi)
Law Invariance: implies .
Proof.
The claims for the acceptance and Law Invariance are trivial. For (i)-(v), the claim follows similar steps from Proposition 2 in Moresco etΒ al., (2023).
β
Dual representations are a key feature in the theory of risk measures. From Theorems 2.11 and 3.1 of Kaina and RΓΌschendorf, (2009), a map , , is a convex risk measure if and only if it can be represented as:
|
|
|
where
|
|
|
Moreover, is continuous in the norm and continuous in the bounded -a.s. convergence (Lebesgue continuous). For , Theorem 4.33 Corollary 4.35 in Follmer and Schied, (2016) assures that the claim holds if and only if is Lebesgue continuous. In any case, the maximum can be taken over the weakly compact .
We now prove a dual representation for worst-case convex risk measures. Our building blocks will be worst-case expectations defined as for . In the following, when not made explicit, we assume that is a convex risk measure and the uncertainty sets possess properties (i)-(iv) of 1.
Lemma 1.
We have that:
-
(i)
is a convex risk measure for any , with .
-
(ii)
for any , and for any .
-
(iii)
for any , and .
-
(iv)
for any and any .
Proof.
For (i), let for some . Thus, each is a finite convex risk measure by 1 considering a base risk measure . Hence, it can be represented over
|
|
|
Regarding (ii), for the first claim on , since for any we have by straightforwardly calculation that
|
|
|
Regarding the second claim, we can take with but . Then, if , then also for any . In this case, we get that
|
|
|
For (iii), the first claim follows since . The claim on the acceptance set follows as
|
|
|
|
|
|
|
|
For (iv), by (i), we have each as a finite convex risk measure. Thus, the supremum is not altered when taken over the weakly compact , and is linear and bounded, hence weakly continuous, the supremum is attained in the definition and for some .
β
Theorem 1.
We
have that
|
|
|
(1) |
where is obtained as
|
|
|
(2) |
Proof.
By 1, is a convex risk measure for any , which is represented over . We claim that is weak continuous for each . Fix then and let weakly, i.e. for any . Let now be defined as for any and . By recalling that is weakly compact by Alaoglu Theorem, we then have that is tight, i.e. for each there is a weakly compact subset and such that
|
|
|
Recall that the hypo-graph of a map is defined as
|
|
|
Since for any , we then have have that hypo-converges to , i.e. for any , with the usual product metric on . Thus, under tightness, we have that hypo-convergence implies convergence of the supremum; see Proposition 7.3.5 of Aubin and Frankowska, (2009) for instance. By 1, we have that . Then, we obtain that
|
|
|
Thus, is weak continuous. Now fix and let be given as
|
|
|
This map is linear and continuous in the first argument, taken on the convex set . In contrast, it is convex and weak lower semicontinuous in the second argument, taken on the weakly compact . By 1, we have that . Thus, we obtain that
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
The third inequality follows from the Sion minimax theorem, see Sion, (1958), which holds since possesses sufficient properties. By the weak lower semicontinuity of , the infumum is attained in . Since for any , the minimum is not altered if taken over . This concludes the proof.
β
Remark 2.
-
(i)
It is intuitive that while is a supremum on constrained to be taken over , in its turn is a infimum over taken on a subset of , the dual space of , adjusted by the penalty of expectations over all .
-
(ii)
We have that . This inequality can also be deduced from . Further, by 1, the infimum in (2) can be taken only over those such that .
-
(iii)
Notice that we have not used any property beyond convexity and lower semicontinuity for , and in the proof. Thus, the claim remains valid without the Monetary properties, as in general convex analysis, for instance, by letting the proper domain of the penalty be contained in some general subset of instead of .
Positive Homogeneity, and thus coherence, leads to a simpler dual representation. Theorem 2.9 in Kaina and RΓΌschendorf, (2009) assures that a map , , is a coherent risk measure if and only if it can be represented as
|
|
|
where is a nonempty, closed, and convex set that is called the dual set of . For , Corollaries 4.37 and 4.38 in Follmer and Schied, (2016) assures that the claim holds under Lebesgue continuity.
We then have a direct Corollary in the presence of Positive Homogeneity, and thus coherence, of the base risk measure and the uncertainty set.
Corollary 1.
If in addition to the conditions of 1 we have Homogeneity Positivity for both and for any , then is the characteristic function of , where is the dual set of .
Proof.
Under these circumstances, by 1, each is also a coherent risk measure. The claim now follows as
|
|
|
β
In convex analysis, sub-differentials play a critical role in optimization. For a convex risk measure , Theorem 21 and Proposition 14 of Delbaen, (2012), for , and Theorem 3 of RuszczyΕski and Shapiro, (2006), for , assure that
|
|
|
Furthermore,
is GΓ’teaux differentiable at if and only if is a singleton, which in this case the derivative turns out to be defined by , i.e. the map .
We now prove a result for explicit representations for the sub-gradient of the worst-case convex risk measure. As in the case for the penalty term and dual representation, the sub-gradient has as building blocks the auxiliary maps and the base risk measure . With some abuse of notation in the context of Gateaux differential, we treat and the continuous linear functional it defines as the same.
Theorem 2.
We have for any that:
-
(i)
|
|
|
(3) |
where belongs to the argmin of (2) regarding to .
-
(ii)
|
|
|
where is the argmax of for . In particular, is GΓ’teaux differentiable at if and only if is GΓ’teaux differentiable at for any with the same derivative.
-
(iii)
If for any , then
|
|
|
Proof.
Fix . For (i), we have that if and only if . Then, by using the penalty term from 1 we directly have
|
|
|
|
|
|
|
|
Concerning (ii), Theorem 2.4.18 in Zalinescu, (2002) assures that for a family of convex functions over , with a compact topological space, and , if are upper semicontinuous and is continuous, then
|
|
|
where , and . We now claim that we can use such a result in our framework. We have that the maximum on (1) can be taken on the weakly compact . Let for each a functional on be defined as
|
|
|
We have that these maps are convex. Also, we have, as in the proof of 1, that is weak upper semicontinuous for any . Further, it is clear that . Moreover, as a convex risk measure, is continuous. Further, it is straightforward that . Hence, applying the result we have that
|
|
|
|
The claim for the GΓ’teaux derivative is straightforwardly obtained from such sub-differential.
Regarding (iii), the claim follows because for any we have that
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Thus, we obtain that
|
|
|
Since sub-differentials are closed and convex, we can safely take operation.
β
3 Mean and variance
On the next sections, we have that atomless. A case of interest is when the uncertainty set is based on moments of the random variable. In particular, mean and variance as .
fits into our approach for any since it is a closed, bounded, even convex set such that . Furthermore, this family fulfills properties (ii)-(vi) of 1. However, it is not a monotone set; thus, the resulting worst-case risk measure may not be monetary. A consequence is that the penalty term from dual representation must be considered on . We address this case in this section and, thus, restrict our analysis to .
Worst-case formulations under this uncertainty set are well documented for spectral risk measures, which are precisely the risk measures satisfying all properties (i)-(vi). Such maps can be represented as weighting (spectral) schemes of Value at Risk (VaR), which is defined as . Thus, distortion/spectral risk measures are represented as
|
|
|
where is a non-increasing functional such that . For details on such representation, see Follmer and Schied, (2016) for and FilipoviΔ and Svindland, (2012) for . In this case, results in Li, (2018), Cornilly etΒ al., (2018), Cornilly and Vanduffel, (2019), Cai etΒ al., (2023), Pesenti etΒ al., (2022) allow to conclude that
|
|
|
where the 2-norm is taken over . These authors also derive a closed form when is coherent and law invariant, relying on the fact that in this case , where . In this case the worst-case risk measure becomes
|
|
|
We now expose a closed-form solution for the worst-case risk measure when the base is a law invariant convex risk measure. Our result is given in terms of and , which are in general more tractable than .
Theorem 3.
Let be a law invariant convex risk measure and . Then, we have that:
-
(i)
|
|
|
-
(ii)
|
|
|
(4) |
-
(iii)
the argmax for any is
|
|
|
-
(iv)
|
|
|
where is the argmax of for . In particular, is GΓ’teaux differentiable at if and only if is a singleton.
Proof.
For (i), under Law Invariance and atomless, we have the special Kusuoka representation, see for instance Theorem 2.2 of FilipoviΔ and Svindland, (2012),
|
|
|
Since each is law invariant, 1 which implies that the same property holds for the auxiliary maps , which become , where
|
|
|
It is an easy task to show that defines a valid distortion/spectral risk measure for any . Thus, in view of the above discussion, we get that
|
|
|
Moreover, by some calculation we also have . We show it for continuous by recalling that has uniform distribution over . Nonetheless, the general case follows similar steps with more algebra under the modified distribution of given as , where . In this case if is independent of and uniformly distributed over , then we also have that follows an uniform distribution over . We get that
|
|
|
|
Therefore, it is then clear that the auxiliary are coherent, and their penalty term are characteristic functions on dual sets . Thus, the result follows by noticing that the dual sets of the negative expectation and the 2-norm are, respectively, and
For (ii), from 1 we have that the maps are building blocks for as
|
|
|
Thus, we then get for any that
|
|
|
|
|
|
|
|
Regarding (iii), for the argmax, if is constant, then the claim is trivial since . Thus, fix non-constant and let , where is the argmax of (4) for . It is straightforward to verify that . It only remains to show that . We have that . Furthermore, we have that
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Hence, we obtain that
|
|
|
For (iv), we proceed as in 2, by letting for each a functional on be defined as
|
|
|
Moreover, is convex and bounded above in any set . Thus, by Theorem 1.4 in Gao and Xanthos, (2024) we have that is continuous. By recalling that the expectation and the 2-norm are both GΓ’teaux differentiable with respective derivatives and , we have that
|
|
|
Further, it is straightforward that . Hence, applying the result we have that
|
|
|
|
The claim for the GΓ’teaux derivative is straightforwardly obtained from such sub-differential. This concludes the proof.
β
Under the presence of Positive Homogeneity, the problem becomes more tractable, with concrete penalty terms and sub-gradient. We now expose a Corollary regarding this context.
Corollary 2.
If in addition to the conditions of 3, fulfills Positive Homogeneity, then:
-
(i)
|
|
|
-
(ii)
is the characteristic function of
|
|
|
-
(iii)
is GΓ’teaux differentiable at any with derivative
|
|
|
Proof.
For (i), the result holds since is the characteristic function of . Regarding (ii), Positive Homogeneity implies that is the characteristic function of . The result follows by noticing that the dual sets of the negative expectation and the 2-norm are, respectively, and . For (iii), the claim follows by recalling that the expectation and the 2-norm are both GΓ’teaux differentiable with respective derivatives and .
β
We conclude this section by exposing some concrete examples of closed-form expressions under 3. We consider both risk measures that already appear in the literature of worst-case under mean and variance uncertainty sets in order to clarify that our approach nests existing results and risk measures for which closed-form solutions are a novelty.
Example 1.
-
(i)
When is comonotone additive, we then recover the result from the literature, with . A typical example in this situation is Expected Shortfall (ES), that is functional defined as
|
|
|
In this case the spectral function is . The worst-case of ES becomes
|
|
|
Of course, under our approach, we have the same result. The dual set of ES is defined as
|
|
|
Thus, it is clear that . For the converse inequality, for each , we have that . Then, we obtain that
|
|
|
-
(ii)
A risk measure in the conditions of the Theorem that is not comonotone is the Expectile Value at Risk (Exp), linked to the concept of an expectile. It is a functional directly defined as an argmin of a scoring function, which is given by
|
|
|
|
By Bellini etΒ al., (2014), the Exp is a law invariant coherent risk measure for . In addition, this measure is the only example of elicitable coherent risk measure that does not collapse to the mean. See Ziegel, (2016) for details.
The dual set of Exp can be given by
|
|
|
In order to obtain , we must to compute . Due to the nature of , this is a tricky quest. Nonetheless, in Proposition 9 of Bellini etΒ al., (2014) a formulation for Exp is given as
|
|
|
Thus, we can represent it as
|
|
|
In this case, by 3 we have that in order to obtain , we must to compute . According to Hu etΒ al., (2024), this maximum is attained for , leading to . Hence, we have that
|
|
|
-
(iii)
Another example in the setup of Theorem without comonotonic additivity is the Mean plus Semi-Deviation (MSD). Such risk measure is the functional defined by
|
|
|
This risk measure is studied in detail by Fischer, (2003), and it is a well-known law invariant coherent risk measure, which belongs to loss-deviation measures discussed by Righi, (2019).
The dual set of this measure can be represented by
|
|
|
Notice that for any we have that
|
|
|
Since and , we have that . By taking for , we have that . Hence, we have that
|
|
|
|
-
(iv)
A class of law invariant convex, not necessarily coherent, risk measures are the shortfall risks (SR). Such maps are defined as as
|
|
|
where is a strictly convex and increasing loss function, and is an interior point in the range of . The intuition is that such maps connect convex risk measures and the
expected utility theory since maximizing expected utility is equivalent to minimizing the expected loss. A concrete and popular choice for utility/loss function is the power functions given as . We then have that its penalty term is given, according to Example 4.118 of Follmer and Schied, (2016), as
|
|
|
We are then, in order to determine , interested in the value of
|
|
|
By recalling that
, we have that
making , the goal then becomes to determine the value of
|
|
|
where is the bound of the weakly compact . Thus, the critical point is obtained for , which is valid when both and . Assuming this is the case, then we have that
|
|
|
4 Wasserstein balls
Another case of potential interest for uncertainty sets is closed balls under some suitable metric centered at with some specified radius . A prominent example in the literature is the Wasserstein distance or order as
|
|
|
For it is possible to defined . For a detailed discussion on this metric, see Villani, (2021), while Esfahani and Kuhn, (2018) is a reference for its use in robust decision-making.
In this context, our uncertainty sets then become . As closed balls, this kind of uncertainty set directly lies in our framework, with the additional feature of being convex. This family fulfills properties (i), (ii), (iii), and (vi) of 1. It is, however, not normalized, which will also make not possess such a property. It is also not Positive Homogeneity. Consequently, coherence is beyond the scope of this section.
For spectral risk measures, as is for the case exposed in the last section, the worst-case is well documented. For instance, Liu etΒ al., (2022) obtains the following formulation . Outside this context, there are results for specific risk measures, such as Shortfall Risks in Bartl etΒ al., (2020) and Expectiles in Hu etΒ al., (2024).
We now expose a closed-form solution for the worst-case risk measure when the base is a law invariant convex risk measure. Our result is given in terms of p-norms and sub-differentials for both the penalty and the risk measure. This result is easily tractable, especially when the base risk measure is GΓ’teaux differentiable.
Lemma 2.
for any , where . If ,
then .
Proof.
For any , we have that . By taking the supremum over we obtain . If , then the map attains its supremum in , which coincides to .
β
Remark 3.
Two sufficient conditions for to be not-empty, even compact are:
-
(i)
is compact: since is continuous, the supremum is attained. In this case, since is closed, it is also compact.
-
(ii)
is weak continuous and convex: by recalling that is weak lower semicontinuous, it is weak continuous if and only if it is weak upper semicontinuous. Since, in this case, the supremum can be taken over the weakly compact , the supremum is attained. In this case, is also weak compact.
Theorem 4.
Let be convex law invariant and for any . Then, we have:
-
(i)
|
|
|
-
(ii)
|
|
|
where and .
-
(iii)
the argmax is
|
|
|
where solves .
-
(iv)
|
|
|
Proof.
For (i), consider again the the family of maps
|
|
|
In this case the auxiliary maps become . Since the expectation is Lipschitz continuous regarding to the Wasserstein metric, we have by HΓΆlder inequality that the following holds for any :
|
|
|
If , for each , let be such that . Then, it is clear that , and we also have the following convergence:
|
|
|
If , then take such that
|
|
|
Then, direct calculation leads to both and
|
|
|
For , take , which is in . It is straightforward to verify that
|
|
|
Thus, in any case for we have that
|
|
|
By 1, the supremum in is always attained in . Thus, by 2 we obtain that
|
|
|
In this case, the penalty term becomes
|
|
|
Thus, in view of 1, and recalling that is law invariant, the penalty term for becomes
|
|
|
Regarding (ii), from the penalty term obtained in (i), we have that is given as the sup-convolution. See Ekeland and Temam, (1999) or Zalinescu, (2002) for details, between and the concave function defined as
|
|
|
We then have for any that
|
|
|
|
|
|
|
|
|
|
|
|
For any , we have that
|
|
|
By taking the infimum over we have that . Notice that the infimum is attained since the q-norm is weakly lower semicontinuous, and the sub-differential is a weakly compact set.
For the converse relation, take
|
|
|
Of course, . We have for any that
|
|
|
By taking the maximum over we have that .
For the last equality in the claim, take . We then have that
|
|
|
By taking the supremum over , we have that . For the converse inequality, since we have that
|
|
|
Thus, . In this case, . Hence, . The fact that the supremum in the definition of is attained is a direct application of the James Theorem since is weakly compact and the q-norm is the supremum of a linear map, , over the unit ball in .
For (iii), regarding the argmax, for , let be such that . For , let . Notice that . We can take such that . Then, we have that . We also have that . For , let . Recall that . Thus, for any we have that
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
We then have that . Hence, is the argmax.
Concerning (iv), the claim follow since, for any ,
|
|
|
This concludes the proof.
β
From 4, the role of sub-differentials in determining features for the worst-case risk measure is clear. We now expose a Corollary that collects facts regarding sub-differentials of specific for the setup in this section.
Corollary 3.
In the conditions and notations of 4, we have the following for any :
-
(i)
.
-
(ii)
and are GΓ’teaux differentiable at if and only if the derivative coincides.
-
(iii)
if , .
-
(iv)
if , then for any , if and only if .
-
(v)
if , then .
Proof.
For (i), since , by 4 we have that
|
|
|
Thus, .
Concerning (ii), the if part is trivial. For the only if, by 4 we have that . If both and are GΓ’teaux differentiable at , then both sub-differential sets are singletons. Thus, the derivative of and is .
For (iii), let and . We then have that
|
|
|
|
|
|
|
|
|
|
|
|
Thus, . Hence, .
For (iv), let and . Since we have that
|
|
|
|
|
|
|
|
|
|
|
|
We then get that
|
|
|
Thus, . Hence, by the definition of we must to have .
The claim for (v) is trivial since, in this case, .
β
We now expose some concrete examples for closed-form expressions under 4. As in the last section, we consider both risk measures that already appear in the literature of worst-case under uncertainty over closed balls of the Wasserstein metric and risk measures for which closed-form solutions are a novelty.
Example 2.
-
(i)
For spectral risk measures, given in terms of the spectral map as , Liu etΒ al., (2022) obtains the following formulation
|
|
|
We recover such results in our approach as follows. Fix . We have that
|
|
|
for any . Thus, as in the proof of 3, we have that for any . Hence, we obtain the closed form as
|
|
|
For the particular case of ES, we then obtain that
|
|
|
-
(ii)
A special case of the literature is studied in Bartl etΒ al., (2020), where it is investigated the worst-case of optimized certainty equivalents (OCE) and shortfall risks (SR). SR was exposed in 1 and the OCE is a map defined as
|
|
|
where is the loss function as for the SR. See Ben-Tal and Teboulle, (2007) for details on such maps. These authors obtain a robust formulation as
|
|
|
where is a transform defined as
|
|
|
This is in consonance with our approach since, in our case, the infimum is taken over q-norms of elements in the sub-differential of and . We now show that this coincides with our result. We show for OCE over . The claims for SR or follow similarly. By Theorem 4.122 of Follmer and Schied, (2016) or Theorem 4.2 in Ben-Tal and Teboulle, (2007), we have that is represented over , where is the convex conjugate of . This penalty term based on conjugate is sometimes called divergence between and . Further, for each , we have by calculation that . We then obtain the following:
|
|
|
|
|
|
|
|
|
|
|
|
The second to last equation holds since for any , there is with but , which implies .
-
(iii)
For this example, we study again the risk measure induced by expectiles (Exp). It is GΓ’teaux differentiable at any with derivative defined as
|
|
|
Thus, under 3, we have that
|
|
|
where is defined as . Direct calculation shows that this value is equals to if and only if . In which case we obtain that and
|
|
|
This closed form aligns with Theorem 2 in Hu etΒ al., (2024).
Bellini and Di Bernardino, (2017) points out that under some conditions on the map , we have that for any . Under this circumstances, we have that
|
|
|
This can also be interpreted as a worst-case formula for VaR.
-
(iv)
Consider the MSD again. For any , we have the derivative defined as
|
|
|
We then have that for any . Hence, in light of 4, we get that
|
|
|
-
(v)
The Entropic risk measure (ENT) is a map that depends on the userβs risk aversion through the exponential utility function. It is the prime example of a law invariant convex risk measure that is not coherent. Formally, it is the map defined as
|
|
|
Its penalty is the relative entropy as
|
|
|
This risk measure is GΓ’teaux differentiable for any with . Thus, by 4 we have for any that
|
|
|
For a particular case when such that follows a Normal distribution, i.e. , we have that is log-normally distributed. By recalling that , we have by direct calculation that
|
|
|
Hence, we obtain
|
|
|