On the free boundary of an annuity purchase

De Angelis, Tiziano; Stabile, Gabriele

doi:10.1007/s00780-018-00379-8

On the free boundary of an annuity purchase

Open access
Published: 19 December 2018

Volume 23, pages 97–137, (2019)
Cite this article

Download PDF

You have full access to this open access article

Finance and Stochastics Aims and scope Submit manuscript

On the free boundary of an annuity purchase

Download PDF

Tiziano De Angelis¹ &
Gabriele Stabile²

2621 Accesses
9 Citations
Explore all metrics

Abstract

It is known that the decision to purchase an annuity may be associated to an optimal stopping problem. However, little is known about optimal strategies if the mortality force is a generic function of time and the subjective life expectancy of the investor differs from the objective one adopted by insurance companies to price annuities. In this paper, we address this problem by considering an individual who invests in a fund and has the option to convert the fund’s value into an annuity at any time. We formulate the problem as a real option and perform a detailed probabilistic study of the optimal stopping boundary. Due to the generic time-dependence of the mortality force, our optimal stopping problem requires new solution methods to deal with nonmonotonic optimal boundaries.

Risk-minimization for life insurance liabilities with basis risk

Article 05 September 2015

The life care annuity: enhancing product features and refining pricing methods

Article Open access 10 July 2024

Annuity contract valuation under dependent risks

Article 16 May 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In an ageing world, an accurate management of retirement wealth is crucial for financial well-being. It is important for working individuals to carefully consider the existing offer of financial and insurance products designed for retirement, beyond the state pension. This offer includes for example occupational pension funds and tax-advantaged retirement accounts (e.g. Individual Retirement Account (US)). Most of these products rely on annuities to turn retirement wealth into guaranteed lifetime retirement income. Life annuities provide a lifelong stream of guaranteed income in exchange for a (single or periodic) premium. The purchase of an annuity helps individuals to manage the longevity risk, i.e., the risk of outliving their financial wealth, but it is usually an irreversible transaction. In fact, most annuity contracts impose steep penalties for partial or complete cancellation by the policyholder, especially in the early years of the contract.

Timing an annuity purchase (so-called annuitisation) is a complex financial decision that depends on several risk factors as e.g. market risk, longevity risk, potential future need of liquid funds, and bequest motive. The study of this topic has motivated a whole research field since the seminal contribution of Yaari [19], who showed that individuals with no bequest motive should convert all their retirement wealth into annuities.

After Yaari, several authors have analysed the annuitisation decision under the so-called all-or-nothing institutional arrangement, where a lifetime annuity is purchased in a single transaction (as opposed to gradual annuitisation). Initially, an individual’s wealth is invested in the financial market, and at the time of an annuity purchase, it is converted into a lifetime annuity. The central idea in this literature is to compare the value deriving from an immediate annuitisation with the value of deferring it while investing in the financial market. Therefore, a strict analogy holds with the problem of exercising an American option, and the annuitisation decision can be considered as the exercise of a real option.

Milevsky [12] proposed a model where an individual defers annuitisation for as long as the financial investment’s returns guarantee a consumption flow which is at least equal to the one provided by the annuity payments. In particular, [12] adopts a criterion based on controlling the probability of a consumption shortfall.

Other papers study the optimal annuitisation time in the context of utility maximisation, and formulate the problem as one of optimal stopping and control. The investor aims at maximising the expected utility of consumption (pre-retirement) and of annuity payments (post-retirement).

Assuming a constant force of mortality and CRRA utility, Stabile [18] analytically solves a time-homogeneous optimal stopping problem. He proves that if the individual has the same degree of risk aversion before and after the annuitisation, then an annuity is purchased either immediately or never (the so-called now-or-never policy). On the other hand, if the individual is more risk averse during the annuity payout phase, the annuity is purchased as soon as the wealth falls below a constant threshold (the optimal stopping boundary).

A constant force of mortality is also assumed in Gerrard et al. [7] and Liang et al. [11]. The model in [7] is analogous to the one studied in [18], but with quadratic utility functions, and the authors find a closed-form solution: If $(X_{t})_{t\ge 0}$ represents the individual’s wealth process, then it is optimal to stop when $X$ leaves a specific interval (hence the optimal stopping boundary is formed by the endpoints of such an interval). In [11], in contrast to the previous papers, the authors assume that the individual may continue to invest and consume after annuitisation. By using martingale methods, explicit solutions are provided in the case of CRRA utility functions. Contrarily to [7], the optimal annuitisation in [11] occurs when the wealth process enters a specific interval, whose endpoints form the optimal stopping boundary.

Assuming a time-dependent force of mortality, Milevsky and Young [13] analyse both the all-or-nothing market and the more general anything-anytime market, where gradual annuitisation strategies are allowed. For the all-or-nothing market, they find that the optimal annuitisation time is deterministic as an artifact of CRRA utility. Thus, the annuitisation decision is independent from the individual’s wealth.

Our work is more closely related to work by Hainaut and Deelstra [9]. They consider an individual whose retirement wealth is invested in a financial fund which eventually must be converted into an annuity. The fund is modelled by a jump-diffusion process and pays dividends at a constant rate. The mortality force is a time-dependent, deterministic function and the individual aims at maximising the market value of future cashflows before and after annuitisation. According to the insurance practice, it is assumed that the individual can only purchase the annuity by a given maximal age. The authors in [9] cast the problem as an optimal stopping one and write a variational inequality for the value function. They then use the Wiener–Hopf factorisation and a time stepping method to solve the variational problem numerically. Hainaut and Deelstra argue that the decision to purchase the annuity should be triggered by either an upper or a lower, time-dependent threshold in the time-wealth plane. Numerical examples are provided in [9] where the annuitisation occurs when the value of the financial fund is high enough or, alternatively, low enough.

In this paper, we perform a detailed mathematical study of the optimal stopping problem associated to an annuitisation decision similar to that considered in [9]. In the interest of a rigorous analysis of the optimal stopping boundary, we simplify the dynamics of the financial fund by considering a geometric Brownian motion with no jumps. As in [9], we look at the maximisation of future expected cashflows for an individual who joins the fund and has the opportunity to purchase an annuity on a time horizon $[0,T]$. Time 0 is the time when the individual joins the fund and time $T$ is the time by which the individual reaches the maximal age for an annuity purchase. The present value of future expected cashflows, evaluated at the optimum, gives us the so-called value function $V$.

Notice that a closer inspection of the problem formulation in (2.4) below shows that at time $T$, the fund is converted into an annuity (the same occurs in [9]). This means that the individual will eventually purchase the annuity at time $T$, but she also has an option to buy it earlier. One could think of this feature as part of the fund’s contract specifications or as a commitment of the investor at time 0. It is, however, important to remark that the methods developed in this paper apply also to the case $T=+\infty $, up to some minor changes (see also Remark 3.3 for further details).

One of the key features of the model presented here is the use of a rather general time-dependent, deterministic mortality force. This is a realistic assumption commonly made in the actuarial profession. As in [13], we consider two different mortality forces: a subjective one, used by the individual to weigh the future cashflows (denoted $\mu ^{S}$), and an objective one, used by the insurance company to price the annuity (denoted $\mu ^{O}$). The interplay between these two different mortality forces contributes to some key qualitative aspects of the optimal annuitisation decision (see Sect. 5 for more details). Interestingly, the generic time-dependent structure of the mortality force constitutes also the major technical challenge in the mathematical study of the problem.

On the one hand, standard optimal stopping results ensure that the time–wealth plane splits into a continuation region $\mathcal{C}$, where the option to wait has strictly positive value, and a stopping region $\mathcal{S}$, where the annuity should be immediately purchased. Denoting by $(X_{t})_{t\ge 0}$ the process that represents the fund’s value (or equally, the individual’s retirement wealth), an optimal stopping rule is given by stopping at the first time the two-dimensional process $(t,X_{t})_{t\ge 0}$ enters the set $\mathcal{S}$. Moreover, under some mild technical assumptions, we prove in Proposition 4.2 that these two sets are split by an optimal boundary (free boundary, in the language of PDEs) which only depends on time, i.e., $t\mapsto b(t)$.

On the other hand, technical difficulties arise when trying to infer properties of the boundary $b$. In fact, due to the generic time-dependence of the mortality force, it is not possible to establish any monotonicity of the mapping $t\mapsto b(t)$. It is well known in optimal stopping and free boundary theory that monotonicity of $b$ is the key to a rigorous study of the regularity of the boundary (e.g. continuity) and of the value function (e.g. continuous differentiability). The interested reader may consult [15, Chaps. VII and VIII] for a collection of relevant examples, and the introduction in [5] for a deeper discussion.

We overcome this major technical hurdle by proving that the optimal boundary is in fact a locally Lipschitz-continuous function of time. In order to achieve this goal, we rely only on probabilistic methods which are new and specifically designed to tackle our problem. This approach draws from similar ideas in [5], but we emphasise that our problem falls outside the class of problems addressed in that paper (see the discussion prior to Theorem 4.8 below).

Once Lipschitz regularity is proved, we then obtain also that the value function $V$ is continuously differentiable in $t$ and $x$, at all points of the $(t,x)$-plane and in particular across the boundary of $\mathcal{C}$. This is a stronger result than the more usual smooth-fit condition, which states that $z\mapsto V_{x}(t,z)$ is continuous across the optimal boundary. Finally, we find nonlinear integral equations that characterise uniquely the free boundary and the value function.

The analysis in this paper is completed by solving numerically the integral equation for some specific examples and studying their sensitivity to variations in the model’s parameters. It is important to remark that the optimal boundary turns out to be nonmonotonic in some of our examples, under natural assumptions on the parameters. This shows that the new approach developed in this paper is indeed necessary to study the annuitisation problem.

In summary, our contribution is at least twofold. On the one hand, we add to the literature concerning annuitisation problems in the all-or-nothing framework by addressing models with time-dependent mortality force. As we have discussed above and to the best of our knowledge, such models were only considered in [13] (which produces only deterministic optimal strategies) and in [9] (mostly in a numerical way). We provide a rigorous theoretical analysis of the optimal annuitisation strategy in terms of the optimal boundary $b$. Our study also reveals behaviours not captured by [9] as e.g. lack of monotonicity of $b$. The latter may reflect the change over time in the investor’s priorities, due to (deterministic) variations in the mortality force. On the other hand, it is rather remarkable that we started by considering an applied problem, with a somewhat canonical and seemingly innocuous formulation, but we soon realised that its rigorous analysis is far from trivial. Therefore we developed methods which are new in the probabilistic literature on optimal stopping and of independent interest.

Finally, in order to relate our work to the PDE literature in this area, it may be worth noticing that [6] (and later [2]) studies a free boundary problem motivated by optimal retirement. In that paper, an investor can decide to retire earlier than a given terminal time $T$. Early retirement benefits are defined by a function $\varPsi (t,s)$ of time $t$ and the current salary $s$. The problem is addressed exclusively with variational inequalities, and the free boundary depends on time since $t\mapsto \varPsi (t,s)$ increases linearly. However, contrarily to our model, the mortality force in [6] and [2] is assumed constant.

The rest of the paper is organised as follows. In Sect. 2, we introduce the financial and actuarial assumptions and then the optimal annuitisation problem. In Sect. 3, we provide some continuity properties of the value function and useful probabilistic bounds on its gradient. In Sect. 4, we present sufficient conditions under which the shape of the continuation and stopping regions can be established, and we study the regularity of the optimal boundary. Moreover, we find nonlinear integral equations that characterise uniquely the free boundary and the value function. In Sect. 5, we present some numerical examples to illustrate the range of applicability of our assumptions. In Sect. 6, we provide some final remarks and extensions.

2 Problem formulation

In our model, we consider an individual (or investor) and an insurance company who are faced with two distinct sources of randomness: a financial market and the survival probability of the individual. We assume that the individual and the insurance company have different beliefs about the demographic risk, while they share the same views on the financial market. It is therefore convenient to construct initially two probability spaces: one that models the financial market and another that models the demographic component. The time horizon of the problem is fixed and denoted by $T<+\infty $.

2.1 Financial and demographic models

For the financial market, we consider a complete probability space $(\varOmega ,\mathcal{F},\mathbb{P})$ carrying a 1-dimensional Brownian motion $(B_{t})_{t\ge 0}$. The filtration generated by $B$ is denoted by $(\mathcal{F}_{t})_{t\ge 0}$, and it is augmented with ℙ-nullsets. The portion of the individual’s wealth allocated for an annuity purchase and invested in a financial fund^{Footnote 1} prior to the annuitisation is modelled by a stochastic process $(X_{t})_{t\ge 0}$. Its dynamics reads

$$\begin{aligned} dX_{t}^{x} = (\theta -\alpha )X_{t}^{x} \,dt+\sigma X_{t}^{x} \,dB_{t}, \qquad X_{0}^{x} = x> 0, \end{aligned}$$

(2.1)

where $\theta $ is the average continuous return of the financial investment, $\alpha $ is the constant dividend rate and $\sigma >0$ is the volatility coefficient.

For the demographic risk, we consider another probability space. Given a measurable space $(\varOmega ',\mathcal{F}')$, we let $\mathbb{Q}^{S}$ and $\mathbb{Q}^{O}$ denote two probability measures on $(\varOmega ', \mathcal{F}')$ and assume that $(\varOmega ',\mathcal{F}',\mathbb{Q}^{i})$, $i=S,O$, are both complete. The measure $\mathbb{Q}^{S}$ is associated with the subjective survival probability of the individual. In contrast, $\mathbb{Q}^{O}$ refers to the objective survival probability used by the insurance company to price annuities, and it is public information.

The individual is aged $\eta >0$ at time zero in our problem, and this value is given and fixed throughout the paper. The time of death of the individual is represented by a random variable $\varGamma _{D}:(\varOmega ', \mathcal{F}')\to (\mathbb{R}_{+},\mathcal{B}(\mathbb{R}_{+}))$, and for $i=S,O$, we define the hazard functions

$$ _{s} p^{i}_{\eta +t}:=\mathbb{Q}^{i}[\varGamma _{D}>\eta +t+s\,|\,\varGamma _{D}>\eta +t] $$

with $s,t\ge 0$. These represent the subjective/objective probability that an individual who is alive at age $\eta +t$ will survive to age $\eta +t+s$ (we follow standard actuarial notation for ${}_{s} p^{i} _{\eta +t}$). Let $\mu ^{i}:[0,+\infty )\to [0,+\infty )$ for $i = S,O$ be deterministic functions, representing the subjective and objective mortality forces, respectively. Then for $i=S,O$, we have

$$\begin{aligned} _{s} p^{i}_{\eta +t}=\exp \bigg( -\int _{0}^{s}\mu ^{i}(\eta +t+u)\,du \bigg) \quad \text{for $t,s\ge 0$}. \end{aligned}$$

(2.2)

The different survival probability functions adopted by insurer and individual account for the imperfect information available to the insurer on the individual’s risk profile.

Finally, we say that $\mathcal{M}^{S}:=(\varOmega \times \varOmega ', \mathcal{F}\otimes \mathcal{F}',\mathbb{P}\times \mathbb{Q}^{S})$ is the probability space for the individual and $\mathcal{M}^{O}:=(\varOmega \times \varOmega ',\mathcal{F}\otimes \mathcal{F}',\mathbb{P}\times \mathbb{Q}^{O})$ the probability space for the insurance company.

Remark 1

The functions $\mu ^{S}$ and $\mu ^{O}$ are given at the outset and are not updated during the optimisation. Updating in a nontrivial way would require the use of a stochastic dynamics for the mortality force which in general would lead to a more complex problem.

2.2 The optimisation problem

The insurance company uses its probabilistic model, based on objective survival probabilities, to price annuities. In particular, according to standard actuarial theory, the value at time $t>0$ of a life annuity that is payable continuously at a rate of one monetary unit per year (purchased by the individual aged $\eta +t$) is given by

$$ a^{O}_{\eta +t}= \int _{0}^{\infty } e^{-\widehat{\rho }u} {}_{u} p ^{O}_{\eta +t} \,du. $$

Here $\widehat{\rho }>0$ is a constant interest rate guaranteed by the insurer.

In our model, the fund is automatically converted into an annuity at time $T$, but the individual has the option to annuitise prior to $T$. If she decides to annuitise at a time $t\in [0,T]$, with the fund’s value equal to $X$, then the annuity payout rate is constant and reads

$$ P_{\eta +t}=\frac{X-K}{a^{O}_{\eta +t}}, $$

(2.3)

where the constant $K$ is either a fixed acquisition fee ($K>0$) or a tax incentive ($K<0$). The case $K=0$ leads to trivial solutions as explained in Remark 3.2 below. From the modelling point of view, $T<+\infty $ reflects the fact that insurance companies typically have a maximum age limit for the purchase of an annuity (this is noticed also in [9]).

The optimisation criterion pursued by the individual is the maximisation of the present value of future expected cashflows, via the optimal timing of the annuity purchase under the model $\mathcal{M}^{S}$. Letting $\mathbb{E}^{S}[\cdot ]$ be the expectation under the measure $\mathbb{P}\times \mathbb{Q}^{S}$, if the individual is alive at time $t$, the optimisation problem reads

(2.4)

where $\mathcal{T}_{t,T}$ is the set of $(\mathcal{F}_{s})_{s\ge 0}$-stopping times taking values in $[t,T]$ and $\rho >0$ is a discount rate. Before annuitisation, i.e., for $s<\tau $, the individual receives dividends from the fund at rate $\alpha $. After annuitisation, i.e., for $s>\tau $, she gets the continuous annuity payment at the constant annual rate $P_{\eta + \tau }$. In case the individual dies before the time of the annuity purchase, i.e., on the event $\{\varGamma _{D} \leq \tau \}$, she leaves a bequest equal to her wealth.

Remark 2

Thanks to a result in [1], we show in the Appendix that there is no loss of generality in using stopping times from $\mathcal{T} _{t,T}$. That is, we obtain the same value in (2.4) as if we were using stopping times of the enlarged filtration $(\mathcal{G} _{t})_{t\ge 0}$, where $\mathcal{G}_{t}=\mathcal{F}_{t}\vee \sigma ( \{\varGamma _{D}>s\},0\le s\le t)$.

Due to the assumed independence between the demographic uncertainty and the fund’s returns (i.e., $\varGamma _{D}$ being independent of $(B_{t})_{t\ge 0}$) and since the optimisation is over $( \mathcal{F}_{t})_{t\ge 0}$-stopping times, the value function can be rewritten by using Fubini’s theorem and (2.2) as

$$\begin{aligned} V_{t}=\mathop{\mathrm{ess\,sup}}_{\tau \in \mathcal{T}_{t,T} } \mathbb{E}\bigg[ \int _{t}^{\tau } e^{-\int _{t}^{s}r(u) \,du}\beta (s) X _{s} \,ds+e^{-\int _{t}^{\tau }r(u)\,du}G(\tau ,X_{\tau })\bigg| \mathcal{F}_{t}\bigg] \end{aligned}$$

(2.5)

where $\mathbb{E}[\cdot \,]$ is the expectation under ℙ, $r(t):=\rho +\mu ^{S}(\eta +t)$, $\beta (t):=\alpha +\mu ^{S}(\eta +t)$, $G(t,x)=f(t)(x-K)$ and

$$\begin{aligned} f(t)=\frac{a^{S}_{\eta +t}}{a^{O}_{\eta +t}}. \end{aligned}$$

(2.6)

Here $a^{S}_{\eta +t}$ is the individual’s subjective valuation of the annuity, i.e.,

$$ a^{S}_{\eta +t}=\int _{0}^{\infty } e^{- \rho u} {}_{u} p^{S}_{\eta +t} \,du. $$

The function $f(\cdot )$ in (2.6) is the so-called “money’s worth”.

Since we are in a Markovian setting, we have $\mathbb{E}[\cdot \,| \mathcal{F}_{t}]=\mathbb{E}[\cdot \,|X_{t}]$. In particular, if $X_{t}=x>0$ ℙ-a.s., we find it convenient to use the notation

$$ \mathbb{E}_{t,x}[\cdot \,]:=\mathbb{E}[\cdot \,|X_{t}=x]=\mathbb{E}[ \cdot \,|\mathcal{F}_{t}]. $$

Moreover, the process $X$ is time-homogeneous so that

$$ \mathrm{Law}\big((u,X_{u})_{u\ge t}\big|X_{t}=x\big)=\mathrm{Law} \big((t+s,X_{s})_{s\ge 0}\big|X_{0}=x\big). $$

Using the above notations, for any given $(t,x)\in [0,T]\times (0,+ \infty )$, we can rewrite (2.5) as

$$\begin{aligned} V(t,x) &=\sup _{0 \leq \tau \leq T-t} \mathbb{E}\bigg[ \int _{0}^{ \tau } e^{-\int _{0}^{s}r(t+u) \,du}\beta (t+s) X_{s}^{x} \,ds \\ & \phantom{=:\sup _{0 \leq \tau \leq T-t} \mathbb{E}\bigg[}+e^{-\int _{0}^{\tau }r(t+u)\,du}G(t+\tau ,X^{x}_{\tau })\bigg], \end{aligned}$$

(2.7)

where we also write $s_{1}\le \tau \le s_{2}$ for $\tau \in \mathcal{T}_{s_{1},s_{2}}$ (this should cause no confusion because all stopping times in this paper belong to $\mathcal{T}_{s_{1},s_{2}}$ for some $s_{1}\le s_{2}$).

We notice that the state process in our problem formulation (2.7) is a time–space Markov process $(Y_{s})_{s\in [0,T-t]}$ defined by $Y_{0}=(t,x)$ and $Y_{s}:=(t+s,X^{x}_{s})$, $s \in (0,T-t]$.

2.3 The variational problem

Before closing this section, we introduce the variational problem naturally associated to (2.7). Let ℒ be the second order differential operator associated to the diffusion (2.1), i.e.,

$$ (\mathcal{L}F)(x)=(\theta -\alpha )xF_{x}(x)+ \frac{\sigma ^{2} x^{2}}{2} F_{xx}(x) \quad \text{for $F\in C^{2}(\mathbb{R}_{+})$}. $$

Assuming for a moment that $V$ is regular enough, by applying the dynamic programming principle and Itô’s formula, we expect that the value function should satisfy the following variational inequality: for $(t,x)\in (0,T)\times \mathbb{R}_{+}$,

$$ \textrm{max} \big\{ \big(V_{t}+\mathcal{L}V-r(\cdot )V\big)(t,x)+ \beta (t)x, G(t,x)-V(t,x) \big\} = 0, $$

(2.8)

with terminal condition $V(T,x)=G(T,x)$, $x\in \mathbb{R}_{+}$. In the rest of the paper, we show that (2.8) holds in the a.e. sense with $V \in C^{1}([0,T) \times \mathbb{R}_{+})\cap C([0,T] \times \mathbb{R}_{+})$ and $V_{xx}\in L^{\infty }_{\mathrm{loc}}([0,T) \times \mathbb{R}_{+})$. Moreover, we study the geometry of the set where $V=G$, i.e., the so-called stopping region.

3 Properties of the value function

In this section, we provide some continuity properties of the value function and useful probabilistic bounds on its gradient. In what follows, given a set $A\subseteq [0,T]\times \mathbb{R}_{+}$, we sometimes write $A\cap \{t< T\}:=A\cap ([0,T)\times \mathbb{R}_{+})$. Also, we make the next standing assumption in the rest of the paper.

Assumption 1

$\mu ^{S}(\cdot )$ and $\mu ^{O}(\cdot )$ are continuously differentiable on $[0,+\infty )$.

To study the optimisation problem (2.7), we find it convenient to introduce the function

$$ v(t,x)= V(t,x)-G(t,x), \quad \text{$(t,x)\in [0,T]\times \mathbb{R}_{+}$}, $$

(3.1)

which may be financially understood as the value of the option to delay the annuity purchase.

We can easily compute

$$\begin{aligned} H(t,x):=\big(G_{t}+\mathcal{L} G-r(\cdot )G\big)(t,x)+\beta (t)x=g(t)x+K \ell (t), \end{aligned}$$

(3.2)

where

$$\begin{aligned} g(t) &:= f'(t)+\beta (t)\big(1- f(t)\big)+ (\theta - \rho )f(t), \\ \ell (t) &:= r(t)f(t)- f'(t). \end{aligned}$$

(3.3)

An application of Itô’s formula gives

$$\begin{aligned} &\mathbb{E}\big[ e^{-\int _{0}^{\tau }r(t+u)\,du}G(t+\tau ,X_{\tau }^{x}) \big] \\ &=G(t,x)+\mathbb{E}\bigg[ \int _{0}^{\tau }e^{-\int _{0}^{s}r(t+u)\,du} \big(H(t+s,X^{x}_{s})-\beta (t+s)X^{x}_{s}\big) \,ds\bigg], \end{aligned}$$

and therefore it is straightforward to verify (see (2.7)) that

$$ v(t,x)=\sup _{0 \leq \tau \leq T-t} \mathbb{E}\bigg[ \int _{0}^{\tau }e ^{-\int _{0}^{s}r(t+u)\,du}H(t+s,X_{s}^{x}) \,ds\bigg]. $$

(3.4)

Notice that (3.4) includes a deterministic discount rate which is not time-homogeneous. Optimal stopping problems of this kind are relatively rare in the literature. They feature technical difficulties which are more conveniently handled by considering a discounted version of the problem. Hence we introduce

$$\begin{aligned} w(t,x) &:= e^{-\int _{0}^{t} r(s)\,ds}v(t,x) \\ & \phantom{:}=\sup _{0 \leq \tau \leq T-t} \mathbb{E}\bigg[ \int _{0}^{ \tau }e^{-\int _{0}^{t+s}r(u)\,du}H(t+s,X_{s}^{x}) \,ds\bigg]. \end{aligned}$$

(3.5)

Since the problem for $w$ is equivalent to the one for $v$ and $V$, we focus from now on on the analysis of (3.5).

From (2.7), it is clear that $V(t,x) \geq G(t,x)$ for all $(t,x)\in [0,T]\times \mathbb{R}_{+}$ so that $w$ is nonnegative. Moreover, it is straightforward to check that $w(t,x)$ is finite for all $(t,x)\in [0,T]\times \mathbb{R}_{+}$, thanks to well-known properties of $X$ and to Assumption 3.1.

As usual in optimal stopping theory, we let

$$\begin{aligned} \mathcal{C} &= \{ (t,x)\in [0,T]\times \mathbb{R}_{+}: w(t,x)>0 \}, \\ \mathcal{S} &= \{ (t,x)\in [0,T]\times \mathbb{R}_{+}: w(t,x)=0 \} \end{aligned}$$

be the so-called continuation and stopping regions, respectively. We denote by $\partial \mathcal{C}$ the boundary of the set $\mathcal{C}$ and introduce the first entry time of $(t+\cdot ,X^{x}_{\cdot })$ into $\mathcal{S}$, i.e.,

$$ \tau _{*}(t,x):= \inf \{ s \in [0,T-t] : (t+s,X_{s}^{x}) \in \mathcal{S} \} $$

(3.6)

for $(t,x)\in [0,T]\times \mathbb{R}_{+}$.

Since $(t,x)\mapsto H(t,x)$ is continuous, it is not difficult to see that for any fixed stopping time $\widetilde{\tau }\ge 0$, setting $\tau :=\widetilde{\tau }\wedge (T-t)$, the map

$$\begin{aligned} (t,x)\mapsto \mathbb{E}\bigg[ \int _{0}^{\tau }e^{-\int _{0}^{t+s}r(u)\,du}H(t+s,X ^{x}_{s}) \,ds\bigg] \end{aligned}$$

is continuous as well. It follows that $w$ is lower semi-continuous and therefore $\mathcal{C}$ is open and $\mathcal{S}$ is closed. Moreover, the finiteness of $w$ and standard optimal stopping results (see [15, Corollary I.2.9]) guarantee that (3.6) is optimal for $w(t,x)$.

For future frequent use, we introduce here a new probability measure $\widetilde{\mathbb{P}}$ on $\mathcal{F}_{T}$ defined by its Radon–Nikodým derivative

$$ Z_{T}:=\frac{d\,\widetilde{\mathbb{P}}}{d\,\mathbb{P}}\bigg|_{ \mathcal{F}_{T}}=\exp \bigg(\sigma B_{T}-\frac{\sigma ^{2}}{2} T \bigg) $$

(3.7)

and notice that

$$\begin{aligned} X_{t}^{x}=x\,Z_{t}\, e^{(\theta -\alpha )t}, \quad t\in [0,T]. \end{aligned}$$

(3.8)

It is well known that ℙ and $\widetilde{\mathbb{P}}$ are equivalent on $\mathcal{F}_{t}$ for all $t\in [0,T]$.

Remark 2

If $K=0$ in (2.3), problem (3.5) reduces to a deterministic problem. Noticing that

because $\{\tau >s\}$ is $\mathcal{F}_{s}$-measurable, and thanks to Fubini’s theorem, one has

Then, using $Z_{\tau }$ to change the measure (cf. (3.7)), we obtain

$$ w(t,x)=x \, \sup _{0 \leq \tau \leq T-t} \widetilde{\mathbb{E}}\bigg[ \int _{0}^{\tau }e^{-\int _{0}^{t+s}r(u)\,du}g(t+s)e^{(\theta -\alpha )s} \,ds\bigg]. $$

The latter is equivalent to the deterministic problem of maximising the function

$$ F(t+\cdot \,):=\int _{0}^{\,\cdot }e^{-\int _{0}^{t+s}r(u)\,du}g(t+s)e ^{(\theta -\alpha )s} \,ds. $$

As a result, the optimal annuitisation time only depends on $t$ (as in [13]).

Remark 3

If we allow $T=+\infty $ and assume that

$$ \mathbb{E}\left [\int _{0}^{\infty }e^{-\int _{0}^{t}r(u)\,du}|H(t,X_{t})|\,dt\right ]< + \infty , $$

then our problem (3.5) remains well posed. We notice that the finite horizon $T$ only features in (3.5) as part of the definition of the admissible stopping times. Then it is intuitively clear that the major mathematical challenges related to the time-dependence of (3.5) arise from the properties of the map $t\mapsto e^{-\int _{0}^{t}r(u)\,du}H(t,x)$. Such properties remain the same for $T = +\infty $; hence the analysis presented below is also informative for the study of the case $T = +\infty $.

The next proposition starts to analyse the regularity of $w$ and provides a probabilistic characterisation for its gradient which is crucial for our subsequent analysis of the boundary of $\mathcal{C}$.

Proposition 4

The value function$w$is convex in$x$for each$t \in [0,T]$and locally Lipschitz-continuous on$[0,T] \times \mathbb{R}_{+}$. Moreover, for almost every$(t,x) \in [0,T] \times \mathbb{R}_{+}$, we have

$$\begin{aligned} w_{x}(t,x)=\widetilde{\mathbb{E}}\bigg[ \int _{0}^{\tau _{*}}e^{-\int _{0}^{t+s}r(u)\,du} g(t+s) e^{(\theta -\alpha )s} \,ds\bigg] \end{aligned}$$

(3.9)

and there exists a constant$C > 0$, independent of$(t,x)$, such that

$$\begin{aligned} -C\left (1+\frac{1}{T-t}\right )(x\, \widetilde{\mathbb{E}}[\tau _{*}]+ \mathbb{E}[\tau _{*}]) \leq w_{t}(t,x) \leq C (x\, \widetilde{\mathbb{E}}[\tau _{*}]+ \mathbb{E}[\tau _{*}]). \end{aligned}$$

(3.10)

Proof

The proof is divided into several steps.

Step 1 (convexity). Since $x\mapsto e^{-\int _{0}^{t} r(u)\,du}H(t,x)$ is linear, it is not difficult to show that for $x,y\in \mathbb{R}_{+}$, $\lambda \in (0,1)$ and $x_{\lambda }:= \lambda x+(1-\lambda )y$, we have

$$\begin{aligned} &\mathbb{E}\left [\int ^{\tau }_{0}e^{-\int _{0}^{t+s} r(u)\,du}H(t+s,X ^{x_{\lambda }}_{s})\,ds\right ] \\ &=\lambda \mathbb{E}\bigg[\int ^{\tau }_{0}e^{-\int _{0}^{t+s} r(u)\,du}H(t+s,X ^{x}_{s})\,ds\bigg] \\ & \phantom{=:}+(1-\lambda )\mathbb{E}\left [\int ^{\tau }_{0}e^{-\int _{0}^{t+s} r(u)\,du}H(t+s,X^{y}_{s})\,ds\right ] \\ &\le \lambda w(t,x)+(1-\lambda )w(t,y) \end{aligned}$$

for any stopping time $\tau $. Taking the supremum over $\tau \in [0,T-t]$, the claim follows.

Step 2 (Lipschitz-continuity). Fix $(t,x)\in [0,T]\times \mathbb{R}_{+}$ and pick $\varepsilon >0$. First we show that

$$\begin{aligned} |w(t,x\pm \varepsilon )-w(t,x)|\le c\,\varepsilon , \end{aligned}$$

(3.11)

with $c>0$ independent of $(t,x)$. Let $\tau _{*}=\tau _{*}(t,x)$ be optimal in $w(t,x)$, hence admissible and suboptimal in $w(t,x+\varepsilon )$, so that we have

$$\begin{aligned} &w(t,x+\varepsilon )-w(t,x) \\ &\geq \mathbb{E}\bigg[ \int _{0}^{\tau _{*}} e^{-\int _{0}^{t+s}r(u)\,du} \left (H(t+s,X_{s}^{x+\varepsilon }) -H(t+s,X_{s}^{x})\right ) \,ds \bigg] \\ &= \varepsilon \mathbb{E}\bigg[ \int _{0}^{\tau _{*}} e^{-\int _{0}^{t+s}r(u)\,du} g(t+s) \frac{X_{s}^{x+\varepsilon }-X_{s} ^{x}}{\varepsilon } \,ds\bigg] \\ &= \varepsilon \mathbb{E}\bigg[ \int _{0}^{\tau _{*}} e^{-\int _{0}^{t+s}r(u)\,du} g(t+s) X^{1}_{s} \,ds\bigg] \\ &= \varepsilon \widetilde{\mathbb{E}}\bigg[ \int _{0}^{\tau _{*}} e ^{-\int _{0}^{t+s}r(u)\,du} g(t+s)e^{(\theta -\alpha )s} \,ds\bigg], \end{aligned}$$

(3.12)

where we used (3.8) for the last equality. For the upper bound, we repeat the above argument with $\tau _{\varepsilon }^{+}:=\tau _{*}(t,x+ \varepsilon )$ optimal for $w(t,x+\varepsilon )$ and find

$$\begin{aligned} w(t,x+\varepsilon )-w(t,x) & \leq \varepsilon \widetilde{\mathbb{E}} \bigg[ \int _{0}^{\tau _{\varepsilon }^{+}}e^{-\int _{0}^{t+s}r(u)\,du} g(t+s) e^{(\theta -\alpha )s} \,ds\bigg]. \end{aligned}$$

Since $\tau _{*}$ and $\tau ^{+}_{\varepsilon }$ are smaller than $T-t$, we have $|w(t,x+\varepsilon )-w(t,x)|\leq c\, \varepsilon $ for a suitable $c>0$ independent of $(t,x)$. By applying symmetric arguments, we can also prove that $|w(t,x-\varepsilon )-w(t,x)|\leq c\, \varepsilon $ so that (3.11) holds. For future reference, we also notice that

$$\begin{aligned} &w(t,x)-w(t,x-\varepsilon ) \\ &\leq \mathbb{E}\bigg[ \int _{0}^{\tau _{*}} e^{-\int _{0}^{t+s}r(u)\,du} \left (H(t+s,X_{s}^{x}) -H(t+s,X_{s}^{x-\varepsilon })\right ) \,ds \bigg] \\ &= \varepsilon \widetilde{\mathbb{E}}\bigg[ \int _{0}^{\tau _{*}} e ^{-\int _{0}^{t+s}r(u)\,du} g(t+s)e^{(\theta -\alpha )s} \,ds\bigg]. \end{aligned}$$

(3.13)

Next we show that for all $\delta >0$, $x\in \mathbb{R}_{+}$ and any $t\in [0,T-\delta ]$, we have

$$\begin{aligned} |w(t\pm \varepsilon ,x)-w(t,x)|\le c_{\delta }\,\varepsilon \end{aligned}$$

(3.14)

for some $c_{\delta }>0$ only depending on $\delta $, and for all $\varepsilon \le T-t$. Let $\tau _{*}=\tau _{*}(t,x)$ be optimal in $w(t,x)$ and define $\nu _{\varepsilon }:=\tau _{*}\wedge (T-t-\varepsilon )$ for $\varepsilon >0$. Since $\nu _{\varepsilon }$ is admissible and suboptimal for $w(t+\varepsilon ,x)$, we get

$$\begin{aligned} &w(t+\varepsilon ,x)-w(t,x) \\ & \geq \mathbb{E}\bigg[ \int _{0}^{\nu _{\varepsilon }} e^{-\int _{0} ^{t+\varepsilon +s} r(u) \,du}H(t+\varepsilon + s,X_{s}^{x})\,ds- \int _{0}^{\tau _{*}} e^{-\int _{0}^{t+s} r(u)\,du} H(t+ s,X_{s}^{x})\,ds\bigg] \\ &= \mathbb{E}\bigg[\int _{0}^{\nu _{\varepsilon }} \big( e^{-\int _{0} ^{t+\varepsilon +s} r(u) \,du}H(t+\varepsilon +s,X_{s}^{x})-e^{-\int _{0}^{t+s} r(u)\,du} H(t+ s,X_{s}^{x}) \big)\,ds \bigg] \\ & \phantom{=:} -\mathbb{E}\bigg[\int _{\nu _{\varepsilon }}^{\tau _{*}} e^{-\int _{0} ^{t+s} r(u)\,du} H(t+s,X_{s}^{x}) \,ds \bigg]. \end{aligned}$$

(3.15)

Now we use that

$$\begin{aligned} &\big|e^{-\int _{0}^{t+\varepsilon +s} r(u) \,du}H(t+\varepsilon +s,X _{s}^{x})-e^{-\int _{0}^{t+s} r(u)\,du} H(t+s,X_{s}^{x})\big| \\ &\leq \int ^{\varepsilon }_{0}\left |\frac{d}{d\,z}e^{-\int _{0}^{t+s+z} r(u)\,du} H(t+s+z,X_{s}^{x}) \right |d\,z \\ &\le \int ^{\varepsilon }_{0}\left | -r(t+s+z)H(t+s+z,X_{s}^{x}) + \frac{ \partial }{\partial z}H(t+s+z,X_{s}^{x}) \right |d\,z \\ &\le c_{1}(1+ X_{s}^{x})\varepsilon , \end{aligned}$$

(3.16)

where the last estimate follows by (3.2), (3.3) and Assumption 3.1, with a uniform constant $c_{1}>0$ (recall that $r(\cdot ) \ge 0$). Plugging the above expression into the first term of (3.15) and using the mean value theorem for the second, we get

$$\begin{aligned} w(t+\varepsilon ,x)-w(t,x)\ge \mathbb{E}\bigg[- c_{1}\varepsilon \int _{0}^{\nu _{\varepsilon }} (1+ X_{s}^{x})d\,s-H(t+\zeta ,X_{\zeta } ^{x}) (\tau _{*}-\nu _{\varepsilon })\bigg], \end{aligned}$$

where $\zeta (\omega )\in (\nu _{\varepsilon }(\omega ),\tau _{*}( \omega ))$. Notice that

and from (3.2), one obtains the bound

In conclusion, noticing that $\nu _{\varepsilon }\le \tau _{*}$ and recalling (3.7) and (3.8), by a change of measure, we have

(3.17)

for a different constant $C>0$. Using the Markov inequality, we obtain

$$\begin{aligned} \mathbb{P}[\tau _{*}\ge T-t-\varepsilon ] &\le \frac{\mathbb{E}[\tau _{*}]}{T-t-\varepsilon }, \\ \widetilde{\mathbb{P}}[\tau _{*}\ge T-t-\varepsilon ] &\le \frac{ \widetilde{\mathbb{E}}[\tau _{*}]}{T-t-\varepsilon }, \end{aligned}$$

which plugged back into (3.17) give

$$\begin{aligned} w(t+\varepsilon ,x)-w(t,x) \ge - C\varepsilon (\mathbb{E}[\tau _{*}]+ x \,\widetilde{\mathbb{E}}[\tau _{*}])\bigg(1+\frac{1}{T-t-\varepsilon } \bigg). \end{aligned}$$

(3.18)

By using similar estimates and observing that $\sigma _{\varepsilon } ^{+}:=\tau _{*}(t+\varepsilon ,x)$ is admissible and suboptimal for $w(t,x)$, we get

$$\begin{aligned} &w(t+\varepsilon ,x)-w(t,x) \\ & \leq \mathbb{E}\bigg[\int _{0}^{\sigma _{\varepsilon }^{+}} \big( e ^{-\int _{0}^{t+\varepsilon +s} r(u) \,du}H(t+\varepsilon +s,X_{s}^{x})-e ^{-\int _{0}^{t+s} r(u)\,du} H(t+s,X_{s}^{x}) \big)\,ds \bigg] \\ &\le c_{1}\varepsilon \mathbb{E}\bigg[\int _{0}^{\sigma _{\varepsilon }^{+}} (1+ X_{s}^{x})d\,s\bigg]\le C\varepsilon (\mathbb{E}[ \sigma _{\varepsilon }^{+}]+x\,\widetilde{\mathbb{E}}[\sigma _{\varepsilon }^{+}]), \end{aligned}$$

(3.19)

where we have used again (3.16) and the change of measure.

Symmetric arguments then give

$$\begin{aligned} w(t,x)-w(t-\varepsilon ,x) \le C\varepsilon (\mathbb{E}[\tau _{*}]+x\, \widetilde{\mathbb{E}}[\tau _{*}]) \end{aligned}$$

(3.20)

and

$$\begin{aligned} w(t,x)-w(t-\varepsilon ,x) \ge - C\varepsilon (\mathbb{E}[ \sigma _{\varepsilon }^{-}]+ x\,\widetilde{\mathbb{E}}[ \sigma _{\varepsilon }^{-}])\bigg(1+\frac{1}{T-t}\bigg) \end{aligned}$$

(3.21)

with $\sigma _{\varepsilon }^{-}:=\tau _{*}(t,x-\varepsilon )$. Equations (3.18)–(3.21) imply (3.14), and combining (3.11) and (3.14), we conclude that $w$ is in $C([0,T]\times \mathbb{R}_{+})$, locally Lipschitz and differentiable a.e. in $[0,T)\times \mathbb{R}_{+}$.

Step 3 (gradient bounds). Let $(t,x)\in [0,T)\times \mathbb{R} _{+}$ be a point of differentiability of $w$. Dividing (3.12) and (3.13) by $\varepsilon $ and letting $\varepsilon \to 0$ gives (3.9), as claimed. Moreover, dividing (3.18) by $\varepsilon $ and letting $\varepsilon \to 0$, we obtain the lower bound in (3.10). Finally, dividing (3.20) by $\varepsilon $ and letting $\varepsilon \to 0$, we get the upper bound in (3.10). □

The continuity of $w$ and standard optimal stopping theory guarantee that for all $t\in [0,T]$, the process

$$\begin{aligned} W_{s}:=w(t+s,X^{x}_{s})+\int _{0}^{s}e^{-\int _{0}^{t+u}r(v)\,dv}H(t+u,X ^{x}_{u}) \,du \end{aligned}$$

(3.22)

is a continuous supermartingale for all $s\in [0,T-t]$, and $(W_{s\wedge \tau _{*}})_{s\in [0,T-t]}$ is a martingale.

The next corollary follows by standard PDE arguments used normally in the optimal stopping literature; see e.g. [10, Theorem 2.7.7].

Corollary 5

The function $w$ is $C^{1,2}$ inside $\mathcal{C}$ and solves the boundary value problem

$$\begin{aligned} (w_{t}+\mathcal{L}w)(t,x) & = -e^{-\int _{0}^{t}r(u)\,du} H(t,x), \quad (t,x)\in \mathcal{C}, \\ w(t,x) &= 0, \quad (t,x)\in \partial \mathcal{C}\cap \{t< T\}, \\ w(T,x) &= 0, \quad x\in \mathbb{R}_{+}. \end{aligned}$$

(3.23)

It may appear that (3.23) is given in a slightly unusual form, but one should remember that $w(t,x)=e^{-\int _{0}^{t}r(u)\,du}v(t,x)$ (see (3.5)) so that for $v$, we obtain the more canonical expression

$$\begin{aligned} \big(v_{t}+\mathcal{L}v-r(\cdot )\big)(t,x) = - H(t,x), \quad (t,x)\in \mathcal{C}. \end{aligned}$$

(3.24)

The next technical lemma states some properties of $w$ that will be useful to study the regularity of the boundary $\partial \mathcal{C}$. Its proof is given in the Appendix.

Lemma 6

Assume$g(t)< 0$for$t\in (0,T)$. Then:

(i)
$x\mapsto w(t,x)$is nonincreasing for all$t\in [0,T]$.
(ii)
For any$t\in [0,T]$, we have
$$\begin{aligned} \lim _{x\to \infty }w(t,x)=0. \end{aligned}$$
(3.25)
(iii)
For all$t_{1}< t_{2}$in$[0,T]$, we have$\mathcal{S}\cap ((t_{1},t_{2})\times \mathbb{R}_{+})\neq \emptyset $.

It is worth noticing that (iii) does not exclude that there may exist $t\in (0,T)$ such that $\mathcal{S}\cap (\{t\}\times \mathbb{R}_{+})= \emptyset $.

4 Properties of the optimal boundary

In this section, we provide sufficient conditions for the boundary $\partial \mathcal{C}$ to be represented by a function $b$ of time. We establish connectedness of the sets $\mathcal{C}$ and $\mathcal{S}$ with respect to the $x$-variable and finally study Lipschitz-continuity of $t\mapsto b(t)$. It is worth emphasising that this study is mathematically challenging because of the lack of monotonicity of the map $t\mapsto b(t)$ and falls outside the scope of the existing probabilistic literature on optimal stopping and free boundary problems. In Sect. 5, we show that the Gompertz–Makeham mortality law (a mainstream model in actuarial science) leads naturally to the set of assumptions that we make below.

An initial insight on the shape of $\mathcal{C}$ is obtained by noticing that the set

$$\begin{aligned} \mathcal{R}:=\{(t,x)\in [0,T]\times \mathbb{R}_{+}: H(t,x)>0\} \end{aligned}$$

(4.1)

is contained in $\mathcal{C}$. In fact, if $(t,x)\in \mathcal{R}$, then

$$ w(t,x)\ge \mathbb{E}\left [\int _{0}^{\tau _{\mathcal{R}}}e^{-\int _{0} ^{t+s}r(t+u)\,du}H(t+s,X^{x}_{s})\,ds\right ]>0 $$

with $\tau _{\mathcal{R}}$ being the first exit time of $(t+s,X^{x} _{s})$ from ℛ. For all $t\in [0,T]$ such that $g(t)\neq 0$, the boundary $\partial \mathcal{R}$ is given by the curve

$$\begin{aligned} \gamma (t):=-K \ell (t)/g(t). \end{aligned}$$

(4.2)

Moreover, for each $t\in [0,T]$, we denote the $t$-section of ℛ by

$$\begin{aligned} \mathcal{R}_{t}:=\{x\in \mathbb{R}_{+}: (t,x)\in \mathcal{R}\}. \end{aligned}$$