Sde

Stochastic Dierential
Equations
Do not worry about your problems with mathematics, I assure you

mine are far greater.
Albert Einstein.
Florian Herzog
2013
Stochastic Dierential Equations (SDE)
A ordinary dierential equation (ODE)
dx(t)
= f (t, x) , dx(t) = f (t, x)dt , (1)
dt
with initial conditions x(0) = x0 can be written in integral form
t
x(t) = x0 + f (s, x(s))ds , (2)
0
where x(t) = x(t, x0, t0) is the solution with initial conditions x(t0) = x0. An
example is given as
dx(t)
= a(t)x(t) , x(0) = x0 . (3)
dt
Stochastic Systems, 2013 2

When we take the ODE (3) and assume that a(t) is not a deterministic parameter
but rather a stochastic parameter, we get a stochastic dierential equation (SDE). The
stochastic parameter a(t) is given as
a(t) = f (t) + h(t)(t) , (4)
where (t) denotes a white noise process.

Thus, we obtain
dX(t)
= f (t)X(t) + h(t)X(t)(t) . (5)
dt
When we write (5) in the dierential form and use dW (t) = (t)dt, where dW (t)
denotes dierential form of the Brownian motion,we obtain:
dX(t) = f (t)X(t)dt + h(t)X(t)dW (t) . (6)

In general an SDE is given as
dX(t, ) = f (t, X(t, ))dt + g(t, X(t, ))dW (t, ) , (7)
where denotes that X = X(t, ) is a random variable and possesses the initial
condition X(0, ) = X0 with probability one. As an example we have already
encountered
dY (t, ) = (t)dt + (t)dW (t, ) .
Furthermore, f (t, X(t, )) R, g(t, X(t, )) R, and W (t, ) R. Similar as
in (2) we may write (7) as integral equation
t t
X(t, ) = X0 + f (s, X(s, ))ds + g(s, X(s, ))dW (s, ) . (8)
0 0

Stochastic Integrals
T
For the calculation of the stochastic integral 0 g(t, )dW (t, ), we assume that
g(t, ) is only changed at discrete time points ti (i = 1, 2, 3, ..., N 1), where
0 = t0 < t1 < t2 < . . . < tN 1 < tN < T . We dene the integral
T
S= g(t, )dW (t, ) , (9)
0
as the Riemannum

N ( )
SN () = g(ti1, ) W (ti, ) W (ti1, ) . (10)
i=1
with N .

A random variable S is called the Ito integral of a stochastic process g(t, ) with
respect to the Brownian motion W (t, ) on the interval [0, T ] if
[(
N ( )]
lim E S g(ti1, ) W (ti, ) (W (ti1, ) = 0, (11)
N
i=1
for each sequence of partitions (t0, t1, . . . , tN ) of the interval [0, T ] such that
maxi(ti ti1) 0. The limit in the above denition converges to the stochastic
integral in the mean-square sense. Thus, the stochastic integral is a random variable,
the samples of which depend on the individual realizations of the paths W (., ).

The simplest possible example is g(t) = c for all t. This is still a stochastic
process, but a simple one. Taking the denition, we actually get
T N (
)
c dW (t, ) = c lim W (ti, ) W (ti1, )
0 N
i=1
= c lim [(W (t1, )W (t0, )) + (W (t2, )W (t1, )) + . . .

N
+(W (tN , )W (tN 1, ))

= c (W (T, ) W (0, )) ,
where W (T, ) and W (0, ) are standard Gaussian random variables. With
W (0, ) = 0, the last result becomes
T
c dW (t, ) = c W (T, ) .
0

T
Example: g(t, ) = W (t, ) 0
W (t, ) dW (t, ) =

N ( )
= lim W (ti1, ) W (ti, ) W (ti1, )
N
i=1
[1 N
1
N ]
2 2 2
= lim (W (ti, ) W (ti1, )) (W (ti, ) W (ti1, ))
N 2 2
i=1 i=1
1 N
2 1 2
= lim (W (ti, ) W (ti1, )) + W (T, ) , (12)
2 N i=1 2
where we have used the following algebraic relationship y(x y) = yx y 2 +

2 x 2 x = 2 x 2 y 2 (x y) .
1 2 1 2 1 2 1 2 1 2

N
We take now a detailed look at :limN i=1 (W (ti , ) W (ti1, ))2.

N
2

N
2
E[ lim (W (ti, ) W (ti1, )) ] = lim E[(W (ti, ) W (ti1, )) ]
N N
i=1 i=1

N
= lim (ti ti1)
N
i=1
= T

N
2

N
2
Var[ lim (W (ti, ) W (ti1, )) ] = lim Var[(W (ti, ) W (ti1, )) ]
N N
i=1 i=1

N
2
= 2 lim (ti ti1) .
N
i=1

By reducing the partition, the variance becomes zero,

N
2

N
lim (ti ti1) max(ti ti1) lim (ti ti1)
N i N
i=1 i=1
= max(ti ti1) T
i
= 0, (13)
N
since ti1 ti 0. Since the expected value of i=1 (ti ti1)2 is T and the
variance becomes zero, we get

N
2
(W (ti, ) W (ti1, )) = T (14)
i=1

The stochastic integral has the solution

T
1 2 1
W (t, ) dW (t, ) = W (T, ) T (15)
0 2 2
This is incontrast to our intuition from standard calculus. In the case of a deterministic
T
integral 0 x(t)dx(t) = 12 x2(t), whereas the Ito integral diers by the term 12 T .
This example shows that the rules of dierentiation (in particular the chain rule)
and integration need to be re-formulated in the stochastic calculus.

Properties of Ito Integrals.

T
E[ g(t, ) dW (t, )] = 0 .
0
Proof:
T
N ( )
E[ g(t, )dW (t, )] = E[ lim g(ti1, ) W (ti, ) W (ti1, ) ]
0 N
i=1

N ( )
= lim E[g(ti1, )] E[ W (ti, ) W (ti1, ) ]
N
i=1
= 0.
The expectation of stochastic integrals is zero. This is what we would expect anyway.

Properties of Ito Integrals.

[ T ] T
2
Var g(t, )dW (t, ) = E[g (t, )]dt . (16)
0 0
Proof:
[ T ] [ T ]
2
Var g(t, )dW (t, ) = E ( g(t, )dW (t, ))
0 0
[(
N ( ))2]
= E lim g(ti1, ) W (ti, ) W (ti1, )
N
i=1


N
N
= lim E[g(ti1, )g(tj1, )
N
i=1 j=1
(W (ti, ) W (ti1, ))(W (tj , ) W (tj1, ))]

N ( )2
2
= lim E[g (ti1, )] E[ W (ti, ) W (ti1, ) ]
N
i=1

N
2
= lim E[g (ti1, )] (ti ti1)
N
i=1
T
2
= E[g (t, )]dt . (17)
0

The calculation of the variance of the Ito Integrals shows two important properties:
[( )2 ] T [ 2 ]
T
E 0
g(t, )dW (t, ) = 0 E g (t, ) dt
T
0
E[g 2(t, )]dt <
The second property is the condition of existence for Ito integrals. The next property is
the linearity of Ito integrals:
T
[a1 g1(t, ) + a2 g2(t, )]dW (t, )
0
T T
= a1 g1(t, )dW (t, ) + a2 g2(t, )dW (t, ) , (18)
0 0
for numbers a1, a2 and stochastic functions g1(t, ), g2(t, ).

Itos lemma
As mentioned shown in the second example, the rules of classical calculus are not valid
for stochastic integrals and dierential equations. It is the equivalent to the chain rule
in classical calculus. The problem can be stated as follows:
Given a stochastic dierential equation
dX(t) = f (t, X(t))dt + g(t, X(t))dW (t) , (19)
and another process Y (t) which is a function of X(t),
Y (t) = (t, X(t)) ,
where the function (t, X(t)) is continuously dierentiable in t and twice continuously
dierentiable in X , nd the stochastic dierential equation for the process Y (t):
dY (t) = f(t, X(t))dt + g(t, X(t))dW (t) .

Itos lemma
In the case when we assume that g(t, X(t)) = 0, we know the result: the chain rule
for standard calculus. The result is given by
dy(t) = (t(t, x) + x(t, x)f (t, x))dt . (20)
In the case of stochastic problems, we reason as follows: The Taylor expansion of

(t, X(t)) yields
1 2
dY (t) = t(t, X)dt + tt(t, X)dt + x(t, X)dX(t)
2
1 2
+ xx(t, X)(dX(t)) + h.o.t . (21)
2

Itos lemma
We use (19) for dX(t) and get
dY (t) = t(t, X)dt + x(t, X)[f (t, X(t))dt + g(t, X(t))dW (t)]
1 (
2 2 2 2 2
+tt(t, X)dt + xx(t, X) f (t, X(t))dt + g (t, X(t))dW (t)
2
)
+2f (t, X(t))g(t, X(t))dt dW (t) + h.o.t . (22)
The dierentials of higher order (dt, dW ) become fast zero, dt2 0 and
dtdW (t) 0. The stochastic term dW 2(t) according to the rules of Brownian
motion is given as
2
dW (t, ) = dt . (23)

Itos lemma
Omitting higher order terms and using the properties of Brownian motion, we arrive at
1 2
dY (t) = [t(t, X) + x(t, X)f (t, X(t)) + xx(t, X)g (t, X(t))]dt
2
+x(t, X)g(t, X(t))dW (t) . (24)
Reordering the terms yields the scalar version of Itos Lemma:
dY (t) = f(t, X(t))dt + g(t, X(t))dW (t) , (25)

f(t, X(t)) = t(t, X) + x(t, X)f (t, X(t))
1 2
+ xx(t, X)g (t, X(t)) , (26)
2
g(t, X(t)) = x(t, X)g(t, X(t)) . (27)

Itos lemma
The term 12 xx(t, X)g 2(t, X(t)) is often called the Ito corretion term, since this
does not occur in the det. case.
We apply Itos formula for the following problem: (t, X) = X 2 with the SDE
dX(t) = dW (t). From the SDE, we get X(t) = W (t) and calculate the partial
(t,X) 2 (t,X) (t,X)
derivatives of X = 2X , X 2
= 2, and t = 0. The Ito lemma yields
2
d(W (t)) = 1dt + 2W (t)dW (t) . (28)
We rewrite the equation and use W (0) = 0

t
2
W (t) = 1t + 2 W (t)dW (t) ,
0
t
1 2 1
W (t)dW (t) = W (t) t . (29)
0 2 2

Itos lemma
We now allow that the process X(t) is in Rn. We let W (t) be an m-dimensional
standard Brownian motion and f (t, X(t)) Rn and g(t, X(t)) Rnm. Consider
a scalar process Y (t) dened by Y (t) = (t, X(t)), where (t, X) is a scalar
function which is continuously dierentiable with respect to t and twice continuously
dierentiable with respect to X . The Ito formula can be written in vector notation as
follows:
dY (t) = f(t, X(t))dt + g(t, X(t))dW (t) , (30)

f(t, X(t)) = t(t, X(t)) + x(t, X(t)) f (t, X(t))
1 ( T
)
+ tr xx(t, X(t))g(t, X(t))g (t, X(t))) , (31)
2
g(t, X(t)) = x(t, X(t)) g(t, X(t)) , (32)
where tr denotes the trace operator.

Itos lemma
Consider the following stochastic dierential equation:
dS(t) = S(t)dt + S(t)dW (t) , (33)
We want to nd the SDE for the process Y related to S as follows: Y (t) = (t, S) =
(t,S) 2 (t,S) (t,S)
ln(S(t)) . The partial derivatives are: S = S1 , S 2
= S12 , and t = 0.
Therefore, according to Ito we get,
( (t, S) (t, S) 1 2(t, S) 2 2 )
dY (t) = + S(t) + S (t) dt
t S 2 S 2
( (t, S) )
+ S(t) dW (t) , (34)
S
1 2
dY (t) = ( )dt + dW (t) . (35)
2

Itos lemma
Since the right hand side of (35) is independent of Y (t), we are able to compute the
stochastic integral:
t t
1 2
Y (t) = Y0 + ( )dt + dW , (36)
0 2 0
1 2
Y (t) = Y0 + ( )t + W (t) . (37)
2
Since Y (t) = ln S(t) we have found a solution for S(t) :
1 2
ln(S(t)) = ln(S(0)) + ( )t + W (t) , (38)
2
( 1 2
S(t) = S(0)e 2 )t+W (t) , (39)
where W (t) is a standard BM.

Itos lemma
Show for U (t) = X1(t)X2(t) with
dX1(t) = f1(t, X1)dt + g1(t, X1)dW (t) ,

dX2(t) = f2(t, X2)dt + g2(t, X2)dW (t) ,
that following formula is valid:
dU (t) = dX1(t)X2(t) + X1(t)dX2(t) + g1(t, X1)g2(t, X2)dt (40)
We show that we obtain the same result as in the previous formula by apply Itos
lemma. By (40) liefert
dU (t) = [X2(t)f1(t, X1) + X1(t)f2(t, X2) + g1(t, X1)g2(t, X2)]dt

+[X2(t)g1(t, X1) + X1(t)g2(t, X2)]dW (t)

Itos lemma
[ ]
2U 0 1
The partial derivatives of U are : U
X = (X2(t), X1(t))T , X 2
= and
1 0
U
t = 0.
U U T
dU (t) = [ + [f1(t, X1), f2(t, X2)]
t X
[ ])
1 ( 2U g1(t, X1)2 g1(t, X1)g2(t, X2)
+ tr ]dt
2 X 2 g1(t, X1)g2(t, X2) g2(t, X2)2
U T
+ [g1(t, X1), g2(t, X2)] dW (t)
X
= [X2(t)f1(t, X1) + X1(t)f2(t, X2) + g1(t, X1)g2(t, X2)]dt
+[X2(t)g1(t, X1) + X1(t)g2(t, X2)]dW (t)

We classify SDEs into two large groups, linear SDEs and non-linear SDEs. Furthermore,
we distinguish between scalar linear and vector-valued linear SDEs.
We start with the easy case, the scalar linear linear SDEs. An SDE
dX(t) = f (t, X(t))dt + g(t, X(t))dW (t) , (41)
for a one-dimensional stochastic process X(t) is called a linear (scalar) SDE if and
only if the functions f (t, X(t)) and g(t, X(t)) are ane functions of X(t) R and
thus
f (t, X(t)) = A(t)X(t) + a(t) ,

g(t, X(t)) = [B1(t)X(t) + b1(t), , Bm(t)X(t) + bm(t)] ,
where A(t), a(t) R, W (t) Rm is an m-dimensional Brownian motion, and

Bi(t), bi(t) R, i = 1, , m. Hence, f (t, X(t)) R and g(t, X(t)) R1m.

The linear SDE possesses the following solution
( t [
m ]
1
X(t) = (t) x0 + (s) a(s) Bi(s)bi(s) ds
0 i=1
m
t )
1
+ (s)bi(s)dWi(s) , (42)
i=1 0
where we denote (t) as the fundamental matrix, which we obtain from
( t [ m
Bi2(s) ] m t )
(t) = exp A(s) ds + Bi(s)dWi(s) , (43)
0 i=1
2 i=1 0
The solution is similar to the solution of ODEs.

Let us assume that W (t) R, a(t) = 0, b(t) = 0, A(t) = A, B(t) = B . We

want to compute the solution of the SDE
dX(t) = AX(t)dt + BX(t)dW (t) , X(t) = x0 , (44)
We can solve it using (42) and (43):

1 B 2 )t+BW (t)
(A 2
(t) = e , (45)
and (42) is easy to calculate since
(A 1 2
x(t) = (t)x0 = x0e 2 B )t+BW (t) . (46)

The expectation m(t) = E[X(t)]and the second moment P (t) = E[X 2(t)] for

m
dX(t) = (A(t)X(t) + a(t))dt + (Bi(t)X(t) + b(t))dWi(t) . (47)
i=1
can be calculated by solving the following system of ODEs:
m(t) = A(t)m(t) + a(t) , m(0) = x0 , (48)

(
m ) (
m )
2
P (t) = 2A(t) + Bi (t) P (t) + 2m(t) a(t) + Bi(t)bi(t)
i=1 i=1

m )
2 2
+ bi (t) , P (0) = x0 . (49)
i=1

The ODE for the expectation is derived by applying the expectation operator on both
sides of (42).

m
E[dX(t)] = E[(A(t)X(t) + a(t))dt + (Bi(t)X(t) + bi(t))dWi(t) ]
i=1
E[dX(t)] = (A(t) E[X(t)] +a(t))dt

| {z } | {z }
dm(t) =m(t)

m
+ E[(Bi(t)X(t) + bi(t))] E[dWi(t) ]
| {z }
i=1 =0
dm(t) = (A(t)m(t) + a(t))dt . (50)

In order to compute the second moment, we need to derive the SDE for Y (t) = X 2(t):
[ m (
)2 ]
dY (t) = 2X(t)(A(t)X(t) + a(t)) + Bi(t)X(t) + bi(t) dt
i=1
m (
)
+2X(t) Bi(t)X(t) + bi(t) dWi(t) (51)
i=1
[ m (

2 2 2
dY (t) = 2A(t)X (t) + 2X(t)a(t) + Bi (t)X (t) + 2Bi(t)bi(t)X(t)
i=1
)] m (
)
2
+bi (t) dt + 2X(t) Bi(t)X(t) + bi(t) dWi(t) (52)
i=1

Furthermore, we apply the expectation operator to (52) and use P (t) = E[X 2(t)] =
E[Y (t)] and m(t) = E[X(t)].
[ m (

2 2 2
E[dY (t)] = 2A(t)E[X (t)] + 2a(t)E[X(t)] + Bi (t)E[X (t)]
i=1
)]
2
+2Bi(t)bi(t)E[X(t)] + bi (t) dt
[ m (
) ]
+E 2X(t) Bi(t)X(t) + bi(t) dWi(t)
i=1
[
dP (t) = 2A(t)P (t) + 2a(t)m(t)
m (
)]
2 2
+ Bi (t)P (t) + 2Bi(t)bi(t)m(t) + bi (t) dt
i=1

In the case that Bi(t) = 0, i = 1, . . . , m, we are able to directly compute the

distribution. The scalar linear SDE

m
dX(t) = (A(t)X(t) + a(t))dt + bi(t)dWi(t), (53)
i=1
with X(0) = x0 is normaly distributed

P (X(t)|x0) N (m(t), V (t)) with expected value m(t) and variance V (t), which
are solutions of the following ODEs,
m(t) = A(t)m(t) + a(t) , m(0) = x0 , (54)

m
2
V (t) = 2A(t)V (t) + bi (t) , V (0) = 0 . (55)
i=1

There are some specic scalar linear SDEs which are found to be quite useful in practice.
The simplest case of SDE is where the drift and the diusion coecients are independent
of the information received over time
dS(t) = dt + dW (t) , S(0) = S0 . (56)
This model has been used to simulate commodity prices, such as metals or agricultural
products.
The mean is E[S(t)] = t + S0 and the variance Var[S(t)] = 2t. S(t) possesses
a behavior of uctuations around the straight line S0 + t.The process is normally
distributed with the given mean and variance.

The standard model of stock prices is the geometric Brownian motion as given by
dS(t) = S(t)dt + S(t)dW (t, ) , S(0) = S0 .
2 2t 2 t
t
The mean is given by E[S(t)] = S0e and its variance by Var[S(t)] = S0 e (e
1). This model forms the starting point for the famous Black-Scholes formula for option
pricing. The geometric Brownian motion has two main features which make it popular
for stock
The rst property is that S(t) > 0 for all t [0, T ] and the second is that all returns
are in scale with the current price. This process has a log-normal probability density
function.

Another very popular class of SDEs are mean reverting linear SDEs. The model is
obtained by
dS(t) = [ S(t)]dt + dW (t, ) , S(0) = S0 . (57)
A special case of this SDE where = 0 is called Ohrnstein-Uhlenbeck process.

Equation (57) models a process which naturally falls back to its equilibrium level of .
The expected price is E[S(t)] = ( S0)e t and the variance is
2 ( 2 t
)
Var[S(t)] = 1e .
2

In the long run, the following (unconditional) approximations are valid
lim E[S(t)] =
t
and
2
lim Var[S(t)] = .
t 2
2
This analysis shows that the process uctuates around and has a variance of 2
which depends on the parameter : the higher , the lower the variance.
This is obvious since the higher , the faster the process reverts back to its mean
value.
This process is a stationary process which is normally distributed.

A popular extension is where the diusion term is in scale with the current value, i.e.,
the geometric mean reverting process:
dS(t) = [ S(t)]dt + S(t)dW (t, ) , S(0) = S0 .
In this model S(t) 0, if S0 0, > 0, and > 0.
The rst mean reversion model(57) may produce negative values even for > 0.
Since the second mean-reversion model has always positive realizations, it is also
called log-normal mean reversion. This type of model is used to model interest rate or
volatilities.

In control engineering science, the most important (scalar) case is

m
dX(t) = (A(t)X(t) + C(t)u(t)) dt + bi(t) dWi . (58)
i=1
In this equation, X(t) is normally distributed because the Brownian motion is just
multiplied by time-dependent factors.
When we compute an optimal control law for this SDE, the deterministic optimal control
law (ignoring the Brownian motion) and the stochastic optimal control law are the same.
This feature is called certainty equivalence. For this reason, the stochastics are often
ignored in control engineering.

The logical extension of scalar SDEs is to allow X(t) Rn to be a vector. The rest of
this section proceeds in a similar fashion as for scalar linear SDEs. A stochastic vector
dierential equation
dX(t) = f (t, X(t))dt + g(t, X(t))dW (t)
with the initial condition X(0) = x0 Rn for an n-dimensional stochastic process

X(t) is called a linear SDE if the functions f (t, X(t)) Rn and g(t, X(t)) Rnm
are ane functions of X(t) and thus
f (t, X(t)) = A(t)X(t) + a(t) ,

g(t, X(t)) = [B1(t)X(t) + b1(t), , Bm(t)X(t) + bm(t)] ,
where A(t) Rnn, a(t) Rn, W (t) Rm is an m-dimensional Brownian motion,

and Bi(t) Rnn, bi(t) Rn.

Alternatively, the vector-valued linear SDE can be written as

m
dX(t) = (A(t)X(t) + a(t))dt + (Bi(t)X(t) + bi(t))dWi(t) . (59)
i=1
A common extension of the above equation is the following form of a controlled

stochastic dierential equation as given by
dX(t) = (A(t)X(t) + C(t)u(t) + a(t)) dt

m
+ (Bi(t)X(t) + Di(t)u(t) + bi(t)) dWi , (60)
i=1
where u(t) Rk , C(t) Rnk , Di(t) Rnk .

The linear SDE (59) has the following solution:
( t [
m ]
1
X(t) = (t) x0 + (s) a(s) Bi(s)bi(s) ds
0 i=1
m
t )
1
+ (s)bi(s)dWi(s) , (61)
i=1 0
where the fundamental matrix (t) Rnn is the solution of the homogenous
stochastic dierential equation.

The fundamental matrix (t) Rnn is the solution of the homogenous stochastic
dierential equation:

m
d(t) = A(t)(t)dt + Bi(t)(t)dWi(t) , (62)
i=1
with initial condition (0) = I , I Rnn e now prove that (61) and (62) are
solutions of (59). We rewrite (61) as
( t )
1
X(t) = (t) x0 + (t)dY (t)
0
[
m ]
m
dY (t) = a(t) Bi(t)bi(t) dt + bi(t)dWi(t) .
i=1 i=1

( t )
1
X(t) = (t)Z(t) , Z(t) = x0 + (t)dY (t)
0
1
dZ(t) = (t)dY (t)
We use the Ito formula to calculate X(t) = (t)Z(t):

m
1
dX(t) = (t)dZ(t) + d(t)Z(t) + Bi(t)(t)(t) bi(t)dt
i=1

m
m
= dY (t) + A(t)(t)Z(t)dt + Bi(t)(t)Z(t)dWi(t) + Bi(t)bi(t)dt
i=1 i=1

Noting that Z(t) = 1(t)X(t) and using the SDE for Y (t), we get

m
m
dX(t) = dY (t) + A(t)(t)Z(t)dt + Bi(t)(t)Z(t)dWi(t) + Bi(t)bi(t)dt
i=1 i=1
[
m ]
m
= a(t) Bi(t)bi(t) dt + bi(t)dWi(t) + A(t)X(t)dt
i=1 i=1

m
m
+ Bi(t)X(t)dWi(t) + Bi(t)bi(t)dt
i=1 i=1

m
= [a(t) + A(t)X(t)]dt + (Bi(t)X(t) + bi(t))dWi(t) .
i=1
This completes the proof.

The expectation m(t) = E[X(t)] Rn and the second moment matrix P (t) =
E[X(t)X T (t)] Rnn can be computed as follows:
m(t) = A(t)m(t) + a(t) , m(0) = x0 , (63)

T T T
P (t) = A(t)P (t) + P (t)A (t) + a(t)m (t) + m(t)a (t)

m
T T
+ [Bi(t)P (t)Bi (t) + Bi(t)m(t)bi (t)
i=1
T T T T
+bi(t)m (t)Bi (t) + bi(t)bi(t) ] , P (0) = x0x0 . (64)
The covariance matrix for the system of linear SDEs is given by als
T
V (t) = Var{x(t)} = P (t) m(t)m (t) . (65)

The special case

m
dX(t) = (A(t)X(t) + a(t))dt + bi(t)dWi(t)
i=1
with the initial condition X(0) = x0 Rn is normally distributed, i.e.,
P (X(t)|x0) N (m(t), V (t))
where
m(t) = A(t)m(t) + a(t) m(0) = x0
T

m
T
V (t) = A(t)V (t) + V (t)A (t) + bibi (t) V (0) = 0 .
i=1

As rst example of a linear vector valued SDE, we consider a two dimensional geometric
Brownian motion:
( )
dS1(t) = 1S1(t)dt + S1(t) 11dW1(t) + 12dW2(t) , (66)
( )
dS2(t) = 2S2(t)dt + S2(t) 21dW1(t) + 22dW2(t) . (67)
Written in matrix form S = (S1, S2)T , the same SDE is given as:
( ) ( ) ( ) ( )
1 0 0 11 0 12 0
A(t) = a(t) = B1(t) = B2(t) =
0 1 0 0 21 0 22
Both processes S1(t) and S2(t) are correlated if 12 = 21 = 0. This model can be
easily extended to n processes.

The observed volatility for real existing price processes, such as stocks or bonds is itself
a stochastic process. The following model describes this observation:
dP (t) = dt + (t)dW1(t) , P (0) = P0 ,

d(t) = ( (t))dt + (t)1dW2(t) , (0) = 0 .
where is the average volatility, 1 a volatility, and the mean reversion rate of
the volatility process (t). If this model is used for stock prices, the transformation
P (t) = ln(S(t)) is useful. The two Brownian motions dW1(t) and dW2(t) are
correlated, hence corr[dW1(t), dW2(t)] = . This model captures the behavior of
real existing prices better and its distribution of returns shows fatter tails.

Die system (68) can be rewritten as linear SDE:

( ) ( ) ( )
0 0 0 1
A(t) = a(t) = B1(t) =
0 0 1
( )
0 0
B2(t) = ,
0 1 1 2
wobei x(t) = (P (t), (t))T . The system (68) has the property, that the variance
of P (t) depends on the initial condition 0 For the parameters = 0.1, = 2,
= 0.2, 1 = 0.5 and = 0.5, we calculate the standard deviation of P (t) with
0 = 0.1 and alternatively with 0 = 0.8. The expected value of (t) has the
following evaluation over time m(t) = + (0 )et and thus the variance of
P (t) depends on 0.

0.7
0=0.1
=0.8
0
0.6
0.5
Standardabweichung
0.4
0.3
0.2
0.1
0
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
time
Abbildung 1: Stand. dev. of P (t) for dierent initial conditions of (t)

In comparison with linear SDEs, nonlinear SDEs are less well understood. No general
solution theory exists. And there are no explicit formulae for calculating the moments.
In this section, we show some examples of nonlinear SDEs and their properties.
In general, a scalar square root process can be written as

with
f (t, X(t)) = A(t)X(t) + a(t)

g(t, X(t)) = B(t) X(t) ,
where A(t), a(t), and B(t) are real scalars. The nonlinear mean reverting SDEs dier
from the linear scalar equations by their nonlinear diusion term. For this process, the
distribution and moments can be calculated.

For a specic square root process with A(t) = 0, a(t) = 1 and B(t) = 2 we are
able to derive the analytical solution: The SDE

dX(t) = 1dt + 2 X(t)dW (t) , X(0) = xo ,
has the solution X(t) = (W (t) + x0)2We verify the solution using Ito formula. We
use (t) = X(t) = (Y (t) + x0)2 and dY (t) = dW (t). The partial derivatives are
t = 0, Y = 2(Y (t) + x0), and Y Y = 2. Thus
1
d(t) = [t + Y 0 + Y Y 1]dt + Y 1dW (t) ,
2

d(t) = 1dt + 2(Y (t) + x0)dW (t) , dX(t) = 1dt + 2 X(t)dW (t) ,

since X(t) = Y (t) + x0.

Another widely used mean reversion model is obtained by

dS(t) = [ S(t)]dt + S(t)dW (t) , S(0) = S0 . (68)
This model is also known as the Cox-Ross-Ingersol processes.The process shows a

less volatile behavior than its linear geometric counterpart and it has a non-central
chi-square distribution. The process is often used to model short-term interest rates or
stochastic volatility processes for stock prices. Another often used square root process
is similar to the geometric Brownian motion, but with a square root diusion term
instead of the linear diusion term. Its model is given by

dS(t) = S(t)dt + S(t)dW (t) , S(0) = S0 . (69)

t
The expected value
( for (69) is
) E [S(t)] = S 0 e and the variance is obtained by
2S
Var[S(t)] = 0 e2t et .
Another widely used mean reversion model is obtained by
dS(t) = S(t)[ ln(S(t))]dt + S(t)dW (t) . (70)
Using the transformation P (t) = ln(S(t)) yields the linear mean reverting and
normally distributed process P (t):
2
dP (t) = [( ) P (t)]dt + dW (t) , (71)
2
Because of the transformation, S(t) is log-normally distributed. This model is used
to model stock prices, stochastic volatilities, and electricity prices. Because S(t) is
log-normally distributed, S(t) is always positive.

In this part, we introduce three major methods to compute solution of SDEs.

The rst method is based on the Ito integral and has already been used for linear
solutions.
We introduce numerical methods to compute path-wise solutions of SDEs.
The third method is based on partial dierential equations, where the problem of
nding the probability density function of the solution is transformed into solving a
partial dierential equation.

The stochastic process X(t) governed by the stochastic dierential equation

X(0) = X0
is explicitly described by the integral form

t t
X(t, ) = X0 + f (s, X(s)) ds + g(s, X(s)) dW (s) ,
0 0
where the rst integral is a path-wise Riemann integral and the second integral is an
Ito integral.
In this denition, it is assumed that the functions f (t, X(t)) and g(t, X(t)) are
suciently smooth in order to guarantee the existence of the solution X(t).

There are several ways of nding analytical solutions. One way is to guess a soluti-
on and use the Ito calculus to verify that it is a solution for the SDE under consideration.
We assume that the following nonlinear SDE

dX(t) = dt + 2 X(t) dW (t) ,
has the solution

2
X(t) = (W (t) + X0) .
In order to verify thisclaim, we use the Ito calculus. We have

X(t) = (W ) where
(W ) = (W (t) + X0)2, so that (W ) = 2(W (t) + X0) and (W ) = 2.

Using Itos rule, we get
dX(t) = fe(t, X)dt + g

e(t, X)dW (t)
1
fe(t, X)
2
= (W ) 1 + (W )(2 X(t)) = 1
2

e(T, X)
g = (W )(2 X(t)) = 2(W (t) + X0) .

Since X(t) = (W (t) + X0)2 we know that (W (t) + X0) = X(t) and thus
the Ito calculation generated the original SDE where we started at.

For some classes of SDEs, analytical formulas exist to nd the solution, e.g. consider
the following SDE:
dX(t) = f (t, X(t))dt + (t)dW (t) , X(0) = x0 (72)
where X(t) Rn, f (t, X(t)) Rn is an arbitrary function, (t) Rnm and
dW (t) Rm. This class of SDEs has the following general solution:
X(t) = Y (t) + F (t) (73)

dY (t) = f (t, Y (t) + F (t))dt , Y (0) = x0 (74)
dF (t) = (t)dW (t) , F (0) = 0 . (75)
t
The SDE for F (t) can be integrated, i.e. F (t) = 0
(s)dW (s). When (t) =
than F (t) = W (t).

SinceF (t) is know,, we are able to solve for Y (t) in in function of F (t).
Using Ito lemman, we show that X(t) = Y (t) + F (t) and this solves the SDE
dX(t) = dY (t) + dF (t) = f (t, Y (t) + F (t))dt + (t)dW (t)

= f (t, X(t))dt + (t)dW (t) (76)
This solution is not very suprising, since X(t) is the sum of the process of Y (t) and
the BM of F (t).

For another class of SDEs, exist an analytical formula for their solution:
dX(t) = f (t, X(t))dt + c(t)X(t)dW (t) , X(0) = x0 , (77)
where f (t, X(t)) R, c(t) R and dW R. DThe solution can be derived as

follows:
1
X(t) = F (t)Y (t) (78)
2
dF (t) = F (t)c (t)dt F (t)c(t)dW (t) , F (0) = 1 (79)
1
dY (t) = F (t)f (t, F Y (t))dt (80)
The proof is similar to the rst case, sice the diusion is linear.

Calculate the analytical solution for
dt
dX(t) = + X(t)dW (t) , X(0) = x0 .
X(t)
1 2 tW (t) F (t) F 2(t)
F (t) = e 2 , dY (t) = 1 dt = dt
F (t)Y Y
t
2 1 2 2
dY (t)Y (t) = F (t)dt , Y (t) = F (s)ds + C0
2 0
( t )1
2 2 s2W (s) 2
Y (t) = x0 + 2 e ds
0

1 2 t+W (t) ( 2 t
2 s2W (s)
)1
2
X(t) = e 2 x0 +2 e ds
0

However, most SDEs, especially nonlinear SDEs, do not have analytical solutions so
that one has to resort to numerical approximation schemes in order to simulate sample
paths of solutions to the given equation.
The simplest scheme is obtained by using a rst-order approximation. This is called the
Euler scheme
X(tk ) = X(tk1) + f (tk1, X(tk1))t + g(tk1, X(tk1))W (tk ) .
The Brownian motion term can be approximated as follows:

W (tk ) = (tk ) t ,
where the (.) is a discrete-time Gaussian white process with mean 0 and standard
deviation 1.

Sde

Uploaded by

Document Informationclick to expand document information

Copyright:

Available Formats

Sde

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Sde

Uploaded by

Copyright:

Available Formats

Stochastic Dierential

Do not worry about your problems with mathematics, I assure you

A ordinary dierential equation (ODE)

Stochastic Systems, 2013 2

a(t) = f (t) + h(t)(t) , (4)

where (t) denotes a white noise process.

dX(t) = f (t)X(t)dt + h(t)X(t)dW (t) . (6)

Stochastic Systems, 2013 3

In general an SDE is given as

dX(t, ) = f (t, X(t, ))dt + g(t, X(t, ))dW (t, ) , (7)

Stochastic Systems, 2013 4

Stochastic Systems, 2013 5

Stochastic Systems, 2013 6

= c lim [(W (t1, )W (t0, )) + (W (t2, )W (t1, )) + . . .

+(W (tN , )W (tN 1, ))

Stochastic Systems, 2013 7

where we have used the following algebraic relationship y(x y) = yx y 2 +

Stochastic Systems, 2013 8

Stochastic Systems, 2013 9

By reducing the partition, the variance becomes zero,

Stochastic Systems, 2013 10

The stochastic integral has the solution

Stochastic Systems, 2013 11

Properties of Ito Integrals.

Stochastic Systems, 2013 12

Properties of Ito Integrals.

Stochastic Systems, 2013 13

(W (ti, ) W (ti1, ))(W (tj , ) W (tj1, ))]

Stochastic Systems, 2013 14

for numbers a1, a2 and stochastic functions g1(t, ), g2(t, ).

Stochastic Systems, 2013 15

dX(t) = f (t, X(t))dt + g(t, X(t))dW (t) , (19)

and another process Y (t) which is a function of X(t),

Y (t) = (t, X(t)) ,

dY (t) = f(t, X(t))dt + g(t, X(t))dW (t) .

Stochastic Systems, 2013 16

dy(t) = (t(t, x) + x(t, x)f (t, x))dt . (20)

In the case of stochastic problems, we reason as follows: The Taylor expansion of

Stochastic Systems, 2013 17

We use (19) for dX(t) and get

Stochastic Systems, 2013 18

Reordering the terms yields the scalar version of Itos Lemma:

dY (t) = f(t, X(t))dt + g(t, X(t))dW (t) , (25)

Stochastic Systems, 2013 19

We rewrite the equation and use W (0) = 0

Stochastic Systems, 2013 20

dY (t) = f(t, X(t))dt + g(t, X(t))dW (t) , (30)

where tr denotes the trace operator.

Stochastic Systems, 2013 21

Consider the following stochastic dierential equation:

dS(t) = S(t)dt + S(t)dW (t) , (33)

Stochastic Systems, 2013 22

where W (t) is a standard BM.

Stochastic Systems, 2013 23

Show for U (t) = X1(t)X2(t) with

dX1(t) = f1(t, X1)dt + g1(t, X1)dW (t) ,

that following formula is valid:

dU (t) = dX1(t)X2(t) + X1(t)dX2(t) + g1(t, X1)g2(t, X2)dt (40)