Langevin Dynamics with Variable Coefficients and Nonconservative Forces: From Stationary States to Numerical Methods

Sachs, Matthias; Leimkuhler, Benedict; Danos, Vincent

doi:10.3390/e19120647

Open AccessFeature PaperArticle

Langevin Dynamics with Variable Coefficients and Nonconservative Forces: From Stationary States to Numerical Methods

by

Matthias Sachs

^1,2,

Benedict Leimkuhler

^2,*

and

Vincent Danos

³

¹

Department of Mathematics, Duke University, Durham, NC 27708, USA

²

The School of Mathematics and the Maxwell Institute of Mathematical Sciences, James Clerk Maxwell Building, University of Edinburgh, Edinburgh EH9 3FD, UK

³

Département d’informatique, École Normale Supérieure, 45 rue d’Ulm, F-75230 Paris CEDEX 05, France

^*

Author to whom correspondence should be addressed.

Entropy 2017, 19(12), 647; https://doi.org/10.3390/e19120647

Submission received: 27 September 2017 / Revised: 20 November 2017 / Accepted: 22 November 2017 / Published: 29 November 2017

(This article belongs to the Special Issue Understanding Molecular Dynamics via Stochastic Processes)

Download

Browse Figures

Versions Notes

Abstract

:

Langevin dynamics is a versatile stochastic model used in biology, chemistry, engineering, physics and computer science. Traditionally, in thermal equilibrium, one assumes (i) the forces are given as the gradient of a potential and (ii) a fluctuation-dissipation relation holds between stochastic and dissipative forces; these assumptions ensure that the system samples a prescribed invariant Gibbs-Boltzmann distribution for a specified target temperature. In this article, we relax these assumptions, incorporating variable friction and temperature parameters and allowing nonconservative force fields, for which the form of the stationary state is typically not known a priori. We examine theoretical issues such as stability of the steady state and ergodic properties, as well as practical aspects such as the design of numerical methods for stochastic particle models. Applications to nonequilibrium systems with thermal gradients and active particles are discussed.

Keywords:

Langevin dynamics; fluctuation-dissipation theorems; nonequilibrium simulation; molecular dynamics; sampling; local thermal equilibrium; temperature gradients

Graphical Abstract

1. Introduction

Langevin dynamics is a system of stochastic differential equations of the form

\begin{matrix} \frac{d q}{d t} & = & M^{- 1} p, \end{matrix}

(1)

\begin{matrix} \frac{d p}{d t} & = & F (q) - Γ (q) p + Σ (q) \dot{W} . \end{matrix}

(2)

Here

q, p

represent vectors of the positions and momenta of the particles comprising together

2 n \in N

degrees of freedom. The mass matrix

M

is symmetric positive definite. The force

F

is usually taken to be conservative, i.e.,

F (q) = - \nabla_{q} U (q)

for some potential energy function U. Langevin dynamics describes a physical system of particles moving under prescribed interaction forces and subject to collisions with particles of a “heat bath.” The friction matrix

Γ

, which is here allowed to vary with position, typically models drag on a set of distinguished particles due to the interactions with the surrounding environment, whereas the matrix

Σ

, also potentially varying with position, characterizes the stochastic effects of collisions.

W

represents a vector of independent and uncorrelated Wiener processes in

R^{m}

,

W = {[W_{1}, \dots W_{m}]}^{T}

, where the components of its formal time derivative

\dot{W}

are independent white noise processes such that

E [{\dot{W}}_{i} (t) {\dot{W}}_{j} (t^{'})] = δ_{i, j} δ (t - t^{'}) .

(3)

and

E [{\dot{W}}_{i} (t)] = 0 .

(4)

In case

Γ (q) \equiv γ I

where

γ

is a nonnegative scalar and constant, one may take

Σ (q) \equiv σ I

, where

σ = \sqrt{2 γ k_{B} T}

, T is the temperature and

k_{B}

is Boltzmann’s constant. The resulting system can then be shown, under conditions reviewed in Section 2, to have a unique invariant distribution with density

ρ_{β} (q, p) \propto \exp (- β H (q, p))

, where

H (q, p) = p^{T} M^{- 1} p / 2 + U (q)

represents the total energy of the isolated deterministic system and

β^{- 1} = k_{B} T

.

The emphasis in this article is on the general form (1) and (2), for which we will relax several typical assumptions. First, the system is not assumed to admit a universal fluctuation-dissipation relation, instead we assume only certain nondegeneracy conditions. Second, we do not assume a conservative force field. Such generalized forms of Langevin dynamics can be used to model diffusion (or thermal) gradients by particle simulation [1,2,3], as well as a variety of models for flocking [4,5,6], protein folding [7] and bacterial suspensions [8]. Problems with nonconservative forces are considered often in the physics literature, but the precise characterization of the convergence to stationary conditions is rarely discussed. In the general case, the form of the stationary distribution is non-obvious and the uniqueness of the steady state and its asymptotic stability are not well understood. An important contribution of this article is to show the existence of a steady state for the model (1) and (2) under relaxed conditions and to explicitly construct a Lyapunov function, thus allowing us to conclude the geometric convergence of observable averages.

Our approach to proving ergodicity is dependent on the fact that the friction and noise terms in the equations depend only on positions. On the other hand, within this class, and subject to the nondegeneracy condition on the friction and noise tensors, our treatment is general and has a considerable range of applications. Formally (1) and (2) includes dissipative particle dynamics (DPD), which is a momentum-conserving, 2nd order, gradient-type system [9,10], but we specifically exclude in this article cases for which

Γ

is not positive definite, which occur as a direct consequence of momentum conservation in DPD; ergodic properties for DPD systems in one dimension were discussed in previous work of Shardlow and Yan [11]. See also [12,13,14] for some special cases of Langevin-type systems with velocity dependent coefficients to which our formulation is not applicable.

To perform numerical discretization of the system (1) and (2), we proceed by splitting. This involves performing an additive decomposition of the stochastic vector field into several component parts, each of which can be exactly integrated (in the sense of distributions). The resulting stochastic maps can be composed to approximate the solution in a single timestep. Even for a particular choice of decomposition, there are many ways to combine the sequence of steps. The properties of various choices of the splitting method have been examined in detail for constant

Γ

and

Σ

in [15]. In that work it has been shown that the numerical methods constructed using certain components inherit the ergodicity properties of the SDE system, whereas the invariant measure of the numerical method differs from that of the SDE. The error in the invariant measure can be characterized in terms of the order of accuracy of averages of test functions. In general, it is found that symmetric compositions are preferred as they exhibit, under mild conditions, even order of accuracy for the invariant measure (thus one obtains second order accuracy for the same computational work as a first order scheme).

In the case of the more general systems considered in this article, the calculation of the Ornstein-Uhlenbeck solution, which is required at each step of our splitting-based numerical methods, becomes potentially demanding from a computational standpoint. In the methods of [15,16] this is done exactly, however in the current setting the solution of a Lyapunov equation would need to be obtained at each step in time, a calculation that would dominate the computational load in a large scale simulation. Therefore, we offer an efficient numerical procedure based on multiple timestepping [17], relying on a further splitting of the OU equation at each interior timestep.

In the last section of this article, we illustrate the theory in three numerical examples. The first is a particle system with an imposed diffusion gradient, as in [7,18,19,20,21]. Here the issue is to capture the correct density fluctuations as a function of position. In the second model, we incorporate a stirring force which alters the equilibrium state of a physical model. In the last example, we discuss the use of a blended Langevin model which draws on ideas from the literature of flocking, in particular the Cucker-Smale system. We use our theory to infer attractive steady states for the system, and characterize flocking tendencies by the use of two order parameters: one modelling the formation of consensus and the other characterizing the peculiarity which can be viewed as the average internal energy of isolated clumps of matter.

2. Stationary States of SDEs and Their Stability

In this section, we briefly outline the general theory on which our analysis of the ergodic properties of the system (1) and (2) in the next section is based. We are following in large parts the presentation in the review articles [22,23] and we refer the reader to these articles for a more detailed and comprehensive presentation.

Let

{(x (t))}_{t \geq 0}

denote the solution of an Itô diffusion process of the form

\dot{x} = a (x) + B (x) \dot{W}, x (0) \sim μ_{0},

(5)

taking values in a suitable domain. In this general presentation,

x (t) \in Ω_{x}

may represent either a position vector or else the combined vector of positions and momenta, i.e.,

x (t) = (q (t), p (t)) \in Ω_{q} \times R^{n}

. In the latter case, we would typically either assume a compact configurational domain, as for example when periodic boundary conditions are used in the position space, i.e.,

Ω_{q} = L T^{n}

, where

L > 0

and

T = R / Z

denotes the 1-torus, or else an unbounded domain, e.g.,

Ω_{q} = R^{n}

.

{(W (t))}_{t \geq 0}

represents a standard Wiener process in

R^{m}

and the coefficients

a

,

B

are assumed to be smooth, i.e.,

a \in C^{\infty} (Ω_{x}, R^{2 n})

, and

B \in C^{\infty} (Ω_{x}, R^{2 n \times m})

. The initial state of

x

is specified by the probability measure

μ_{0}

, which throughout this article we assume to be such that

x (0)

has finite mean and variance, i.e.,

\int_{Ω_{x}} {∥ x ∥}_{2}^{2} μ_{0} (d x) < \infty .

2.1. The Associated Semigroup of Evolution Operators and Their Adjoints

The time evolution of the expectations of observables under the dynamics of the SDE is described by the semigroup of evolution operators

{(P_{t})}_{t \geq 0}

(P_{t} φ) (x) : = E [φ (x (t)) | x (0) = x],

for

φ \in S, x \in Ω_{x}

, where the expectation is taken with respect to the Wiener measure associated with the driving noise process

{(W (t))}_{t \geq 0}

and

S \subseteq M (Ω_{x}, R)

denotes a set of test functions or observables, which is contained in

M (Ω_{x}, R)

, the set of all real valued measurable functions defined on the domain

Ω_{x}

. If the test function set

S

is chosen appropiately, (e.g.,

S = C_{b}^{\infty} (Ω_{x}, R)

, where

C_{b}^{\infty} (Ω_{x}, R)

denotes the set of all smooth bounded real valued functions defined on

Ω_{x}

) the action of

P_{t}

corresponds to the solution of an initial value problem, i.e.,

(P_{t} φ) (x) = u (x, t)

, where u solves

\frac{\partial}{\partial t} u (x, t) = L u (x, t), u (x, 0) = φ (x) .

(6)

The operator

L

, defined such that

(L φ) (x) = \lim_{τ \to 0} \frac{E [φ (x (τ)) | x (0) = x] - φ (x)}{τ},

for all

φ \in S

, is referred to as the infinitesimal generator. As the action of the operators

{(P_{t})}_{t \geq 0}

is given as the solution of the differential Equation (6), it is common to use the notation

P_{t} : = e^{t L}

. In the case of an Itô diffusion process (5), and with the regularity assumptions on the coefficients

a, B

stated above, it can be shown that the infinitesimal generator takes the form

L = a \cdot \nabla + \frac{1}{2} B^{T} B : \nabla^{2},

where “

:

” denotes the Frobenius product, i.e.,

B^{T} B : \nabla^{2} = \sum_{i, j = 1}^{n} {[B^{T} B]}_{i, j} \partial_{x_{i}} \partial_{x_{j}},

See, e.g., [24] for the case

S = C_{b}^{\infty} (Ω_{x}, R)

, and [25] for extensions from that core to more general test function sets such as the ones considered below.

It is often convenient to adopt a dual perspective by considering the evolution of the density of the law

μ_{t}

of

x (t)

in time. Let

μ_{t} (d x) = ρ (x, t) d x

, for

t \geq 0

, in the sense of distributions. The corresponding semigroup

{(P_{t}^{†})}_{t \geq 0}

of transfer operators is then naturally defined as the formal adjoint operators of

P_{t}

, i.e.,

\int_{Ω_{x}} (P_{t} φ) (x) ρ_{0} (x) d x = \int_{Ω_{x}} φ (x) P_{t}^{†} ρ_{0} (x) d x,

assuming

x (0) \sim μ_{0}

.

Similarly as for its adjoint, the action of

P_{t}^{†}

corresponds to the solution of an initial value problem, which in this case is known as the Fokker-Planck equation. More specifically,

\frac{\partial}{\partial t} ρ (x, t) = L^{†} ρ (x, t), ρ (x, 0) = ρ_{0} (x),

(7)

where

ρ (\cdot, t)

denotes the probability density of

μ_{t}

and

L^{†}

—the Fokker-Planck operator—can be shown to correspond to the

L^{2}

-adjoint of the infinitesimal generator

L

, i.e.,

L^{†} ρ = - \nabla \cdot (a ρ) + \nabla^{2} : (\frac{1}{2} B^{T} B ρ),

thus

e^{t L^{†}} = P_{t}^{†} .

2.2. Hypoellipticity and Existence of a Smooth Transition Kernel

Note that (7) is in general to be interpreted in a weak sense as, up to this point, we have not made any assumptions on the regularity of

ρ

. However, within the scope of this article it is sufficient to consider the case where the differential operator

\partial_{t} - L^{†}

is hypoelliptic. A differential operator A is said to be hypoelliptic, if for any g solving the differential equation

A g = f

, it follows that g is of higher regularity than f in the sense that

f \in H_{s}^{loc} \Rightarrow g \in H_{s + ϵ}^{loc},

with

ϵ > 0

, where

H_{s}^{loc}

denotes the local Sobolev space of order

s \in N

. This means that if

\partial_{t} - L^{†}

is hypoelliptic, then the solution of (7) is smooth in the sense that

ρ \in C^{\infty} (Ω_{x}, (0, \infty))

irrespective of the regularity of

ρ_{0}

. A common way to establish hypoellipticity of a differential operator is via Hörmander’s theorem ([26], Theorem 22.2.1, on p. 353):

Theorem 1.

Let A be a differential operator of the form

A = a_{0} \cdot \nabla + \sum_{i = 1}^{M} {(a_{i} \cdot \nabla)}^{†} (a_{i} \cdot \nabla),

where

a_{i}, 0 \leq i \leq M

are

C^{\infty}

vector fields in

R^{n}

and ^† indicates the formal

L^{2}

adjoint. Iteratively define a collection of vector fields by

V_{0} = {a_{i} : i \geq 0}, V_{k + 1} = V_{k} \cup {[v, a_{i}] : v \in V_{k}, 0 \leq i \leq M},

(8)

where

[X, Y] = (\nabla Y) X - (\nabla X) Y,

denotes the commutator of vector fields

X, Y \in C^{\infty} (Ω_{x}, R^{n})

and

(\nabla X), (\nabla Y)

their Jacobian matrices. If

\forall x \in R^{n}, lin \{v (x) : v \in ⋃_{k \in N} V_{k}\} = R^{n},

(9)

then the operator A is hypoelliptic.

The condition (9) is commonly referred to as Hörmander’s condition. In the particular case of

A = \partial_{t} - L^{†}

one can easily verify that (9) is exactly satisfied if

V_{0} = {B_{i} : i \geq 1}, V_{k + 1} = V_{k} \cup {[v, B_{i}] : v \in V_{k}, 0 \leq i \leq M} .

with

B_{0} = a

and

B_{i}

refers to the i-th column of the diffusion tensor

B

in (5). This particular version of Hörmander’s condition adapted to the parabolic PDE of the form (7) is referred to as the parabolic Hörmander condition.

The direct consequence of the parabolic Hörmander condition is the smoothness of the underlying transition kernel describing the evolution of the probability measure associated to the SDE.

2.3. Ergodicity and Convergence in Law

Let now

μ (d x) = ρ (x) d x

be a probability measure with density

ρ \in C^{2} (Ω_{x}, [0, \infty))

. When we say that

μ

is an invariant measure of the SDE (5) or that the latter preserves the probability measure

μ

, we mean that the probability density

ρ

is a stationary solution of the Fokker-Planck equation associated to the SDE, i.e.,

L^{†} ρ = 0 .

(10)

Note that for (10) to be well posed in a strong sense it is sufficient that

L

is hypoelliptic, i.e., the Fokker-Planck operator

L^{†}

satisfies the Hörmander condition. It is also straightforward to see that every convex combination of two invariant measures is again an invariant measure of the respective SDE which by definition means that the set of invariant measures of a particular SDE is convex.

Let for

t > 0

and some suitable observable

φ

,

{\bar{φ}}_{t} : = \frac{1}{t} \int_{0}^{t} φ (x (s)) d s,

denote the finite time average of

φ

evaluated along the path of the solution

x

of (5). Similarly, we write

μ [φ] : = \int_{Ω_{x}} φ (x) μ (d x) .

for the

μ

-weighted spatial average of

φ

. The process

x

is said to be ergodic with respect to the invariant probability measure

μ

if for all

φ \in L^{1} (μ)

and for almost all realizations of the Wiener process

W

, and

μ_{0}

-almost all initial values

x (0)

, trajectory averages coincide with expectations with respect to the measure

μ

in the asymptotic limit

t \to \infty

, i.e.,

\lim_{t \to \infty} {\bar{φ}}_{t} = μ [φ], a . s .

(11)

It can be shown (see [27]) that ergodicity follows if (i) there exists an invariant measure with positive smooth density and (ii)

\partial_{t} - L^{†}

is hypoelliptic.

Assume that

x (t)

converges in law towards a unique invariant measure

μ

, i.e.,

\lim_{t \to \infty} E [φ (x (t)) | x (0) = x] = μ [φ],

(12)

for all

φ \in C_{b}^{\infty} (Ω_{x}, R)

,

μ_{0}

-almost all

x \in Ω_{x}

. A common way to characterize the convergence of the expectation

E [φ (x (t)) | x (0) = x]

, or, in semi group notation the convergence of

e^{t L} φ (x)

to the

μ

-weighted average

μ [φ]

, is via functional decay estimates of the semi-group operators

e^{t L}

. For this purpose a set of test functions

S

is fixed and equipped with a norm

{∥ \cdot ∥}_{S}

such that

E : = (S, {∥ \cdot ∥}_{S})

forms a Banach-space. Of particular interest in this context is exponential convergence of

e^{t L} φ

towards

μ [φ]

in the respective norm, i.e.,

{∥ e^{t L} φ - μ [φ] ∥}_{S} \leq C e^{- κ t} {∥ φ - μ [φ] ∥}_{S},

(13)

where

C, κ

are positive constants, the latter corresponding to the spectral gap of the generator

L

in the functional space

E_{0} = (S_{0}, {∥ \cdot ∥}_{S})

, where

S_{0} \subseteq S

denotes the subset of test functions with vanishing mean, i.e.,

S_{0} = \{φ \in S : μ [φ] = 0\} .

Let the operator

Π

denote the orthogonal projection from

S

onto

S_{0}

, i.e.,

Π φ = φ - μ [φ], φ \in S .

Denote further by

{∥ A ∥}_{B (E)}

the operator norm

{∥ A ∥}_{B (E)} : = \sup_{\begin{matrix} φ \in E \\ φ \neq 0 \end{matrix}} \frac{{∥ A φ ∥}_{S}}{{∥ φ ∥}_{S}},

of an operator

A : E \to E

. Equation (13) implies that

e^{t L} Π

when considered as an operator on E is bounded in the operator norm

B (E)

as

{∥ e^{t L} Π ∥}_{B (E)} \leq C e^{- κ t} .

(14)

2.4. Finite-Time Averages and the Central Limit Theorem

Ergodicity of an SDE ensures that time averages of infinitely long trajectories almost surely coincide with spatial averages with respect to the target measure. However, ergodicity per se does not address the statistical properties of finite-time averages apart from the convergence of the time average

{\bar{φ}}_{t}

as

t \to \infty

. For practical applications, where for a given unique invariant measure, the aim is to approximate the

μ

-weighted average

μ [φ]

of a test function

φ \in S

, it is important that fluctuations of finite-time averages

{\bar{φ}}_{t}

(i.e., the Monte Carlo error in the finite-time approximations) around the infinite-time value

\lim_{t \to \infty} {\bar{φ}}_{t} = μ [φ]

can be quantified. For this purpose it typically regarded as necessary that a central limit theorem holds, i.e.,

\sqrt{t} ({\bar{φ}}_{t} - μ [φ]) \sim N (0, σ_{φ}^{2}), as t \to \infty,

(15)

where

σ_{φ}^{2}

is commonly referred to as the asymptotic variance of the observable

φ

. A sufficient condition for a central limit theorem of the form (15) to hold for the solution

x

of (5) and the observable

φ

is that the Poisson equation

- L Φ = Π φ,

(16)

has a solution which belongs to

L^{2} (μ)

(see [28]). The asymptotic variance then takes the form

σ_{φ}^{2} = - 2 \int_{Ω_{x}} (L^{- 1} Π φ) Π φ d μ .

Let

L_{0}^{2} (μ)

denote the subspace of

L^{2} (μ)

consisting of functions with vanishing mean. Note that for

Φ \in L^{2} (μ)

it is a priori not clear how to interpret (16) since only under additional regularity assumptions (16) can be interpreted in a weak sense. A common way to makes sense of (16) is by deriving bounds for the operator

L^{- 1}

in

B (E_{0})

where

E_{0}

is some subspace of

L_{0}^{2} (μ)

. If

L^{- 1}

is bounded in

B (E_{0})

, this then directly implies

Φ \in L^{2} (μ)

in (16) for

φ \in E_{0}

. The relationship between the spectral properties of the operator

L^{- 1} Π

and the convergence properties of the solution process

x

, or more specifically the decay properties of the semi-group operator

e^{t L} Π

as

t \to \infty

can be made more precise via the formal identity

L^{- 1} Π = - \int_{0}^{\infty} e^{t L} Π d t,

(17)

which follows from

- \int_{0}^{\infty} L e^{t L} φ d t = - \int_{0}^{\infty} (\frac{d}{d t} e^{t L} φ) d t = φ,

(18)

for

φ \in {ϕ \in S_{0} : L ϕ \in S_{0}}

. Using the identity (17), one directly finds that (14) is a sufficient condition for

L^{- 1} Π

to be bounded since

\begin{matrix} ∥ {L^{- 1} Π φ ∥}_{S} & = {∥ \int_{0}^{\infty} e^{t L} Π φ d t ∥}_{S} \\ \leq \int_{0}^{\infty} {∥ e^{t L} Π φ ∥}_{S} d t \\ \leq C \int_{0}^{\infty} e^{- κ t} ∥ {Π φ ∥}_{S} d t \\ \leq \frac{C}{κ} ∥ {Π φ ∥}_{S} . \end{matrix}

(19)

We conclude that the exponential decay (13) implies the central limit theorem (15) for

φ

in the corresponding function space

E_{0}

. Estimates for

E_{0} = H^{1} (μ) \cap L_{0}^{2} (μ)

can be obtained using the framework of hypocoercivity as presented in [29]. In [30] techniques are introduced to show the decay estimate (13) for

E_{0} = L_{0}^{2} (μ)

. In this article we will use Lyapunov function-based techniques which allow to show exponential convergence as in (13) in some weighted

L^{\infty}

spaces, which we specify in the next section.

2.5. Exponential Convergence in Weighted $L^{\infty}$ Spaces

Another way of deriving exponential decay estimates for the semi-group

{(e^{t L})}_{t \geq 0}

, which are sufficient to establish a central limit theorem for certain observables, is by means of well established Lyapunov techniques. These techniques have been formulated originally for discrete-time Markov processes/Markov chains [31,32,33] and have been subsequently extended to continuous time solutions processes of SDEs [22,23,34,35]. The function space on which decay estimates are shown in these references are Banach spaces of the form

(L_{K}^{\infty}, {∥ \cdot ∥}_{L_{K}^{\infty}})

, with

L_{K}^{\infty} : = \{φ measurable | \frac{φ}{K} \in L^{\infty} (Ω_{x})\},

(20)

where

K \in C^{2} (Ω_{x}, [1, \infty))

is a positive function, and

{∥ φ ∥}_{L_{K}^{\infty}} : = {∥ \frac{φ}{K} ∥}_{L^{\infty}} .

(21)

Exponential convergence in the sense of

{∥ e^{t L} φ - μ [φ] ∥}_{L_{K}^{\infty}} \leq C e^{- κ t} {∥ φ - μ [φ] ∥}_{L_{K}^{\infty}},

(22)

is typically shown provided that the following two assumptions are satisfied.

Assumption 1 (Infinitesimal Lyapunov condition).

There is a function

K \in C^{2} (Ω_{x}, [1, \infty))

with

\lim_{∥x∥ \to \infty} K (x) = \infty

, and real numbers

a \in (0, \infty), b \in (0, \infty)

such that,

L K \leq - a K + b .

(23)

Assumption 2.

For some

t > 0

there exists a constant

η \in (0, 1)

and a probability measure ν such that

\inf_{x \in C} e^{t L^{†}} δ_{x} (d y) \geq η ν (d y)

where

C = {x \in Ω_{x} : K (x) \leq K_{\max}}

for some

K_{\max} > 1 + 2 b / a,

where

a, b

are the same constants as in (23).

As outlined in [23,35], Assumption 2 follows if

L

satisfies the parabolic Hörmander condition and the SDE (5) is controllable in the sense that there is a

t > 0

such that for any pair

x_{-}, x_{+} \in C

, there exists a continuous control

u \in L^{1} ([0, t], Ω_{x})

, such that the solution

\tilde{x}

of the differential equation

\dot{\tilde{x}} = a (\tilde{x}) + B (\tilde{x}) u,

satisfies

\tilde{x} (0) = x_{-}

and

\tilde{x} (t) = x_{+}

. The following theorem is derived in [22] using results from [33]. Similar results can be found e.g., in [23,36].

Theorem 2.

Suppose that Assumptions 1 and 2 hold. Then, the solution of the SDE (5) admits a unique invariant probability measure μ, such that

\int_{Ω_{x}} K d μ < \infty .

(24)

Moreover, there exist

C > 0

and

κ > 0

such that (22) holds for all

φ \in L_{K}^{\infty} (Ω_{x})

and any

t \in [0, \infty)

.

Finally, let us demonstrate how decay estimates in spaces

L_{K}^{\infty}

can be used to derive a central limit theorem for certain functions. Let

V

be a Lyapunov function such that the conditions for Theorem 2 are satisfied for

K = V

. Note that if the conditions of Theorem 2 are also satisfied for

V^{2}

, then this implies that a central limit theorem holds for all observables

φ \in L_{V}^{\infty}

, since (24) being valid for

K = V^{2}

implies

L_{V}^{\infty} \subset L^{2} (μ) .

Thus, the inequality (19) for

S = L_{V^{2}}^{\infty}

again implies that the solution

Φ

of (16) is contained in

L^{2} (μ)

for

φ \in L_{V}^{\infty}

, so that by [28] indeed a central limit theorem of the form (15) holds for

φ \in L_{V}^{\infty}

. This motivates to show (22) for a wide class of Lyapunov functions. In the next section, we consider the case of the Langevin dynamics with position dependent coefficients (1) and (2) and show (22) for Lyapunov functions being of the form of polynomials of even but arbitrarily high degree.

3. Langevin Dynamics with Configuration-Dependent Diffusion

We now return to the examples from the introduction, specifically Langevin dynamics with space-dependent friction (1) and (2), and show that in case the variable friction tensor

Γ

is positive definite and the diffusion tensor

Σ

has full rank, at all points of the phase space, then the system satisfies the conditions described in the previous section for ergodicity and exponential decay estimates. As illustrations, we consider three applications: (i) a particle model with a temperature gradient; (ii) a simple 2-dimensional Langevin diffusion with a non-conservative force term; (iii) an illustrative Langevin dynamics model for of flocking/swarming, as commonly arises in studies of active particle systems.

3.1. Geometric Ergodicity of Langevin Dynamics With Space-Dependent Coefficients

The main theorem of this section generalizes well known results regarding the ergodicity properties and exponential decay estimates of the associated evolution operator for the underdamped Langevin with constant scalar friction and diffusion coefficient, such as presented in [23,35]. Specifically we extend existing results by

allowing the systematic force $F$ to be non-conservative,
explicitly considering the case of $Γ$ and $Σ$ being matrix-valued functions of $q$ .

In the current case, no assumption is made regarding a fluctuation-dissipation relation or the form of the invariant density. We will see that under certain conditions on the coefficients

Γ

and

Σ

and the non-conservative force

F

, which are detailed below, the proof of the extended ergodicity criterion for Langevin dynamics follows in a straightforward way from the corresponding proof of the constant-coefficient result [35]. These generalization are nonetheless of high practical relevance and allow us to conclude ergodicity for a wide range of relevant modelling applications.

Let, for a square matrix

A \in R^{n \times n}

,

σ (A) : = {λ \in R : \exists v \in R^{n}, v \neq 0, λ v = A v},

denote its spectrum. The following assumption on the spectrum of

Γ

and

Σ Σ^{T}

ensures the existence of a suitable Lyapunov function in the case these matrices are not constant in

q

.

Assumption 3.

(i): The spectrum $σ (Γ (q))$ is uniformly bounded in $q \in Ω_{q}$ from above and away from 0, i.e., there are positive constants $λ_{\min}, λ_{\max}$ , such that

$λ_{\min} \leq \inf_{q \in Ω_{q}} \min σ (Γ (q)),$

and

$\sup_{q \in Ω_{q}} \max σ (Γ (q)) \leq λ_{\max} .$
(ii): The diffusion matrix has full rank, i.e.,

$rank (Σ (q)) = n,$

for all $q \in Ω_{q}$ , and the spectrum of $Σ Σ^{T}$ is bounded in the sense that

$\exists \bar{σ} > 0 : \sup_{q \in Ω_{q}} \max σ (Σ (q) Σ^{T} (q)) \leq \bar{σ} .$

Obviously, Assumption 3 is automatically satisfied for

Ω_{q} = L T^{n}

as long as the coefficients

Γ, Σ

are smooth and

Γ (q)

is positive definite and

Σ (q)

has full rank at every point

q \in Ω_{q}

. The next assumption ensures the existence of a suitable Lyapunov function in the case of a non-conservative force. Again, it is trivially satisfied for

Ω_{q} = L T^{n}

and

F \in C^{\infty} (T^{n}, R)

.

Assumption 4.

There exists a potential function

U \in C^{2} (Ω_{q}, R)

with the following properties

(i): there exists $G \in R$ such that

$\forall q \in Ω_{q}, 〈 q, F (q) 〉 \leq - 〈 q, \nabla_{q} U (q) 〉 + G .$

for all $q \in Ω_{q}$ .
(ii): the potential function is bounded from below, i.e., there exists $u_{\min} > - \infty$ such that

$\forall q \in Ω_{q}, U (q) \geq u_{\min} .$
(iii): there exist constants $D, E > 0$ and $F \in R$ such that

$\forall q \in Ω_{q}, 〈 q, \nabla_{q} U (q) 〉 \geq D U (q) + E {∥ q ∥}_{2}^{2} + F .$

(25)

We point out that if

F

is a conservative force, then Assumption 4 reduces to the same asymptotic growth criteria commonly assumed in the derivation of geometric ergodicity of Langevin dynamics with constant coefficients

Γ

and

Σ

on an unbounded configurational domain,

Ω_{q} = R^{n}

(See again e.g., [23,35]).

Theorem 3.

In (1) and (2), let the force, the friction and diffusion tensors be smooth functions, i.e.,

F \in C^{\infty} (Ω_{q}, R^{n})

,

Γ \in C^{\infty} (Ω_{q}, R^{n \times n})

,

Σ \in C^{\infty} (Ω_{q}, R^{n \times m})

such that Assumptions 3 and 4 hold. There is a unique invariant probability measure

\tilde{μ} (d q, d p) : = \tilde{ρ} (q, p) d q d p,

with a

(F, Γ, Σ)

-dependent density

\tilde{ρ} \in C^{\infty} (Ω_{q} \times R^{n}, R_{+})

, and for all

l \in N

there are constants

κ_{l}, C_{l} > 0

so that

{∥ e^{t L} φ - \tilde{μ} [φ] ∥}_{L_{K_{l}}^{\infty}} \leq C_{l} e^{- t κ_{l}} {∥ φ - \tilde{μ} [φ] ∥}_{L_{K_{l}}^{\infty}},

(26)

for all

φ \in L_{K_{l}}^{\infty}

and

t \geq 0

, where

K_{l} (q, p) = {(〈 p, p 〉 + 1)}^{l},

(27)

in the case of

Ω_{q} = L T^{n}

and

K_{l} (q, p) = {(c U (q) + a 〈 q, q 〉 + 2 b 〈 q, p 〉 + \frac{c}{2} 〈 p, p 〉)}^{l},

(28)

with suitably chosen positive constants

a, b, c > 0

in the case of

Ω_{q} = R^{n}

. Furthermore, the probability measure

\tilde{μ}

is such, that

\int_{Ω_{q} \times R^{n}} K_{l} d \tilde{μ} < \infty .

Proof.

Lyapunov condition

We first show that Assumption 1 holds for

K_{l}

as defined in (27) and (28) in the respective setups. Note that

L = L_{H} + L_{O},

with

\begin{matrix} L_{H} & = p \cdot \nabla_{q} - \nabla_{q} U (q) \cdot \nabla_{p}, \\ L_{O} & = - p^{T} Γ (q) \cdot \nabla_{p} + \frac{1}{2} Σ^{T} (q) Σ (q) : \nabla_{q}^{2} . \end{matrix}

We first show the existence of constants

a_{l}, b_{l}

such that Assumption 1 is satisfied for

a = a_{l}, b = b_{l}

in the case

l = 1

. For

l \geq 2

, the existence of suitable constants

a_{l}, b_{l}

follows inductively.

By Assumption 4(i), it follows that

\begin{matrix} L_{H} K_{1} (q, p) & = 2 b 〈 q, F (q) 〉 + 2 a 〈 p, q 〉 + 2 b 〈 p, q 〉 \\ \leq - 2 b 〈 q, \nabla_{q} U (q) 〉 + G + 2 a 〈 p, q 〉 + 2 b 〈 p, q 〉 . \end{matrix}

Similarly, by Assumption 3(ii), we find

\begin{matrix} L_{O} K_{1} (q, p) & = - 2 b 〈 q, Γ (q) p 〉 - c 〈 p, Γ (q) p 〉 + c tr (Σ^{T} (q) Σ (q)) \\ \leq - 2 b 〈 q, Γ (q) p 〈 - c 〈 p, Γ (q) p 〉 + c n λ_{\max} . \end{matrix}

Putting both inequalities together thus yields

\begin{matrix} L K_{1} (q, p) & \leq - 2 b (D U (q) + {E ∥ q ∥}_{2}^{2} + F) + 2 (a + b) 〈 p, q 〉 - 2 b 〈 q, Γ (q) p 〉 - c 〈 p, Γ (q) p 〉 + c n λ_{\max} \\ = - 2 b D U (q) - {(\begin{matrix} q \\ p \end{matrix})}^{T} M (\begin{matrix} q \\ p \end{matrix}) - 2 b F + c n λ_{\max}, \end{matrix}

with

M = (\begin{matrix} E b I_{n} & b Γ (q) - a I_{n} \\ b Γ (q) - a I_{n} & - 2 b I_{n} + c Γ (q) \end{matrix}),

where we used Assumption 4(iii) in the inequality. The existence of suitable constants a and b such that Assumption 1 is satisfied for the case

K = K_{1}

, follows directly if the matrix

M

is positive definite. The positive definiteness of the block matrix

M

is implied if (See e.g., [37]), the matrices

- 2 b I_{n} + c Γ (q)

and the Schur complement

E b I_{n} - {(b Γ (q) - a I_{n})}^{T} {(- 2 b I_{n} + c Γ (q))}^{- 1} (b Γ (q) - a I_{n})

are both positive definite. Indeed, since the spectrum of

Γ

is uniformly bounded on

Ω_{q} = R^{n}

according to Assumption 3(i) the positive definiteness of both these matrices can be ensured by choosing a, b sufficiently small and c sufficiently large.

Minorization condition

A simple calculation shows that

Σ (q)

having rank n for all

q \in Ω_{q}

immediately implies that the SDE (1) and (2) satisfies the parabolic Hörmander condition. It therefore only remains to show that for arbitrary

T > 0

and

(q^{-}, p^{-}), (q^{+}, p^{+}) \in R^{2 n}

, the control problem

\begin{matrix} \frac{d q}{d t} & = p, \\ \frac{d p}{d t} & = F (q) - Γ (q) p + Σ (q) u, \\ subject to (q (0), p (0)) & = (q^{-}, p^{-}), (q (T), p (T)) = (q^{+}, p^{+}), \end{matrix}

(29)

has a continuous solution

u \in L^{1} ([0, T], R^{m})

. It is easy to verify that there exists a smooth path

\tilde{q} \in C^{2} ([0, T], R^{n})

such that

(\tilde{q} (0), \dot{\tilde{q}} (0)) = (q^{-}, p^{-}), (\tilde{q} (T), \dot{\tilde{q}} (T)) = (q^{+}, p^{+}) .

Rewrite (29) as a second order differential equation:

\ddot{q} = - \nabla_{q} U (q) - Γ (q) \dot{q} + Σ (q) u .

Since

rank (Σ (q)) = n

for all

q \in Ω_{q}

, there exists

Σ^{g} \in C^{\infty} (Ω_{q}, R^{m \times n})

such that

Σ (q) Σ^{g} (q) p = p,

for all

q, p \in R^{n}

, thus,

u (t) = Σ^{g} (\tilde{q} (t)) [\ddot{\tilde{q}} + \nabla_{q} U (\tilde{q} (t)) + Γ (\tilde{q} (t)) \dot{\tilde{q}} (t)]

(30)

is a solution of (29). ☐

Note that the case where

Γ

and

Σ

are constant was already studied in Mattingly et al. [35] and the proof of the above theorem resembles the structure of the proof therein.

In what follows we provide three examples of models whose ergodic properties can be characterised by the above Theorem 3.

3.2. Single-Particle System with Non-Conservative Force

As an example of a system with a non-conservative force term, which satisfies the condition of the above theorem we consider a Langevin equation of the form (1) and (2) where the force term

F

is of the form

F (q) = - \nabla_{q} U (q) + α J q,

(31)

where

J \neq 0

is a skew-symmetric matrix and

α \in R

. For

α \neq 0

the additional term

α J q

obviously does not correspond to the negative gradient of a smooth potential energy function, thus in this case the force (31) is indeed non-conservative. It is easy to verify that U satisfying the growth conditions (ii) and (iii) in Assumption 4 and

U (q) \in O ({∥ q ∥}^{2 + ϵ}),

with

ϵ > 0

as

∥ q ∥ \to \infty

implies (i) in Assumption 4. Therefore, as long as the remaining conditions on the coefficients

Γ

and

Σ

in Theorem 3 are satisfied, it follows from the same theorem that the respective (non-equilibrium) dynamics possesses an invariant measure,

μ_{α} (d q, d p) = ρ_{α} (q, p) d q d p

, to which it converges exponentially fast in

L_{K}^{\infty}

as specified above.

3.3. Multi-Particle Systems

In the remainder of this section we consider the application of Theorem 3 to two different types of particle systems, which can be seen as instances of the underdamped Langevin equation with non-constant coefficients. With some abuse of notation whenever a particle model is considered, let

q_{i} \in L T^{d}, i = 1, \dots, N

where

L > 0

and

p_{i} \in R^{d}, i = 1, \dots, N

denote the position and momentum vectors of the particles, respectively. If, on the other hand, we want to refer to the i-th entry of the vector

v

where

v = q

or

v = p

, we write

Π_{i} v

, where

Π_{i} : R^{n} \to R

with

i \in {1, 2, \dots, n}, n = N d

, denotes the operator which selects the i-th cartesian coordinate, i.e.,

Π_{i} x = e_{i} \cdot x

. Furthermore, if not stated otherwise, we will assume, that the force is conservative and corresponds to the gradient of a potential function

U (q)

, which is composed as the sum of smooth pair potentials

{\tilde{U}}_{i, j} \in C^{\infty} (R, R), 1 \leq j < i \leq N

., i.e.,

U (q) = \sum_{i = 1}^{N} \sum_{j = 1}^{i - 1} {\tilde{U}}_{i, j} (∥ q_{i} - q_{j} ∥) .

(32)

3.4. Particle System with Temperature Gradient

As a first application we consider an N-particle system with periodic boundary condition where the particles are coupled to a heat bath whose temperature varies depending on the position of the particle within the periodic simulation box. The system we consider in the following is of the form

\begin{matrix} {\dot{q}}_{i} & = - p_{i}, \\ {\dot{p}}_{i} & = - \nabla_{q_{i}} U (q) - γ p_{i} + \sqrt{2 γ {(β (q_{i}))}^{- 1}} {\dot{W}}_{i}, \end{matrix}

(33)

for

i = 1, \dots, N

, where

W_{i}, i = 1, \dots, N

, are independent white noise processes in

R^{d}

. We further assume that the friction coefficient

γ > 0

is a positive constant and

β

a smooth positive function on

L T^{d}

, i.e.,

β \in C^{\infty} (L T^{d}, R_{+})

. In light of (29) this corresponds to a constant diagonal friction matrix

\begin{matrix} Γ (q) & = γ I_{n}, \end{matrix}

and diffusion tensor of the form

Σ (q) = \sqrt{2 γ} β^{- \frac{1}{2}} (q) \otimes I_{d},

where

β^{- \frac{1}{2}} (q) : = diag {(\sqrt{β (q_{1})}, \dots, \sqrt{β (q_{N})})}^{- 1} .

The matrices

Γ

and

Σ

are clearly positive definite and invertible, hence by Theorem 3, there exists a unique invariant measure

μ_{γ, β}

to which the solution of (33) converges (exponentially fast) in law. Note that, even for this relatively simple generalization of the standard underdamped Langevin equation, the form of the invariant measure

μ_{γ, β}

depends nontrivially on

γ

and

β

, and one can in general not find an analytic solution of the corresponding stationary Fokker-Planck equation. As long as

β^{- 1}

is bounded from above and away from 0, and the potential U is modified so that Assumption 4 is satisfied (e.g., by adding a confining potential to U), it is easy to see that Theorem 3 applies also for the case

Ω_{q} = R^{n}

.

3.5. Stochastic Cucker-Smale Model

As a second application of Theorem 3, we consider a stochastic model for flocking which is a variant of the model presented in [4]. In fact, the primary model considered in that paper replaced the deterministic Cucker-Smale system [38], in which a collection of active particles interact with a configuration-dependent friction, by one in which the particles were additionally perturbed by bounded noise. An SDE model was presented there without analysis. A subsequent paper [5] provided numerical evidence for flocking states. In these papers, the SDE approach consists of a Langevin-type model with coordinate-dependent friction and additive (typically uniform) white noise, that is they take the form

\begin{matrix} \frac{d q}{d t} & = & M^{- 1} p, \end{matrix}

(34)

\begin{matrix} \frac{d p}{d t} & = & - \nabla_{q} U (q) - Γ (q) p + σ I_{N d} \dot{W} (t) . \end{matrix}

(35)

For a system of particles moving in one dimension, one has a friction matrix

Γ (q) = {[γ_{i j} (q)]}_{1 \leq i, j \leq N} \otimes I_{d}

, where

γ_{i j} (q) = - ψ (∥ q_{j} - q_{i} ∥), i \neq j

(36)

and

γ_{i i} = - \sum_{j \neq i} γ_{i j},

(37)

for some given scalar kernel function

ψ (r) \geq 0

. That is, the matrix

Γ (q)

is a (weighted) graph Laplacian which reflects the interaction structure of the problem. If

ψ (r) > 0

, this interaction graph is complete.

The inclusion of conservative forces derived from a potential energy function U is an addition to the models mentioned above and allows to incorporate direct attraction and repulsion effects. Note that since

{[γ_{i j}]}_{1 \leq i, j \leq N}

is of the form of a graph Laplacian, it is positive semi-definite with at least one eigenvalue being singular. If we assume that

ψ (r) > 0

, then it follows that

{[γ_{i j}]}_{1 \leq i, j \leq N}

possesses exactly one singular eigenvector,

1_{N} \in R^{N}

, i.e., the vector whose entries all are equal to 1. Consequently,

Γ (q)

, has exactly d singular eigenvalues, each of them being of the form

u_{l} = 1_{N} \otimes e_{l}

, where

e_{l}

denotes the l-th canonical vector in

R^{d}

. These singular eigenvectors can be associated with the mean momentum

\bar{p} = N^{- 1} \sum_{i = 1}^{N} p_{i},

of the collection of particles via the relation

{\bar{p}}_{l} = N^{- 1} u_{l}^{T} p,

for

l \in {1, \dots, d}

. There are several issues regarding the model (34) and (35). Most importantly, while the diffusion matrix in (35) has full rank, the friction tensor

Γ (q)

is only of rank

(N - 1) d

, which means that in directions of the singular vectors

u_{l}, 1 \leq l \leq d

there is no dissipation and therefore the kinetic energy of the system will be unbounded as

t \to \infty

, and hence one cannot expect (34) and (35) to possess an invariant measure. More specifically, if U is composed of pair potentials one can show that the mean momentum

\bar{p}

is a Brownian motion,

\dot{\bar{p}} = {[u_{l}]}_{1 \leq l \leq d}^{T} \dot{W} .

3.5.1. Regularized Stochastic Cucker-Smale Dynamics

A simple fix to the model (34) and (35), which ensures ergodicity of the dynamics, is to add additional dissipation, which is uniform in each component of

p

. This can be achieved by replacing the friction tensor in (35) by

Γ_{ϵ} (q)

, which for

ϵ > 0

is defined such that

Γ_{ϵ} (q) = Γ (q) + ϵ I_{N d} .

It follows directly from the Gershgorin circle theorem that

Γ_{ϵ} (q)

is positive definite for all

q \in Ω_{q}

and any choice of

ϵ > 0

.

3.5.2. Modified Stochastic Cucker-Smale Dynamics

While the above described regularized stochastic Cucker-Smale dynamics is a valid extension of the original model which ensures that the conditions of Theorem 3 are satisfied and hence geometric ergodicity for (34) and (35) holds, the form of the corresponding invariant measure does depend in a non-trivial way on

Γ, σ

and

ϵ

and unless

Γ

is constant in

q

, one cannot easily find a closed form of the invariant measure. We therefore propose another novel modification of (34) and (35), which is geometrically ergodic with an invariant measure of closed form. We construct this model as a superposition, i.e.,

x = x_{⊥} + x_{| |}

, of two independent stochastic processes,

x_{⊥} = (q_{⊥}, p_{⊥})

and

x_{| |} = (q_{| |}, p_{| |})

taking both values in

Ω_{x} \times R^{n}

, respectively. We construct these processes such that the parametrization of the former process determines the statistical properties of the stochastic inter-particle interactions and the parametrization of the latter process the collective motion of the flock, i.e., the diffusive properties of the centre of mass. More specifically, we construct

x_{⊥}

as the solution of the SDE

\begin{matrix} \frac{d q_{⊥}}{d t} & = p_{⊥}, \\ \frac{d p_{⊥}}{d t} & = - \nabla_{q} U (q_{⊥}) - γ_{⊥} Γ_{⊥} (q_{⊥}) p_{⊥} + \sqrt{2 γ_{⊥} T_{⊥}} Σ_{⊥} (q_{⊥}) \dot{W} (t) . \end{matrix}

(38)

and

x_{| |}

as the solution of

\begin{matrix} \frac{d q_{| |}}{d t} & = p_{| |}, \\ \frac{d p_{| |}}{d t} & = - γ_{| |} Γ_{| |} p_{| |} + \sqrt{2 γ_{| |} T_{| |}} Σ_{| |} \dot{W} (t) . \end{matrix}

(39)

where

γ_{⊥}, T_{⊥}, γ_{| |}, T_{| |} \geq 0

and

W

as specified in (3) and (4). We refer to (38) as the equation of the peculiar dynamics and to (39) as the equation of the consensus dynamics. This naming is motivated by the following choices of the respective friction tensor and diffusion matrix.

Peculiar dynamics:
As in (34) and (35), we choose

$Γ_{⊥} (q) = {[γ_{i j} (q)]}_{1 \leq i, j \leq N} \otimes I_{d},$

(40)

where $γ_{i j}$ are defined as above in (36) and (37). By choosing the diffusion matrix $Σ_{⊥}$ such that

$Γ_{⊥} (q) = Σ_{⊥} (q) Σ_{⊥}^{T} (q),$

we ensure that the total momentum in each dimension of the physical domain remains constant, i.e.,

$\frac{d}{d t} \sum_{j = 1} {p_{⊥}}_{j} = 0 .$

(41)

This follows directly from the fact, that

$u_{l} \in \ker (Γ_{⊥} (q)) = \ker (Σ_{⊥} (q)), l \in {1, \dots, d} .$

as well as

$u_{l} \cdot \nabla_{q} U (q_{⊥}) = 0,$

(42)

(since U is composed of pair potentials) and therefore by (38),

$u_{l} \cdot \frac{d}{d t} p_{⊥} = 0, l \in {0, \dots, d},$

hence

$\frac{d}{d t} \sum_{j = 1}^{N} {p_{⊥}}_{j} = \sum_{l = 1}^{d} e_{l} u_{l} \cdot \frac{d}{d t} p_{⊥} = 0 .$

(43)
Consensus dynamics:
We construct the matrix $Γ_{| |}$ such that the difference in the momenta of all particle pairs remains constant under the dynamics (39), i.e.,

$\frac{d}{d t} ({p_{| |}}_{j} - {p_{| |}}_{i}) = 0,$

for all $1 \leq i, j \leq N$ . We achieve that by choosing $Γ_{| |}$ as

$Γ_{| |} = I_{N} \otimes I_{d} = {[(i + j) \mod d]}_{1 \leq i, j \leq N d},$

(44)

and $Σ_{| |}$ such that

$Γ_{| |} = Σ_{| |} Σ_{| |}^{T},$

i.e.,

$Σ_{| |} = N^{- 1 / 2} I_{N} \otimes I_{d} .$
Combined dynamics:
We first observe that although the processes $x_{⊥}$ and $x_{| |}$ are driven by the same Wiener process $W$ they are indeed independent. This follows since the column vectors of $Γ_{⊥} (q)$ are orthogonal to the column vectors of $Γ_{| |}$ in the sense that

$\ker (Γ_{⊥} (q)) = span (Γ_{| |}),$

i.e.,

$Γ_{⊥} (q) Γ_{| |}^{T} = Γ_{| |} Γ_{⊥}^{T} (q) = 0,$

(45)

for all $q \in Ω_{q}$ , and hence also

$Σ_{⊥} (q) Σ_{| |}^{T} = Σ_{| |} Σ_{⊥}^{T} (q) = 0,$

(46)

so that $Σ_{⊥} (q) W$ and $Σ_{| |} W$ are independent processes, which implies that also the solution processes of the respective SDEs are independent. Moreover, since U is composed of pair-potentials, we have

$U (q_{⊥} + q_{| |}) = U (q_{⊥}) .$

(47)

Using (45)–(47) it directly follows that $x = x_{⊥} + x_{| |}$ can be identified with the solution of (1) and (2) with

$Γ (q) = γ_{⊥} Γ_{⊥} (q) + γ_{| |} Γ_{| |}, γ_{⊥} \geq 0, γ_{| |} \geq 0,$

(48)

with

$Γ_{⊥} (q) = {[γ_{i j} (q)]}_{1 \leq i, j \leq N} \otimes I_{d},$

(49)

and

$Σ (q) = \sqrt{2 γ_{⊥} T_{⊥}} Σ_{⊥} (q) + \sqrt{2 γ_{| |} T_{| |}} Σ_{| |}, T_{⊥} > 0, T_{| |} > 0 .$

(50)

or more explicitly with the solution of

$\begin{matrix} \frac{d q}{d t} & = p, \\ \frac{d p}{d t} & = - \nabla_{q} U (q) - γ_{⊥} Γ_{⊥} (q) p - γ_{| |} Γ_{| |} p + [\sqrt{2 γ_{⊥} T_{⊥}} Σ_{⊥} (q) + \sqrt{2 γ_{| |} T_{| |}} Σ_{| |}] \dot{W} (t) . \end{matrix}$

(51)

Remark 1.

We note that the above choice of

Γ_{⊥} (q)

and

Σ_{⊥} (q)

is very similar to the friction tensor and diffusion tensor in dissipative particle dynamics. In fact (38) would exactly correspond to a DPD system, if instead of (40) one constructs the friction tensor such that dissipation is aligned with the relative orientation of particle pairs

Γ_{⊥} (q) = {[γ_{i j} (q) {\hat{q}}_{i, j} \otimes {\hat{q}}_{i, j}]}_{1 \leq i, j \leq N} .

with

{\hat{q}}_{i, j} = (q_{j} - q_{i}) / ∥ q_{j} - q_{i} ∥ .

Proposition 1.

Let

Γ (q)

and

Σ (q)

defined as above in (49) and (50), with

T_{⊥} > 0, T_{| |} > 0

. Let further

Ω_{q} = T^{N \times d}

and U be of the form (32), then the SDE (51) possesses an invariant measure

μ_{T_{⊥}, T_{| |}} (d q d p) = ρ_{T_{⊥}, T_{| |}} (q, p) d q d p

of the form

ρ_{T_{⊥}, T_{| |}} (q, p) = \frac{1}{Z} e^{- [T_{⊥}^{- 1} U (q) + \frac{1}{2} p^{T} C^{- 1} p]}

(52)

with covariance matrix

C = \frac{1}{N} [T_{| |} 1_{N} + T_{⊥} L] \otimes I_{d}

where

1_{N} \in R^{N \times N}

is the all-ones matrix, i.e., every entry of

1_{N}

is equal to 1. The matrix

L \in R^{N \times N}

with

L_{i, j} = \{\begin{matrix} N - 1, & i = j, \\ - 1, & i \neq j \end{matrix}

denotes the graph Laplacian of a fully connected graph. In particular,

E_{μ} [(p_{j} - p_{i}) \otimes (p_{j} - p_{i})] = 2 T_{⊥} I_{d}

(53)

where

1 \leq i < j \leq N

, and for

\bar{p} : = N^{- 1} \sum_{i = 1}^{N} p_{i} \in R^{d}

,

E_{μ} [\bar{p} \otimes \bar{p}] = T_{| |} I_{d}

(54)

where

1 \leq j \leq d

.

Proof.

We show that (52) is a stationary solution of the Fokker-Planck equation associated with the SDE (51), i.e.,

L^{†} ρ_{T_{⊥}, T_{| |}} = 0,

(55)

with

L^{†} = L_{H}^{†} + γ_{⊥} L_{O ⊥}^{†} + γ_{| |} L_{O | |},

where the action of each of these operators applied to

ρ \in C^{2} (Ω_{q}, R)

is given as

\begin{matrix} L_{H}^{†} ρ & = [- p \cdot \nabla_{q} + \nabla_{p} \cdot \nabla_{q} U (q)] ρ, \\ L_{O ⊥}^{†} ρ & = \nabla_{p} \cdot (Γ_{⊥} p ρ) + \nabla_{p}^{2} : T_{⊥} Γ_{⊥} (q) ρ, \\ L_{O | |}^{†} ρ & = \nabla_{p} \cdot (Γ_{| |} p ρ) + \nabla_{p}^{2} : T_{| |} Γ_{| |} ρ . \end{matrix}

Before we show (55), we first note that since

\ker (Γ_{⊥} (q)) = span (Γ_{| |}),

for all

q \in Ω_{q}

, we have

Σ_{⊥} (q) Σ_{| |}^{T} = Σ_{| |} Σ_{⊥}^{T} (q) = 0,

hence

\begin{matrix} Σ (q) Σ^{T} (q) & = γ_{⊥} T_{⊥} Σ_{⊥} (q) Σ_{⊥}^{T} (q) + γ_{| |} T_{| |} Σ_{| |} Σ_{| |}^{T} \\ = γ_{⊥} T_{⊥} Γ_{⊥} (q) + γ_{| |} T_{| |} Γ_{| |} . \end{matrix}

(56)

Furthermore,

C^{- 1} = N^{- 1} [T_{| |}^{- 1} 1_{N} + T_{⊥}^{- 1} L] \otimes I_{d},

which implies

(i): $C^{- 1} p = T_{| |}^{- 1} p,$ for $p \in span (Γ_{| |})$ ,
(ii): $C^{- 1} p = T_{⊥}^{- 1} p,$ for $p \in span (Γ_{⊥})$ .

Finally, since

U (q)

is composed of pair potential functions, we have

U (q) = U ([c 1_{N} + N^{- 1} L] \otimes I_{d} q),

for all

c \in R

, hence in particular

\begin{matrix} T_{⊥}^{- 1} \nabla_{q} U (q) & = T_{⊥}^{- 1} \nabla_{q} U ([\frac{T_{⊥}}{T_{| |}} 1_{N} + N^{- 1} L] \otimes I_{d} q) \end{matrix}

(57)

Given the above identities, we conclude

\begin{matrix} L_{H} ρ_{T_{⊥}, T_{| |}} & = [T_{⊥}^{- 1} p \cdot \nabla_{q} U (q) - \nabla_{q} U (q) \cdot C^{- 1} p] ρ_{T_{⊥}, T_{| |}} & = 0 \end{matrix}

due to (57), and

\begin{matrix} L_{O ⊥}^{†} ρ_{T_{⊥}, T_{| |}} & = 0, \\ L_{O | |}^{†} ρ_{T_{⊥}, T_{| |}} & = 0, \end{matrix}

due to (ii) and (i), respectively. ☐

Corollary 1.

Let

ψ (r) > 0

in the definition (36) of

γ_{i j}

, furthermore

γ_{⊥} > 0

and

γ_{| |} > 0

. The invariant measure

μ_{T_{⊥}, T_{| |}}

specified in Proposition 1 is unique and the law of (51) converges exponentially fast towards

μ_{T_{⊥}, T_{| |}}

in the sense of (26) with

K_{l}, l \in N

as constructed in the proof of Theorem 3.

4. Numerical Discretization

We next describe the construction of numerical methods for the Langevin system. The discretization schemes described here are based on a general splitting framework as explained in the introduction, adapted for the variable coefficient structure.

Following [15], we break the Langevin system (1) and (2) into three parts: A and B corresponding to the deterministic flow:

L_{A} f = p \cdot M^{- 1} \nabla_{q} f

and

L_{B} f = F \cdot \nabla_{p} f,

and O, which is associated to the isolated momenta diffusion process defined by an Ornstein-Uhlenbeck type equation, i.e., the stochastic system

\dot{q} = 0, \dot{p} = - Γ (q) p + Σ (q) \dot{W} (t) .

(58)

A stochastic splitting method is then obtained by concatenating the (stochastic) flow maps corresponding to the B, A and O part. Although other decompositions can be used for splitting, experience in the constant coefficient case has shown that it is beneficial to maintain the same balance of noise and dissipation of the original equations by keeping these terms together. Among stochastic splitting schemes based on this decomposition, the symmetric integration sequence BAOAB was observed to yield a lower discretization bias for ergodic averages in comparison to other integration sequences requiring only one evaluation of the gradient

\nabla_{q} U

[16]; moreover, in the case of constant coefficients, this integration sequence has been shown to yield a superconvergent (4th order) error in configurational (

q

-dependent) observables [15].

In the case of non-constant coefficients

Γ (q), Σ (q)

, exact solution of the O-step can be computationally costly. More specifically, bearing in mind that

q (t) \equiv q

is fixed during the isolated Ornstein Uhlenbeck process (58), a time-step

Δ t > 0

can be written:

p (t + Δ t) = G_{t} p (t) + S_{t} R,

(59)

where

R \sim N (0, I_{N d})

, and

G_{t} = e^{- Δ t Γ (q)} .

(60)

The matrix

S_{t} \in R^{N d \times N d}

is related to

G_{t}

as

S_{t} S_{t}^{T} = C_{t} - G_{t} C_{t} G_{t}^{T},

(61)

where

C_{t}

is the solution of the Lyapunov equation

Γ (q) C_{t} + C_{t} Γ^{T} (q) = Σ (q) Σ^{T} (q) .

(62)

This means that in order to compute an exact solution of the O-part as an (59), first the Lyapunov Equation (62) has to solved, followed by the computation of the Cholesky decomposition (61) and matrix exponential (60). Each of these operations is without any additional assumptions on the structure of the matrices

Γ (q)

and

Σ (q)

of computational complexity

O (N^{3} d^{3})

. We circumvent these computations by instead integrating the O-part using a numerical method. We construct a symmetric splitting method based on the decomposition

\dot{p} = \underset{= : D}{\underset{︸}{- Γ (q) p}} + \underset{= : F}{\underset{︸}{Σ (q) \dot{W}}},

and use the integration sequence DFD, which results in an update of the form

p \leftarrow G^{2} p + \sqrt{Δ t} G Σ (q) R, R \sim N (0, I_{n}),

(63)

with

G = \exp (- \frac{Δ t}{2} Γ (q)) .

In order to avoid large numerical errors induced by this approximation we employ a multiple timestepping approach [17], meaning that instead of executing a single integration step of (63) for a full time step, we repeat for

K \in N

, the integrations step (63) K-times using a step size

Δ t / K

.

We refer this variant of the BAOAB integration scheme using a multiple time stepping solution of the O-part as “multiple time stepping BAOAB” (m-BAOAB). We provide the implementation of this method in Algorithm 1. The performance evaluation of schemes such as this can be demanding as there is a trade off between computational costs and accuracy of the approximation of the solution of the Ornstein-Uhlenbeck process, which in turn affects the accuracy of the invariant measure of the numerical method. The parameter K in the m-BAOAB algorithm allows control of the accuracy with which the OU process is resolved. The method was tested in the example of Section 5.3, below, but the detailed numerical analysis of this method will be taken up in a forthcoming article.

Algorithm 1 m-BAOAB

1:: INPUT $(q, p), Γ, Σ, Δ t, K$
2:: $q \leftarrow q + (Δ t / 2) p$
3:: $p \leftarrow p + (Δ t / 2) F (q)$
4:: $G \leftarrow \exp (- \frac{Δ t}{2 K} Γ (q))$
5:: for $i = 1$ to K do
6:: $p \leftarrow G^{2} p + G \sqrt{Δ t / K} Σ (q) R_{i}, R_{i} \sim N (0, I_{n})$
7:: end for
8:: $p \leftarrow p + (Δ t / 2) F (q)$
9:: $q \leftarrow q + (Δ t / 2) p$
10:: return $(q, p)$

5. Numerical Experiments

In this section we consider three systems that fit the form of (1) and (2). In the first, we consider a system of particles diffusing in a temperature gradient. The second model is a two-dimensional example incorporating a nonconservative stirring force. The third example consists of a particle flocking model of the stochastic Cucker-Smale type, as discussed in Section 3. In all of these examples, the model is sufficiently complicated that we do not possess an analytical solution for the nonequilibrium steady state. For this reason, we are only able to argue for the correctness of the numerical results based on qualitative features (or from our understanding of an unperturbed equilibrium model, in the second example). We are also able to explore the robustness of the numerical solution obtained with respect to variation of the initial conditions, which illustrates the ergodic property. Ultimately comparisons need to be performed with respect to laboratory experiments which can assess the efficacy of the modelling approach in particular situations.

The simulation of the stochastic Cucker-Smale model required the above introduced m-BAOAB scheme for efficient computation. The second model relies on constant (scalar) friction and diffusion coefficient. Due to the diagonal structure of

Γ

and

Σ

in the first model, the additional computational costs incurred for the computation of the coefficients in (59) are minimal.

5.1. Particle System with Temperature Gradient

We first study a simple particle system with a position dependent temperature parameter as described in Section 3.4, comprising

N = 64

particles on a two dimension torus, i.e.,

q_{i} \in L T^{2}

with

L = 5

, which are heated by a source at the center of the simulation box. The heat source is modeled by choosing the position-dependent heat bath temperature as

{(β (q_{i}))}^{- 1} = Ψ (| q_{i} - c_{L} |),

where

c_{L} = \frac{1}{2} (L, L)

is the center point of the simulation box and

Ψ

is a smooth bump function of the form

Ψ (r) = \{\begin{matrix} T_{\min} + (T_{\max} - T_{\min}) \exp (- \frac{1}{1 - {(r / r_{\max})}^{2}}), & r \leq r_{\max} \\ T_{\min}, & r > r_{\max} \end{matrix}

(64)

The constant

T_{\max} > 0

corresponds to the maximum heat bath temperature at the center point

c_{L}

of the simulation box and

T_{\min} > 0

describes the heat bath temperature outside of the disk

B_{r_{\max}} (c_{L}) : = {q \in L T^{2} : | q - c_{L} | \leq r_{\max}} .

The potential function U is modeled as the sum of pairwise potentials, i.e.,

U (q) = \sum_{i = 1}^{N - 1} \sum_{j = 1}^{i - 1} ω (r_{i, j})

with

r_{i, j} : = | q_{j} - q_{i} |

and

ω

is a simple repulsive soft potential of the form

ω (r_{i, j}) = \{\begin{matrix} \frac{k}{2} {(r_{i, j} - c_{r})}^{2}, & r_{i, j} < c_{r} \\ 0, & r_{i, j} \geq c_{r} . \end{matrix}

where k and

c_{r}

are positive constants. The pair interaction described by

ω

corresponds to a harmonic spring of stiffness k and rest length

c_{r}

. Particle systems involving this type of pair potential are commonly used as benchmark systems in the context of dissipative particle dynamics [39,40]. Due to the isolated jump discontinuity in the second derivative of the potential Theorem 3 does not strictly speaking, apply in this case. Although it would be easy to modify the potential to have any desired level of smoothness, we expect, based on our experience with molecular dynamics problems where similar such issues arise, that the results will be very similar to those obtained with the potential given here. The simulation results reported in the remainder of this section are obtained for a parameterization of the model where

k = 25, c_{r} = 1

and

r_{\max} = 1

with

T_{\min} = 1 / 10

and

T_{\max} = 4

. The particle positions and momenta were initialized on an equidistant grid such that

q_{i} = \frac{L}{N} (⌊ \frac{i}{N} ⌋, i \mod N)

and

p_{i} = (0, 0)

, respectively.

N_{t} = 10^{5}

timesteps of stepsize

h = 2 \times 10^{- 2}

were simulated with varying values

γ \in {10^{i}, i = - 1, 0, 1, 2}

of the friction coefficient. Define by

β_{γ}^{- 1} (r) : = E (p_{i}^{2} | | q_{i} - c_{L} | = r)

the effective temperature of the system at distance

r \in [0, L / 2)

from the center

c_{L}

of the simulation box. Figure 1B, shows estimates of this function for different values of

γ

calculated at points

r_{i} = Δ_{r} / 2 + i Δ_{r}, i = 0, \dots, ⌊ \frac{L}{2 Δ_{r}} ⌋

, as

{\hat{β}}_{γ}^{- 1} (r_{i}) = \frac{c_{i}}{N_{i}} \sum_{j = 1}^{N} \sum_{k = 1}^{N_{t}} 𝟙_{[r_{i} - Δ_{r} / 2, r_{i} + Δ_{r} / 2)} (| q_{j}^{(k)} - c_{L} |) \frac{1}{2} ({(p_{j, 1}^{(k)})}^{2} + {(p_{j, 2}^{(k)})}^{2}),

(65)

with

c_{i} : = \frac{1}{[{(r_{i} + Δ_{r} / 2)}^{2} - {(r_{i} - Δ_{r} / 2)}^{2}]} {(\sum_{i = 1}^{⌊ \frac{L}{2 Δ_{r}} ⌋} \frac{1}{{(r_{i} + Δ_{r} / 2)}^{2} - {(r_{i} - Δ_{r} / 2)}^{2}})}^{- 1},

and

N_{i} = \sum_{j = 1}^{N} \sum_{k = 1}^{N_{t}} 𝟙_{[r_{i} - Δ_{r} / 2, r_{i} + Δ_{r} / 2)} (| q_{j}^{(k)} - c_{L} |) .

A larger value of

γ

leads to a tighter local coupling of the particle system with the heat bath and hence for large friction

γ = 10^{2}

, the estimates of the effective temperature coincide very well with the heat bath temperature given by

Ψ

at the respective distances from the center of the simulation box. For smaller friction values the position dependence of the effective temperature is less apparent and for very small friction values, e.g.,

γ = 10^{- 2}

, it is nearly lost entirely. At the same time the decay of the temperature gradient for smaller friction values leads to an increase in the temperature outside the heat source region. Indication of this effect can be also found in the estimated radial density function (see Figure 1A). For small values of the friction coefficient the increased temperature leads to a stronger fluctuation of inter-particle distances outside the heat source area, and hence minima and maxima in the density of radial distribution are less apparent in comparison to the radial distributions for systems with a high friction value.

Due to the relatively small size of the heat source area the number of particles in this area is small in comparison to the number of particles outside of the heat source area and hence the shape of the radial density is mainly determined by the statistical properties of the particles outside the heat source area. The empirical particle densities shown in Figure 2 provide further insight into how the value of the friction coefficient affects the properties of

μ_{γ, β}

. For a small friction coefficient, i.e.,

γ = 10^{- 2}

the particle density is very close to uniform on

L T^{2}

. If the heat bath temperature was constant in

L T^{2}

one would expect a uniform density as the potential function U is solely composed of pair potentials and thus translation-invariant in the coordinates of the particle density. More precisely, let

q_{i, 1}

and

q_{i, 2}

denote the first and second coordinate value of the i-th particle, then

U (q) = U ({(q_{i, 1} + a_{1}, q_{i, 2} + a_{2})}_{1 \leq i \leq N}),

for all

a_{1}, a_{2} \in R

. Therefore, the observed uniform particle density for

γ = 10^{- 2}

is consistent with the observation that the position dependency of the effective temperature vanishes for this friction value. While for the considered trajectory length/sample size the plot of the invariant for a friction coefficient

γ = 10^{2}

is very noisy, it still strongly suggests that the translation invariance of the particle density, i.e., the uniformity of the particle density, is broken in this regime. Within the interior of the heat source region particles are distributed approximately uniformly whereas close to the boundary of the heat source region the particle density is increased. Outside the heat source region the particle density is concentrated around positions close to energy minima of the potential energy function U.

5.2. System with Non-Conservative Force

We next consider an instance of a non-equilibrium system with a non-conservative force of a form as outlined in Section 3.2. More specifically, we let

Ω_{q} = R^{n}

with

n = 2

and let the force

F

be of the form (31), i.e.,

F (q) = - \nabla_{q} U (q) + α J q,

with

U (q) = \sum_{i = 1}^{2} {(q_{i}^{2} - 1)}^{2},

and

J : = \frac{3}{2} (\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix}) .

We furthermore assume that the system is driven by a standard Langevin diffusion, i.e.,

Γ = γ I_{n},

and

Σ = \sqrt{2 γ β^{- 1}} I_{n},

where we choose

γ = 1

, and

β = 1

. This means, that without the non-conservative force part, i.e., in the case

α = 0

, the system considered resembles to a particle moving in a 4-well potential driven by a standard Langevin equation at equilibrium at unit temperature. The non-conservative force part

α J q

, corresponds to a stirring force, which pushes the system radially and in clockwise direction around the origin. The effect of the stirring force can be seen in Figure 3. In the absence of the stirring force (see Figure 3A) the invariant distribution is exactly the canonical distribution, i.e.,

ρ_{0} (q, p) \propto \exp (- U (q) - \frac{1}{2} p \cdot p) .

The modes of the marginal density in

q

are exactly positioned at the energy minima of U at

(\pm 1, \pm 1)

. Estimates of the mean momentum vectorfield

\bar{p} (q) : = \int_{Ω_{q}} p ρ_{α} (q, p) d p,

(66)

vanish at each point in the configurational phase space. In the presence of the stirring force (see Figure 3B), the invariant density is rotated slightly and smeared out over the energy barriers. The mean momentum resembles a vector field spiralling clockwise around the origin.

5.3. Stochastic Cucker-Smale Model

In this section we present simulation results for a modified stochastic Cucker-Smale model as described in Section 3.5.2. In particular, we demonstrate that the peculiar and consensus temperature as well as the consensus diffusion rate can be controlled independently and match in simulation with the analytically derived values, if the friction tensor

Γ

and the diffusion tensor

Σ

are suitably parametrized. Apart from these macroscopic quantities, we also demonstrate the effect of different parameter values of

γ_{⊥}, γ_{| |}, T_{⊥}, T_{| |}

on other macroscopic quantities which in general do not possess a closed form, such as the radial density of particles and peculiar velocity autocorrelation function. Several animations of the particle motion, which effect of parameter choices, may be viewed at the following web location: https://github.com/MatthiasSachs/StochasticCuckerSmale.git.

5.3.1. Model Parametrization

If not stated otherwise the results presented in this section were obtained from a system of 64 particles in a periodic simulation box of edge length 5 and dimension 2, i.e.,

Ω_{q} = L T^{N d}

with

d = 2, L = 5

and

N = 64

. We assume the particle pair-potentials

{\tilde{U}}_{i, j}

in the definition of U to be identical Morse potentials, i.e., for

i \neq j

{\tilde{U}}_{i, j} (r) = D {(1 - e^{- a (r - c_{r})})}^{2}, r > 0,

with

D = 1, a = 1, c_{r} = 1 / 2

. Following [4] we choose the functions

ψ

in the definition of

γ_{i j}

in (36) to be of the form

ψ (r) = \frac{K}{1 + {∥ r ∥}^{α}},

with

K = 1 / 10

, and

α = 6

.

5.3.2. Independent Control of Peculiar and Consensus Temperature

We first demonstrate that as stated in Proposition 1, we can indeed control the peculiar and the consensus temperature independently, such that these quantities coincide with the values of the model parameters

T_{| |}

and

T_{⊥}

, respectively, i.e., we show that the identities (53) and (54) are reproduced in simulations. For this purpose we compute an estimate of the peculiar temperature at time step

n \in N

as

φ_{T_{⊥}} (p^{(k)}) = \frac{1}{N (N - 1) d} \sum_{i = 1}^{N - 1} \sum_{j = i + 1}^{N} \sum_{l = 1}^{d} {(p_{j, l}^{(k)} - p_{i, l}^{(k)})}^{2}

(67)

and an estimate of the consensus temperature at time step

k \in N

as

φ_{T_{| |}} (p^{(k)}) = \frac{1}{d} \sum_{l = 1}^{d} {({\bar{p}}_{l}^{(k)})}^{2} .

(68)

In Figure 4 we show the results for a system with

T_{⊥} = 1 / 2

and

T_{| |} = 5

. We see that the cumulative average of the respective estimates converges, after a short equilibration period, to the target values.

5.3.3. Properties of the Flock

We first demonstrate the effect of the parameterization of the peculiar dynamics (the values of the peculiar friction

γ_{⊥}

and the peculiar temperature

T_{⊥}

) on the flock size, flock formation, and inter-flock diffusion. While we vary the values of the peculiar temperature parameter and peculiar friction parameter in order to demonstrate the effect of these parameters on the above quantities, we leave the parameter values of the consensus dynamics unchanged in all simulations as

γ_{| |} = 1

and

T_{| |} = 1

. In the first series of simulations we vary the value of

T_{⊥}

, while fixing the value of the peculiar friction parameter to

γ_{⊥} = 1

. Figure 5 shows the radial density for different values of

T_{⊥}

. As one would expect, the probability mass is concentrated around the mean rest-length of the Morse-potential for low values of

T_{⊥}

and the distribution spreads out for higher values of

T_{⊥}

. If we measure flock size in terms of the mean distance between particles in the flock, this implies that the flock size grows for increased temperature.

We next explore the effects of the peculiar friction parameter

γ_{⊥}

. We study the effect of the value of

γ_{⊥}

on the flock formation using the mean distance between particles,

φ_{md} (q) = \frac{2}{N (N - 1)} \sum_{i = 1}^{N - 1} \sum_{j = i + 1}^{N} ∥ q_{j} - q_{j} ∥,

as a measure of the progression of the flock formation. We initialize the particle position out of equilibrium on a equidistant square grid covering a square with side length

5 / 2

. Figure 6A shows the time evolution of observed mean distance

φ_{md} (q)

in simulations using different values of

γ_{⊥}

. We find that for small value of

γ_{⊥}

flock formation comes with strong oscillation in the flock size (measured in terms of the mean particle distance), which only slowly decays. With increased values of

γ_{⊥}

these oscillations are more strongly damped so that the flock size quickly approaches its equilibrium value.

We next explore the effect of the value of the model parameter

γ_{⊥}

on the mobility of particles within the flock. In order to measure the strength of diffusion within the flock we consider the mean distance of the particle i to the center of mass, i.e.,

φ_{mc}^{i} (q) = ∥ q_{i} - \bar{q} ∥ .

The autocorrelation of this observable can be used as a measure of mobility within the flock. Figure 6B, shows the autocorrelation function for

φ_{mc}^{i}

calculated from a trajectory in equilibrium, i.e., after the initial transitional flock-formation phase described in the previous paragraph. We can see that the decay of the autocorrelation function becomes slower for increasing values of

γ_{⊥}

, which indicates that for an increased value of

γ_{⊥}

the mobility of particles within the flock is reduced.

5.3.4. Collective Motion

We next explore the effects of the values of

γ_{| |}

and

T_{| |}

on the collective motion of the flock, i.e., the diffusive behaviour of the center of mass. Since the motion of the center of mass is described by the consensus dynamics, which is driven by an Ornstein-Uhlenbeck process, we find the diffusion constant D of the center of mass (viewed as a free particle in space) to be

D = \frac{T_{| |}}{N γ_{| |}} .

Table 1 shows the estimated diffusion coefficients for various values of

γ_{| |}

and

T_{| |}

. We find a good match between theoretically predicted values and observed values.

6. Conclusions

In this article we have provided a general treatment of the convergence of Langevin dynamics to a stationary state, including for systems with configuration-dependent friction and noise, as well as nonconservative forces. We have demonstrated the concepts in applications to systems with temperature gradients and stochastic models of active particle systems. Our approach does not assume the usual fluctuation-dissipation rule, so it can be applied to a wide range of nonequilibrium molecular and particle systems where the form of the stationary distribution is a priori unknown. Future work might look at the use of configuration-dependent memory kernels (within a generalized Langevin equation setting) and proofs of ergodicity for degenerate thermostats, for example the pairwise adaptive thermostats [40], which provide an alternative method for controlling observables.

Acknowledgments

The research of all three authors was supported by the ERC project RULE (grant number 320823). The collaboration was initiated during a residency of the first two authors at the Institut Henri Poincaré and its program on “Stochastic dynamics out of equilibrium.” The work of M. Sachs was further supported by the Statistical and Applied Mathematical Sciences Institute (North Carolina).

Author Contributions

The article was conceived in joint discussions of all three authors during May of 2017. All three authors contributed to the writing. The numerical experiments were proposed and discussed by all three authors but were performed by M.S.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lancon, P.; Batrouni, G.; Lobry, L.; Ostrowsky, N. Drift without flux: Brownian walker with a space-dependent diffusion coefficient. Europhys. Lett. 2001, 54, 28–34. [Google Scholar] [CrossRef]
Regev, S.; Grønbech-Jensen, N.; Farago, O. Isothermal Langevin dynamics in systems with power-law spatially dependent friction. Phys. Rev. E 2016, 94, 012116. [Google Scholar] [CrossRef] [PubMed]
Becton, M.; Wang, X. Thermal gradients on graphene to drive nanoflake motion. J. Chem. Theory Comput. 2014, 10, 722–730. [Google Scholar] [CrossRef] [PubMed]
Cucker, F.; Mordecki, E. Flocking in noisy environments. J. Math. Pures Appl. 2008, 89, 278–296. [Google Scholar] [CrossRef]
Ha, S.Y.; Lee, K.; Levy, D. Emergence of time-asymptotic flocking in a stochastic Cucker-Smale system. Commun. Math. Sci. 2009, 7, 453–469. [Google Scholar] [CrossRef]
Gachelin, J.; Rousselet, A.; Lindner, A.; Clement, E. Collective motion in an active suspension of Escherichia coli bacteria. New J. Phys. 2014, 16, 025003. [Google Scholar] [CrossRef]
Best, R.; Hummer, G. Coordinate-dependent diffusion in protein folding. Proc. Natl. Acad. Sci. USA 2010, 107, 1088–1093. [Google Scholar] [CrossRef] [PubMed]
Chen, C.; Liu, S.; Shi, X.Q.; Chaté, H.; Wu, Y. Weak synchronization and large-scale collective oscillation in dense bacterial suspensions. Nature 2017, 542, 210–214. [Google Scholar] [CrossRef] [PubMed]
Hoogerbrugge, P.; Koelman, J. Simulating microscopic hydrodynamic phenomena with dissipative particle dynamics. Europhys. Lett. 1992, 19, 155–160. [Google Scholar] [CrossRef]
Español, P. Dissipative particle dynamics. In Handbook of Materials Modeling; Springer: Berlin, Germany, 2005; pp. 2503–2512. [Google Scholar]
Shardlow, T.; Yan, Y. Geometric ergodicity for dissipative particle dynamics. Stoch. Dyn. 2006, 6, 123–154. [Google Scholar] [CrossRef]
Ahn, S.; Ha, S.Y. Stochastic flocking dynamics of the Cucker-Smale model with multiplicative white noises. J. Math. Phys. 2010, 51, 103301. [Google Scholar] [CrossRef]
Ton, T.; Linh, N.; Yagi, A. Flocking and non-flocking behavior in a stochastic Cucker-Smale System. Anal. Appl. 2014, 12, 63–73. [Google Scholar] [CrossRef]
Erban, R.; Haskovec, J.; Sun, Y. A Cucker-Smale model with noise and delay. SIAM J. Appl. Math. 2016, 76, 535–1557. [Google Scholar] [CrossRef]
Leimkuhler, B.; Matthews, C.; Stoltz, G. The computation of averages from equilibrium and nonequilibrium Langevin molecular dynamics. IMA J. Numer. Anal. 2016, 36, 13–79. [Google Scholar] [CrossRef]
Leimkuhler, B.; Matthews, C. Rational construction of numerical methods for stochastic molecular dynamics. Appl. Math. Res. Exp. 2013, 2013, 34–56. [Google Scholar]
Tuckerman, M.; Berne, B.J.; Martyna, G.J. Reversible multiple time scale molecular dynamics. J. Chem. Phys. 1992, 97, 1990–2001. [Google Scholar] [CrossRef]
Hummer, G. Position-dependent diffusion coefficients and free energies from Bayesian analysis of equilibrium and replica molecular dynamics simulations. New J. Phys. 2005, 7, 34. [Google Scholar] [CrossRef]
Burada, P.; Schmid, G.; Reguera, D.; Rubi, J.; Hänggi, P. Biased diffusion in confined media: Test of the Fick-Jacobs approximation and validity criteria. Phys. Rev. E 2007, 75, 051111. [Google Scholar] [CrossRef] [PubMed]
Berezhkovskii, A.; Szabo, A. Time scale separation leads to position-dependent diffusion along a slow coordinate. J. Chem. Phys. 2011, 135, 074108. [Google Scholar] [CrossRef] [PubMed]
Wilmer, O.R.; Pedro, C.; Lopez, F. Direct evaluation of the position dependent diffusion coefficient and persistence time from the equilibrium density profile in anisotropic fluids. J. Chem. Phys. 2013, 139, 074103. [Google Scholar]
Lelièvre, T.; Stoltz, G. Partial differential equations and stochastic methods in molecular dynamics. Acta Numer. 2016, 25, 681–880. [Google Scholar] [CrossRef]
Bellet, L.R. Ergodic properties of Markov processes. In Open Quantum Systems II; Springer: Berlin, Germany, 2006; pp. 1–39. [Google Scholar]
Gardiner, C. Handbook of stochastic methods for physics, chemistry and the natural sciences. Appl. Opt. 1986, 25, 3145. [Google Scholar] [CrossRef]
Guionnet, A.; Zegarlinksi, B. Lectures on logarithmic Sobolev inequalities. In Séminaire de Probabilités XXXVI; Springer: Berlin, Germany, 2003; pp. 1–134. [Google Scholar]
Hörmander, L. The Analysis of Linear Partial Differential Operators. III, Volume 274 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]; Springer: Berlin, Germany, 1985. [Google Scholar]
Kliemann, W. Recurrence and invariant measures for degenerate diffusions. Ann. Probab. 1987, 15, 690–707. [Google Scholar] [CrossRef]
Bhattacharya, R.N. On the functional central limit theorem and the law of the iterated logarithm for Markov processes. Zeitschrift Für Wahrscheinlichkeitstheorie Und Verwandte Gebiete 1982, 60, 185–201. [Google Scholar] [CrossRef]
Villani, C. Hypocoercivity; Number 949-951; American Mathematical Society: Providence, RI, USA, 2009. [Google Scholar]
Dolbeault, J.; Mouhot, C.; Schmeiser, C. Hypocoercivity for linear kinetic equations conserving mass. Trans. Am. Math. Soc. 2015, 367, 3807–3828. [Google Scholar] [CrossRef]
Harris, T.E. The existence of stationary measures for certain Markov processes. In Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume 2: Contributions to Probability Theory; University of California Press: Oakland, CA, USA, 1956; pp. 113–124. [Google Scholar]
Meyn, S.P.; Tweedie, R.L. Stability of Markovian processes I: Criteria for discrete-time chains. Adv. Appl. Probab. 1992, 24, 542–574. [Google Scholar] [CrossRef]
Hairer, M.; Mattingly, J.C. Yet another look at Harris’ ergodic theorem for Markov chains. In Seminar on Stochastic Analysis, Random Fields and Applications VI; Springer: Berlin, Germany, 2011; Volume 63, pp. 109–117. [Google Scholar]
Meyn, S.P.; Tweedie, R.L. Stability of Markovian processes III: Foster–Lyapunov criteria for continuous-time processes. Adv. Appl. Probab. 1993, 25, 518–548. [Google Scholar]
Mattingly, J.C.; Stuart, A.M.; Higham, D.J. Ergodicity for SDEs and approximations: Locally Lipschitz vector fields and degenerate noise. Stoch. Process. Their Appl. 2002, 101, 185–232. [Google Scholar] [CrossRef]
Meyn, S.P.; Tweedie, R.L. Markov Chains and Stochastic Stability; Springer Science & Business Media: Berlin, Germany, 2012. [Google Scholar]
Zhang, F. The Schur Complement and Its Applications; Springer Science & Business Media: Berlin, Germany, 2006; Volume 4. [Google Scholar]
Cucker, F.; Smale, S. Emergent behavior in flocks. IEEE Trans. Autom. Control 2007, 52, 852–862. [Google Scholar] [CrossRef]
Groot, R.D.; Warren, P.B. Dissipative particle dynamics: Bridging the gap between atomistic and mesoscopic simulation. J. Chem. Phys. 1997, 107, 4423–4435. [Google Scholar] [CrossRef]
Leimkuhler, B.; Shang, X. Pairwise adaptive thermostats for improved accuracy and stability in dissipative particle dynamics. J. Comput. Phys. 2016, 324, 174–193. [Google Scholar] [CrossRef]

Figure 1. (A) Radial density function for the particle system described in Section 5.1 for different values of the friction coefficient

γ

; (B) Distance r from heat source center

c_{L}

vs. effective temperature estimated according to (65). The black curve corresponds to the heat bath temperature

Ψ (r)

, with

Ψ

as defined in (64).

Figure 1. (A) Radial density function for the particle system described in Section 5.1 for different values of the friction coefficient

γ

; (B) Distance r from heat source center

c_{L}

vs. effective temperature estimated according to (65). The black curve corresponds to the heat bath temperature

Ψ (r)

, with

Ψ

as defined in (64).

Figure 2. Empirical particle density calculated as a cumulative average over the simulation time, for two values of

γ

. The area inside the black circle corresponds to the heat source area.

Figure 2. Empirical particle density calculated as a cumulative average over the simulation time, for two values of

γ

. The area inside the black circle corresponds to the heat source area.

Figure 3. Marginal of

q

of the invariant density of the system described in Section 5.2 in the absence (

α = 0

) of the non-conservative force (A), and in the presence (

α = 1

) of the non-conservative force (B). The black arrows correspond to estimates of the mean momentum vector field (66) in the invariant density at the respective points in configurational space.

Figure 3. Marginal of

q

of the invariant density of the system described in Section 5.2 in the absence (

α = 0

) of the non-conservative force (A), and in the presence (

α = 1

) of the non-conservative force (B). The black arrows correspond to estimates of the mean momentum vector field (66) in the invariant density at the respective points in configurational space.

Figure 4. Time vs. observed peculiar temperature (upper figure) and consensus temperature (lower figure). The blue trajectory shows the estimates

φ_{T_{⊥}} (p^{(k)})

and

φ_{T_{| |}} (p^{(k)})

at timestep k for the peculiar and the consensus temperature, respectively. The respective cumulative averages are shown in red.

Figure 4. Time vs. observed peculiar temperature (upper figure) and consensus temperature (lower figure). The blue trajectory shows the estimates

φ_{T_{⊥}} (p^{(k)})

and

φ_{T_{| |}} (p^{(k)})

at timestep k for the peculiar and the consensus temperature, respectively. The respective cumulative averages are shown in red.

Figure 5. Radial density function for varying values of the peculiar temperature parameter

T_{⊥}

.

Figure 5. Radial density function for varying values of the peculiar temperature parameter

T_{⊥}

.

Figure 6. Flock formation and inter-flock diffusion for different values of the peculiar friction parameter

γ_{⊥}

. (A) Time vs. mean distance

φ_{md} (q)

; (B) Lag time vs. time-autocorrelation of the mean-centre distance

φ_{mc}^{i} (q)

. The plotted curve is computed as an average over all particles indices

i = 1, \dots, N

.

Figure 6. Flock formation and inter-flock diffusion for different values of the peculiar friction parameter

γ_{⊥}

. (A) Time vs. mean distance

φ_{md} (q)

; (B) Lag time vs. time-autocorrelation of the mean-centre distance

φ_{mc}^{i} (q)

. The plotted curve is computed as an average over all particles indices

i = 1, \dots, N

.

Table 1. Estimates of

N D

for various values of

γ_{| |}

and

T_{| |}

.

N = 16

denotes the number of particles and D the diffusion coefficient of the diffusive motion of the center of mass. Values in parentheses show the exact values.

Table 1. Estimates of

N D

for various values of

γ_{| |}

and

T_{| |}

.

N = 16

denotes the number of particles and D the diffusion coefficient of the diffusive motion of the center of mass. Values in parentheses show the exact values.

$T_{\| \|}$	1	10	100
$γ_{\| \|}^{- 1}$	1	10	100
1	0.987, (1)	9.775, (10)	101.8, (100)
3	3.059, (3)	29.445, (30)	308.1, (300)
9	8.476, (9)	91.994, (90)	861.4, (900)

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sachs, M.; Leimkuhler, B.; Danos, V. Langevin Dynamics with Variable Coefficients and Nonconservative Forces: From Stationary States to Numerical Methods. Entropy 2017, 19, 647. https://doi.org/10.3390/e19120647

AMA Style

Sachs M, Leimkuhler B, Danos V. Langevin Dynamics with Variable Coefficients and Nonconservative Forces: From Stationary States to Numerical Methods. Entropy. 2017; 19(12):647. https://doi.org/10.3390/e19120647

Chicago/Turabian Style

Sachs, Matthias, Benedict Leimkuhler, and Vincent Danos. 2017. "Langevin Dynamics with Variable Coefficients and Nonconservative Forces: From Stationary States to Numerical Methods" Entropy 19, no. 12: 647. https://doi.org/10.3390/e19120647

APA Style

Sachs, M., Leimkuhler, B., & Danos, V. (2017). Langevin Dynamics with Variable Coefficients and Nonconservative Forces: From Stationary States to Numerical Methods. Entropy, 19(12), 647. https://doi.org/10.3390/e19120647

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Langevin Dynamics with Variable Coefficients and Nonconservative Forces: From Stationary States to Numerical Methods

Abstract

1. Introduction

2. Stationary States of SDEs and Their Stability

2.1. The Associated Semigroup of Evolution Operators and Their Adjoints

2.2. Hypoellipticity and Existence of a Smooth Transition Kernel

2.3. Ergodicity and Convergence in Law

2.4. Finite-Time Averages and the Central Limit Theorem

2.5. Exponential Convergence in Weighted $L^{\infty}$ Spaces

3. Langevin Dynamics with Configuration-Dependent Diffusion

3.1. Geometric Ergodicity of Langevin Dynamics With Space-Dependent Coefficients

3.2. Single-Particle System with Non-Conservative Force

3.3. Multi-Particle Systems

3.4. Particle System with Temperature Gradient

3.5. Stochastic Cucker-Smale Model

3.5.1. Regularized Stochastic Cucker-Smale Dynamics

3.5.2. Modified Stochastic Cucker-Smale Dynamics

4. Numerical Discretization

5. Numerical Experiments

5.1. Particle System with Temperature Gradient

5.2. System with Non-Conservative Force

5.3. Stochastic Cucker-Smale Model

5.3.1. Model Parametrization

5.3.2. Independent Control of Peculiar and Consensus Temperature

5.3.3. Properties of the Flock

5.3.4. Collective Motion

6. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Langevin Dynamics with Variable Coefficients and Nonconservative Forces: From Stationary States to Numerical Methods

Abstract

1. Introduction

2. Stationary States of SDEs and Their Stability

2.1. The Associated Semigroup of Evolution Operators and Their Adjoints

2.2. Hypoellipticity and Existence of a Smooth Transition Kernel

2.3. Ergodicity and Convergence in Law

2.4. Finite-Time Averages and the Central Limit Theorem

2.5. Exponential Convergence in Weighted L ∞ Spaces

3. Langevin Dynamics with Configuration-Dependent Diffusion

3.1. Geometric Ergodicity of Langevin Dynamics With Space-Dependent Coefficients

3.2. Single-Particle System with Non-Conservative Force

3.3. Multi-Particle Systems

3.4. Particle System with Temperature Gradient

3.5. Stochastic Cucker-Smale Model

3.5.1. Regularized Stochastic Cucker-Smale Dynamics

3.5.2. Modified Stochastic Cucker-Smale Dynamics

4. Numerical Discretization

5. Numerical Experiments

5.1. Particle System with Temperature Gradient

5.2. System with Non-Conservative Force

5.3. Stochastic Cucker-Smale Model

5.3.1. Model Parametrization

5.3.2. Independent Control of Peculiar and Consensus Temperature

5.3.3. Properties of the Flock

5.3.4. Collective Motion

6. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.5. Exponential Convergence in Weighted $L^{\infty}$ Spaces