Abstract
We introduce a new class of Wasserstein-type distances specifically designed to tackle questions concerning stability and convergence to equilibria for kinetic equations. Thanks to these new distances, we improve some classical estimates by Loeper (J Math Pures Appl (9) 86(1):68–79, 2006) and Dobrushin (Funktsional Anal i Prilozhen 13:48–58, 1979) on Vlasov-type equations, and we present an application to quasi-neutral limits.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
1.1 General overview
Monge–Kantorovich distances, also known as Wasserstein distances, play a central role in statistical mechanics, especially in the theory of propagation of chaos and studying large particle systems’ mean behavior. From the late 1970s, there have been many applications of Wasserstein distances in kinetic theory, as is beautifully described in the bibliographical notes of [60, Chapter 6]. In particular, these distances are frequently used to prove the uniqueness and stability of solutions to kinetic equations, study singular limits, and measure convergence to equilibrium.
The first celebrated result relying on Monge–Kantorovich–Wasserstein distances in non-collisional kinetic theory is the proof by Dobrushin [16] on the well-posedness for Vlasov equations with \(C^{1,1}\) potentials, where existence, uniqueness, and stability are proved via a fixed point argument in the bounded-Lipschitz or the 1-Wasserstein distance. As a consequence of this argument, one also obtains the validity of the mean-field limit for Vlasov equations with smooth potentials. The interested reader may refer to [20, Chapter 1.4] and [40, Chapter 3.3] for a detailed explanation of Dobrushin’s stability estimate, its consequences on the mean-field limit for the Vlasov equation, and of the role of Monge–Kantorovich–Wasserstein distances. Dobrushin’s estimate is at the core of several kinetic theory arguments; see, for example, [10,11,12,13, 15, 18, 22, 29] for some applications.
In recent times, Golse and Paul in [23] introduced a quantum analog of the 2-Wasserstein distance to measure the approximation of the N-body quantum dynamics by its mean-field limit. In [21] the authors prove quantitative stability estimates that are reminiscent of Dobrushin’s, and they show that, in the case of \(C^{1,1}\) potentials, the mean-field limit of the quantum mechanics of N identical particles is uniform in the classical limit.
Another fundamental stability estimate was proved by Loeper [49], who established uniqueness and stability of solutions with bounded density for the Vlasov–Poisson equation. Loeper’s argument relies on the fact that the Coulomb kernel is generated by a potential solving Poisson’s equation and exploits the strong connection between the 2-Wasserstein distance and the \(H^{-1}\)-norm. Besides providing the best-known uniqueness criterion for Vlasov–Poisson, this approach also gives a new proof of uniqueness \(\grave{a}\) la Yudovich for 2D Euler. Loeper’s result has been generalized to less singular kernels [37], and it is the cornerstone for several other stability arguments [7, 14, 36, 42, 44, 48, 58]. Also, Loeper’s uniqueness criterion for Vlasov–Poisson has been extended to solutions whose associated density belongs to some suitable Orlicz spaces [38, 51]. In what follows, we will focus our attention on some applications of Loeper’s stability estimate related to quasi-neutral limit for the Vlasov–Poisson equation [28, 30, 34, 35].
In general, extending Dobrushin’s and Loeper’s estimates is a delicate matter. A possible idea is to introduce an anisotropic metric that weights spatial and momentum coordinates differently. For example, in [43], the author considers a variant of the 2-Wasserstein distance where the cost for moving points in the x-variable is higher than for the v-variable. By suitably selecting the parameters, this allows the author to extend the validity ranges for the mean-field limit for the Vlasov–Poisson system. Also, as shown in [28, 30], an analogous method provides better convergence estimates when considering combined mean-field and quasi-neutral limits in Vlasov–Poisson-type systems. At the same time as this paper was written, another variant of this idea was introduced in [56], where the author improves the trend to equilibrium for 1-D kinetic Fokker–Planck equations via estimates measured in an analog of the 2-Wasserstein metric.
This work aims to push further the idea that, when applied to kinetic problems, Wasserstein distances should be modified to reflect the natural anisotropy between position and momentum variables. Moreover, since these metrics are used to measure the distance between PDEs’ solutions, we will introduce time-dependent counterparts that can vary along with the characteristic flow. Still, it is worth noticing that our method could be applied, beyond the kinetic framework, to equations where the evolution in one of the variables enjoys better regularity properties than the others.
Before stating our main results, let us emphasize that the idea of finding appropriate generalised Wasserstein distances has been used successfully in other contexts in the optimal transport and evolution PDE community; see, for instance, [17, 19, 45, 46, 54, 55] and references therein.
1.2 Definitions and main results
Let us recall the definition of Wasserstein distances (see for instance [1, 60]). In what follows, \({\mathcal {X}}\) will be either the d-dimensional torus \({\mathbb {T}}^d\) or the Euclidean space \(\mathbb {R}^d\).
Definition 1.1
Let \(\mu , \nu \) be two probability measures on \({\mathcal {X}}\times \mathbb {R}^d\). We denote with \(\Pi (\mu , \nu )\) the set of all probability measures on \(({\mathcal {X}}\times \mathbb {R}^d)^2\) with marginals \(\mu \) and \(\nu .\) More precisely, \(\pi \in \Pi (\mu , \nu )\) if
We shall call coupling (between \(\mu \) and \(\nu \)) an element in \(\Pi (\mu , \nu ).\)
For \(p\geqq 1\), the p-Wasserstein distance between \(\mu \) and \(\nu \) is defined as
(1) A free-flow \(W_1\)-type distance for the Vlasov equations with \(C^{1,1}\) potential. Consider two solutions \(f_1,f_2\) of the Vlasov equation on \({\mathcal {X}}\), namely
where \(\Vert D^2 K\Vert _{\infty }=:B<\infty \). The classical Dobrushin’s argument shows that
In particular, when the potential K is identically zero, this bound provides an exponential stability for \(W_1\) that is far from optimal. Indeed, since the solution is simply given by \(f(t,x,v)=f(0,x-tv,v)\), it is clear that in this case \(W_1(f_1(t),f_2(t))\sim t\) for \(t \gg 1\).
By introducing a \(W_1\)-type distance adapted to the free flow, we can prove that
(see Theorem 2.1 below).
This estimate gives the optimal bound when \(K\equiv 0\). Moreover, for \(B\leqq 1\), this provides a better estimate compared to the usual Dobrushin’s bound when \(t \in [0,T_B]\) with \(T_B\simeq B^{-1/2}\).
(2) An improved \(W_2\)-stability estimate for Vlasov–Poisson with bounded density. For this second application, we focus on the case of the torus for simplicity, but a completely similar analysis works on the whole space.
Consider two solutions \(f_1,f_2\) of the Vlasov–Poisson equation on \({\mathbb {T}}^d\), namely
As shown in [49], Loeper’s proof provides the following stability estimate whenever \(W_2(f_1(0),f_2(0))\) is sufficiently small (which is the interesting case):
Here \(c_d>0\) is a dimensional constant, while C depends on the \(L^\infty \) norm of \(\rho _{f_1}\) and \(\rho _{f_2}.\) This estimate can then be applied to prove the validity of the quasi-neutral limit for Vlasov–Poisson for initial data that are double-exponential perturbation of analytic functions [34, 35] (see Remark 3.4).
To improve this result, given \((X_i,V_i)\) the characteristics associated to \(f_i\), we consider a nonlinear \(W_2\)-type quantity of the form
where \(\pi _0\) is an optimal coupling, and \(\lambda (t)=|\log (Q(t))|.\) We then prove that Q(t) is well-defined whenever \(Q(t)\ll 1,\) and finally, comparing Q(t) to \(W_2,\) we show that
(see Theorem 3.1). To better understand the improvement of our estimate with respect to Loeper’s, one can think as follows: if \(W_2(f_1(0),f_2(0)= \theta \ll 1\), then Loeper’s estimate implies that \(W_2(f_1(t),f_2(t)) \lesssim 1\) for \(t \in [0,\log |\log \theta |]\). Instead, our bound gives \(W_2(f_1(t),f_2(t)) \lesssim 1\) for \(t \in \bigl [0,|\log \theta |^{1/2}\bigr ]\), so on a much longer time-interval.
Remark 1.2
-
Note that a standard Gronwall estimate of the form \(W_2(f_1(t),f_2(t))\leqq e^{Ct}W_2(f_1(0),f_2(0))\) would imply that \(W_2(f_1(t),f_2(t)) \lesssim 1\) for \(t \in [0,|\log \theta |]\). So, while Loeper’s bound loses an extra logarithm in terms of time-scale, our bound only loses a square root. Since the electric field for a solution with bounded density is at most log-Lipschitz, an estimate of the form \(W_2(f_1(t),f_2(t))\leqq e^{Ct}W_2(f_1(0),f_2(0))\) is not expected to hold in this setting, and we believe our bound to be essentially sharp.
-
Our improvement from \(\log |\log \theta |\) to \(|\log \theta |^{1/2}\) is similar to the one obtained for the \(W_1\) distance, see [38, Remark 1.7]. In that paper, the authors rely crucially on the second-order structure of the Vlasov equation, namely \(\ddot{X}=\nabla U(t,X)\). Our proof, instead, relies only on the fact that \(\dot{X}=a(t,X,V)\), where \(a(t,\cdot ,\cdot )\) is Lipschitz, and it can be generalized to other contexts where the second-order structure fails.
Our new stability estimate has interesting applications for what concerns some singular limits for Vlasov-type equations. In particular, by considering the Vlasov–Poisson system in appropriate dimensionless variables that take the Debye length into account, we prove the validity of the quasi-neutral limit for Vlasov–Poisson for initial data that are an exponential perturbation of analytic functions, see also Remark 3.4.
The paper is structured as follows: in the next two sections, we will present our two main results, and then in the final section of the paper, we will discuss more generally our approach and how it leads to the introduction of a new family of Wasserstein-type distances.
2 Dobrushin’s Estimate Revisited
2.1 The Vlasov equation
The Vlasov equation is a non-linear partial differential equation providing a statistical description for the collective behavior of large numbers of charged particles in mutual, long-range interaction. This model was first introduced by Jeans in the context of Newtonian stellar dynamics [41], and later by Vlasov in his work on plasma physics [61, 62]. The unknown of the Vlasov equation f(t, x, v) is the distribution function of the system at time, that is the number density of particles that are located at the position x and have instantaneous velocity v at time t. The Vlasov equation for the distribution function f reads as
where
In other words, the Vlasov equation for particle systems is a kinetic model where each particle is subject to the acceleration field F[f] created by all the other particles in the system.
The Vlasov equation is a transport equation and, for a sufficiently regular force field, it can be described by the method of characteristics. The initial distribution \(f_0\) is transported by a characteristic flow (X, V) generated by the mean-field force F[f]; if we denote
then \(f(t,X(t,x,v),V(t,x,v))=f(0,x,v)\). Also, since the vector field (v, F[f]) is divergence free, one has conservation of mass and of all \(L^p\)-norms. For an introduction to this topic we refer to the lecture notes [20].
2.2 An improved Dobrushin’s estimate
Consider the Vlasov equation with smooth kernel. More precisely,
where \(\Vert D^2 K\Vert _{\infty }=:B<\infty \). As explained in the introduction, our goal is to provide a stability estimate for solutions that is optimal in the regime as B tends to zero. Here is our result:
Theorem 2.1
Let \(f_1,f_2\) be two solution of (2.1). Then
Proof
Let \((X_i,V_i)\) denote the characteristic flow associated to \(f_i\), that is,
Note that, since \(\nabla K\) is Lipschitz, the characteristic flow is well-defined thanks to Cauchy–Lipschitz theory (see [20, Chapter 2]). To prove Theorem 2.1, we consider \(\pi _0\) an optimal \(W_1\)-coupling between \(f_{1}(0)\) and \(f_{2}(0)\), and we define the quantity
Note that
Also
Since \(\frac{d}{dt}(X_i-tV_i)=-t\dot{V}_i\), one has
We now observe that, since \(\nabla K\) is B-Lipschitz, we can bound
where the second inequality follows from (2.3). For \(T_1\), we note that
Here, similarly to Dobrushin’s argument, we use that \(W_1\) admits the following dual formulation:
Thanks to this fact, since \(z\mapsto \nabla K(X_1(t,x,v)-z)\) is B-Lipschitz, we deduce that
and therefore
Let \(\gamma _t=(X_1(t,\cdot ,\cdot ),X_2(t,\cdot ,\cdot ))_\#\pi _0 \in \Pi (\rho _{f_1},\rho _{f_2}).\) Then, by the definition of \(W_1\) (see Definition 1.1),
so using (2.3) we conclude that \(T_1 \leqq B(1+t)Q(t)\).
In conclusion, we proved that
therefore
Recalling (2.2) and (2.3), this yields
As noted in the introduction, this estimate is more powerful than the usual Dobrushin’s estimateFootnote 1
when B is small. On the other hand, for large times, the term \((1+t)^3\) in our estimate provides a worse bound (2.6). Hence, both bounds are helpful depending on the mutual sizes of B and t, and one can choose to apply whichever gives the stronger bound. In conclusion, one has
as desired.\(\square \)
3 Stability Estimates for Vlasov–Poisson and Quasi-Neutral Limits
3.1 The Vlasov–Poisson system
The Vlasov–Poisson system is the classical kinetic model describing dilute, totally ionised, unmagnetized plasma. In its most common form, f is the distribution function of the electrons moving in a self-induced electrostatic field, while the ions are assumed to act as a fixed background. In this section, we consider the phase space to be \({{\mathbb {T}}}^d\times {{\mathbb {R}}}^d,\) for reasons that will be explained later.
The well-posedness theory of this system has been extensively studied, see, for example, the survey paper [32]. Global-in-time classical solutions have been constructed under various conditions on the initial data (see for example [4, 6, 47, 53, 57, 59]), while global-in-time weak solutions were presented in [2] and [39] for \(L^p\) initial data (see also [3, 5]). In this section, we will focus on an important contribution to the uniqueness theory made by Loeper [49], who proved uniqueness for solutions of (3.1) with bounded density by means of a strong–strong stability estimate in Wasserstein.
3.2 Quasi-neutral limits
Since plasmas are excellent conductors of electricity, and any charges that develop are readily neutralized, they can be treated as being quasi-neutral. On the other hand, at small spatial and time scales, the quasi-neutrality is no longer verified. The distance over which quasi-neutrality may break down can be described in terms of the Debye length \(\lambda _D\), and varies according to the physical characteristics of the plasma. The Debye length is usually much shorter than the typical observation scale. Therefore, we can define the parameter \(\varepsilon := \lambda _D/L\) and consider the limit as \(\varepsilon \) tends to zero. This procedure is known as quasi-neutral limit.
When we take the Debye length into account, in appropriate dimensionless variables, the Vlasov–Poisson system becomes
and the energy of the rescaled system is the following:
The quasi-neutral limit corresponds to a singular limit for the rescaled system (3.2), in which the formal limiting system is the Kinetic Isothermal Euler system
The force \(E=-\nabla _x U\) is defined implicitly through the incompressibility constraint \(\rho =1\), and may be thought of as a Lagrange multiplier associated to this constraint. In other words, electrons move under the effect of a gradient in such a way that their density remains equal to 1 everywhere. Thus (KIE) is a “kinetic” version of the incompressible Euler equations. As shown in [8], the potential U formally satisfies the Laplace equation
As discussed in [31], the justification of this limit is very delicate. In particular, it can fail even for smooth initial data. Still, a series of positive results are available. In particular, as shown in [34, 35], a way to get the validity of the quasi-neutral limit for a large class of data can be achieved if one can prove some quantitative strong-strong stability at the level of the \((VP)_{\varepsilon }\) system. Also, the stronger the stability estimate, the larger the class of initial data for which the quasi-neutral limit hold. In [34, 35] the authors prove that the quasi-neutral limit holds for initial data that are an extremely small perturbation of an analytic function. Here, by introducing a suitable non-linear version of the Wasserstein distance, we can considerably improve that results.
Here is our main theorem, which provides us with a new \(W_2\) stability estimate. We prove the result with a general parameter \(\varepsilon \leqq 1\) as this is necessary for the study of the quasi-neutral limit. The reader interested in the Vlasov–Poisson case can simply apply our estimate with \(\varepsilon =1\).
Theorem 3.1
Let \(\varepsilon \leqq 1\), and let \(f_1, f_2\) be two weak solutions of the \((VP)_{\varepsilon }\) system (3.2), and set
Define the function
and assume that \(A(t) \in L^1([0,T])\) for some \(T>0\). There exist a dimensional constant \(C_d>0\) and a universal constant \(c_0>0\) such that the following holds: if \(W_2(f_1(0),f_2(0))\) is sufficiently small so that \(W_2(f_1(0),f_2(0))\leqq c_0 \varepsilon \) and
then
Remark 3.2
The assumption (3.4) depends on the time interval [0, T]. If T is very small so that
then (3.4) corresponds to \(W_2(f_1(0),f_2(0))\le \varepsilon ^3.\) Of course this is not the relevant regime since the time interval is usually at least of size 1. In particular, since \(A(t)\ge 2\),Footnote 2
Therefore (3.4) corresponds to asking \(W_2(f_1(0),f_2(0))\) being bounded by \(e^{-C\varepsilon ^{-2}}.\) This requirement is very natural in this context, as also discussed in Remark 3.4.
As in [34, 35], Theorem 3.1 yields the validity of the quasi-neutral limit for \(W_2\)-perturbations of analytic data. However, our estimate is stronger with respect to the previous results and provides an almost optimal rate in the quasi-neutral limit. More broadly, we believe that our approach for proving Theorem 3.1 has its own interest and could be used in other settings.
To state our application to the quasi-neutral limit, we need to recall some notation introduced by Grenier [26] in one of the first mathematical works on this topic. In [26] the author relies on an interpretation of the plasma as a superposition of a—possibly uncountable—collection of fluids and he shows that the quasi-neutral limit holds when the sequence of initial data \(f_{0,\varepsilon }\) enjoys uniform analytic regularity with respect to the space variable. As explained in [34] (see the discussion after Definition 1.4), this decomposition is purely a technical tool and it does not impose any restriction on the initial datum. This result has been improved by Brenier [9], who gives a rigorous justification of the quasi-neutral limit in the so called “cold electron” case, i.e. when the initial distribution \(f_{0,\varepsilon }\) converges to a monokinetic profile
where \(\delta _v\) denotes the Dirac measure in velocity, see also [9, 25, 50].
Let us define a suitable analytic norm, as in [26]: given \(\delta >0\) and a function \(g:\mathbb {T}^d \rightarrow {\mathbb {R}}\), we define
where \(\widehat{g}(k)\) is the k-th Fourier coefficient of g. We define \(B_\delta \) as the space of functions g such that \(\Vert g \Vert _{B_\delta }<+\infty \).
Corollary 3.3
Let \(d=2, 3\), and let \(\gamma \), \(\delta _0\), and \(C_0\) be positive constants. Consider a sequence \((f_{0,\varepsilon })\) of non-negative initial data in \(L^1\) for (3.2) such that for all \(\varepsilon \in (0,1)\), and all \(x \in \mathbb {T}^d\),
-
(uniform estimates)
$$\begin{aligned} \Vert f_{0,\varepsilon }\Vert _\infty \leqq C_0, \quad \mathcal {E}(f_{0,\varepsilon }) \leqq C_0, \end{aligned}$$ -
(compact support in velocity)
$$\begin{aligned} f_{0,\varepsilon }(x,v)= 0 \quad \text {if } |v| > \frac{1}{\varepsilon ^\gamma }, \end{aligned}$$ -
(analytic + perturbation) Assume the following decomposition:
$$\begin{aligned} f_{0,\varepsilon } = g_{0,\varepsilon } + h_{0,\varepsilon }, \end{aligned}$$where \((g_{0,\varepsilon })\) is a sequence of continuous functions satisfying
$$\begin{aligned} \sup _{\varepsilon \in (0,1)}\sup _{v \in \mathbb {R}^d} \, (1+|v|^2) \Vert g_{0,\varepsilon } (\cdot ,v)\Vert _{B_{\delta _0}} \leqq C_0, \end{aligned}$$admitting a limit \(g_0\) in the sense of distributions. Furthermore, \((h_{0,\varepsilon })\) is a sequence of functions satisfying for all \(\varepsilon >0\)
$$\begin{aligned} W_2(f_{0,\varepsilon },g_{0,\varepsilon }) \leqq e^{-K \varepsilon ^{-2\zeta }}\qquad \text {with }\zeta =\left\{ \begin{array}{ll} 1+2\max \{\beta ,\gamma \}) &{} \text {if }d=2,\\ 1+\max \{38,3\gamma \}) &{} \text {if }d=3,\\ \end{array} \right. \end{aligned}$$(3.5)for some constants \(K>0\) and \(\beta >2\).
For all \(\varepsilon \in (0,1)\), consider \(f_\varepsilon (t)\) a global weak solution of (3.2) with initial condition \(f_{0,\varepsilon }\), and define the filtered distribution function
where \((d_\pm )\) are the correctors are defined as the solution of
Then there exist \(T>0\), and g(t) a weak solution on [0, T] of (3.3) with initial condition \(g_0,\) such that
Remark 3.4
Already in the one dimensional case, there is a negative result stating that an initial rate of convergence of the form \(W_2(f_{0,\varepsilon },g_{0,\varepsilon }) \leqq \varepsilon ^k\) for some \(k>0\) is not sufficient to ensure the validity of the quasi-neutral limit for positive times. This is the consequence of instability mechanisms described in [27] and [33]. Hence, our assumption on the size of \(W_2(f_{0,\varepsilon },g_{0,\varepsilon }) \) considerably improves the results in [34, 35], where a double exponential \(\exp \big (-\exp ({K \varepsilon ^{-\zeta }})\big )\) was required.
Remark 3.5
In Corollary 3.3 we consider sequences of initial conditions with compact support in velocity (yet, we allow the support to grow polynomially as \(\varepsilon \) goes to zero). The reason is that we need \(L^\infty \) bounds on the density \(\rho _{f_\varepsilon }(t)=\int f_\varepsilon (t)\,\mathrm {d}v\), so a control on the support in velocity is needed. We have decided to put these assumptions because they are the same as in [6] and so we can rely on some estimates proved in that paper. However, using the argument in [52] (see also [29]) one could relax the assumptions and require only a moment condition on \(f_{0,\varepsilon }\). Providing this extension is not difficult, but it would require some work that would go beyond the main goal of this paper.
Before proving Proof of Theorem 3.1, we first show how it implies Corollary 3.3.
Proof of Corollary 3.3
Let \(g_\varepsilon (t)\) denote the solution of (3.2) starting from \(g_{0,\varepsilon }\). As shown in [34, Sect. 4], under the assumptions in the statement, the following bounds hold:
where \(\zeta =\zeta (d)\) is as in the statement. Hence Theorem 3.1 and (3.5) yield
provided that \(C_d^2{\bar{C}}^2T^2<K\) (which can be guaranteed by taking T smaller if necessary). This implies that \(\sup _{[0,T]}W_2(f_\varepsilon (t),g_\varepsilon (t)) \rightarrow 0\) as \(\varepsilon \rightarrow 0,\) and we can now conclude as in [34, Proof of Theorem 1.7]. \(\quad \square \)
3.3 Proof of Theorem 3.1
Before starting the proof we recall [34, Lemma 3.2], see also [29, Lemma 3.3].
Lemma 3.6
Let \(\Psi _i:{\mathbb {T}}^d\rightarrow {\mathbb {R}}\) solve
Then
Let \((X_i,V_i)\) denote the characteristic flow associated to \(f_i\), that is
To prove Theorem 3.1, we consider \(\pi _0\) an optimal \(W_2\)-coupling between \(f_{1}(0)\) and \(f_{2}(0)\), and we define the quantity Q(t) defined as the unique constant (assuming it exists) such that
In other words, we are considering a quantity of the form
with \(\lambda (t)\) depending on time, and we are assuming that actually \(\lambda (t)\) is a function of Q(t) itself. The particular choice \(\lambda (t)=\varepsilon ^{-2}|\log Q(t)|\) is specific to this problem: the logarithm will help to compensate for the log-Lipschitz regularity of the electric fields, while \(\varepsilon ^{-2}\) is the natural scaling in the current setting.
Note that a priori is not clear that Q(t) is well-defined. This will be proved in Lemma 3.7 below. However, assuming for now that Q(t) is well-defined, we show how this quantity allows us to prove the result. We have
By Cauchy–Schwarz inequality and recalling the definition of Q(t) we have,
Adding and subtracting \(-E_2(t,X_1)\) we obtain,
where
Thanks to Lemma 3.6 and by the very same argument in [34] we can bound \(T_1\) and \(T_2\) as follows:Footnote 3
where we have
We now recall that \(\lambda (t)=\varepsilon ^{-2}|\log (Q(t))|\) and we substitute this expression in the derivative of Q(t). Notice that in this estimate we are interested in small values of Q(t) and in particular, as we will show below, we will always be in the regime \(\varepsilon ^2 Q(t)/|\log (Q(t))|\in (0,1/e).\) Therefore we have
so, by equation (3.6), we have,
We now consider two cases, depending on the sign of \(Q'(t)\). If \(Q'(t)\leqq 0\), then we do not do anything. If instead \(Q'(t)>0\), then the first term in the right-hand side above is negative, and therefore
Since the right-hand side above is nonnegative, independently of the sign of \(Q'(t)\) we know that the bound above holds. We now observe that, as long as \(Q(t)\leqq \varepsilon \), then
Thus,
Since \(A(t)\geqq 1\)Footnote 4, provided \(|\log (Q(t))|\geqq 1\), the above bound reduces to
where \(C_d\) is a dimensional constant. Note that the two conditions \(Q(t)\leqq \varepsilon \) and \(|\log (Q(t))|\geqq 1\) are guaranteed if \(Q(t)\leqq \frac{\varepsilon }{e}\) (recall that \(\varepsilon \leqq 1\) by assumption).
Hence, provided that we are in the regime \(Q(s)\leqq \frac{\varepsilon }{e}\) on [0, t], this implies that
We observe that the bound (3.7) guarantees that
In particular, (3.7) holds if
We now compare the quantity Q to the Wasserstein distance. First of all, since \(Q(t)\leqq \frac{\varepsilon }{e}\), \(\varepsilon ^{-2}|\log (Q(t))| \geqq 1\), therefore
On the other hand, since \(\varepsilon ^{-2}|\log (Q(0))| \geqq 1\) and \(\pi _0\) is an optimal plan,
or equivalently
We now observe that, near the origin, the inverse of the function \(s\mapsto \frac{s}{|\log s|}\) behaves like \(\tau \mapsto \tau |\log \tau |\). In particular, there exists a universal small constant \(c_0>0\) such that
Hence, if \( \frac{1}{2}\varepsilon ^{-2}W_2(f_1(0),f_2(0))^2 \leqq c_0\), we deduce that
Combining these bounds with (3.7), and recalling (3.8), this implies
provided \( \frac{1}{2}\varepsilon ^{-2}W_2(f_1(0),f_2(0))^2 \leqq c_0\) and
Finally, to complete the proof, we show the following:
Lemma 3.7
With the notation and assumptions of the theorem, the quantity Q(t) is well defined and it is locally Lipschitz continuous where \(Q(t)>0\). In particular it is differentiable a.e.
Proof
Set
We can assume that D(t) and E(t) are nonzero, otherwise we are in the “degenerate” situation where \(f_1\equiv f_2\), in which case Q(t) is trivially 0. Also, since D(t) and E(t) are written in terms of the characteristic flow, it is standard to check that they are differentiable.Footnote 5
We note that the quantity Q(t) is implicitly defined via the relation
or equivalently, for each fixed time t, Q(t) is the solution of the equation
Since the function \(q\mapsto q+\varepsilon ^{-2}\log q\, D(t)\) is strictly increasing on (0, 1) and its image covers the interval (0, 1), we deduce that the equation above has a unique solution provided \(E(t)< 1\). Hence, this proves that \(Q(t) \in (0,1)\) is well defined provided \(E(t)< 1\). In addition, thanks to the implicit function theorem applied to the function \(F\in C^1_{\mathrm{loc}}((0,1)\times {{\mathbb {R}}}\times {{\mathbb {R}}})\), we deduce the existence of a \(C^1_{\mathrm{loc}}\) function G such that \(Q(t)=G(D(t),E(t))\). Now, differentiating the relation (3.9) with respect to t we obtain
Hence, since D and E are uniformly bounded and Lipschitz, for any \(\delta >0\) we deduce that
This proves that, for any \(\delta >0\), the function \(t\mapsto Q(t)\) is uniformly Lipschitz continuous inside the set \(\{Q(t)>\delta \}\). This proves that Q(t) is locally Lipschitz continuous inside the region \(\{Q(t)>0\}\).
So, to conclude the proof, we need to ensure that \(E(t)< 1\). Note that, since by assumption \(E(0)\leqq \frac{1}{2} W_2(f_1(0),f_2(0))^2 \ll 1\), by continuity we have that \(E(t)<1\) for \(t>0\) small. So Q(t) is well defined for \(t>0\) small. Also, as long as Q(t) is well defined, we have that \(Q(t)\geqq E(t)\). Hence, as long as Q(t) is well defined, we have that
Since, by our smallness assumption on \(W_2(f_1(0),f_2(0))\), the right hand side above remain small on [0, T], the bound above guarantees that \(E(t)\ll 1\) for all \(t \in [0,T]\). This proves that Q(t) is well-defined on [0, T], which concludes the proof. \(\quad \square \)
Remark 3.8
In the previous proof we considered \(\lambda (t)=\varepsilon ^{-2}|\log (Q(t))|\) and in Lemma 3.7 we proved that Q(t) is well-defined provided it is small enough. This restriction is due to the fact that the function \({{\mathbb {R}}}^+ \ni s\mapsto \varepsilon ^{-2}|\log s|\) is decreasing only for \(s\in (0,1).\) An alternative choice could have been to define
and \(\lambda (t):= \varepsilon ^{-2}\Phi (Q(t)).\) With this choice, since \({{\mathbb {R}}}^+ \ni s\mapsto \Phi (s)\) is decreasing and of class \(C^1\), one can define Q(t) as the unique solution of
With this definition, the proof of Lemma 3.7 shows that Q(t) is always well defined (without any restriction on the size of E(t)), and it is locally Lipschitz continuous where \(Q(t)>0\).
Since in our setting we are interested in the case \(Q(t)\ll 1,\) there is no advantage in using this latter definition of \(\lambda .\) However this observation could be useful in other situations, see also Section 4 below.
4 Summary, Generalizations, and Perspectives
As we have seen in the last two sections, suitably modifying Wasserstein distances can be particularly useful in a kinetic setting to take advantage of the asymmetry between x and v. More precisely, let \(\mathcal {X}={\mathbb {T}}^d\) or \(\mathcal {X}=\mathbb R^d\), and let \(\mu \) and \(\nu \) be two probability measures on \(\mathcal {X}\times \mathbb {R}^d\). Also, let \(\Pi (\mu ,\nu )\) denote the collection of all measures on \((\mathcal {X} \times \mathbb {R}^d)^2\) with marginals \(\mu \) and \(\nu \) on the first and second factors respectively.
The first natural generalization, given \(p\geqq 1\) and \(\lambda \in \mathbb {R}^+\), is to consider
An alternative way, introduced in [56] for \(p=2\), would be to consider three parameters \(a,b,c>0\) such that \(\sqrt{ac}>b\) and define
In this paper, we have introduced two different generalizations.
-
(i)
First, we considered the nonlinear version of the \(KW_{\lambda ,\, p} (\mu , \nu )\) by choosing \(\lambda \) depending on the distance itself. We defined this along a flow, but that can be also be defined in a general setting as follows: given \(p\geqq 1\) and a decreasing function \(\Phi :{{\mathbb {R}}}^+\rightarrow {{\mathbb {R}}}^+\), for every \(\pi \in \Pi (\mu ,\nu )\) and \(\lambda \) we define \(D_p(\pi ,\Phi )\) as the unique number s such that
$$\begin{aligned}&s- \Phi (s)\int _{(\mathcal {X} \times \mathbb {R}^d)^2} |x-y|^{p}\mathrm {d} \pi (x,v,y,w) \\&\quad = \int _{(\mathcal {X} \times \mathbb {R}^d)^2} |v-w|^p\mathrm {d} \pi (x,v,y,w) \end{aligned}$$(arguing as in the proof of Lemma 3.7 it is easy to check that \(D(\pi ,\Phi )\) is well defined, see also Remark 3.8). Then, we set
$$\begin{aligned} KW_{\Phi ,p}(\mu ,\nu ):=\left( \inf _{\pi \in \Pi (\mu ,\nu )} D_p(\pi ,\Phi ) \right) ^{1/p}. \end{aligned}$$This definition with \(\Phi (s)=\varepsilon ^{-2}|\log s|\) for \(s \in (0,1/e)\) and \(p=2\) essentially corresponds to the quantity used in the proof of Theorem 3.1, although there we considered the quantity D(t) where we did not take the infimum over couplings \(\pi \in \Pi (\mu ,\nu ),\) since it was not needed for our purpose.
-
(ii)
In a different direction, we modified the \(W_1\) distance by introducing a shift in position. Note that this second quantity cannot be defined as a “static” distance since the shift \(x -tv\) depends on the time t. Hence, one can generalize it only as a time dependent quantity as follows:
$$\begin{aligned}&{\widetilde{KW}}_{t,\, p} (\mu , \nu )\\&\quad :=\left( \inf _{\pi \in \Pi (\mu ,\nu )} \int _{(\mathcal {X} \times \mathbb {R}^d)^2} \big (|(x-tv)-(y-tw)|^{p} +|v-w|^p\big )\right. \\&\left. \mathrm {d} \pi (x,v,y,w) \right) ^{1/p}. \end{aligned}$$
Of course, these approaches can be further combined by mixing the different quantities defined above. Note that there is no universal “best” choice, and each problem requires its adaptation. Still, we believe, as this paper shows, that this approach can lead to an improvement to several existing results, as well as to prove new estimates. In addition, the approach is very general and can be useful in any situation where there is an asymmetry between the variables involved.
To mention some concrete applications, our ideas could also be applied in the setting of quantum systems by suitably modifying the quantum Wasserstein distances introduced in [21, 24]. Also, our new Loeper-type estimate may be helpful to obtain stability estimates in \(W_2\) when the density belongs to some suitable Orlicz spaces, in analogy to [38] where stability estimates have been proved for \(W_1\).
Notes
Dobrushin’s argument is performed considering the so-called bounded-Lipschitz distance on probability measures, which is defined by duality against bounded Lipschitz functions. However, the same proof where one replaces the bounded-Lipschitz distance with the \(W_1\) distance (which can be defined by duality against Lipschitz functions, as shown in (2.4)), provides this bound.
We recall that \(\int _{\mathbb {T}^d}\rho _i(x,t)\mathrm {d}x=1,\) which implies \(\Vert \rho _i(\cdot , t) \Vert _{L^\infty (\mathbb {T}^d)}\ge 1,\) \(i=1,2.\) Therefore \(A(t)\ge 2.\)
Note that, since \(\rho _i\ge 0\) and \(\Vert \rho _i(\cdot , t) \Vert _{L^\infty (\mathbb {T}^d)}\ge 1,\) then \(\Vert \rho _i(\cdot , t)-1 \Vert _{L^\infty (\mathbb {T}^d)}\le \Vert \rho _i(\cdot , t) \Vert _{L^\infty (\mathbb {T}^d)}\le A(t)\) for \(i=1,2.\)
We recall that \(\int _{\mathbb {T}^d}\rho (x,t)\mathrm {d}x=1,\) which implies \(\Vert \rho (\cdot , t) \Vert _{L^\infty (\mathbb {T}^d)}\ge 1.\) Therefore \(A(t)\ge 1.\)
There is no novelty here, as these are the quantities that appear also in [49], where Loeper computes their derivatives and show that they can be controlled in terms of D(t) and E(t) themselves. In particular, the quantities D(t) and E(t) are also uniformly Lipschitz.
References
Ambrosio, L., Gigli, N., Savaré, G.: Gradient Flows in Metric Spaces and in the Space of Probability Measures. Lectures in Mathematics ETH Zürich. Birkhäuser Verlag, Basel, 2005
Arsenev, A.: Existence in the large of a weak solution to the Vlasov system of equations. Zh. Vychisl. Mat. i Mat. Fiz. 15, 136–147, 1975
Bardos, C., Degond, P.: Existence globale des solutions des équations de Vlasov-Poisson. In: Nonlinear Partial Differential Equations and Their Applications. Collège de France Seminar, Vol. VII (Paris, 1983–1984). Research Notes in Mathematics, Vol. 122, 1–3, 35–58. Pitman, Boston, MA, 1985
Bardos, C., Degond, P.: Global existence for the Vlasov–Poisson equation in 3 space variables with small initial data. Ann. Inst. H. Poincaré Anal. Non Linéaire 2(2), 101–118, 1985
Bardos, C., Degond, P., Golse, F.: A priori estimates and existence results for the Vlasov and Boltzmann equations. In Nonlinear Systems of Partial Differential Equations in Applied Mathematics, Part 2 (Santa Fe, N.M., 1984). Lectures in Applied Mathematics, Vol. 23, 189–207. American Mathematical Society, Providence, RI, 1986
Batt, J., Rein, G.: Global classical solutions of the periodic Vlasov–Poisson system in three dimensions. C. R. Acad. Sci. Paris Sér. I Math. 313(6), 411–416, 1991
Bertozzi, A.L., Laurent, T., Rosado, J.: \(L^p\) theory for the multidimensional aggregation equation. Commun. Pure Appl. Math. 64(1), 45–83, 2011
Brenier, Y.: Une formulation de type Vlasov–Poisson pour les équations d’Euler des fluides parfaits incompressibles. Rapport de recherche, RR-1070, INRIA, 1989
Brenier, Y.: Convergence of the Vlasov–Poisson system to the incompressible Euler equations. Commun. Partial Differ. Equ. 25(3–4), 737–754, 2000
Cañizo, J.A., Carrillo, J.A., Rosado, J.: A well-posedness theory in measures for some kinetic models of collective motion. Math. Models Methods Appl. Sci. 21(3), 515–539, 2011
Caglioti, E., Rousset, F.: Long time estimates in the mean field limit. Arch. Ration. Mech. Anal. 190(3), 517–547, 2008
Carrillo, J.A., Choi, Y.-P., Hauray, M.: The derivation of swarming models: mean-field limit and Wasserstein distances. In: Collective Dynamics from Bacteria to Crowds. CISM Courses and Lectures, vol. 553, 1–46. Springer, Vienna, 2014
Carrillo, J.A., Choi, Y.-P., Salem, S.: Propagation of chaos for the Vlasov–Poisson–Fokker–Planck equation with a polynomial cut-off. Commun. Contemp. Math. 21(4), 1850039, 2019
Carrillo, J. A., Rosado, J.: Uniqueness of bounded solutions to aggregation equations by optimal transport methods. In European Congress of Mathematics, 3–16. European Mathematical Society, Zürich, 2010
De Bièvre, S., Goudon, T., Vavasseur, A.: Particles interacting with a vibrating medium: existence of solutions and convergence to the Vlasov–Poisson system. SIAM J. Math. Anal. 48(6), 3984–4020, 2016
Dobrushin, R.L.: Vlasov equations. Funktsional Anal. i Prilozhen. 13, 48–58, 1979
Dolbeault, J., Nazaret, B., Savaré, G.: A new class of transport distances between measures. Calc. Var. Partial Differ. Equ. 34(2), 193–231, 2009
Fernández-Real, X.: The Lagrangian structure of the Vlasov–Poisson system in domains with specular reflection. Commun. Math. Phys. 364(3), 1327–1406, 2018
Figalli, A., Gigli, N.: A new transportation distance between non-negative measures, with applications to gradients flows with Dirichlet boundary conditions. J. Math. Pures Appl. (9) 94(2), 107–130, 2010
Golse, F.: On the dynamics of large particle systems in the mean field limit. In: Macroscopic and Large Scale Phenomena: Coarse Graining, Mean Field Limits and Ergodicity. Lecture Notes in Applied Mathematics and Mechanics, Vol. 3, 1–144. Springer, Cham, 2016
Golse, F., Mouhot, C., Paul, T.: On the mean field and classical limits of quantum mechanics. Commun. Math. Phys. 343(1), 165–205, 2016
Golse, F., Mouhot, C., Ricci, V.: Empirical measures and Vlasov hierarchies. Kinet. Relat. Models 6(4), 919–943, 2013
Golse, F., Paul, T.: The Schrödinger equation in the mean-field and semiclassical regime. Arch. Ration. Mech. Anal. 223(1), 57–94, 2017
Golse, F., Paul, T.: Empirical measures and quantum mechanics: applications to the mean-field limit. Commun. Math. Phys. 369(3), 1021–1053, 2019
Golse, F., Saint-Raymond, L.: The Vlasov–Poisson system with strong magnetic field in quasi-neutral regime. Math. Models Methods Appl. Sci. 13(5), 661–714, 2003
Grenier, E.: Oscillations in quasi-neutral plasmas. Commun. Partial Differ. Equ. 21(3–4), 363–394, 1996
Grenier, E.: Limite quasineutre en dimension 1. In: Journées “Équations aux Dérivées Partielles” (Saint-Jean-de-Monts, 1999), pages Exp. No. II, 8. Univ. Nantes, Nantes, 1999
Griffin-Pickering, M., Iacobelli, M.: A mean field approach to the quasi-neutral limit for the Vlasov–Poisson equation. SIAM J. Math. Anal. 50(5), 5502–5536, 2018
Griffin-Pickering, M., Iacobelli, M.: Global well-posedness for the Vlasov–Poisson system with massless electrons in the 3-dimensional torus. Preprint, 2020
Griffin-Pickering, M., Iacobelli, M.: Singular limits for plasmas with thermalised electrons. Journal de Mathématiques Pures et Appliquées 135, 199–255, 2020
Griffin-Pickering, M., Iacobelli, M.: Recent developments on quasi-neutral limits for Vlasov-type equations. In: Recent Advances in Kinetic Equations and Applications. Springer INdAM Series. Preprint, 2021
Griffin-Pickering, M., Iacobelli, M.: Recent developments on the well-posedness theory for Vlasov-type equations. In: Proceedings of the conference Particle Systems and Partial Differential Equations editions VI, VII and VIII. Preprint, 2021
Han-Kwan, D., Hauray, M.: Stability issues in the quasi-neutral limit of the one-dimensional Vlasov–Poisson equation. Commun. Math. Phys. 334(2), 1101–1152, 2015
Han-Kwan, D., Iacobelli, M.: Quasi-neutral limit for Vlasov–Poisson via Wasserstein stability estimates in higher dimension. J. Differ. Equ. 263(1), 1–25, 2017
Han-Kwan, D., Iacobelli, M.: The quasi-neutral limit of the Vlasov–Poisson equation in Wasserstein metric. Commun. Math. Sci. 15(2), 481–509, 2017
Han-Kwan, D., Miot, E., Moussa, A., Moyano, I.: Uniqueness of the solution to the 2D Vlasov–Navier–Stokes system. Rev. Mat. Iberoam. 36(1), 37–60, 2020
Hauray, M.: Wasserstein distances for vortices approximation of Euler-type equations. Math. Models Methods Appl. Sci. 19(8), 1357–1384, 2009
Holding, T., Miot, E.: Uniqueness and stability for the Vlasov–Poisson system with spatial density in Orlicz spaces. In Mathematical Analysis in Fluid Mechanics—Selected Recent Results. Contemporary Mathematics, Vol. 710, 145–162. American Mathematical Society, Providence, RI, 2018
Horst, E., Hunze, R.: Weak solutions of the initial value problem for the unmodified non-linear Vlasov equation. Math. Methods Appl. Sci. 6(2), 262–279, 1984
Jabin, P.-E.: A review of the mean field limits for Vlasov equations. Kinet. Relat. Models 7(4), 661–711, 2014
Jeans, J.H.: On the theory of star-streaming and the structure of the universe. Mon. Not. R. Astron. Soc. 76, 70–84, 1915
Lafleche, L.: Propagation of moments and semiclassical limit from Hartree to Vlasov equation. J. Stat. Phys. 177(1), 20–60, 2019
Lazarovici, D.: The Vlasov–Poisson dynamics as the mean field limit of extended charges. Commun. Math. Phys. 347(1), 271–289, 2016
Lewin, M., Sabin, J.: The Hartree and Vlasov equations at positive density. Commun. Partial Differ. Equ. 45(12), 1702–1754, 2020
Liero, M., Mielke, A., Savaré, G.: Optimal transport in competition with reaction: the Hellinger–Kantorovich distance and geodesic curves. SIAM J. Math. Anal. 48(4), 2869–2911, 2016
Liero, M., Mielke, A., Savaré, G.: Optimal entropy-transport problems and a new Hellinger–Kantorovich distance between positive measures. Invent. Math. 211(3), 969–1117, 2018
Lions, P.L., Perthame, B.: Propagation of moments and regularity for the 3-dimensional Vlasov–Poisson system. Invent. Math. 105(2), 415–430, 1991
Loeper, G.: A fully nonlinear version of the incompressible Euler equations: the semigeostrophic system. SIAM J. Math. Anal. 38(3), 795–823, 2006
Loeper, G.: Uniqueness of the solution to the Vlasov–Poisson system with bounded density. J. Math. Pures Appl. (9) 86(1), 68–79, 2006
Masmoudi, N.: From Vlasov–Poisson system to the incompressible Euler system. Commun. Partial Differ. Equ. 26(9–10), 1913–1928, 2001
Miot, E.: A uniqueness criterion for unbounded solutions to the Vlasov–Poisson system. Commun. Math. Phys. 346(2), 469–482, 2016
Pallard, C.: Moment propagation for weak solutions to the Vlasov–Poisson system. Commun. Partial Differ. Equ. 37(7), 1273–1285, 2012
Pfaffelmoser, K.: Global classical solutions of the Vlasov–Poisson system in three dimensions for general initial data. J. Differ. Equ. 95(2), 281–303, 1992
Piccoli, B., Rossi, F.: Generalized Wasserstein distance and its application to transport equations with source. Arch. Ration. Mech. Anal. 211(1), 335–358, 2014
Piccoli, B., Rossi, F.: On properties of the generalized Wasserstein distance. Arch. Ration. Mech. Anal. 222(3), 1339–1365, 2016
Salem, S.: An optimal transport approach of hypocoercivity for the 1d kinetic Fokker-Plank equation. Preprint, 2021
Schaeffer, J.: Global existence of smooth solutions to the Vlasov–Poisson system in three dimensions. Commun. Partial Differ. Equ. 16(8–9), 1313–1335, 1991
Serfaty, S., Vázquez, J.L.: A mean field equation as limit of nonlinear diffusions with fractional Laplacian operators. Calc. Var. Partial Differ. Equ. 49(3–4), 1091–1120, 2014
Ukai, S., Okabe, T.: On classical solutions in the large in time of two-dimensional Vlasov’s equation. Osaka J. Math. 15(2), 245–261, 1978
Villani, C.: Optimal Transport. Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Old and new, Vol. 338. Springer-Verlag, Berlin, 2009
Vlasov, A.: On high-frequency properties of electron gas. Zh. Eskper. Teor. Fiz. 8, 291, 1938
Vlasov, A.: Vlasov equation and plasma dispersion relation. J. Phys. USSR 9, 25, 1945
Acknowledgements
We are grateful to Megan Griffin-Pickering and Evelyne Miot for their valuable comments on a preliminary version of this paper. We also thank the anonymous referees for their useful comments and observations.
Funding
Open access funding provided by Swiss Federal Institute of Technology Zurich.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by D. Kinderlehrer.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Iacobelli, M. A New Perspective on Wasserstein Distances for Kinetic Problems. Arch Rational Mech Anal 244, 27–50 (2022). https://doi.org/10.1007/s00205-021-01705-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00205-021-01705-9