A sparse approximation of the Lieb functional with moment constraints
Abstract
The aim of this paper is to present new sparsity results about the so-called Lieb functional, which is a key quantity in Density Functional Theory for electronic structure calculations of molecules. The Lieb functional was actually shown by Lieb to be a convexification of the so-called Lévy-Lieb functional. Given an electronic density for a system of electrons, which may be seen as a probability density on , the value of the Lieb functional for this density is defined as the solution of a quantum multi-marginal optimal transport problem, which reads as a minimization problem defined on the set of trace-class operators acting on the space of electronic wave-functions that are anti-symmetric functions of , with partial trace equal to the prescribed electronic density. We introduce a relaxation of this quantum optimal transport problem where the full partial trace constraint is replaced by a finite number of moment constraints on the partial trace of the set of operators. We show that, under mild assumptions on the electronic density, there exist sparse minimizers to the resulting moment constrained approximation of the Lieb (MCAL) functional that read as operators with rank at most equal to the number of moment constraints. We also prove under appropriate assumptions on the set of moment functions that the value of the MCAL functional converges to the value of the exact Lieb functional as the number of moments go to infinity. We also prove some rates of convergence on the associated approximation of the ground state energy. We finally study the mathematical properties of the associated dual problem and introduce a suitable numerical algorithm in order to solve some simple toy models.
Conflict of interest statement:
The authors have no competing interests to declare that are relevant to the content of this article.
Data availability statement:
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
Contents
1 Introduction
The so-called Hohenberg-Kohn or Lévy-Lieb functional plays a fundamental role in Density Functional Theory for electronic structure calculations. For the sake of simplicity, we use here atomic units and neglect the effect of spin in this work. For a given electronic density , which we assume here to be of integral equal to for the sake of simplicity, and a given number of electrons , the Lévy-Lieb functional reads as the solution of the following a minimization problem of the form:
where
-
(i)
is the set of admissible electronic wavefunctions for a system of electrons with finite kinetic energy, that is the set of antisymmetric functions of ;
-
(ii)
for any and , is the electronic density associated to the wavefunction ;
-
(iii)
the function is the electron-electron Coulomb interaction potential.
There is a wide zoology of electronic structure calculation models which rely on various types of approximations of this Lévy-Lieb functional. Recently, Strictly Correlated Electrons (SCE) based approximation of this functional have drawn an increasing interest from mathematicians because it gives rise to a symmetric multi-marginal optimal transport problem with Coulomb cost, with the number of marginal constraints being equal to the number of electrons in the system. The literature about the SCE approximation (namely the multi-marginal optimal transport with Coulomb cost) is growing considerably. Recent developments include results on the existence and non-existence of Monge-type solutions (e.g., [CD15, CDD15, CFK13, Fri19, BDGG12, CS16, BDPK20]), structural properties of Kantorovich potentials (e.g., [CDMS19, DGN17, GKR19, BCD17]), grand-canonical optimal transport [DMLN22], efficient computational algorithms (e.g., [BCN17, FSV22, CEL+19, MG19, KLLY19]) and the design of new density functionals (e.g., [GGGG19, CF15, MUMIGG14, LDMG+16]).
Moreover, recent works indicate that the solution of this symmetric Coulomb cost multi-marginal problem (MMOT), which is a probability measure on , is actually a sparse object at least in discrete settings. Two types of discrete settings have been considered so far where such sparsity results have been obtained. On the one hand, the most classical discrete approximation consists in introducing a discrete grid of . The discrete optimal transport plan is then defined as a discrete probability measure defined on the cartesian product grid . Actually, it was proved in [FV18, Vög21] that the discrete optimal transport plan does not charge all the points of the discrete cartesian product grid (of cardinality ) but only a number of points in this grid which scales at most linearly with . Finding the few points of which are actually charged by the discrete optimal transport plan is not a trivial task though, and the GenCol algorithm is a numerical procedure which aims at achieving this task. It has been first proposed in [FSV22], then extended in [FP22] and its convergence has been analyzed for two-marginal problems in [FP23]. On the other hand, an alternative approach which was first considered in [ACEL21] consists in introducing an approximation of the exact multi-marginal transport problems where the marginal constraints are replaced by a finite number of moment constraints associated to a finite number of ”moment functions” which are real-valued functions defined on . Under some natural assumptions, this approximate problem is then equivalent to approximating the solution of the dual problem associated to the exact optimal transport problem, namely the so-called Kantorovich potential, as a linear combination of these moment functions. The solution of this moment-contrained optimal transport problem is still a probability measure defined on but is also a sparse object in the sense that it can be written as a discrete measure charging a number of points belonging to which scales at most linearly with the number of moment constraints. Finding the location of these points then reads as a non-convex optimization problem defined on a continuous (and not a discrete set) set, and stochastic gradient algorithms have been proposed in [ACE22] in order to find such optimal points, and numerically tested on three-dimensional settings involving electrons. We also refer the reader to the works [CFM14, BCN17, NP22, Lel22, HCL23] where alternative numerical methods have been proposed for the computation of the SCE limit of the Lévy-Lieb functional, which do not rely on sparsity arguments.
The objective of this work is to prove similar sparsity results for the so-called Lieb functional, which is a convex relaxation of the Lévy-Lieb functional, the expression of which is given under the following form:
(1) |
where , denotes the set of non-negative trace-class self-adjoint operators on and where is the electronic density associated to , the precise definition of which will be given below. Actually, problem (1) is a particular instance of quantum optimal transport problem. We refer the reader to [GMP16, GP17] for references on earlier works on closely related types of problems. Notice that problem (1) can be understood as a quantum version of a multi-marginal optimal transport problem. Moreover, it still enjoys the nice property, as the original problem, of being a linear programming problem. Our aim here is to prove that solutions of approximations of problems (1) where the partial trace constraint is relaxed by a finite number of moment constraints enjoy similar sparsity properties than solutions of moment constrained multi-marginal symmetric classical optimal transport problems, such as those which were established in [ACEL21]. More precisely, we prove, using the so-called Tchakaloff’s theorem (notice that for the usual entropic regularization of MMOT we cannot use this kind of approach), that the solutions of moment constrained approximations of (1) can be written under the form , where scales at most linearly with the number of moment constraints, and where for all , , and is the orthogonal projector of onto the vectorial space spanned by (using bra-ket notation). We will, finally, exploit this sparsity structure in order to propose some numerical scheme in order to approximate the solution of (1). Notice that solving (5) for a small systems can be exploited (as done in some recent works for the Levy-Lieb functional [SPTP23, BVMTG22]) in order to build approximations of the Lieb functional for larger systems by means of machine learning techniques. Let us finally mention here that particular moment-constrained approximations of the Lieb functional have already been considered in [Gar22] for the construction of Kohn-Sham potentials. The novel results brought by the present contribution in comparison to the latter work is (i) the extension of existence and convergence results to more general moment constraints that the one considered in [Gar22]; (ii) the results on the sparsity structure of associated minimizers; (iii) convergence rate of the approximate ground state energy; and (iv) study of the mathematical properties of the associated dual problem.
The outline of the article is the following. In Section 2, we recall some fundamental results about the exact Lieb functional. The moment-constrained approximation we consider here and the associated sparsity result on their minimizers is presented in Section 3. Convergence results of the moment-constrained approximation towards the exact Lieb functional are presented in Section 4.1. In Section 4.2, we also prove some rates of convergence of the associated approximation of the ground state energy to the exact one. Section 5 is devoted to present some results about the dual formulation of the moment-constrained problem in the case of electronic density with support included in bounded domains. We, finally, introduce a numerical method in Section 6 exploiting the sparsity result and the convenient dual formulation as an SDP problem. Some numerical experiments for small systems are then predented.
2 The exact Lieb functional
Let us first introduce some notation together with the problem we consider in this work. We use here atomic units and neglect the influence of spin for the sake of simplicity.
Let denote the number of electrons in the molecule of interest. Let us assume that there are nuclei in the molecule, the positions and electric charges of which are denoted by and . For all , let us denote by
the Coulomb electric potential generated at by the nuclei.
Let and . For any , we denote by its norm and by the electronic density associated to the wavefunction , namely the real-valued function defined over as follows:
For a given set of nuclei positions and charges , one can compute the ground state energy as a minimization over a density , that is
(2) |
where and
(3) |
is called the Levy-Lieb functional. In ((3)), the function is defined as follows: for all ,
(4) |
The Levy-Lieb functional is the central object in Density Functional Theory and its knowledge would allow the computation the electronic ground state energy of any molecule. However, it turns out that is not convex, it is therefore convenient to look at a convexification proposed by Lieb [Lie83a] where the minimization is performed over the set of mixed states instead of the set of pure ones as in (3). More precisely, we consider here the alternative minimization problem
(5) |
where is a self-adjoint operator on with domain , denotes the set of trace-class self-adjoint non-negative operators on . For all , there exists an orthonormal basis of and a non-increasing sequence of non-negative numbers such that
(6) |
using so-called bra-ket notation. Then, the associated electronic density is defined as follows: for all ,
We know that there exist positive constants such that (in the sense of self- adjoint operators on ). We also denote by the set of self-adjoint operators on with finite kinetic energy, i.e. such that .
Remark 1.
It can then be easily checked that, if and only if and . Then, if admits an eigendecomposition of the form (6), necessarily as soon as .
Remark 2 (Convexification).
It is worth highlighting that is indeed the convexification of in the sense that
It is useful noticing that admits a dual problem.
Theorem 3 ([Lie83a]).
Duality holds in the sense that
(7) |
where
The constraint in (7) has to be understood in the sense of self-adjoint operators, namely for all , .
3 Moment-constrained approximation and sparsity result
We focus now on a first approximation of (5) by using the moment constraint approach which has previously been studied in the framework of classical optimal transport [ACEL21, ACE22]. We also refer to [Gar22] where a particular instance of moment-constrained approximation of the Lieb functional has been considered for the computation of Kohn-Sham potentials.
We begin by introducing here some notation. From now on, we fix an electronic density . Let us recall that we have . For any , we denote by
Let , given a collection of functions , the main idea of the moment-constrained approximation consists in replacing the density constraint in (5) with the scalar moment constraints associated to the functions , that is
(8) |
Notice that (8) is equivalent to
(9) |
We denote by the set of satisfying constraints (8) (or equivalently (9)).
In the following, we show that there exists at least one solution to the corresponding moment-constrained Lieb optimization problem admits a sparse solution , such that there exists an integer , weights and wavefunctions such that
(10) |
In other words, we will show that there exists a finite-rank minimizer the rank of which is at most .
3.1 Tchakaloff’s theorem on Hilbert spaces
Let us first recall the following proposition which is an immediate consequence of Tchakaloff’s theorem, see [BT06]. For any Hilbert space , we denote by the Borel -algebra of .
Proposition 5.
Let be a Borelian measure on a Hilbert space concentrated on a Borel set . Let and a Borel measurable map. Assume that the first moments of exists, that is
where denotes the Euclidean norm of . Then there exists an integer , elements and weights such that
where is the th component of , and .
The main idea of the proof of the sparsity result announced above is to define a measure associated to an operator . Assume that the operator can be written as
(11) |
for some sequence of normalized functions of and non-negative real numbers such that . Then we can define a Borelian measure associated to the decomposition (11) of the operator as
Naturally, there is no unique such measure associated with an operator since it heavily depends on the decomposition (11). However, we will see in the following that this is not a problem for our purpose here.
3.2 Existence of sparse minimizers for Moment Constrained Approximation of Lieb (MCAL) functional
In the following, we denote by the function defined over which is identically equal to .
We then have the following theorem, the proof of which is postponed to Section 7.1.
Theorem 6.
Let , and such that . Let us assume in addition that
-
(A)
there exists a non-negative non-decreasing continuous function such that and .
For all , let us introduce the Moment-Constrained Approximation of the Lieb functional (MCAL)
(12) |
where for all . Then, for all , is finite and a minimum. Moreover, for all , there exists a minimizer to (12) such that , for some , with and for all .
Remark 7.
Let us remark that the existence of a minimizer to a moment-constraint approximation of the Lieb functional has been investigated in [Gar22][Theorem 3.1]. More precisely, in the latter work, the author considers moment functions , where is a countable subset of , which forms a partition of unity of i.e. such that
In particular, . Note that in Theorem 6, assumption (A) can be seen as an additional condition on which enables to obtain tightness of minimizing sequences. Instead, the author of [Gar22] does not require additional conditions on but considers a tightness condition on the set which reads as
(13) |
where for all , denotes the open ball of of radius centered at . Note that our existence result, up to the cost of assuming that satisfies (A), allows to treat moment constraints for which the tightness condition (13) does not hold. For instance, one can consider a family of moment functions where are the characteristic functions of cells of a mesh associated to a bounded subdomain and ). It can then be easily checked that such a family does not satisfy condition (13).
Proposition 8 (Lower semi-continuity).
Suppose such that in then .
Proof.
The proof is a straightforward adaptation of the proof of Theorem 6. Assume that exists then up to the extraction of a subsequence, there exists a trace-class operator such that
weakly converges in the sense of trace-class operators to as goes to infinity. Moreover, we have that
In particular satisfies the right moment constraints associated to as well as . Then by using the same arguments as in step 2 of the proof above we deduce that is admissible for . It follows then
∎
Remark 9.
We see from the proof of Theorem 6 that assumption (A) is needed in order to obtain tightness of the sequence of kernel functions . This is needed because we are considering operators defined on the space . Notice that such a technical assumption is not needed in the case when one considers operators acting on functions acting on a finite domain with Dirichlet boundary conditions. We state such a result below without giving its proof since it follows exactly the same lines as the proof of Theorem 6.
Let be a bounded subdomain of . We then denote by , , and . The operator is then a self-adjoint bounded from below operator acting on with domain . We also denote by the set of non-negative self-adjoint trace-class operators on . We also define the set of function with support included in . For any and any and , we introduce the set of such that
Then, the following theorem holds:
Theorem 10.
Let , and such that . Let us introduce
(14) |
Then, is finite and there exists a minimizer to (14) such that , for some , with and for all . Moreover, suppose such that in then .
In view of the sparsity results we have just proved, it is natural to consider an approximate MCAL problem, where the set of minimizers is restricted to the set of finite-rank operators satisfying moment constraints. More precisely, for a given , we consider the following set
The approximate MCAL functional then reads as follows
(15) |
where
Remark 11.
Notice that as soon as then we have that .
Remark 12.
Since then the set is no empty. Moreover it can be shown, by standard arguments, that there exists a minimizer to (15).
As in the case of moment constrained optimal transport [ACE22] we can state some interesting mathematical properties on the set of minimizers of the approximate problem (15). First, consider two elements of , then there exists a continuous path in connecting these two elements and such that varies monotonically along it.
Theorem 13.
Let us assume that . Let . Then, there exists a continuous application made of polygonal chain such that , and such that the application is monotone.
Since the proof is a straightforward adaptation of the one for [ACE22][Theorem 1], we refer the reader to it. We only highlight that, as we did in the previous sections, given a couple one can always associate a measure , then by Thchakaloff’s theorem the result follows. An interesting consequence of theorem 13 concerns the minimizers of MCAL: first, as soon as any local minimizer of MCAL (or of problem (15)) is a global minimizer. Secondly, the set of minimizers forms a polygonally connected set.
4 Some convergence results
The aim of this section is to gather some convergence results on the MCAL approximation towards solutions of the exact problem.
4.1 Convergence of the MCAL functional to the exact Lieb functional
The aim of this section is to prove that, under some appropriate assumptions, the MCAL functional converges to the exact Lieb functional as the number of moment constraints go to infinity. Let us denote here by the set of real-valued functions defined on with compact support.
More precisely, let such that there exists a function satisfying assumption (A). Let and let .
For all , let and be a sequence of functions belonging to and which satisfies for all together with the following density conditions:
-
(A)
for all ,
Then, we have the following useful lemma that we will use in the sequel.
Lemma 15.
Let such that and such that for all ,
Then, converges in the sense of distributions to as goes to infinity.
Proof.
The proof uses the same lines as the proof of [Gar22][Theorem 3.2]. We rewrite it here for the sake of completeness. Let and let be a sequence of functions such that for all and . Then, it holds that
Hence the desired result. ∎
Remark 16.
One example of sequence satisfying (A) is the following: for all , let and let (with ) be a regular conforming triangular mesh of , the elements of which have a maximal diameter size such that . Let . Denoting by for and by and by for all , one can easily check that the sequence satisfies (A).
We then have the following convergence result, which may be seen as an extension of [Gar22][Theorem 3.2] to more general set of moment functions, up to the additional tightness assumption (A), the proof of which is postponed to Section 7.2.
Theorem 17.
Let such that there exists a function satisfying assumption (A). Let and . For all , let and such that assumption (A) holds. We assume in addition that there exists such that for all . Then, for all , there exists at least one sparse minimizer to (12) with in the sense of Theorem 6. Besides, it holds that
(16) |
Moreover, from any sequence such that is a minimizer for (12) with , one can extract a subsequence which strongly converges in to , where is a minimizer of (5).
Like in Section 3.2, we can state a similar result with less technical assumptions in the case when we consider operators acting on functions defined on a bounded subdomain with Dirichlet boundary conditions. We state such a result here, using the same notation as in Section 3.2, since it follows exactly the same lines of proof as Theorem 17. To this aim, for all , we introduce the exact Lieb functional defined on the domain as
(17) |
Let us point out here that there exists also such that
where refers here to the self-adjoint bounded from below operator on with domain (Laplacian with Dirichlet boundary conditions in ). We also denote by the set of operators such that .
Theorem 18.
Let . For all , let and such that for all ,
We assume in addition that there exists such that for all . Then, for all , there exists at least one sparse minimizer to (12) with in the sense of Theorem 6. Besides, it holds that
(18) |
Moreover, from any sequence such that is a minimizer for (12) with , one can extract a subsequence which strongly converges in to , where is a minimizer of (5).
4.2 Convergence rate of the ground state energy in the bounded domain case
In this section, we restrict ourselves to the case of a bounded subdomain . Let , be a set of moment functions. For all , let us introduce the ground state energy associated to the potential :
where
Rewriting the minimization over as an external minimization over and then as an internal one over all such that , it can easily be checked that
(19) |
Let us also define by
(20) |
Similarly, let us point out that, if , rewriting the minimization over as an external minimization over and then as an internal one over all , it holds that
We then prove the following approximation result.
Proposition 19.
Let us assume that and that . Then, it holds that
(21) |
Proof.
Let . Let arbitrarily small. Let , , and be -minimizers of , , and respectively. It then holds that
Using similar calculations, we obtain that
As a consequence, we obtain that
Since can be chosen arbitrarily small, it actually holds that
(22) |
Using similar arguments, we also obtain that
(23) |
Collecting (22) and (23) and using the fact that yields the desired result. ∎
Proposition 19 then enables to quantify the rate of convergence of as goes to infinity for some particular sequences of moment functions provided that is regular enough. As an illustration, we analyze here the rate of convergence of a numerical method inspired from the external dual charge approach recently proposed in [Lel22].
Corollary 20.
Let and be a bounded regular subdomain of . Let be an external density of charge and define as the unique solution to
Let be a sequence of triangular regular meshes of such that
Let and be the subspace of continuous finite element functions associated to the mesh . We denote by the subspace of containing all functions solution to
for some . Let be a basis of . Then, asuming that , there exists a constant such that for all ,
Proof.
Corollary 20 easily follows for the compact embedding and standard interpolation error results associated with finite element approximations. ∎
Remark 21.
Denoting by the dimension of , it holds that . As a consequence, the above result implies that the rate of convergence of to decays like where is the number of moment constraints in the MCAL approximation.
5 Duality results for the MCAL functional
Let us begin by recalling some classical results about semi-definite programming problems and introduce some notation.
5.1 Semi-definite positive programming problems
Let . We denote by the set of symmetric matrices of . For any , the notation (respectively ) is used to mean that is a semi-definite non-negative (respectively definite positive) matrix. We also denote by and by . For all , we denote by the Frobenius scalar product between and .
Let , , a linear application and . We consider here the following (primal) semi-definite positive programming problem:
(24) |
The dual problem associated to (24) then reads as follows:
(25) |
where is the adjoint of .
We introduce the following sets:
We also denote by and the set of solutions to (24) and (25). Then, we recall the following classical result [AL11, WSV12]:
Theorem 22.
-
(i)
If , is non-empty and bounded and ;
-
(ii)
If and surjective, then is non-empty and bounded and ;
-
(iii)
If and surjective, then and are non-empty and bounded and .
5.2 Dual MCAL problem
In this section we study the dual problem in the bounded domain case. We know that the dual variable associated to the density is a one-body interaction potential of the form for a given .
We then consider the following natural dual problem
(26) |
where
If we take any satisfying the above constraints and any then we have
which proves that . We would like to prove that this inequality is actually an equality. Let us introduce the ground state energy associated to the potential :
We rewrite now the minimization over as an external minimization over and then as an internal one over all in (we are considering the ground state for a potential ):
Notice that is nothing but the Legendre-Fenchel transform of . On the other hand, we rewrite (26) in the form
(27) |
Thus, is the Legendre transform of . From Proposition 8 and Fenchel duality theorem for convex lower semi-continuous functions we conclude the following
Theorem 23.
Under the assumptions of Theorem 10, we have .
We now have the following result which, taking into account the sparsity result of Theorem 10, gives a more convenient formulation of .
6 Numerical scheme
The aim of this section is to propose a new numerical scheme using the sparsity of minimizers of the MCAL functional to compute approximations of the Lieb functional. The scheme proposed here requires the resolution of eigenvalue problems for operators acting on , which leads to high-dimensional problems when the number of electrons is large. The combination of the algorithm proposed here with numerical methods dedicated to overcome the curse of dimensionality will be the object of a future work.
We propose here an iterative scheme which shares some common features with the well-known Column Algorithm used for classical optimal transport problems (see for instance [FP22, FSV22]). The aim is to construct at each iteration a finite set of -normalized wavefunctions which will be used to enforce the inequality constraints in the resolution of the MCAL dual problems. More precisely, inequality constraints in small-dimensional dual problems are enforced to hold on the space spanned by the wavefunctions belonging to the set . As a consequence, in our present quantum optimal transprt framework, semi-definite programming problems have to be solved at each iteration instead of linear programming problems for classical optimal transport problems.
6.1 MCAL iterative scheme
We describe the MCAL algorithm in this section. The algorithm takes as input data:
-
•
set of moment functions;
-
•
with support included in ;
-
•
for all .
-
•
initial finite set of -normalized wavefunctions.
As an output, after iterations, the algorithm yields which is an approximation of the quantity .
We make here the following assumption on the initial set .
Assumption (A0): let and an orthonormal basis of . We assume that there exists such that for all ,
In the case when , a way to find such an initial set is given in [Lie83b]. In this case, can be chosen to be equal to and can be chosen as follows: for all and , define
where for all ,
The family then forms an orthonormal family of and one may define as the normalized Slater determinant associated to the family .
6.1.1 Initialization step
Compute solution to
(28) |
Then, it holds that where are the eigenvalues of (assumed to be ranked in non-increasing order) and for all , is a normalized eigenvector associated with so that forms an orthonormal basis of .
Let . For all , let and . We also denote by .
Remark 25.
Remark 26.
Notice also that this initialization step is useless in the case when .
6.1.2 Iteration
Step 1: Let and be an orthonormal basis of . Let be defined by
Let and
Compute solution to
(30) |
Remark 27.
Step 2: Compute a -normalized solution to
(31) |
where is the smallest eigenvalue of .
Step 3: We now distinguish two different cases.
-
•
Case 1:
Define . Let and let be an orthonormal basis of .
Compute solution to
(32) Then, it holds that where are the eigenvalues of (assumed to be ranked in non-increasing order) and for all , is a normalized eigenvector associated with so that forms an orthonormal basis of .
Let . For all , let and . We then denote by .
Define and proceed with the next iteration.
-
•
Case 2:
Stop the algorithm.
6.2 Property of the MCAL iterative scheme
We prove the following lemma, which states that the sequence of approximations yielded by the MCAL algorithm is non-increasing. Note however that we do not prove here that the sequence converges indeed to .
Lemma 28.
For all , it holds that
Proof.
The second equality comes from the fact that
In addition, we know, by definition of and of , …, that there exists at least one minimizer to the second minimization problem which is a positive definite matrix, that is the diagonal matrix with entries . Using standard results of semi-definite positive programming, it holds that the dual problem associated to the second minimization problem introduced in the last line of the calculations above is precisely
Hence the desired result. ∎
6.3 Numerical results
The numerical tests presented in this section were performed using Julia. In particular, the finite element code developped in [QC23] was used to solve the eigenvalue problems (31), and the ProxSDP library was used for the resolution of the semi-definite programming problems. The associated code can be found on ZENODO with the DOI 10.5281/zenodo.11669900.
We present in this section some preliminary numerical results on a toy numerical test case with , and with . More precisely, for a given value , the solution of problems (31) is approximated using a Galerkin approximation in the finite element () discretization space
where
and
The moment functions are chosen to be hat functions associated to a uniform discretization of so that
The electronic density of choice is constructed as follows: we define, for all ,
Then, we define and
We then apply the MCAL algorithm starting from .
Let us first highlight the influence of the parameter on the performance of the algorithm in terms of the number of iterations required to achieve numerical convergence. We first conduct a first series of tests with and .
Figure 1 highlights the behaviour of the numerical scheme with respect to .
The upper (respectively lower) figure shows the values of (respectively ) as a function of for different values of . As predicted by our theoretical results, for any value of , the sequence is non-increasing and we also checked numerically that for all . In constrast, the sequence is not monotonous. We also observe that for any tested value of , the sequence converges to the same limit value. It seems that for greater values of , the number of iterations needed for the algorithm to converge is lower.
Figure 2 highlights the behaviour of the numerical scheme with respect to the number of moment constraints. In these tests, and .
Again, the upper (respectively lower) figure shows the values of (respectively ) as a function of for different values of . As before, we observe that the sequence is non-increasing and we also checked numerically that for all . In constrast, the sequence is not monotonous. We also observe that for any tested value of , the sequence converges to some limit value denoted here by which depends on . We observe again that the value of does not increase monotonically with , which stems from the fact that the spaces do not form an increasing family of vector spaces for the inclusion. However, is still holds that , and we indeed observe that , which is coherent with the variational structure of the moment constraint approach studied here. We also observe that the value of seems to stagnate in most of the numerical tests (except the one corresponding to ) to a value close to .
Lastly, Figure 3 shows the plots of the potential obtained after running iterations of the MCAL algorithm for various values of (). We observe that the potential value seems to converge to some limit value of increases. However, the number of moment constraints should definitely be higher to obtain a better accuracy, which was not possible with our current implementation. More evolved versions of the present MCAL algorothm should be designed to alleviate this bottleneck, which will be the object of a future work.
7 Proofs
We gather in this section the proofs of our main theoretical results.
7.1 Proof of Theorem 6
Proof of Theorem 6.
Step 1: (Finiteness) Since , there exists at least one element such that . Denoting by , it can then be easily seen that and that . Thus, we immediately obtain that for all , .
Step 2: (Existence of minimizer) Let be a minimizing sequence associated to (12). Then, we know from the proof of Theorem 4.4 of [Lie83a] that, up to the extraction of a subsequence, there exists a trace-class operator such that weakly converges in the sense of trace-class operators to as goes to infinity. To prove that is a minimizer to (12), it is sufficient to prove that satisfies
For all , let us denote by the kernel of and by the kernel of . Let us also denote for all ,
and by
for all . Let us prove that is a tight sequence. Indeed, let and be the ball of radius of . Then, denoting by the characteristic function of the set , it holds that for all ,
Let us denote by the multiplication operator by any function bounded with compact support on . We then know from the proof of Theorem 4.4 of [Lie83a] that
This, together with the tightness result above, yields that weakly converges to in . It then easily follows that for all
and that
The operator is thus a minimizer of (12). In particular, since , it holds that .
Step 3: (Existence of a sparse minimizer)
Let us now introduce the function such that for all
and
It can then be easily seen that is a continuous map on .
Let be a minimizer of (12). Then, there exists a countable index set , an orthonormal family of and a family of positive numbers such that (this comes from the fact that ) and
In addition, it can be easily checked that for all . We then define which is a Borel measure on since is finite and . It can then be easily checked that
Thus, by Proposition 5, there exist , and such that
Denoting by , it can then be easily checked that is also a minimizer to (12). Hence the desired result.
∎
7.2 Proof of Theorem 17
Proof of Theorem 17.
The first assertion of the theorem is a direct consequence of Theorem 6. Using the same arguments as in the proof of Theorem 6, one can easily obtain that the sequence is compact in . Thus, up to the extraction of a subsequence there exists such that and such that weakly converges to in the sense of trace-class operators of .
Moreover, following again the same lines of proof, we obtain that the sequence weakly converges in to . As a consequence, it holds that . Moreover, since for all , , using Lemma 15, we then obtain that, necessarily, . This makes admissible for (5) so that we have that . Notice now that for all , . Thus for any converging subsequence of to some limit , it holds that . For this subsequence, still denoted by for the sake of simplicity, it holds that , and we then have that
Thus, necessarily, is a minimizer of (5). Moreover, for any extracted subsequence so that . Using the compactness of the Fock space of bounded particle number for the geometric convergence [Lew11][Lemma 2.2, Lemma 2.3], we thus obtain the desired result. ∎
7.3 Proof of Theorem 24
Proof of Theorem 24.
Step 1: Let us first prove that there exists a maximizer to the optimization problem
(33) |
We denote here by the set of symmetric matrices of . For any , let us consider the linear form defined as follows:
where
Let us now consider the vectorial space
The space is a finite-dimensional subspace of the set of linear forms on , and its dimension is lower or equal to the dimension of . Let be a basis of . By construction, there exists such that for all . Let us then denote by . It can then be easily checked that any element of can be rewritten as
where and such that . In particular, this implies that since for all ,
Thus, proving that there exists a maximizer to (33) is equivalent to proving that there exists a maximizer to
(34) |
Now, by definition of , it holds that the application defined so that for all and all ,
is surjective. Indeed, this comes from the fact that . It can then be easily checked that (34) is then equivalent to the dual semi-definite programming problem:
(35) |
where is such that for all and with
Indeed, if is a maximizer to (35), it holds that is a maximizer to (34), and thus to (33).
The primal problem associated to (35) reads as
(36) |
Let us also remark that for all if and only if . Thus, this implies that there exists at least one minimizer to (36) which is given by and is positive definite. Using Theorem 22, we then obtain the existence of at least one maximizer to (35), and hence to (33) and (34).
Step 2: To conclude the proof of the desired result, it only remains to show that
On the one hand, it holds from Theorem 23, that . On the other hand, using similar arguments as in the proof of Theorem 23, it holds that
where . Since, by definition of , …, , it holds that , we obtain the desired result. ∎
Acknowledgements
This publication is part of a project that has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Programme – Grant Agreement 810367 L.N. is partially on academic leave at Inria (team Matherials) for the year 2022-2023 and 2023-2024 and acknowledges the hospitality if this institution during this period. His work benefited from the support of the FMJH Program PGMO, from H-Code, Université Paris-Saclay and from the ANR project GOTA (ANR-23-CE46-0001).
References
- [ACE22] Aurélien Alfonsi, Rafaël Coyaud, and Virginie Ehrlacher. Constrained overdamped Langevin dynamics for symmetric multimarginal optimal transportation. Mathematical Models and Methods in Applied Sciences, 32(03):403–455, 2022.
- [ACEL21] Aurélien Alfonsi, Rafaël Coyaud, Virginie Ehrlacher, and Damiano Lombardi. Approximation of optimal transport problems with marginal moments constraints. Mathematics of Computation, 90(328):689–737, 2021.
- [AL11] Miguel F Anjos and Jean B Lasserre. Handbook on semidefinite, conic and polynomial optimization, volume 166. Springer Science & Business Media, 2011.
- [BCD17] G. Buttazzo, T. Champion, and L. De Pascale. Continuity and estimates for multimarginal optimal transportation problems with singular costs. Appl. Math. Optim., August 2017.
- [BCN17] Jean-David Benamou, Guillaume Carlier, and Luca Nenna. A numerical method to solve multi-marginal optimal transport problems with Coulomb cost. In Splitting Methods in Communication, Imaging, Science, and Engineering, pages 577–601. Springer, 2017.
- [BDGG12] Giuseppe Buttazzo, Luigi De Pascale, and Paola Gori-Giorgi. Optimal-transport formulation of electronic density-functional theory. Phys. Rev. A, 85:062502, Jun 2012.
- [BDPK20] Ugo Bindini, Luigi De Pascale, and Anna Kausamo. On Seidl-type maps for multi-marginal optimal transport with Coulomb cost. arXiv preprint arXiv:2011.05063, 2020.
- [BT06] Christian Bayer and Josef Teichmann. The proof of Tchakaloff’s theorem. Proceedings of the American mathematical society, 134(10):3035–3040, 2006.
- [BVMTG22] Yuanming Bai, Leslie Vogt-Maranto, Mark E Tuckerman, and William J Glover. Machine learning the hohenberg-kohn map for molecular excited states. Nature communications, 13(1):7044, 2022.
- [CD15] Maria Colombo and Simone Di Marino. Equality between Monge and Kantorovich multimarginal problems with Coulomb cost. Ann. Mat. Pura Appl. (4), 194(2):307–320, 2015.
- [CDD15] Maria Colombo, Luigi De Pascale, and Simone Di Marino. Multimarginal optimal transport maps for one-dimensional repulsive costs. Canad. J. Math., 67:350–368, 2015.
- [CDMS19] Maria Colombo, Simone Di Marino, and Federico Stra. Continuity of multimarginal optimal transport with repulsive cost. SIAM J. Math. Anal., 51(4):2903–2926, 2019.
- [CEL+19] Rafael Coyaud, Virginie Ehrlacher, Damiano Lombardi, et al. Approximation of optimal transport problems with marginal moments constraints. Technical report, 2019.
- [CF15] Huajie Chen and Gero Friesecke. Pair densities in density functional theory. Multiscale Modeling & Simulation, 13(4):1259–1289, 2015.
- [CFK13] Codina Cotar, Gero Friesecke, and Claudia Klüppelberg. Density functional theory and optimal transportation with Coulomb cost. Comm. Pure Appl. Math., 66(4):548–599, 2013.
- [CFM14] Huajie Chen, Gero Friesecke, and Christian B Mendl. Numerical methods for a Kohn-Sham density functional model based on optimal transport. Journal of chemical theory and computation, 10(10):4360–4368, 2014.
- [CS16] Maria Colombo and Federico Stra. Counterexamples in multimarginal optimal transport with Coulomb cost and spherically symmetric data. Mathematical Models and Methods in Applied Sciences, 26(06):1025–1049, 2016.
- [DGN17] Simone Di Marino, Augusto Gerolin, and Luca Nenna. Optimal Transportation Theory with Repulsive Costs, volume “Topological Optimization and Optimal Transport in the Applied Sciences” of Radon Series on Computational and Applied Mathematics, chapter 9, pages 204–256. De Gruyter, June 2017.
- [DMLN22] Simone Di Marino, Mathieu Lewin, and Luca Nenna. Grand-canonical optimal transport. arXiv preprint arXiv:2201.06859, 2022.
- [FP22] Gero Friesecke and Maximilian Penka. The GenCol algorithm for high-dimensional optimal transport: general formulation and application to barycenters and Wasserstein splines. arXiv preprint arXiv:2209.09081, 2022.
- [FP23] Gero Friesecke and Maximilian Penka. Convergence proof for the GenCol algorithm in the case of two-marginal optimal transport. arXiv preprint arXiv:2303.07137, 2023.
- [Fri19] Gero Friesecke. A simple counterexample to the Monge ansatz in multimarginal optimal transport, convex geometry of the set of Kantorovich plans, and the Frenkel–Kontorova model. SIAM Journal on Mathematical Analysis, 51(6):4332–4355, 2019.
- [FSV22] Gero Friesecke, Andreas S Schulz, and Daniela Vögler. Genetic column generation: Fast computation of high-dimensional multimarginal optimal transport problems. SIAM Journal on Scientific Computing, 44(3):A1632–A1654, 2022.
- [FV18] Gero Friesecke and Daniela Vögler. Breaking the curse of dimension in multi-marginal Kantorovich optimal transport on finite state spaces. SIAM Journal on Mathematical Analysis, 50(4):3996–4019, 2018.
- [Gar22] Louis Garrigue. Building Kohn-Sham potentials for ground and excited states. Archive for Rational Mechanics and Analysis, 245(2):949–1003, 2022.
- [GGGG19] Augusto Gerolin, Juri Grossi, and Paola Gori-Giorgi. Kinetic correlation functionals from the entropic regularisation of the strictly-correlated electrons problem. Journal of Chemical Theory and Computation, 16(1):488–498, 2019.
- [GKR19] Augusto Gerolin, Anna Kausamo, and Tapio Rajala. Duality theory for multi-marginal optimal transport with repulsive costs in metric spaces. ESAIM: Control, Optimisation and Calculus of Variations, 25:62, 2019.
- [GMP16] François Golse, Clément Mouhot, and Thierry Paul. On the mean field and classical limits of quantum mechanics. Communications in Mathematical Physics, 343:165–205, 2016.
- [GP17] François Golse and Thierry Paul. The Schrödinger equation in the mean-field and semiclassical regime. Archive for Rational Mechanics and Analysis, 223:57–94, 2017.
- [HCL23] Yukuan Hu, Huajie Chen, and Xin Liu. A global optimization approach for multimarginal optimal transport problems with Coulomb cost. SIAM Journal on Scientific Computing, 45(3):A1214–A1238, 2023.
- [KLLY19] Yuehaw Khoo, Lin Lin, Michael Lindsey, and Lexing Ying. Semidefinite relaxation of multi-marginal optimal transport for strictly correlated electrons in second quantization, 2019.
- [LDMG+16] G. Lani, S. Di Marino, A. Gerolin, R. van Leeuwen, and P. Gori-Giorgi. The adiabatic strictly-correlated-electrons functional: kernel and exact properties. Phys. Chem. Chem. Phys., 18:21092–21101, 2016.
- [Lel22] Rodrigue Lelotte. An external dual charge approach to the optimal transport with Coulomb cost. arXiv preprint arXiv:2208.14762, 2022.
- [Lew11] Mathieu Lewin. Geometric methods for nonlinear many-body quantum systems. Journal of Functional Analysis, 260(12):3535–3595, 2011.
- [Lie83a] Elliott H. Lieb. Density functionals for Coulomb systems. Int. J. Quantum Chem., 24:243–277, 1983.
- [Lie83b] Elliott H. Lieb. On the lowest eigenvalue of the Laplacian for the intersection of two domains. Invent. Math., 74(3):441–448, 1983.
- [LLS19] Mathieu Lewin, Elliott H Lieb, and Robert Seiringer. Universal functionals in density functional theory. arXiv preprint arXiv:1912.10424, 2019.
- [MG19] Simone Di Marino and Augusto Gerolin. An optimal transport approach for the Schrödinger bridge problem and convergence of Sinkhorn algorithm, 2019.
- [MUMIGG14] A. Mirtschink, C. J. Umrigar, J. D. Morgan III, and P. Gori-Giorgi. Energy density functionals from the strong-coupling limit applied to the anions of the he isoelectronic series. J. Chem. Phys., 140(18):18A532, 2014.
- [NP22] Luca Nenna and Brendan Pass. An ODE characterisation of multi-marginal optimal transport with pair-wise cost functions. arXiv preprint arXiv:2212.12492, 2022.
- [QC23] Xue Quan and Huajie Chen. A finite element configuration interaction method for wigner localization. Journal of Computational Physics, 489:112251, 2023.
- [SPTP23] Xuecheng Shao, Lukas Paetow, Mark E Tuckerman, and Michele Pavanello. Machine learning electronic structure methods based on the one-electron reduced density matrix. arXiv preprint arXiv:2302.10741, 2023.
- [Vög21] Daniela Vögler. Geometry of Kantorovich polytopes and support of optimizers for repulsive multi-marginal optimal transport on finite state spaces. Journal of Mathematical Analysis and Applications, 502(1):125147, 2021.
- [WSV12] Henry Wolkowicz, Romesh Saigal, and Lieven Vandenberghe. Handbook of semidefinite programming: theory, algorithms, and applications, volume 27. Springer Science & Business Media, 2012.