Abstract
We develop new numerical schemes for Vlasov–Poisson equations with high-order accuracy. Our methods are based on a spatially monotonicity-preserving (MP) scheme and are modified suitably so that the positivity of the distribution function is also preserved. We adopt an efficient semi-Lagrangian time integration scheme that is more accurate and computationally less expensive than the three-stage TVD Runge–Kutta integration. We apply our spatially fifth- and seventh-order schemes to a suite of simulations of collisionless self-gravitating systems and electrostatic plasma simulations, including linear and nonlinear Landau damping in one dimension and Vlasov–Poisson simulations in a six-dimensional phase space. The high-order schemes achieve a significantly improved accuracy in comparison with the third-order positive-flux-conserved scheme adopted in our previous study. With the semi-Lagrangian time integration, the computational cost of our high-order schemes does not significantly increase, but remains roughly the same as that of the third-order scheme. Vlasov–Poisson simulations on mesh grids have been successfully performed on a massively parallel computer.
Export citation and abstract BibTeX RIS
1. Introduction
Numerical simulations of collisionless self-gravitating systems are one of the indispensable tools in the study of galaxies, galaxy clusters, and the large-scale structure of the universe. Gravitational N-body simulations have been widely adopted, and significant scientific achievements have been made for the past four decades. Particles in N-body simulations represent the structure of the distribution function in the phase space in a statistical manner. Similarly, particle-in-cell (PIC) simulations (Birdsall & Langdon 1991) are widely used for studying the dynamics of an astrophysical plasma such as particle acceleration in collisionless shock waves (Matsumoto et al. 2015) and magnetic reconnection (Fujimoto 2014).
Particle-based N-body simulations have several shortcomings, most notably the intrinsic shot noise in physical quantities such as mass density and velocity field. The discreteness of the mass distribution sampled by a finite number of point mass elements (particles) affects the gravitational dynamics considerably in regions with a small number of particles, in other words, where the phase space density is low. For example, gravitational N-body simulations cannot treat collisionless damping of density fluctuations and the two-stream instability in a fluid/plasma with a large velocity dispersion because of poor sampling of particles in the velocity space. Interestingly, physically equivalent situations are observed in PIC simulations (Kates-Harbeck et al. 2016). In PIC simulations, the spacing of grids in the physical space, on which electric and magnetic fields are calculated by solving the Maxwell equations, is constrained not to be larger than the Debye length. Hence, the simulation volume is also restricted for a given number of grids.
To overcome these shortcomings, a number of alternative methods to the N-body approach have been proposed. Hernquist & Ostriker (1992) and Hozumi (1997) use the self-consistent field (SCF) method, in which particles move on trajectories on a smooth gravitational potential computed by a basis-function expansion of the density field and gravitational potential. Unlike in conventional N-body simulations, particle motions do not suffer from numerical effects owing to discrete gravitational potential. However, a drawback or complication of the SCF method is that the basis functions need to be suitably chosen for a given configuration of the self-gravitating systems.
Abel et al. (2012), Shandarin et al. (2012), and Hahn et al. (2013) develop a novel method to simulate cosmic structure formation by following the motion of dark matter "sheets" in the phase space volume, while retaining the smooth, continuous structure. The method suppresses significantly artificial discreteness effects in N-body simulations such as a spurious filament fragmentation. The method, however, can only be applied to "cold" matter with sufficiently small velocity dispersions compared with its bulk velocity. Vorobyov & Theis (2006) and Mitchell et al. (2013) solve moment equations of the collisionless Boltzmann equation in a finite-volume manner by adopting a certain closure relation. The closure relation needs to be chosen in a problem-dependent manner, and often the validity of the adopted relation remains unclear or its applicability is limited.
A straightforward method to simulate a collisionless plasma or a self-gravitating system is directly solving the collisionless Boltzmann equation, or the Vlasov equation. Such an approach was originally explored by Janin (1971) and Fujiwara (1981) for one-dimensional self-gravitating systems and by Nishida et al. (1981) for two-dimensional stellar disks. It is found to be better in following kinetic phenomena such as collisionless damping and two-stream instability than conventional N-body simulations.
The Vlasov–Poisson and Vlasov–Maxwell equations are solved to simulate the dynamics of electrostatic and magnetized plasma, respectively. An important advantage of this approach is that there is no constraint on the mesh spacing, unlike in PIC simulations. In principle, direct Vlasov simulations enable us to simulate the kinematics of an astrophysical plasma with a wide dynamic range, which cannot be easily achieved with PIC simulations.
Despite its versatile nature, the Vlasov simulations have been applied only to problems in one or two spatial dimensions. Solving the Vlasov equation with three spatial dimensions, i.e., in a six-dimensional phase space, requires an extremely large amount of memory. Yoshikawa et al. (2013) successfully perform Vlasov–Poisson simulations of self-gravitating systems in six-dimensional phase space, achieving high accuracies in mass and energy conservation. Unfortunately, the number of grids is still limited by the amount of available memory rather than the computational cost. With currently available computers, it is not realistic to achieve a significant increase in spatial (and velocity) resolution by simply increasing the number of mesh grids. Clearly, developing high-order schemes is necessary to effectively improve the spatial resolution for a given number of mesh grids. In the present paper, we develop and test new numerical schemes for solving the Vlasov equation based on the fifth- and seventh-order monotonicity-preserving (MP) schemes (Suresh & Huynh 1997). We improve the schemes suitably so that the positivity of the distribution function is ensured. We run a suite of test simulations and compare the results with those obtained by the spatially third-order positive-flux-conservative (PFC) scheme (Filbet et al. 2001) adopted in our previous work (Yoshikawa et al. 2013, hereafter YYU13).
The rest of the paper is organized as follows. In Section 2, we describe our new numerical schemes to solve the Vlasov equation in higher-order accuracy in detail. Section 3 is devoted to presenting several test calculations to show the technical advantage of our approach over previous methods. We address the computational costs of our schemes in Section 4. Finally, in Section 5, we summarize our results and present our future prospects.
2. Vlasov–Poisson Simulation
We follow the time evolution of the distribution function according to the following Vlasov (collisionless Boltzmann) equation:
where and are the Cartesian spatial and velocity coordinates, respectively. For self-gravitating systems, the Vlasov–Poisson equation is given as
where ϕ is the gravitational potential satisfying the Poisson equation
where G is the gravitational constant and ρ is the mass density. Throughout the present paper, the distribution function is normalized such that its integration over velocity space yields the mass density. We discretize the distribution function in the same manner as in YYU13; we employ a uniform Cartesian mesh in both the physical configuration space and the velocity (momentum) space.
2.1. Advection Solver
The Vlasov Equation (2) can be broken down into six one-dimensional advection equations along each dimension of the phase space: three for advection in the physical space
and another three in the velocity space
We solve these six equations sequentially. The gravitational potential ϕ in Equation (5) is updated after advection Equation (4) in the physical space is solved and the mass density distribution in the physical space is updated. Clearly, an accurate numerical scheme for a one-dimensional advection scheme is a crucial ingredient of our Vlasov solver.
Let us consider a numerical scheme of the advection equation
on a one-dimensional uniform mesh grid. In a finite-volume manner, we define an averaged value of over the ith grid centered at with the interval of at a time as
where are the coordinates of the boundaries of the ith mesh grid. Without loss of generality, we can restrict ourselves to the case with a positive advection velocity .
Physical and mathematical considerations impose the following requirements on numerical solutions of the advection equation: (i) Monotonicity—since the solution of the advection Equation (6) preserves monotonicity, so should its numerical solution. (ii) Positivity—the distribution function must be positive by definition, and thus the numerical solution should be non-negative. Note that monotonicity is not sufficient to ensure positivity of the numerical solution.
2.1.1. MP Schemes with the Positivity-preserving Limiter
We first present the spatially fifth-order scheme for the advection Equation (6) based on the MP5 scheme of Suresh & Huynh (1997), in which the value of at the interface between the th and ith mesh grids is reconstructed with five stencils , , , fni and as
In order to ensure monotonicity of the numerical solution, we impose the constraint
The detailed prescriptions of the constraints are given in Appendix A. The MP5 scheme adopts the three-stage TVD Runge–Kutta (TVD–RK) time integration as
where is the CFL number and the operator Li(f) is defined as
Hereafter, this scheme is referred to as RK–MP5.
As already noted above, the MP scheme does not ensure positivity of the numerical solution. We thus introduce a positivity-preserving (PP) limiter based on the method proposed by Hu et al. (2013). By denoting and , the single-stage Eulerian time integration can be written as
Here, we consider a linear combination of the interface value obtained with the MP scheme and that with the spatially first-order upwind scheme as
where is expressed as
and is a parameter with a condition of . Note that the spatially first-order upwind scheme preserves positivity of the numerical solution if the CFL condition is satisfied. Then, the time integration of the combined interface values (15) can be written as
where and . Therefore, if both of the terms on the right-hand side of Equation (19) are positive, is also positive. A simple calculation shows that it can be realized by setting the parameter as
where
and
In addition, the condition of can be met by setting and as a sufficient condition, which leads to a constraint on the CFL parameter as . Let us denote the above procedures to evaluate the positive interface value (15) as
It can be shown that positivity of the numerical solution is preserved even with the three-stage TVD–RK time integration by setting the CFL parameter to . In the following, we call this scheme the RK–MPP5 scheme (MPP for "monotonicity and positivity preserving").
It is tedious but straightforward to construct the spatially seventh-order schemes, RK–MP7 and RK–MPP7, by simply replacing in Equation (9) with the seventh-order interpolation
2.1.2. Semi-Lagrangian Time Integration
The MP schemes described above adopt the three-stage TVD–RK time integration (10), which needs explicit Euler integration three times per single time step. It can be a source of significant computational cost. To overcome this drawback, we devise an alternative method based on the semi-Lagrangian (SL) time integration.
The averaged value of over the ith mesh grid at a time of is written as
where is the position of the characteristic curve at a time of t1 originating from the phase space coordinate , and for the advection Equation (6), ; see Figure 1. Denoting
and
we can rewrite Equation (25) in terms of numerical flux as
By interpolating the discrete values of fnj, we can compute the numerical fluxes and . Actually, the PFC scheme also adopts essentially the same approach. It adopts the third-order reconstruction to interpolate fnj with a slope limiter and suppresses numerical oscillation while preserving positivity of the numerical solutions. For more detailed description of the PFC scheme, see Filbet et al. (2001) and the Appendix of YYU13.
Our new schemes are based on the conservative semi-Lagrangian (CSL) scheme (Qiu & Christlieb 2010; Qiu & Shu 2011), which is originally coupled with a finite-difference scheme but is also applicable to the finite-volume schemes. In the fifth-order CSL scheme, can be expressed with five stencil values as
where
and
The seventh-order CSL scheme is given by
where
and
However, the numerical solution obtained with the CSL scheme does not preserve the monotonicity. Thus, we apply the MP constraint to and as
and
The numerical solution with the monotonicity preservation at is then computed as
which is referred to as the SL–MP5 and SL–MP7 schemes (SL for "semi-Lagrangian") for the fifth- and seventh-order accuracies, respectively.
The PP limiter described in the previous subsection can be applied as
and
Then, the numerical solution at is given by
We refer to these schemes as the SL–MPP5 and SL–MPP7 schemes hereafter.
2.2. Poisson Solver
The Poisson Equation (3) is numerically solved with the convolution method (Hockney & Eastwood 1981) using the fast Fourier transform (FFT). Under periodic boundary conditions, the discrete Fourier transform (DFT) of the gravitational potential can be written as
where is the DFT of the density field, is the discrete wavenumber vector, and is the Fourier-transformed Green function of the discretized Poisson equation. The second-order discretization of the Green function, , is given by
with . Since we adopt advection solvers with spatially fifth- and seventh-order accuracy, it is reasonable to adopt a Poisson solver with the same or similar order of accuracy, which can be simply constructed by high-order discretization of the Poisson equation. As shown in Appendix C, the DFTs of the Green functions with spatially fourth- and sixth-order accuracy are given by
and
respectively. The gravitational potential is computed with the inverse DFT of .
For isolated boundary conditions, the doubling-up method (Hockney & Eastwood 1981) is adopted, where the number of mesh grids is doubled for each spatial dimension and the mass density in the extended grid points is zeroed out. The Green function is set up in the real space as
and is Fourier-transformed to obtain . Then, the gravitational potential can be calculated in the same manner as in the periodic boundary condition.
After obtaining the gravitational potential, we compute gravitational forces at each of the spatial grids with the finite-difference approximation (FDA). Again we adopt FDA schemes with the same order of accuracy as that of the gravitational potential calculation. The two-, four- and six-point FDAs are described in Appendix C. In what follows, we adopt the sixth-order calculation of the gravitational forces and potentials irrespective of adopted advection schemes because we are primarily interested in the accuracy of our advection equation solver.
2.3. Parallelization and Domain Decomposition
We parallelize our Vlasov–Poisson solvers by decomposing the six-dimensional phase space into distributed memory spaces. We decompose the computational domain in the phase space in the same manner as YYU13. The physical space is divided into rectangular subdomains along x-, y-, and z-directions, while the velocity space is not decomposed; each spatial point has an entire set of velocity mesh grids. This decomposition is convenient when computing the velocity moments of the distribution function such as the mass density, mean velocities, and velocity dispersions.
In solving the advection equations in the physical space (4) across the decomposed subdomains, the values of the distribution function adjacent to the boundaries of the subdomains are transferred using the Message Passing Interface (MPI). The number of adjacent mesh grids to be transferred is two for the spatially third-order PFC scheme and three and four for spatially fifth- and seventh-order MP schemes, respectively.
2.4. Time Integration
We advance the distribution function from to using the directional splitting method on the order of
where denotes an operator for advection along the ℓ-direction with time step . It should be noted that the Poisson equation is solved after the advection operations in the physical space () to update the gravitational potential and forces for the updated density field. We adopt the directional splitting method rather than the unsplit method because the former is the combination of one-dimensional advection schemes and hence rigorously assures the monotonicity and the positivity even in the multidimensional problems and also because the latter requires a larger amount of memory space to temporally store the numerical fluxes or the updated distribution function. Furthermore, the unsplit method requires a relatively smaller time step than the directional splitting method to achieve the same level of numerical accuracy. See Section 3.1.4 for the comparison of the dimensional splitting and unsplit methods in the two-dimensional advection problems. The temporal accuracy of the directional splitting method is in the second order at best. However, we find that the errors in the numerical solutions presented in the following numerical test are mostly dominated by ones that originate from the spatial discretization, and the spatial resolution is more important than the temporal accuracy in terms of the quality of the numerical solution.
As for time step width, we have several constraints on the CFL parameter. The MP schemes are subject to the constraint with if the parameter α in the MP scheme is set to (see Appendix A) and the PP limiter requires . As shown in Appendix B, all of our new schemes work well for and . In practice, the condition is desirable, and thus we set and throughout the present paper unless otherwise stated.5
We set the time step for integrating the Vlasov equation as
where and are the time steps for advection equations in physical and velocity space, respectively. is given by
where denotes the spacing of mesh grids along the ℓ-direction and is the maximum extent of the velocity space along the vℓ-direction. Similarly, we compute as
where is the gravitational acceleration along the ℓ-direction at the ith mesh grids in the physical space and the minimization is taken over all the spatial mesh grids.
3. Test Calculations
In this section, we present the results of several test simulations with our new schemes. The test suite includes simulations of one- and two-dimensional advection and Vlasov–Poisson simulations in two- and six-dimensional phase space.
3.1. Test-1: Advection Equation
As the first of the test suite of our new schemes, we perform numerical simulations of the advection equation. Here, we solve the one- and two-dimensional advection equations. In the one-dimensional tests, we examine the fundamental properties of our new schemes such as monotonicity, positivity, and accuracy. In the two-dimensional tests, we present the applicability of our one-dimensional schemes to multidimensional problems.
The first test run we perform is the one-dimensional advection problem. We solve
with our new schemes. Since the accuracy of the numerical solutions depends on the shape of , we consider three initial conditions and examine critically the numerical solutions.
3.1.1. Test-1a: Rectangular-shaped Function
We configure the rectangular-shaped initial condition given by
in the simulation domain over and integrate Equation (59) using PFC, RK–MPP5, RK–MPP7, SL–MPP5, and SL–MPP7 under the periodic boundary conditions. Figure 2 shows the numerical solutions at t = 8 with the number of mesh grids . The numerical solutions with RK–MPP5, SL–MPP5, and SL–MPP7 are less diffusive compared with that of PFC. Note that RK–MPP7 yields undesirable asymmetric features around and 0.7. Actually these features are persistent irrespective of and ν and are likely caused by the lack of accuracy in the time integration. In order to avoid these features, we would need the fourth- or higher-order TVD–RK integration schemes, which are computationally quite expensive (see Section 4). Thus, we exclude RK–MP7 and RK–MPP7 in the rest of our test simulations. Unlike RK–MPP7, SL–MPP7 does not exhibit such features, indicating that the SL time integration method is able to be combined with spatially seventh- or higher-order schemes. We confirm that the spatially ninth-order CSL scheme combined with the SL method works well at least in one-dimensional advection problems. Figure 3 shows the relative error of the numerical solutions to the analytic solution defined as
Since there are discontinuities in the initial condition, the numerical fluxes computed with PFC and our new schemes are effectively of the first order, avoiding numerical oscillations. This is why the relative errors scale with the first-order power law.
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution image3.1.2. Test-1b: Sinusoidal Wave
As a good example of advection of a smooth function, we consider the initial condition given by
in the periodic domain of . Figure 4 shows the relative errors of numerical solutions obtained with RK–MPP5, SL–MPP5, and SL–MPP7 in the left panel and with RK–MP5, SL–MP5, and SL–MP7 without the PP limiter in the right panel as a function of . For comparison, the result obtained with PFC is also presented in both panels. The numerical solutions obtained with RK–MPP5 have nearly third-order accuracy for , whereas those with RK–MP5 have fifth-order accuracy. This is because the PP limiter operates at the minima of numerical solutions at each stage of the TVD–RK time integration and degrades overall accuracy. Contrastingly, the accuracies of the SL schemes are not affected by the PP limiter, i.e., the results of SL–MPP5 and SL–MPP7 show fifth- and seventh-order scaling.
Download figure:
Standard image High-resolution image3.1.3. Test-1c: Quartic Sine Function
The final example is the initial condition given by
This case serves as a good example to examine how well the PP limiter works. Figure 5 shows the numerical solutions with at t = 8 computed with PFC, RK–MP5, and RK–MPP5 (left panel) and those obtained with SL–MP5, SL–MPP5, and SL–MPP7 (right panel). The PFC solution is significantly smeared near the maxima and minima, although it preserves positivity. The numerical solutions obtained with RK–MP5 and SL–MP5 without the PP limiter exhibit noticeable negative values around the minima. On the other hand, RK–MPP5, SL–MPP5, and SL–MPP7 successfully preserve positivity and reproduce the analytic solution very well. Note that SL–MPP7 is better at reproducing the profiles around the maxima than SL–MPP5 and RK–MPP5, showing the importance of high-order accuracy. As can be seen in Figure 6, the order of accuracy for RK–MPP5, SL–MPP5, and SL–MPP7 is actually close to the fifth, and the result with SL–MPP7 is more accurate.
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution image3.1.4. Test-1d: Two-dimensional Advection
Here we present numerical simulations of two-dimensional advection problems with our new schemes to test their numerical accuracy in multidimensional problems. We solve a two-dimensional advection equation
where cx and cy are advection speed along the x- and y-direction, respectively. We set up a two-dimensional domain with with a periodic boundary condition and consider three initial conditions similar to those presented in the one-dimensional tests: a rectangular-shaped wave given by
a sinusoidal wave
and a quartic sine wave
In all three runs, we set the advection velocities to . The simulation domain is discretized into the Cartesian mesh grids with and mesh grids along the x- and y-directions, respectively.
We compute the relative error of numerical solutions to the analytic solution defined by
where is a numerical solution at and xi and yj are the x and y coordinates of the ith and jth mesh grids in the x- and y-direction, respectively. Figure 7 shows the relative errors of numerical solutions obtained with PFC, SL–MPP5, and SL–MPP7 combined with the directional splitting or the unsplit methods, in which the numbers of mesh grids along each dimension are set to , 64, 128, 256, and 512.
Download figure:
Standard image High-resolution imageFor the advection of a rectangular-shaped function, the scaling of the relative errors is nearly first order in space irrespective of adopted numerical schemes, while the results obtained with SL–MPP7 and SL–MPP5 are less diffusive than those obtained with PFC. On the other hand, numerical solutions of the advection of a two-dimensional sinusoidal wave obtained with PFC, SL–MPP5, and SL–MPP7 give the third-, fifth-, and seventh-order scalings of the relative error, respectively, in which the sinusoidal shape is so smooth that the MP constraint and PP limiter do not affect the overall accuracy of the schemes. In the case of a quartic sine function, the PP limiter compromises the numerical accuracy around the minima of the wave, and the numerical solutions with SL–MPP5 and SL–MPP7 exhibit nearly fifth-order accuracy, while the scaling of the relative errors obtained with the PFC scheme is not better than the third order. All these results are consistent with what we have in the one-dimensional Test-1a, Test-1b, and Test-1c, indicating that our new schemes also exhibit spatially high-order accuracies in the multidimensional problems.
As described in Section 2.4, we do not adopt the unsplit method for the time integration, due to its several disadvantages against the directional splitting method. It is, however, useful to make a comparison of numerical accuracy obtained with these two methods. The relative errors obtained with the unsplit method combined with PFC, SL–MPP5, and SL–MPP7 are also shown in each panel of Figure 7. We heuristically find that the CFL parameter must be less than 0.1 in the runs with the unsplit method to keep the monotonicity and positivity of the numerical solutions, and we set in computing the results with the unsplit method. It is clearly seen that the unsplit method yields almost the same level of numerical errors as the directional splitting method. We also find that the computational cost in computing with the unsplit method is larger than that of the directional splitting method by % in terms of wall-clock time per step.
3.2. Test-2: Vlasov–Poisson Simulation of One-dimensional Gravitational Instability and Collisionless Damping
Next, we perform Vlasov–Poisson simulations of spatially one-dimensional self-gravitating systems in two-dimensional phase space volume. We consider the periodic simulation domain with and set the initial condition to be
where σ is the velocity dispersion, is the mean density of the system, and A is the amplitude of density inhomogeneity. This distribution function yields the initial mass density as
and the density inhomogeneity develops through the gravitational instability when the wavenumber k satisfies the condition , where is the Jeans wavenumber defined by
In this test, we set the wavenumber k to , the velocity dispersion σ so as to be , and the amplitude to A = 0.01. The velocity space is defined over , where V = L/T and T is the dynamical timescale defined by . We adopt outflow boundary conditions in the velocity space. Figure 8 shows the maps of the phase space density computed with PFC, RK–MPP5, SL–MPP5, and SL–MPP7 at , , and . The numbers of mesh grids in physical and velocity space are set to . The phase space density computed with PFC with third-order spatial accuracy is significantly smeared especially at later epochs compared to the results with other spatially higher-order schemes. The results with RK–MPP5 and SL–MPP5 schemes, both of which have the spatially fifth-order accuracy, appear almost identical to each other. SL–MPP7 is able to resolve finer spiral-arm structure of matter distribution in the phase space located at the turnaround position ( and ).
Download figure:
Standard image High-resolution imageFigure 9 compares the results computed with RK–MPP5, SL–MPP5, and SL–MPP7 with the PP limiter (top panels) and with RK–MP5, SL–MP5, and SL–MP7 without it (bottom panels). The regions with negative phase space density depicted by white dots in the bottom panels are not present in the results computed with the PP limiter.
Download figure:
Standard image High-resolution imageTo measure the accuracy of the numerical solutions obtained with various schemes, we compute the following quantity:
where is a reference solution to be compared with. Since we do not have any analytical solution for this test, we construct the reference solution in the following manner: First, we conduct runs with using SL–MPP7 and apply the fifth-order spline interpolation to the results so that the values of the numerical distribution function can be directly compared between different numbers of mesh grids. The left panel of Figure 10 shows the relative errors of the numerical solutions at computed using PFC, RK–MPP5, SL–MPP5, and SL–MPP7 with –2048 mesh grids. The scalings of the relative errors of our new schemes and the PFC scheme are nearly in second and first order, respectively, and somewhat compromised compared with their formal orders of accuracies, because the functional shape of the distribution function is not so smooth that the MP constraint and the PP limiter effectively work. The accuracies of RK–MPP5, SL–MPP5, and SL–MPP7 for a given number of mesh grids are significantly better than PFC with the same number of mesh grids and are similar to or even better than that with four times as many mesh grids.
Download figure:
Standard image High-resolution imageWe also measure the accuracies of our new schemes in the runs with , in which the density fluctuation damps through the collisionless damping or the free streaming. In the case with , we set the amplitude of density fluctuation to A = 0.1 and extend the velocity space as since the velocity dispersion is larger than the case with . We compute the relative errors of the distribution function at in the same manner as done in the case with . The right panel of Figure 10 shows the relative errors of numerical solutions computed with our new schemes as well as PFC for –2048 mesh grids. In this test, the shape of the distribution function is relatively smoother than the runs with , and the numerical solutions are not as strongly affected by the MP constraint and the PP limiter as the runs with . Accordingly, the scalings of the relative errors are better than those in the case with and are nearly third and second order for our new schemes and PFC, respectively. Note that the accuracies of RK–MPP5 and SL–MPP5 are almost the same as each other in both cases with and 1.1, indicating that the RK schemes and SL schemes yield almost the same accuracy in these tests.
3.3. Test-3: Landau Damping in Electrostatic Plasma
We perform simulations of Landau damping in an electrostatic plasma, in which the time evolution of the distribution function of electrons is described by the following Vlasov equation:
where is the electrostatic potential and e and m are the charge and mass of an electron, respectively. The electrostatic potential satisfies the Poisson equation
where n is the number density of ions. In this formulation, the distribution function is normalized such that its integration over the velocity space gives the fraction of electron number density relative to ions.
We consider a two-dimensional phase space with and , where and is the electron plasma frequency and the periodic boundary condition is imposed on the physical space. The initial condition is given by
where the wavenumber is set to and . With this initial condition, it is expected that the electric field damps exponentially as with .
First, we show results of linear Landau damping in which we set the amplitude of the initial perturbation to A = 0.01. We set . Figure 11 shows time evolution of the amplitude of the electric field with a mode of obtained with PFC and SL–MPP5. The results obtained with RK–MPP5 and SL–MPP7 are almost identical to that of SL–MPP5 and are not presented in this figure. Although the numerical solution with PFC ceases to damp at , the one obtained with the SL–MPP5 scheme continues to damp to , just before the recurrence time , indicating that SL–MPP5 is less dissipative than PFC.
Download figure:
Standard image High-resolution imageNext, we perform simulations of nonlinear Landau damping with A = 0.5. This problem has been investigated in a number of previous studies (e.g., Klimas 1987; Manfredi 1997; Nakamura & Yabe 1999). In this problem, the initial perturbations damp (in space) quickly, but a highly stratified distribution in the phase space develops with time through rapid phase mixing (see Figure 12). The amplitude of the electric field initially decreases, but particle trapping by the electric field prevents complete damping. Figure 12 shows the phase space density of electrons at , , , and computed with PFC, RK–MPP5, SL–MPP5, and SL–MPP7. All the results are obtained with and . Rapid phase space mixing generates a number of peaks along the velocity axis. The PFC solution exhibits a smeared distribution at , while those computed with other higher-order schemes preserve features of phase mixing even at . This is also seen in Figure 13, in which we show the velocity distribution function
at . There are a number of peaks in the velocity distribution function, which represent electrons trapped by electrostatic waves around their phase velocities. The peaks in the distribution function are more prominent in the results with fifth- and seventh-order schemes than in PFC, again showing the less dissipative nature of our schemes.
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution image3.4. Test-4: Spherical Collapse of a Uniform-density Sphere
Finally, we study gravitational collapse of a uniform-density sphere. The mass and the initial radius of the sphere are M and , respectively, and the initial distribution function is given by
where is the initial density of the sphere and is the initial velocity dispersion of the matter inside the sphere. We set the initial virial ratio to be 0.5. The extent of the simulation volume is in the physical space and , where V is given by V = R/T and is the dynamical timescale of the system. In this test, we adopt the out-going boundary condition in both the physical and velocity spaces. We perform the runs with for the physical space and or mesh grids for the velocity space.
We plot the phase space density in the -plane with and at , , , and in Figure 14. We set . Initially, the sphere starts to collapse toward the center and reaches the maximum contraction around . Then the system expands again, and a part of the matter passes through the center of the system to reach the outer boundary of the physical space defined at , , and . The results computed with PFC show a smeared distribution compared with those obtained with RK–MPP5, SL–MPP5, and SL–MPP7. We note that RK–MPP5 and SL–MPP5 yield almost identical results, although the computational costs for the two are significantly different (see Section 4).
Download figure:
Standard image High-resolution imageFigure 15 is the same plot as Figure 14 but shows the results with . It is interesting that the results obtained by SL–MPP7 with and (bottom panels in Figure 15) are similar to or even better than those obtained with PFC with (top panels in Figure 14), despite the eight times smaller number of mesh grids.
Download figure:
Standard image High-resolution imageFor this test, we compare our new schemes with another independent scheme. To this end, we utilize a high-resolution N-body simulation in the following manner. First, we perform an N-body simulation starting from the initial condition equivalent to Equation (77). In this N-body simulation, the gravitational potential field is dumped and stored at all the time steps. Then, we compute the phase space density at a given phase space coordinate by tracing back the coordinate to its initial coordinate using the previously stored gravitational potentials. Since the phase space density is constant along each trajectory, we can, in principle, compute the phase space density from the initial phase space coordinate using Equation (77). However, trajectories in conventional N-body simulations are subject to rather grainy gravitational potential compared to what it should be in the actual physical system owing to their superparticle approximation. Thus, we adopt the self-consistent field (SCF) method (Hernquist & Ostriker 1992; Hozumi 1997), in which the gravitational potential is computed by expanding the density field into a set of basis functions, and particle trajectories are calculated for the smooth gravitational potential. The initial condition is constructed with 107 equal-mass particles, and the particle orbits are computed using the basis set constructed by Clutton-Brock (1972), where the numbers of the expansion terms are 64 and 144 in the radial and angular directions, respectively. Figure 16 shows the phase space density obtained with this procedure. Comparing with Figures 14 and 15 manifests that our spatially higher-order schemes (RK–MPP5, SL–MPP5, and SL–MPP7) yield results much closer to those obtained with the SCF approach. It should be noted that, in practice, the SCF method is applicable only to self-gravitating systems with a certain spatial symmetry, such as in the present test, to which the lowest-order functions of the basis set suffice to describe the overall structure of the system. Also, storing the gravitational potential (or its expansion) at all the time steps is very costly. Therefore, the SCF approach cannot be applied to a wide range of self-gravitating systems. For further comparison with the results obtained by the SCF methods, we perform a Vlasov–Poisson run with the largest numbers of mesh grids, and using SL–MPP5, and show the phase space density at , , , and in the -plane in Figure 17. With such high resolution, the phase space density is consistent with that obtained with the SCF method in detail.
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageWe further examine the mass and energy conservation for this spherical collapse test. We show the time evolution of the total mass, kinetic energy, and gravitational potential energy of the results simulated with PFC and SL–MPP5 in Figure 18, where the results obtained with and are shown in the left and right panels, respectively. Note that the results simulated with RK–MPP5 and SL–MPP7 are almost the same as those of SL–MPP5 and hence are not shown in this figure. For comparison, we also plot the results obtained with an N-body simulation, in which we employ 1283 particles, and adopt the Particle-Mesh (PM) method with the number of mesh grids in solving the Poisson equation equal to that of the Vlasov–Poisson simulation, i.e., 64 mesh grids in each dimension, in order to realize the same spatial resolution of the gravitational field and force. The total mass in the Vlasov simulation starts to decrease around because part of the matter reaches the boundary of the simulation volume at that time, as can be seen in Figures 16 and 17. In the runs with PFC, such a decrease of the total mass begins earlier and the amount of the escaped mass is larger than that in the runs with SL–MPP5 and other higher-order schemes owing to larger numerical diffusion, irrespective of . The most prominent difference can be seen around the maximum contraction epoch , when the kinetic and gravitational potential energies have their maximum and minimum values, respectively. As can be seen in Figures 14 and 15, the large numerical diffusion of PFC, in both the physical and velocity spaces, smears the "S"-shaped phase space distribution and enlarges the physical extent of the system at , yielding small kinetic and shallow gravitational potential energies seen in Figure 18. The overall accuracy can be improved for PFC by simply increasing the number of mesh grids from to 643. On the other hand, the result of SL–MPP5 is close to the N-body simulation, irrespective of , and in the runs with both of the results with PFC and SL–MPP5 are in good agreement with the N-body simulation. This suggests that SL–MPP5 (as well as RK–MPP5 and SL–MPP7) with provides roughly the same level of accuracy as PFC with in terms of mechanical energy.
Download figure:
Standard image High-resolution image4. Computational Cost
In addition to the computational accuracy, we need to address the computational speed of our new schemes. We measure the wall-clock time to advance each of Equations (4) and (5) by a single time step using PFC, RK–MPP5, SL–MPP5, and SL–MPP7 for various combinations of and ranging from 323 to 1283 on 64 nodes of the HA-PACS system at the Center for Computational Sciences, University of Tsukuba. The largest number of mesh grids adopted in this measurement is . Figure 19 shows the measured wall-clock time per mesh grid in six-dimensional Vlasov–Poisson simulations of the growth of gravitational instability from a nearly uniform-density distribution that is identical to Test-4 in YYU13. We show the wall-clock time to advance the advection equations in the physical space (Equation (4)) and those in the velocity space (Equation (5)) in the left and right panels, respectively. One can see that RK–MPP5 is the most computationally expensive among all the advection schemes described in this work. This is largely because it performs three-stage RK time integration (Equation (10)) and computes numerical fluxes three times per step while other schemes need only single-stage time integration (Equation (49)). Since the physical space is divided into subdomains and stored in independent memory spaces, solving advection Equation (4) in the physical space requires internode communications to transfer phase space densities to/from adjacent computational nodes and thus takes a longer wall-clock time than solving Equation (5) in the velocity space. The computational costs of SL–MPP5 and SL–MPP7 are significantly smaller than that of RK–MPP5 and are almost the same as that of PFC.
Download figure:
Standard image High-resolution image5. Summary and Conclusion
In this work, we present a set of new spatially fifth- and seventh-order advection schemes developed for Vlasov simulations that preserve both monotonicity and positivity of numerical solutions. The schemes are based on the spatially fifth-order MP scheme proposed by Suresh & Huynh (1997) combined with the three-stage TVD–RK scheme and are extended so that they also preserve positivity by using a flux limiter. We also devise an alternative approach to construct MP and PP schemes by adopting the SL time integration scheme, for the substitution of the three-stage TVD–RK scheme. With this approach, we are able to adopt spatially seventh- or higher-order schemes without using computationally expensive fourth- or higher-order TVD–RK time integration schemes and hence to reduce the computational cost.
We perform an extensive set of numerical tests to verify the improvement of numerical solutions using our newly developed schemes. The test suite includes one- and two-dimensional linear advection problems, one-dimensional collisionless self-gravitating and electrostatic plasma systems in two-dimensional phase space, and a three-dimensional self-gravitating system in six-dimensional phase space. The results are compared with those obtained with the spatially third-order PFC scheme adopted in our previous work (Yoshikawa et al. 2013) and are summarized as follows.
In Test-1, we present calculations of one-dimensional linear advection problems and confirm that numerical solutions computed with our new schemes have better accuracy than PFC. The PP limiter works well in every advection test, although it introduces a slight degradation of numerical accuracy in terms of the error scaling in combination with the TVD Runge–Kutta time integration scheme. It is found that the SL time integration provides us with better numerical accuracy than the TVD–RK scheme and that the combination of the PP limiter and the SL time integration does not introduce any expense of numerical accuracy unlike the TVD–RK time integration. It should be noted that the spatially seventh-order RK–MPP7 exhibits undesirable features in the advection of a rectangular-shaped function originating from insufficient accuracy of the three-stage RK time integration scheme, and that the numerical solutions obtained with SL–MPP7 contrastingly do not have such features, indicating that the SL approach is crucial for constructing schemes with spatially seventh- or higher-order accuracy. Numerical solutions of two-dimensional linear advection problems computed with our new semi-Lagrangian schemes are consistent with those in the one-dimensional ones, proving that our new schemes work well also in multidimensional problems.
In Test-2 and Test-3, we perform Vlasov–Poisson simulations of one-dimensional self-gravitating and electrostatic plasma systems in two-dimensional phase space, respectively. In the self-gravitating system, we found that the phase space density distribution simulated with the PFC scheme significantly suffered from numerical diffusion compared with those simulated with our new schemes. In terms of the L1 error of the distribution function, the fifth-order schemes (RK–MPP5 and SL–MPP5) yield almost identical results and achieve almost the same level of numerical accuracy as the third-order scheme with four times refined mesh grids, while the spatially seventh-order SL–MPP7 is proved to be the most accurate scheme among all the schemes presented in this work. In addition, the PP limiter also works well in the Vlasov simulations. In the simulations of electrostatic plasma, we perform the linear and nonlinear Landau damping problems. In the former, we found that numerical solution computed with the third-order scheme ceases damping sufficiently before the recurrence time, due to its large numerical diffusion. On the other hand, the numerical solutions with our new schemes continue to damp just before the recurrence time. As for the nonlinear Landau damping, the highly stratified distribution formed through rapid phase mixing is well solved with our new schemes, while it is strongly smeared in the numerical solution computed with PFC, due to its large numerical diffusion.
Finally, in Test-4 we perform Vlasov–Poisson simulations of a fully three-dimensional self-gravitating system in a six-dimensional phase space, in which runs with two different sets of numerical resolution and are conducted. By comparing the distribution functions obtained with the Vlasov–Poisson simulation with the reference solution obtained by exploiting the SCF method, one can see that our new schemes yield better numerical solutions than the third-order scheme, and the advantages of these schemes are confirmed also in Vlasov simulations in six-dimensional phase space. Comparison of the results between the runs with two different numerical resolutions tell us that our new schemes yield numerical solutions equivalent to those obtained by the third-order PFC scheme with eight times refined mesh grids. We also perform a Vlasov–Poisson simulation with 1283 mesh grids each for the spatial and velocity space, 1286 mesh grids in total, which successfully produce the apparently same phase space distribution as that obtained by exploiting the SCF method.
As for the computational costs of our new schemes, RK–MPP5 is the most expensive and requires two to three times longer wall-clock time to advance a single time step than other schemes. This is mainly because it adopts the three-stage TVD–RK time integration scheme and operates the Eulerian time integration three times per step. On the other hand, the SL schemes perform only a single Eulerian time integration per step; hence, their computational costs are significantly less expensive than RK–MPP5 and are nearly the same as that of PFC, indicating that the SL schemes achieve the improvement of effective spatial resolution without a significant increase of computational costs.
Among all the new schemes described in this paper, the spatially seventh-order SL scheme (SL–MPP7) is the most favored one in terms of numerical accuracy and computational cost, although the spatially fifth-order SL scheme can be a viable option when the capacity of internode communication is not sufficient. As a similar scheme to SL schemes, Qiu & Christlieb (2010) and Qiu & Shu (2011) develop conservative semi-Lagrangian WENO schemes to solve the Vlasov equation that adopts weighted essentially non-oscillatory (WENO) reconstruction to compute the interface values instead of the MP reconstruction adopted in our schemes. The WENO reconstruction is, however, known to be more diffusive and more computationally expensive compared with the MP scheme (Suresh & Huynh 1997).
Since Vlasov simulations require a huge amount of memory to store the distribution function in six-dimensional phase space volume, the size of Vlasov simulations is mainly limited by the amount of total available memory rather than the computational cost. Adoption of spatially higher-order advection schemes in Vlasov simulations enables us to effectively improve the numerical resolution without increasing the number of mesh grids. In this sense, our new spatially higher-order advection schemes are of crucial importance in numerically solving Vlasov equations. We implement our new advection schemes to Vlasov–Poisson simulations that can be applied to collisionless self-gravitating and electrostatic plasma systems, and we confirmed that our schemes really improve the numerical accuracies of Vlasov–Poisson simulations for a given number of mesh grids. Though we have applied our schemes only to Vlasov–Poisson simulations, we are going to apply our schemes to Vlasov–Maxwell simulations for magnetized plasma systems in a future work, in which difficulties in following gyro motion around magnetic field lines or an advection by rigid rotation in velocity space have been reported by many authors (Minoshima et al. 2011, 2013, 2015).
We thank Professor Shunsuke Hozumi for providing distribution function data of Test-4 computed with his SCF code and for his valuable comments on this work. This research is supported by MEXT as Priority Issue on Post-K computer (Elucidation of the Fundamental Laws and Evolution of the Universe) and JICFuS and uses computational resources of the K computer provided by the RIKEN Advanced Institute for Computational Science through the HPCI System Research project (project ID:hp160212 and hp170231). This research used computational resources of the HPCI system provided by Oakforest–PACS through the HPCI System Research Project (project ID:hp170123). Part of the numerical simulations are performed using HA-PACS and COMA systems at the Center for Computational Sciences, University of Tsukuba, and Cray XC30 at the Center for Computational Astrophysics, National Astronomical Observatory of Japan.
Appendix A: Monotonicity-preserving Constraint
The MP scheme devised by Suresh & Huynh (1997) computes the interface value at a mesh boundary between the ith and th mesh grids with five stencil values, three in the upwind side and two in the downwind side, in the following procedures:
- 1.
- 2.Evaluate , where a parameter α controls the upper bound of the interface value and should hold the condition . Throughout in this work, we set , unless otherwise stated.
- 3.If the condition holds, the resultant interface value is given by . If not, we should proceed to the next step.
- 4.Compute , and , where .
- 5.Compute , , , and .
- 6.Compute , , and , .
- 7.Finally, the interface value is given by median .
It should be noted that the time step constraint to keep the monotonicity of numerical solutions depends on the parameter α and that the CFL parameter should satisfy the condition (Suresh & Huynh 1997)
if we set the parameter α to as recommended in Suresh & Huynh (1997), although works well in practice. (see Appendix B for further argument on a viable range of the CFL parameter).
Appendix B: CFL Condition
As noted in Appendix A, the choice of α and ν is rather heuristic in the MP scheme. Therefore, we also optimize the combination of α and ν in our new schemes RK–MPP5, SL–MPP5, and SL–MPP7 with the PP limiter in a heuristic manner. We perform Test-1c (linear advection of ) using RK–MPP5, SL–MPP5, and SL–MPP7 with various combinations of α and ν. The number of mesh grids is set to 128. Figure 20 shows maps of relative L1 errors of Test-1c obtained with the three schemes for and . Comparison of RK–MPP5 and SL–MPP5 shows that SL–MPP5 allows us to adopt a larger CFL parameter ν than RK–MPP5 to achieve a certain accuracy. Furthermore, errors obtained with SL–MPP5 are almost unchanged irrespective of the adopted CFL parameter as long as and , probably in virtue of its semi-Lagrangian nature. The SL–MPP7 scheme exhibits more accurate results than the SL–MPP5 scheme for and , and the parameter space that yields the most accurate results is located in the vicinity of the relation .
Download figure:
Standard image High-resolution imageAppendix C: The High-order Poisson solver
We derive the Green function of the discretized Poisson equation in spatially second-, fourth-, and sixth-order accuracy. Here, for simplicity, we only consider the one-dimensional Poisson equation. The extension to the multidimensional one is quite straightforward. Given the discrete density field with , where is the number of mesh grids, approximating the Poisson equation with the second-order discretization leads to
Discrete Fourier transformation (DFT) of Equation (79) yields
and can be expressed as
where the "hat" denotes the DFT of physical quantities expressed as
is the discrete wavenumber, and the periodic boundary condition is assumed. The discrete Poisson equations in fourth- and sixth-order accuracy are given by
and
respectively. Therefore, the corresponding Fourier-transformed potentials can be derived as
and
respectively.
Gravitational force is numerically obtained with the FDA of the gradient of the gravitational potential. Use of the higher-order Poisson solvers motivates us to adopt higher-order FDAs for the gravitational force. The gradient of the gravitational potential at is approximated as
and
with the two-, four-, and six-point FDAs, respectively.
To check the numerical accuracy of the gravitational potential and force obtained with the numerical schemes described above, we solve the one-dimensional Poisson equation with the periodic boundary condition under the density field given by
in the domain , where the wavenumber k is set to and we have 16 waves in the domain. The gravitational potential and force can be analytically solved as
and
respectively. Figure 21 shows the relative errors of gravitational potentials and forces defined by
and
respectively. Note that the gravitational forces are computed with an FDA scheme with the same order of accuracy as the Poisson solver, i.e., the gravitational forces with second-order accuracy are computed with the gravitational potentials calculated with the second-order Poisson solver through the second-order FDA scheme. It can be seen that both the gravitational potentials and forces have the expected order of accuracy.
Download figure:
Standard image High-resolution imageFootnotes
- 5
The PFC scheme is free from any explicit constraints on time step thanks to its semi-Lagrangian nature in solving the one-dimensional advection equation. Nonetheless, it is better to adopt a certain constraint in domain-decomposed (and hence parallelized) Vlasov simulations in multidimensional phase space. With large time steps, the trajectories of characteristic lines originating from the boundaries of decomposed domains reach further from the boundaries, and the number of grids to be passed to the adjacent domains becomes also larger, resulting in the increase of the amount of data exchange between parallel processes.