Comparing Domain Decomposition Methods for the Parallelization of Distributed Land Surface Models

von Ramm, Alexander; Weismüller, Jens; Kurtz, Wolfgang; Neckel, Tobias

doi:10.1007/978-3-030-22734-0_15

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11536))

Included in the following conference series:

International Conference on Computational Science

2541 Accesses

Abstract

Current research challenges in hydrology require high resolution models, which simulate the processes comprising the water-cycle on a global scale. These requirements stand in great contrast to the current capabilities of distributed land surface models. Hardly any literature noting efficient scalability past approximately 64 processors could be found. Porting these models to supercomputers is no simple task, because the greater part of the computational load stems from the evaluation of highly parametrized equations. Furthermore, the load is heterogeneous in both spatial and temporal dimension, and considerable load-imbalances occur triggered by input data. We investigate different domain decomposition methods for distributed land surface models and focus on their properties concerning load balancing and communication minimizing partitionings. Artificial strong scaling experiments from a single core to 8, 192 cores show that graph-based methods can distribute the computational load of the application almost as efficiently as coordinate-based methods, while the partitionings found by the graph-based method significantly reduce communication overhead.

You have full access to this open access chapter, Download conference paper PDF

Enhancing speed and scalability of the ParFlow simulation code

Article 30 September 2017

Domain decomposition strategies for the two-dimensional Wigner Monte Carlo Method

Article 24 July 2015

A Parallel Approach for the Generation of Unstructured Meshes with Billions of Elements on Distributed-Memory Supercomputers

Article 02 September 2016

Keywords

1 Introduction

Predicting hydrological phenomena is of great importance for various fields such as climate change impact studies and flood prediction. An outline of the principles and structure of physically-based hydrological models is given in [8]. Experimental hydrology often provides the scientific basis for hydrological catchment models. However, the great complexity, size and uniqueness of hydrological catchments^{Footnote 1} makes a methodology involving physical experiments unfeasible if not impossible for catchments exceeding a few hectars. Hydrologists remedy by performing in silico experiments, simulating the hydrological processes in the basins.

Different classes of hydrological models exist. In this study, we focus on distributed land-surface models (dLSMs). dLSMs focus on detailed modelling of vertical processes and the correct simulation of mass and energy balance at the land-surface. To this end, they solve the hydrological water balance equation globally for the entire domain and locally for the subdomains dictated by the discretization.

A key characteristic of dLSMs is their focus on vertical surface-processes, such as evapotranspiration, snow processes, infiltration and plant growth. Lateral processes such as discharge concentration and lateral subsurface flow are simulated in a simplified way, if they are simulated at all. For this investigation we will consider vertical and lateral processes in a de-coupled way. The reason for this is the fact that vertical processes can, in general, be computed concurrently for all points in the domain. Lateral processes, in contrast, require information from the surrounding area and can therefore not be computed concurrently. They consist mainly of simple fluid dynamics simulations, and are computationally less demanding than the vertical processes. However, the structure of communication is dictated by the lateral processes, so for a scalable, parallel dLSM, an integrated approach that considers both aspects is required.

Various computer codes exist that implement different aspects of the hydrological theory behind dLSMs. WaSiM [22] for example is a hydrological model that solves the one-dimensional Richards’s equation to simulate vertical soil water movement. PROMET [16] features a detailed simulation of plant growth. The open-source code WRF-Hydro [9] was designed to link multi-scale process models of the atmosphere and terrestrial hydrology. These models are the basis for a number of different applications ranging from flood prediction [10], climate change impact studies [17] to land use scenario evaluation [19].

In contrast to dLSMs, which focus on surface processes, integrated hydrological models such as Parflow [18] or HydroGeoSphere [6] focus on the simulation of subsurface flow. They solve the three-dimensional Richards’s equation fully coupled to the surface runoff. Parallel implementations of integrated hydrological models, scaling to supercomputer capabilities are already available [7]. However, for a number of reasons integrated hydrological models can not be used substitutively for dLSMs.

The hydrological modelling community using dLSMs is currently moving from desktop models to small-size distributed clusters. Currently, dLSMs do not scale well beyond 64 processors [24]. The authors of [4] list four examples related to global environmental change that require high-resolution models of terrestrial water on a global scale.

Advanced methods for Bayesian inference and uncertainty quantification such as e.g. [3] are often based on Markov-Chain Monte Carlo methods, which require a great amount of sequential model evaluations. In order to obtain results for dLSM-based hydrology analyses in a reasonable amount of time, the execution time of a single model evaluation needs to be reduced. Parallelization and deployment on supercomputers is one possibility to achieve this. In summary, better parallelization schemes and dLSMs scaling to the abilities of modern supercomputers would greatly advance the capabilities of hydrologists.

Parallelization efforts have been undertaken for a number of hydrological models. However, the employed strategies rarely aim for an ideal load balancing and minimized communication efforts. In [14], a hydrological model is parallelized under the assumption that downstream cells require information of upstream cells. The catchment is interpreted as a binary tree, which is partitioned and distributed among processors, using a master-slave approach. With this approach, the authors decrease the execution time of the serial algorithm by a factor of five on multiple processors. WRF-Hydro partitions the domain into rectangular blocks [9], yet we are not aware of any scaling experiments for this implementation. In WaSiM [22], the domain is partitioned by distributing different rows to different processors. The parallelization strategies of WRF-Hydro and WaSiM both do not take the communication structure into account that is necessary to compute the flow-equations in parallel. It is likely that this limits the scaling potential to a limited amount of processors. In [24], the TIN-based hydrological model tRIBS is parallelized. A graph-based domain decomposition method is employed to produce a communication efficient partitioning. Scaling experiments indicate scalability up to about 64 processors. No contribution except [24] attempts to minimize the required point-to-point communication. Therefore, the hydrological community still lacks a parallelized dLSM which efficiently scales to the capabilities of modern supercomputers.

In this paper, we re-visit some of the graph-based domain decomposition methods suggested by [24], adapt it for a regular grid and conduct a more thorough analysis of its communication patterns. We compare different coordinate-based domain decomposition methods with a graph-based method, and investigate the impact on the necessary point-to-point communication. Furthermore, we perform an artificial strong scaling experiment and compute theoretical peak values for parallel speed-up for an example application on up to 8, 192 processors. As we want to conduct a methodological study of the suitability of different domain-decomposition methods for dLSMs we limit the evaluation to a simplified communication and work-balance model and do not measure performance of an actual application.

We will start by giving an overview over the functionality of dLSMs. Subsequently, we introduce the governing equations of the lateral processes and show how they dictate a specific communication structure. We then introduce the different domain decomposition methods. Finally, we present theoretical peak values of speed-up and efficiency and evaluate the potential for the minimization of point-to-point communication.

2 Theory

2.1 The Computational Domain of dLSMs

The focus of this study lies on the efficient decomposition of the computational domain of dLSMs. The computational domain commonly comprises one hydrological catchment as defined in the introduction, is denoted by $\varOmega \in \mathbb {R}^2$ and fulfills the following assumptions:

1.
Any point $x \in \varOmega $ has an elevation h(x).
2.
There is exactly one outlet $O_{\varOmega }$ located on the boundary $\partial \varOmega $ of the domain $\varOmega $, where the minimum $h_{0}$ of h is located.
3.
$\varOmega $ does not contain any sinks, i.e., there is a monotonously decreasing path from any point $x \in \varOmega $ to the outlet $O_{\varOmega }$.

In practice, the domain is commonly derived from a digital elevation model of the basin and a given outlet $O_{\varOmega }$ with a gauging station where discharge measurements are taken.

For $\varOmega $ as well as for any subdomain $\omega $ of $\varOmega $, the hydrological water balance equation holds. Commonly the domain is discretized into a regular grid. However, other discretization methods also exist. The water budget equation is then solved for any cell of the grid individually. This step usually requires the greatest computational effort. The structure of point-to-point communication is dictated by the lateral processes. In dLSMs these processes are commonly simulated under the following assumptions:

1.
The flow direction is dependent on the topography given by h.
2.
Flow follows the steepest gradient of h on the domain $\varOmega $.
3.
Flow is one-dimensional along the deterministic flow paths derived under the previous two assumptions.

The exact drainage network is commonly derived from the digital elevation model of the domain by the D8-Algorithm [20]. This algorithm assigns a flow direction to each cell under the consideration of the altitude h(x) of all eight neighbouring cells. The flow is always directed towards the neighbouring cell with the smallest altitude. In Fig. 1, a simple example of a domain for a dLSM is given.

2.2 Governing Equations

In hydrological catchments, a number of lateral processes occur physically. These include, but are not limited to, channel flow, surface runoff and subsurface flow. In this paper, we focus on channel flow, but the method can be extended to include other lateral processes. Given the assumption of one-dimensional flow, the governing equations of the channel flow are the Saint-Venant equations, with spatial dimension x and time t:

$$\begin{aligned} \frac{ \delta Q(x, t) }{\delta t} + \frac{\delta A(x,t)}{\delta t}&= f (x, t) \end{aligned}$$

(1)

$$\begin{aligned} \frac{1}{A(x, t)} \frac{\delta Q(x, t)}{\delta t} + \frac{1}{A(x, t)} \frac{\delta }{\delta x} \left( \frac{Q(x, t)^{2}}{A(x, t)}\right) + g\frac{\delta y}{\delta x} - g(S_{0} - S_{f})&= 0 \end{aligned}$$

(2)

Here, Q is the discharge measured in $[m^{3}/s]$, A represents the cross-sectional area given in $[m^{2}]$, g is gravitational acceleration and y is the water level in [m]. Finally, $S_{0}$ represents the slope of the channel, $S_{f}$ is the friction slope and f(x, t) is a source term describing the runoff generated on every point x for every time step t. The source term f(x, t) represents the result of the hydrological water budget equation, and as such its computation comprises the simulation of all vertical processes. The one-dimensional Saint-Venant equations are solved on the flow structure derived by the D8-Algorithm. For the flow structure graph, the following holds:

1.
All cells of the discretized domain $\varOmega _{h}$ are nodes of the graph.
2.
A cell can only be connected by an edge to its neighbouring cells.
3.
The root of the flow structure is located at the outlet $O_{\varOmega }$.

Note that whenever the flow structure (i.e. the graph) is cut during domain decomposition, point-to-point communication is induced. Hence, for an efficient communication the graph should be cut as little as possible.

Solutions of the Saint-Venant equations in dLSMs are commonly obtained by kinematic or diffusive wave approximations. Solution methods for the kinematic wave usually involve looping over the cells in an upstream to downstream order to compute a discharge for each cell. For these kinematic wave methods, only the discharge of upstream cells is required, as backwater effects cannot be simulated with this method [23]. Diffusive wave methods can be used to simulate backwater effect. Thus, they also require additional information from downstream cells, which induces additional communication. The exact parallel implementation of these methods is not part of this study, but a rough overview is necessary in order to comprehend the method proposed in the following section.

2.3 Domain Decomposition

At the core of most parallelization strategies lies the division of computational work among computational resources. We investigate an approach which involves the decomposition of the domain $\varOmega $ into subdomains $\omega $ to be computed on different processors. For the domain decomposition, we consider four different algorithms, three coordinate-based and one graph-based. For all algorithms we use the implementation provided by the Zoltan library [5].

The first algorithm considered is called “Block-Partitioning”. The algorithm considers the unique ID of each cell and assigns each processor a block of ids. Therefore, this algorithm is heavily dependent on the cell-ID, which is application dependent. We include this algorithm in the analysis, as it is the most commonly used domain decomposition method included in current parallel implementations of dLSMs. WaSiM [22], for example, uses an altered version of this algorithm.

Secondly, we consider Recursive Coordinate Bisection [2]. This algorithm determines partitions by recursively dividing the domain along its longest dimension. This method reduces communication by minimizing the length of partition borders.

Additionally, a method employing the Hilbert curve was considered. Here, discrete iterates of the space-filling Hilbert curve^{Footnote 2} are constructed. Partitioning is done by cutting the preimage of the discrete iterates of the Hilbert curve into parts of equal size and assigning the resulting 2D subdomains to different processors [5].

Finally, we investigate the method also used by [24]. Here, the flow direction graph is considered in order to minimize the necessary communication. We employ graph-partitioning methods, which attempt to partition the given graph into sub-graphs of almost equal size, while minimizing the amount of edges cut. We use the parallel graph-partitioning algorithm described in [11, 13] and parallelized in [12, 21].

3 Application Example

In order to avoid the overhead of the parallelization of an entire dLSM, we first measured the execution time per cell and time step of a PROMET model of the Upper-Danube catchment with 76, 215 cells. These measurements were then used to perform an artificial strong scaling experiment from one processor to 8, 192 processors. The outlet of this model is located at the gauging station in Passau, Germany. A more detailed description of the catchment, the model and its validation is given in [15]. The catchment was chosen for its heterogeneous domain, which includes cells in alpine regions as well as cells with agricultural use in the Alpine Foreland. These characteristics suggest a heterogenous load behaviour. Furthermore, it represents a typical model size of current dLSMs.

The measured cell executions times of the Upper-Danube model are displayed in Fig. 2. While the total computational effort is homogeneously distributed, the computational effort for individual time steps can be quite heterogeneous.

The results of the domain decomposition algorithms introduced in the previous section for a small, overseeable head catchment are displayed in Fig. 3.

The illustration of the block-wise domain decomposition (top-left) shows PROMET’s scheme to determine the cell-ID by sequential assignment from the top left to the bottom right corner of the domain.

The scaling experiments in Sect. 4 were performed using the measured cell execution times. Rather than performing dummy-calculations to simulate the load generated by the source-term f(x, t) of Eq. (1), we decided to estimate the runtime by computing the sum of execution times on each processor. So with exception of the single cell measures, no real application code is executed. This allowed us to consider more methods without the overhead of implementing them into a dLSM and wasting CPU-hours on dummy calculations. Consequently, the reported metrics are theoretical peak values, which only take into account the load balance.

In order to evaluate the scaling experiments, we computed theoretical peak values of parallel speed-up $S_{p}$ and the parallel efficiency $E_{p}$ based on the estimated runtimes. These are:

$$\begin{aligned} S_{p} = \frac{T_{1}}{T_{p}} \end{aligned}$$

(3)

$$\begin{aligned} E_{p} = \frac{S_{p}}{p}, \end{aligned}$$

(4)

where $T_{1}$ is the total runtime on a single processor, and $T_{p}$ is the runtime on p processors. Furthermore, the edge-cut count $E_C$ was computed, which is the sum of all edges spanning over two processors. We use the edge-cut count as an estimate for the communication overhead introduced by a parallel implementation of the lateral processes.

We considered two synchronization scenarios, which should represent the lower and upper boundary in terms of synchronization requirements. The first scenario (unsynchronized) assumes no synchronization during the simulation and represents the lower boundary. The second scenario (synchronized) assumes that all cells need to be synchronized at the end of every time step and therefore represents the upper boundary, and would be required for a diffusive wave model for the lateral processes.

4 Results

The theoretical speed-up values computed during the strong scaling experiment are displayed in Fig. 4.

Figure 4 shows that, for synchronized simulations, all methods solve the load-balancing task equally well, and produce a partitioning which yields a theoretical speed up of 5, 740. This corresponds to a parallel efficiency of 0.7. For unsynchronized simulations, the coordinate-based methods outperform the graph-based method by $6\%$, with theoretical speed up values of 7, 200 and 6, 722 respectively. This can be explained by the more constant partition size produced by the coordinate-based methods. Without a synchronization barrier at the end of each time step, the total computational load over the whole simulation time needs to be balanced, rather than the more heterogeneous load at each individual time step. The distribution of partition sizes is less relevant for the synchronized case, because load-imbalances are realized at the end of every time step without the possibility to be damped over the remaining simulation.

Much more severe differences between the methods can be found in the communication overhead potentially introduced by a parallel implementation of lateral processes. In Fig. 5, the total amount of flow paths cut is displayed. Here, the graph-based method shows significant advantages over the coordinate-based methods. The superiority of graph-partitioning becomes even more apparent if the flow paths cut per partition are considered. Here, the graph-based method is capable of producing partitionings with approximately 2.1 flow path cuts per partition, which mostly corresponds to one inlet and one outlet per partition. This holds true over the entire range of processors considered. For coordinate-based methods the number of flow paths cut is strongly dependant on the length of the partition border. Thus, for decreasing partition sizes the methods approach again. However, for future problems with significantly more than 76, 000 cells and therefore bigger partitions the superiority of the graph-based method will be even more severe.

5 Discussion

For unsynchronized simulations, there is a strong connection between the number of cells computed on a processor and the workload of this processor. This is caused by the homogeneous distribution of the total computational effort of the simulation (see Fig. 2 (right)). For synchronized simulations, this connection is significantly weaker, because load-imbalances that are present in a given time step are realized immediately. In unsynchronized simulations, these imbalances can dissipate over the remaining simulation. Therefore, the disadvantage of the graph-based method seen for unsynchronized simulations is not relevant for synchronized simulations.

Furthermore, the great drop between unsynchronized and synchronized simulations indicates that speed-up could be further improved by relaxing the synchronization requirements of the different algorithms for the simulation of the lateral processes. Of course, this has to be done under the consideration of the underlying physical processes.

Subsequently, the superiority of the of the graph-based domain decomposition in terms of the number of flow paths cut per partition over the entire range of processors needs to be noted. This result emphasizes the advantages of using graph-based domain decomposition for parallel implementations of the lateral processes in dLSMs most convincingly.

The artificial strong scaling experiment was conducted up to 8, 192 processors and yielded parallel efficiences between 0.70 and 0.88. These are, in comparison to integrated hydrological model such as Parflow, rather poor efficiencies on a small amount of processors. The catchment model of the Upper-Danube in the considered resolution simply is not big enough to be scaled to more processors. Models with more cells need to considered for scaling experiments with more processors. It is assumed that hydrologists will want to compute such models in the near future. Therefore, larger test scenarios with the possibility to be scaled up to the capabilities of current supercomputers will become available soon.

Furthermore, these results have to be seen in the context of current parallel dLSMs. The implementation of [24] reaches a parallel efficiency of approximately 0.3 on 64 processors.

Other codes scale even worse. For hyper-resolution global modelling as described in [4] this will not suffice. Also, for catchment studies at higher spatial resolutions, scalability beyond 64 processors is advantageous and would support scientific progress in hydrology.

6 Conclusion

We investigated several domain decomposition strategies for dLSMs. Strategies based purely on the coordinates of cells as well as a strategy acknowledging the special nature of the domain of the lateral processes were considered. We performed simplified scaling experiments to evaluate the suitability of these methods, and computed theoretical peak values for speed-up and parallel efficiency. For synchronized simulations, graph-based and coordinate-based domain decomposition yield similar speed-up values, which suggests that the advantageous communication structure of the graph-based methods would lead to a more scalable solution. For unsynchronized simulations, coordinate-based methods scale about $6\%$ better. It remains to be demonstrated that the advantageous communication structure makes up for this gap.

One of the shortcomings of this study, the disregard of the induced communication effort, will be addressed by a parallel implementation of the methods used to solve the Saint-Venant equations in dLSMs. Furthermore, for dLSMs inducing greater and more heterogenous computational effort dynamic load-balancing strategies should be researched. Examples for such dLSMs include WaSiM, where the vertical soil water movement is simulated by the 1D-Richards’s equation.

Notes

1.
In [25], the hydrological catchment is defined as “the drainage area that contributes water to a particular point along a channel network (or a depression), based on its surface topography.” As such it makes a suitable logical unit of study in hydrology, which can be modelled by physically-based models.
2.
Space-filling curves are surjective maps of the unit interval onto the 2D unit square (or a general rectangle) which provide decent surface-to-volume ratios of the resulting 2D subdomains when used for parallel partitioning; see [1] for details on and properties of space-filling curves.

References

Bader, M.: Space-Filling Curves. Springer, New York (2010)
Google Scholar
Berger, M.J., Bokhari, S.H.: A partitioning strategy for nonuniform problems on multiprocessors. IEEE Trans. Comput. C–36(5), 570–580 (1987)
Article Google Scholar
Betz, W., Papaioannou, I., Beck, J.L., Straub, D.: Bayesian inference with subset simulation: strategies and improvements. Comput. Methods Appl. Mech. Eng. 331, 72–93 (2018). https://doi.org/10.1016/j.cma.2017.11.021
Article MathSciNet Google Scholar
Bierkens, M.F.P., et al.: Hyper-resolution global hydrological modelling: what is next? Hydrol. Process. 29(2), 310–320 (2014). https://doi.org/10.1002/hyp.10391
Article Google Scholar
Boman, E., et al.: Zoltan 3.0: parallel partitioning, load-balancing, and data management services; user’s guide. Sandia National Laboratories, Albuquerque, NM. Technical report SAND2007-4748W (2007). http://www.cs.sandia.gov/Zoltan/ug_html/ug.html
Brunner, P., Simmons, C.T.: HydroGeoSphere: a fully integrated, physically based hydrological model. Ground Water 50(2), 170–176 (2011). https://doi.org/10.1111/j.1745-6584.2011.00882.x
Article Google Scholar
Burstedde, C., Fonseca, J.A., Kollet, S.: Enhancing speed and scalability of the parflow simulation code. Comput. Geosci. 22(1), 347–361 (2018). https://doi.org/10.1007/s10596-017-9696-2
Article MathSciNet MATH Google Scholar
Freeze, R., Harlan, R.: Blueprint for a physically-based, digitally-simulated hydrologic response model. J. Hydrol. 9(3), 237–258 (1969). https://doi.org/10.1016/0022-1694(69)90020-1
Article Google Scholar
Gochis, D.J., et al.: The WRF-Hydro modeling system technical description, (Version 5.0) (2018). https://ral.ucar.edu/sites/default/files/public/WRF-HydroV5TechnicalDescription.pdf
Karsten, J., Gurtz, J., Herbert, L.: Advanced flood forecasting in alpine watershed by coupling meteorological and forecasts with a distributed hydrological model. J. Hydrol. 269, 40–52 (2002)
Google Scholar
Karypis, G., Kumar, V.: Multilevel algorithms for multi-constraint graph partitioning. In: SC 1998: Proceedings of the 1998 ACM/IEEE Conference on Supercomputing, p. 28 (1998). https://doi.org/10.1109/SC.1998.10018
Karypis, G., Kumar, V.: A coarse-grain parallel formulation of multilevel k-way graph partitioning algorithm. In: Parallel Processing for Scientific Computing. 8th SIAM Conference on Parallel Processing for Scientific Computing (1997)
Google Scholar
Karypis, G., Kumar, V.: Multilevelk-way partitioning scheme for irregular graphs. J. Parallel Distrib. Comput. 48(1), 96–129 (1998). https://doi.org/10.1006/jpdc.1997.1404
Article Google Scholar
Li, T., Wang, G., Chen, J., Wang, H.: Dynamic parallelization of hydrological model simulations. Environ. Model. Softw. 26, 1736–1746 (2011). https://doi.org/10.1016/j.envsoft.2011.07.015
Article Google Scholar
Mauser, W.: Global Change Atlas - Einzugsgebiet Obere Donau - online, chap. E4 - Validierung der hydrologischen Modellierung in DANUBIA. GLOWA-Danube-Projekt (2011)
Google Scholar
Mauser, W., et al.: PROMET - Processes of Mass and Energy Transfer An Integrated Land Surface Processes and Human Impacts Simulator for the Quantitative Exploration of Human-Environment Relations Part 1: Algorithms Theoretical Baseline Document (2015). http://www.geographie.uni-muenchen.de/department/fiona/forschung/projekte/promet_handbook/index.html
Mauser, W., Bach, H.: Promet - large scale distributed hydrological modelling to study the impact of climate change on the water flows of mountain watersheds. J. Hydrol. 376(3–4), 362–377 (2009)
Article Google Scholar
Maxwell, R.M., et al.: PARFLOW User’s Manual
Google Scholar
Niehoff, D., Fritsch, U., Bronstert, A.: Land-use impacts on storm-runoff generation: scenarios of land-use change and simulation of hydrological response in a meso-scale catchment in sw-germany. J. Hydrol. 267(1–2), 80–93 (2002)
Article Google Scholar
O’Callaghan, J.F., Mark, D.M.: The extraction of drainage networks from digital elevation data. Comput. Vis. Graph. Image Process. 28(3), 323–344 (1984). https://doi.org/10.1016/S0734-189X(84)80011-0
Article Google Scholar
Schloegel, K., Karypis, G., Kumar, V.: Parallel multilevel algorithms for multi-constraint graph partitioning. In: Bode, A., Ludwig, T., Karl, W., Wismüller, R. (eds.) Euro-Par 2000. LNCS, vol. 1900, pp. 296–310. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-44520-X_39
Chapter Google Scholar
Schulla, J.: Model Description WaSiM. Hydrology Software Consulting J. Schulla (2017). http://www.wasim.ch/de/products/wasim_description.htm
Todini, E.: A mass conservative and water storage consistent variable parameter Muskingum-Cunge approach. Hydrol. Earth Syst. Sci. 11(5), 1645–1659 (2007). https://doi.org/10.5194/hess-11-1645-2007
Article Google Scholar
Vivoni, E.R., et al.: Real-world hydrologic assessment of a fully-distributed hydrological model in a parallel computing environment. J. Hydrol. 409(1–2), 483–496 (2011). https://doi.org/10.1016/j.jhydrol.2011.08.053
Article Google Scholar
Wagener, T., Sivapalan, M., Troch, P., Woods, R.: Catchment classification and hydrologic similarity. Geogr. Compass 1(4), 901–931 (2007). https://doi.org/10.1111/j.1749-8198.2007.00039.x
Article Google Scholar

Download references

Acknowledgements

This work has in parts been funded by the German Federal Ministry of Education and Research under the grant reference 02WGR1423F (“Virtual Water Values” project), and by the Bavarian State Ministry of the Environment and Consumer Protection under the Hydro-BITS grant. The responsibility for the content of this publication lies with the authors.

Author information

Authors and Affiliations

Leibniz Supercomputing Centre (LRZ), Boltzmannstraße 1, 85748, Garching, Germany
Alexander von Ramm, Jens Weismüller & Wolfgang Kurtz
Department of Informatics, Technical University of Munich, Boltzmannstraße 3, 85748, Garching, Germany
Tobias Neckel

Authors

Alexander von Ramm
View author publications
You can also search for this author in PubMed Google Scholar
Jens Weismüller
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Kurtz
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Neckel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexander von Ramm .

Editor information

Editors and Affiliations

University of Algarve, Faro, Portugal
João M. F. Rodrigues
University of Algarve, Faro, Portugal
Pedro J. S. Cardoso
University of Algarve, Faro, Portugal
Jânio Monteiro
University of Algarve, Faro, Portugal
Roberto Lam
University of Amsterdam, Amsterdam, The Netherlands
Valeria V. Krzhizhanovskaya
University of Amsterdam, Amsterdam, The Netherlands
Michael H. Lees
University of Tennessee at Knoxville, Knoxville, TN, USA
Jack J. Dongarra
University of Amsterdam, Amsterdam, The Netherlands
Peter M.A. Sloot

Rights and permissions

Open Access This chapter is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, duplication, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, a link is provided to the Creative Commons license and any changes made are indicated.

The images or other third party material in this chapter are included in the work's Creative Commons license, unless indicated otherwise in the credit line; if such material is not included in the work's Creative Commons license and the respective action is not permitted by statutory regulation, users will need to obtain permission from the license holder to duplicate, adapt or reproduce the material.

Reprints and permissions

Copyright information

About this paper

Cite this paper

von Ramm, A., Weismüller, J., Kurtz, W., Neckel, T. (2019). Comparing Domain Decomposition Methods for the Parallelization of Distributed Land Surface Models. In: Rodrigues, J., et al. Computational Science – ICCS 2019. ICCS 2019. Lecture Notes in Computer Science(), vol 11536. Springer, Cham. https://doi.org/10.1007/978-3-030-22734-0_15

Download citation

DOI: https://doi.org/10.1007/978-3-030-22734-0_15
Published: 08 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22733-3
Online ISBN: 978-3-030-22734-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics