Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/582034.582084acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
Article

A ghost cell expansion method for reducing communications in solving PDE problems

Published: 10 November 2001 Publication History

Abstract

In solving Partial Differential Equations, such as the Barotropic equations in ocean models, on Distributed Memory Computers, finite difference methods are commonly used. Most often, processor subdomain boundaries must be updated at each time step. This boundary update process involves many messages of small sizes, therefore large communication overhead. Here we propose a new approach which expands the ghost cell layers and thus updates boundaries much less frequently --- reducing total message volume and groupping small messages into bigger ones. Together with a technique for eliminating diagonal communications, the method speedup communication substantially, upto 170%. We explain the method and implementation in details, provide systematic timing results and performance analysis on the Cray T3E and IBM SP.

References

[1]
V. Balaji, 2000. "Abstract Parallel Dynamical Kernels for Flexible Climate Models." Talk presented at ECMWF TeraComputing Workshop, Reading, England, Nov, 2000.
[2]
X.-C. Cai, D. E. Keyes and V. Venkatakrishnan. 1997. "Newton-Krylov-Schwarz: An Implicit Solver for CFD", Proc. 8th Int'l Conf. on Domain Decomposition Methods (R. Glowinski et al., eds.), Wiley, New York, pp.387-400.
[3]
T.F. Chan and J. Zou. 1994. Additive Schwarz Domain Decomposition Methods For Elliptic Problems On Unstructured Meshes. Numerical Algorithms, 8, pp.329-346.
[4]
C.H.Q. Ding, 1991. "Simulating Lattice QCD on a Caltech/JPL Hypercube," Int'l J. Supercomp. Appl., 5, pp:73-80.
[5]
C.H.Q. Ding and Y. He. "Data Organization and I/O in a parallel ocean circulation model", Proc. Supercomputing '99, Nov 1999.
[6]
C.H.Q. Ding. 2001. An Optimal Index Reshuffle Algorithm for Multidimensional Arrays and Its Applications for Parallel Architectures. IEEE Trans. Para. Distr. Sys., 12, pp.306-315.
[7]
J.Drake, I.Foster, J.Michalakes, B.Toonen and P. Worley, "Design and performance of a scalable parallel community climate model", Parallel Computing, v.21, pp.1571-1581, 1995.
[8]
J. Drake and P. Worley. 2000. Unpublished results.
[9]
J.J. Hack, J.M.Rosinski, D.L.Williamson, B.A.Boville and J.E. Truesdale, "Computational Design of NCAR community climate model", Parallel Computing, v.21, pp.1545-1555, 1995.
[10]
S.M. Griffies, R.C. Pacanowski, M.Schmidt, V. Balaji. "Tracer Conservation with an Explicit Free Surface Method for Z-coordinate Ocean Models." Monthly Weather Review, 2001: 129, pp.1081-1098. See also http://www.gfdl.gov/~smg/MOM/MOM.html.
[11]
A.A. Mirin, D. Shumaker, M.F. Wehner. "Efficient Filtering Techniques for Finite-Difference Atmospheric General Circulation Models on Parallel Processors." Parallel Computing, v.24, pp.729-740, 1998.
[12]
R.D. Smith, J.K. Dukowicz, and R.C. Malone. Parallel ocean general circulation modeling. Physica, D60, 38, 1992. See also http://gnarly.lanl.gov/Pop/Pop.html.
[13]
S. Atlas, S. Banerjee, J.C. Cummings, P.J. Hinker, M. Srikant, J.V.W. Reynders, and M. Tholburn. "POOMA: A high performance distributed simulation environment for scientific applications." Proc. Supercomputing '95, Nov 1995. see also http://www.acl.lanl.gov/pooma/.
[14]
D.L. Brown, W.D. Henshaw and D.J. Quinlan. "Overture: An Object-Oriented Framework for Solving Partial Differential Equations on Overlapping Grids." SIAM conference on Object Oriented Methods for Scientfic Computing, 1999. See also http://www.llnl.gov/CASC/Overture/.

Cited By

View all
  • (2024)Bricks: A high-performance portability layer for computations on block-structured gridsThe International Journal of High Performance Computing Applications10.1177/10943420241268288Online publication date: 19-Aug-2024
  • (2022)Simulating the Euler equations on multiple GPUs using PythonFrontiers in Physics10.3389/fphy.2022.98544010Online publication date: 3-Oct-2022
  • (2021)Inter-loop optimization in RAJA using loop chainsProceedings of the 35th ACM International Conference on Supercomputing10.1145/3447818.3461665(1-12)Online publication date: 3-Jun-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SC '01: Proceedings of the 2001 ACM/IEEE conference on Supercomputing
November 2001
756 pages
ISBN:158113293X
DOI:10.1145/582034
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 November 2001

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. PDE
  2. bandwidth
  3. ghost cells
  4. latency
  5. near neighbor communication

Qualifiers

  • Article

Conference

SC '01
Sponsor:

Acceptance Rates

SC '01 Paper Acceptance Rate 60 of 240 submissions, 25%;
Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)1
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Bricks: A high-performance portability layer for computations on block-structured gridsThe International Journal of High Performance Computing Applications10.1177/10943420241268288Online publication date: 19-Aug-2024
  • (2022)Simulating the Euler equations on multiple GPUs using PythonFrontiers in Physics10.3389/fphy.2022.98544010Online publication date: 3-Oct-2022
  • (2021)Inter-loop optimization in RAJA using loop chainsProceedings of the 35th ACM International Conference on Supercomputing10.1145/3447818.3461665(1-12)Online publication date: 3-Jun-2021
  • (2021)Improving communication by optimizing on-node data movement with data layoutProceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming10.1145/3437801.3441598(304-317)Online publication date: 17-Feb-2021
  • (2021)Intercomparison of Subgrid Scale Models in Large-Eddy Simulation of Sunset Atmospheric Boundary Layer Turbulence: Computational AspectsLobachevskii Journal of Mathematics10.1134/S199508022107023442:7(1580-1595)Online publication date: 9-Aug-2021
  • (2020)Pencil: A Pipelined Algorithm for Distributed StencilsSC20: International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC41405.2020.00089(1-16)Online publication date: Nov-2020
  • (2020)Communication Optimization Strategy for Molecular Dynamics Simulation on Sunway TaihuLight2020 IEEE 22nd International Conference on High Performance Computing and Communications; IEEE 18th International Conference on Smart City; IEEE 6th International Conference on Data Science and Systems (HPCC/SmartCity/DSS)10.1109/HPCC-SmartCity-DSS50907.2020.00072(571-578)Online publication date: Dec-2020
  • (2019)Tessellating Star StencilsProceedings of the 48th International Conference on Parallel Processing10.1145/3337821.3337835(1-10)Online publication date: 5-Aug-2019
  • (2019)A Survey on Agent-based Simulation Using Hardware AcceleratorsACM Computing Surveys10.1145/329104851:6(1-35)Online publication date: 28-Jan-2019
  • (2017)Tessellating stencilsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3126908.3126920(1-13)Online publication date: 12-Nov-2017
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media