Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/370049acmconferencesBook PagePublication PagesscConference Proceedingsconference-collections
SC '00: Proceedings of the 2000 ACM/IEEE conference on Supercomputing
2000 Proceeding
Publisher:
  • IEEE Computer Society
  • 1730 Massachusetts Ave., NW Washington, DC
  • United States
Conference:
SC '00: International Conference for High Performance Computing, Networking, Storage and Analysis Dallas Texas USA November 4 - 10, 2000
ISBN:
978-0-7803-9802-3
Published:
01 November 2000
Sponsors:
SIGARCH, IEEE-CS
In-Cooperation:
SIAM
Reflects downloads up to 12 Jan 2025Bibliometrics
Article
Free
The implementation of MPI-2 one-sided communication for the NEC SX-5
Pages 1–es

We describe the MPI/SX implementation of the MPI-2 standard for one-sided communication (Remote Memory Access) for the NEC SX-5 vector supercomputer. MPI/SX is a non-threaded implementation of the full MPI-2 standard. Essential features of the ...

Article
Free
Single sided MPI implementations for SUN MPIr
Pages 2–es

This paper describes an implementation of generic MPI-2 single-sided communications for SUN-MPI. Our implementation is layered on top of point-to-point MPI communications and therefore can be adapted to other MPI implementations.

The code is ...

Article
Free
Automatically tuned collective communications
Pages 3–es

The performance of the MPI's collective communications is critical in most MPI-based applications. A general algorithm for a given collective communication operation may not give good performance on all systems due to the differences in architectures, ...

Article
Free
Landing CG on EARTH: a case study of fine-grained multithreading on an evolutionary path
Pages 4–es

We report on our work in developing a fine-grained multithreaded solution for the communication-intensive Conjugate Gradient (CG) problem. In our recent work, we have developed a simple, yet very efficient, solution to executing matrix-vector multiply ...

Article
Free
Parallel smoothed aggregation multigrid: aggregation strategies on massively parallel machines
Pages 5–es

Algebraic multigrid methods offer the hope that multigrid convergence can be achieved (for at least some important applications) without a great deal of effort from engineers and scientists wishing to solve linear systems. In this paper we consider ...

Article
Free
Scalable algorithms for adaptive statistical designs
Pages 6–es

We present a scalable, high-performance solution to multidimensional recurrences that arise in adaptive statistical designs. Adaptive designs are an important class of learning algorithms for a stochastic environment, and we focus on the problem of ...

Article
Free
Randomization, speculation, and adaptation in batch schedulers
Pages 7–es

This paper proposes extensions to the backfilling job-scheduling algorithm that significantly improve its performance. We introduce variations that sort the "backfilling order" in priority-based and randomized fashions. We examine the effectiveness of ...

Article
Free
An object-oriented job execution environment
Pages 8–es

This is a project for developing a distributed job execution environment for highly iterative jobs. An iterative job is one where the same binary code is run hundreds of times with incremental changes in the input values for each run. An execution ...

Article
Free
Towards an integrated, web-executable parallel programming tool environment
Pages 9–es

We present a new parallel programming tool environment that is (1) accessible and executable “anytime, anywhere,” through standard Web browsers and (2) integrated in that it provides tools which adhere to a common underlying methodology for parallel ...

Article
Free
Performance of hybrid message-passing and shared-memory parallelism for discrete element modeling
Pages 10–es

The current trend in HPC hardware is towards clusters of shared-memory (SMP) compute nodes. For applications developers the major question is how best to program these SMP clusters. To address this we study an algorithm from Discrete Element Modeling, ...

Article
Free
A comparison of three programming models for adaptive applications on the Origin2000
Pages 11–es

Adaptive applications have computational workloads and communication patterns which change unpredictably at runtime, requiring load balancing to achieve scalable performance on parallel machines. Efficient parallel implementation of such adaptive ...

Article
Free
MPI versus MPI+OpenMP on IBM SP for the NAS benchmarks
Pages 12–es

The hybrid memory model of clusters of multiprocessors raises two issues: programming model and performance. Many parallel programs have been written by using the MPI standard. To evaluate the pertinence of hybrid models for existing MPI codes, we ...

Article
Free
A wrapper generator for wrapping high performance legacy codes as Java/CORBA components
Pages 13–es

This paper describes a Wrapper Generator for wrapping high performance legacy codes as Java/CORBA components for use in a distributed component-based problem-solving environment. Using the Wrapper Generator we have automatically wrapped an MPI-based ...

Article
Free
A scalable SNMP-based distibuted monitoring system for heterogeneous network computing
Pages 14–es

Traditional centralized monitoring systems do not scale to present-day large, complex, network-computing systems. Based on recent SNMP standards for distributed management, this paper addresses the scalability problem through distribution of monitoring ...

Article
Free
ESP: a system utilization benchmark
Pages 15–es

This article describes a new benchmark, called the Effective System Performance (ESP) test, which is designed to measure system-level performance, including such factors as job scheduling efficiency, handling of large jobs and shutdown-reboot times. In ...

Article
Free
PM2: a high performance communication middleware for heterogeneous network environments
Pages 16–es

This paper introduces a high performance communication middle layer, called PM2, for heterogeneous network environments. PM2 currently supports Myrinet, Ethernet, and SMP. Binary code written in PM2 or written in a communication library, such as MPICH-...

Article
Free
Performance and interoperability issues in incorporating cluster management systems within a wide-area network-computing environment
Pages 17–es

This paper describes the performance and interoperability issues that arise in the process of integrating cluster management systems into a wide-area network-computing environment, and provides solutions in the context of the Purdue University Network ...

Article
Free
Architectural and performance evaluation of GigaNet and Myrinet interconnects on clusters of small-scale SMP servers
Pages 18–es

GigaNet and Myrinet are two of the leading interconnects for clusters of commodity computer systems. Both provide memory-protected user-level network interface access, and deliver low-latency and high-bandwidth communication to applications. GigaNet is ...

Article
Free
MPICH-GQ: quality-of-service for message passing programs
Pages 19–es

Parallel programmers typically assume that all resources required for a program's execution are dedicated to that purpose. However, in local and wide area networks, contention for shared networks, CPUs, and I/O systems can result in significant ...

Article
Free
Scalable fault-tolerant distributed shared memory
Pages 20–es

This paper shows how a state-of-the-art software distributed shared-memory (DSM) protocol can be efficiently extended to tolerate single-node failures. In particular, we extend a home-based lazy release consistency (HLRC) DSM system with independent ...

Article
Free
Realizing fault resilience in Web-server cluster
Pages 21–es

Today, a successful Internet service is absolutely critical to be up 100 percent of the time. Server clustering is the most promising approach to meet this requirement. However, the existing Web server-clustering solutions merely can provide high ...

Article
Free
Data access performance in a large and dynamic pharmaceutical drug candidate database
Pages 22–es

An explosion in the amount of data generated through chemical and biological experimentation has been observed in recent years. This rapid proliferation of vast amounts of data has led to a set of cheminformatics and bioinformatics applications that ...

Article
Free
Real-time biomechanical simulation of volumetric brain deformation for image guided neurosurgery
Pages 23–es

We aimed to study the performance of a parallel implementation of an intraoperative nonrigid registration algorithm that accurately simulates the biomechanical properties of the brain and its deformations during surgery. The algorithm was designed to ...

Article
Free
Computer simulations of cardiac electrophysiology
Pages 24–es

CardioWave is a modular system for simulating wavefront conduction in the heart. These simulations may be used to investigate the factors that generate and sustain life-threatening arrhythmias such as ventricular fibrillation. The user selects a set of ...

Article
Free
Parallel algorithms for radiation transport on unstructured grids
Pages 25–es

The method of discrete ordinates is commonly used to solve the Boltzmann radiation transport equation for applications ranging from simulations of fires to weapons effects. The equations are most efficiently solved by sweeping the radiation flux across ...

Article
Free
A parallel dynamic-mesh Lagrangian method for simulation of flows with dynamic interfaces
Pages 26–es

Many important phenomena in science and engineering, including our motivating problem of microstructural blood flow, can be modeled as flows with dynamic interfaces. The major challenge faced in simulating such flows is resolving the interfacial motion. ...

Article
Free
Self-consistent Langevin simulation of Coulomb collisions in charged-particle beams
Pages 27–es

In many plasma physics and charged-particle beam dynamics problems, Coulomb collisions are modeled by a Fokker-Planck equation. In order to incorporate these collisions, we present a three-dimensional parallel Langevin simulation method using a Particle-...

Article
Free
Using high-speed WANs and network data caches to enable remote and distributed visualization
Pages 28–es

Visapult is a prototype application and framework for remote visualization of large scientific datasets. We approach the technical challenges of tera-scale visualization with a unique architecture which employs high speed WANs and network data caches ...

Article
Free
High performance visualization of time-varying volume data over a wide-area network status
Pages 29–es

This paper presents an end-to-end, low-cost solution for visualizing time-varying volume data rendered on a parallel computer located at a remote site. Pipelining and careful grouping of processors are used to hide I/O time and to maximize processor ...

Article
Free
Distributed rendering for scalable displays
Pages 30–es

We describe a novel distributed graphics system that allows an application to render to a large tiled display. Our system, called WireGL, uses a cluster of off-the-shelf PCs connected with a high-speed network. WireGL allows an unmodified existing ...

Contributors
  • The University of Alabama

Index Terms

  1. Proceedings of the 2000 ACM/IEEE conference on Supercomputing

    Recommendations

    Acceptance Rates

    SC '00 Paper Acceptance Rate 62 of 179 submissions, 35%;
    Overall Acceptance Rate 1,516 of 6,373 submissions, 24%
    YearSubmittedAcceptedRate
    SC '173276119%
    SC '164428118%
    SC '153587922%
    SC '143948321%
    SC '134499120%
    SC '1246110022%
    SC '113527421%
    SC '102535120%
    SC '092615923%
    SC '082775921%
    SC '072685420%
    SC '062395423%
    SC '052606224%
    SC '042006030%
    SC '032076029%
    SC '022306729%
    SC '012406025%
    SC '001796235%
    Supercomputing '952416929%
    Supercomputing '933007224%
    Supercomputing '922207534%
    Supercomputing '912158339%
    Overall6,3731,51624%