Scalable Parallel Matrix Multiplication on Distributed Memory Parallel Computers
Consider any known sequential algorithm for matrix multiplication over an arbitrary ring with time complexity O(N ), where 2< 3. We show that such an algorithm can be parallelized on a distributed memory parallel computer (DMPC) in O(logN) time by using ...
Achieving Efficiency and Accuracy in Simulation for I/O-Intensive Applications
This paper presents a family of simulators for data-intensive applications, and a methodology to select the most efficient simulator based on a user-supplied requirement for accuracy. The methodology consists of a series of tests that select an ...
A Parallel Implementation of a Fast Multipole-Based 3-D Capacitance Extraction Program on Distributed Memory Multicomputers
Very fast and accurate 3-D capacitance extraction is essential for interconnect optimization in VLSI ultra-deep sub-micron designs (UDSM). Parallel processing provides an approach to reducing the simulation turn-around time. This paper examines the ...
Efficient Integration of Compiler-Directed Cache Coherence and Data Prefetching
Cache coherence enforcement and memory latency reduction and hiding are very important and challenging problems in the design of large-scale distributed shared-memory (DSM) multiprocessors. We propose an integrated approach to solve these problems ...
Telescoping Languages
- Ken Kennedy,
- Bradley Broom,
- Keith Cooper,
- Jack Dongarra,
- Rob Fowler,
- Dennis Gannon,
- Lennart Johnsson,
- John Mellor-Crummey,
- Linda Torczon
As machines and programs have become more complex, the process of programming applications that can exploit the power of high-performance systems has become more difficult and correspondingly more labor-intensive. This has substantially widened the ...
What Are the Top Ten Most Influential Parallel and Distributed Processing Concepts of the Past Millenium?
- Mitchell D. Theys,
- Shoukat Ali,
- Howard Jay Siegel,
- Mani Chandy,
- Kai Hwang,
- Ken Kennedy,
- Lui Sha,
- Kang G. Shin,
- Marc Snir,
- Larry Snyder,
- Thomas Sterling
This is a report on a panel titled “What are the top ten most influential parallel and distributed processing concepts of the last millennium?” that was held at the IEEE Computer Society sponsored “14th International Parallel and Distributed Processing ...