Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleJune 1997
LoPC: modeling contention in parallel algorithms
PPOPP '97: Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programmingPages 276–287https://doi.org/10.1145/263764.263803Parallel algorithm designers need computational models that take first order system costs into account, but are also simple enough to use in practice. This paper introduces the LoPC model, which is inspired by the LogP model but accounts for contention ...
Also Published in:
ACM SIGPLAN Notices: Volume 32 Issue 7 - ArticleJune 1997
An effective garbage collection strategy for parallel programming languages on large scale distributed-memory machines
PPOPP '97: Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programmingPages 264–275https://doi.org/10.1145/263764.263801This paper describes the design and implementation of a garbage collection scheme on large-scale distributed-memory computers and reports various experimental results. The collector is based on the conservative GC library by Boehm & Weiser. Each ...
Also Published in:
ACM SIGPLAN Notices: Volume 32 Issue 7 - ArticleJune 1997
Performance implications of communication mechanisms in all-software global address space systems
PPOPP '97: Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programmingPages 230–239https://doi.org/10.1145/263764.263794Global addressing of shared data simplifies parallel programming and complements message passing models commonly found in distributed memory machines. A number of programming systems have been designed that synthesize global addressing purely in ...
Also Published in:
ACM SIGPLAN Notices: Volume 32 Issue 7 - ArticleJune 1997
Auto-blocking matrix-multiplication or tracking BLAS3 performance from source code
PPOPP '97: Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programmingPages 206–216https://doi.org/10.1145/263764.263789An elementary, machine-independent, recursive algorithm for matrix multiplication C+=A*B provides implicit blocking at every level of the memory hierarchy and tests out faster than classically optimrd code, tracking hand-coded BLAS3 routines. Proof of ...
Also Published in:
ACM SIGPLAN Notices: Volume 32 Issue 7 - ArticleJune 1997
Experiences with non-numeric applications on multithreaded architectures
PPOPP '97: Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programmingPages 124–135https://doi.org/10.1145/263764.263782Distributed-memory machines have proved successful for many challenging numerical programs that can be split into largely independent computation-intensive subtasks requiring little data exchange (although the amount of exchanged data may be large). ...
Also Published in:
ACM SIGPLAN Notices: Volume 32 Issue 7 - ArticleJune 1997
Effective fine-grain synchronization for automatically parallelized programs using optimistic synchronization primitives
PPOPP '97: Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programmingPages 112–123https://doi.org/10.1145/263764.263781As shared-memory multiprocessors become the dominant commodity source of computation, parallelizing compilers must support mainstream computations that manipulate irregular, pointer-based data structures such as lists, trees and graphs, Our experience ...
Also Published in:
ACM SIGPLAN Notices: Volume 32 Issue 7 - ArticleJune 1997
Optimizing communication in HPF programs on fine-grain distributed shared memory
PPOPP '97: Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programmingPages 100–111https://doi.org/10.1145/263764.263780Unlike compiler-generated message-passing code, the coherence mechanisms in shared-memory systems work equally well for regular and irregular programs. In many programs, however compile-time information about data accesses would permit data to be ...
Also Published in:
ACM SIGPLAN Notices: Volume 32 Issue 7