Language features

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,790,159 records)|Limit your search to The ACM Full-Text Collection (766,444 records)

Showing 1 - 20of37 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

article
December 2014
Design patterns percolating to parallel programming framework implementation
International Journal of Parallel Programming (IJPP), Volume 42, Issue 6Pages 1012–1031https://doi.org/10.1007/s10766-013-0273-6

Structured parallel programming is recognised as a viable and effective means of tackling parallel programming problems. Recently, a set of simple and powerful parallel building blocks ( $$\mathsf{RISC\text{- }pb^2l}$$ RISC - pb 2 l ) has been proposed to support modelling and implementation of ...
9
Metrics
Total Citations9
article
April 2006
A platform-independent distributed runtime for standard multithreaded Java
International Journal of Parallel Programming (IJPP), Volume 34, Issue 2Pages 113–142https://doi.org/10.1007/s10766-006-0007-0

JavaSplit is a portable runtime environment for distributed execution of standard multithreaded Java programs. It gains augmented computational power and increased memory capacity by distributing the threads and objects of an application among the ...
1
Metrics
Total Citations1
article
February 2004
Alias analysis in Java with reference-set representation for high-performance computing
International Journal of Parallel Programming (IJPP), Volume 32, Issue 1Pages 39–76https://doi.org/10.1023/B:IJPP.0000015564.82048.f3

In this paper, a flow-sensitive, context-insensitive alias analysis in Java is proposed. It is more efficient and precise than previous analyses for C++, and it does not negatively affect the safety of aliased references. To this end, we first present a ...
1
Metrics
Total Citations1
article
August 2003
Restructuring computations for temporal data cache locality
International Journal of Parallel Programming (IJPP), Volume 31, Issue 4Pages 305–338https://doi.org/10.1023/A:1024556711058

Data access costs contribute significantly to the execution time of applications with complex data structures. A the latency of memory accesses becomes high relative to processor cycle times, application performance is increasingly limited by memory ...
10
Metrics
Total Citations10
article
June 2002
Control Flow Regeneration for Software Pipelined Loops with Conditions
- Dragan Milicev,
- Zoran Jovanovic
International Journal of Parallel Programming (IJPP), Volume 30, Issue 3Pages 149–179https://doi.org/10.1023/A:1015453520790

A new intermediate representation for software pipelined loops with conditions is proposed in the paper. The representation allows separation of operations from different paths and their conditional, as well as speculative scheduling, including ...
3
Metrics
Total Citations3
article
October 2000
Data Dependence Analysis of Assembly Code
International Journal of Parallel Programming (IJPP), Volume 28, Issue 5Pages 431–467https://doi.org/10.1023/A:1007588710878

Determination of data dependences is a task typically performed with high-level language source code in today's optimizing and parallelizing compilers. Very little work has been done in the field of data dependence analysis on assembly language code, ...
13
Metrics
Total Citations13
article
October 2000
Loop Shifting for Loop Compaction
- Alain Darte,
- Guillaume Huard
International Journal of Parallel Programming (IJPP), Volume 28, Issue 5Pages 499–534https://doi.org/10.1023/A:1007506711786

The idea of decomposed software pipelining is to decouple the software pipelining problem into a cyclic scheduling problem without resource constraints and an acyclic scheduling problem with resource constraints. In terms of loop transformation and code ...
10
Metrics
Total Citations10
article
June 1999
Nonsingular Data Transformations: Definition, Validity, and Applications
- Michael F. P. O'Boyle,
- Peter M. W. Knijnenburg
International Journal of Parallel Programming (IJPP), Volume 27, Issue 3Pages 131–159https://doi.org/10.1023/A:1018744411700

This paper describes a unifying framework for nonsingular data transformations. It shows that a wide class of existing transformations may be expressed in this framework, allowing compound transformations to be performed in one step. Validity conditions ...
5
Metrics
Total Citations5
article
December 1998
Reuse-Driven Tiling for Improving Data Locality
- Jingling Xue,
- Chua-Huang Huang
International Journal of Parallel Programming (IJPP), Volume 26, Issue 6Pages 671–696

This paper applies unimodular transformations and tiling to improve data locality of a loop nest. Due to data dependences and reuse information, not all dimensions of the iteration space will and can be tiled. By using cones to represent data ...
10
Metrics
Total Citations10
article
August 1998
Combining Loop Transformations Considering Caches and Scheduling
International Journal of Parallel Programming (IJPP), Volume 26, Issue 4Pages 479–503

The performance of modern microprocessors is greatly affected by cache behavior, instruction scheduling, register allocation and loop overhead. High-level loop transformations such as fission, fusion, tiling, interchanging and outer loop unrolling (e.g.,...
13
Metrics
Total Citations13
article
August 1998
Meld Scheduling: A Technique for Relaxing Scheduling Constraints
International Journal of Parallel Programming (IJPP), Volume 26, Issue 4Pages 349–381

Meld scheduling melds the schedules of neighboring scheduling regions to respect latencies of operations issued in one region but completing after control transfers to the other. In contrast, conventional schedulers ignore latency constraints from other ...
1
Metrics
Total Citations1
article
April 1998
Quantitative Evaluation of Register Pressure on Software Pipelined Loops
International Journal of Parallel Programming (IJPP), Volume 26, Issue 2Pages 121–142https://doi.org/10.1023/A:1018743102645

Software Pipelining is a loop scheduling technique that extracts loop parallelism by overlapping the execution of several consecutive iterations. One of the drawbacks of software pipelining is its high register requirements, which increase with the ...
8
Metrics
Total Citations8
article
December 1997
Affine dependence classificatinon for communications minimization
- Catherine Mongenet
International Journal of Parallel Programming (IJPP), Volume 25, Issue 6Pages 497–524https://doi.org/10.1023/A:1025165407063
1
Metrics
Total Citations1
article
December 1997
Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs
- Alain Darte,
- Frédéric Vivien
International Journal of Parallel Programming (IJPP), Volume 25, Issue 6Pages 447–496https://doi.org/10.1023/A:1025168022993
17
Metrics
Total Citations17
article
December 1997
Parameterized polyhedra and their vertices
- Vincent Loechner,
- Doran K. Wilde
International Journal of Parallel Programming (IJPP), Volume 25, Issue 6Pages 525–549https://doi.org/10.1023/A:1025117523902
29
Metrics
Total Citations29
article
December 1996
Connection Analysis: A Practical Interprocedural Heap Analysis for C
- Rakesh Ghiya,
- Laurie J. Hendren
International Journal of Parallel Programming (IJPP), Volume 24, Issue 6Pages 547–578

This paper presents a practical heap analysis technique, connection analysis, that can be used to disambiguate heap accesses in C programs. The technique is designed for analyzing programs that allocate many disjoint objects in the heap such as ...
24
Metrics
Total Citations24
article
August 1996
A Study of the EARTH-MANNA Multithreaded System
International Journal of Parallel Programming (IJPP), Volume 24, Issue 4Pages 319–348

Multithreaded architectures have been proposed for future multiprocessor systems. However, some open issues remain. Can multithreading be supported in a multiprocessor so that it can tolerate synchronization and communication latencies, with little ...
26
Metrics
Total Citations26
article
August 1996
A Partitioning-Independent Paradigm for Nested Data Parallelism
- Dean Engelhardt,
- Andrew Wendelborn
International Journal of Parallel Programming (IJPP), Volume 24, Issue 4Pages 291–317

A generalization of the data parallel model has been proposed by Blelloch which permits the nesting of data parallel operators to specify parallel computation across nested and irregular data structures. In this paper we consider the costs of supporting ...
0
Metrics
Total Citations0
article
April 1996
Minimizing Register Requirements of a Modulo Schedule via Optimum Stage Scheduling
International Journal of Parallel Programming (IJPP), Volume 24, Issue 2Pages 103–132

Modulo scheduling is an efficient technique for exploiting instruction level parallelism in a variety of loops, resulting in high performance code but increased register requirements. We present an approach that schedules the loop operations for minimum ...
8
Metrics
Total Citations8
article
April 1996
Hardware-Based Profiling: An Effective Technique for Profile-Driven Optimization
International Journal of Parallel Programming (IJPP), Volume 24, Issue 2Pages 187–206

Profile-based optimization can be used for instruction scheduling, loop scheduling, data preloading, function in-lining, and instruction cache performance enhancement. However, these techniques have not been embraced by software vendors because programs ...
11
Metrics
Total Citations11

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

All Publications

Publisher

Publication Date

Design patterns percolating to parallel programming framework implementation

A platform-independent distributed runtime for standard multithreaded Java

Alias analysis in Java with reference-set representation for high-performance computing

Restructuring computations for temporal data cache locality

Control Flow Regeneration for Software Pipelined Loops with Conditions

Data Dependence Analysis of Assembly Code

Loop Shifting for Loop Compaction

Nonsingular Data Transformations: Definition, Validity, and Applications

Reuse-Driven Tiling for Improving Data Locality

Combining Loop Transformations Considering Caches and Scheduling

Meld Scheduling: A Technique for Relaxing Scheduling Constraints

Quantitative Evaluation of Register Pressure on Software Pipelined Loops

Affine dependence classificatinon for communications minimization

Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs

Parameterized polyhedra and their vertices

Connection Analysis: A Practical Interprocedural Heap Analysis for C

A Study of the EARTH-MANNA Multithreaded System

A Partitioning-Independent Paradigm for Nested Data Parallelism

Minimizing Register Requirements of a Modulo Schedule via Optimum Stage Scheduling

Hardware-Based Profiling: An Effective Technique for Profile-Driven Optimization