Compiler optimizations and architecture design issues for multiprocessors (parallel)

January 1985

Author:
Alexander V. Veidenbaum

Publisher:

University of Illinois at Urbana-Champaign
Champaign, IL
United States

Order Number:AAI8521897

Pages:

135

Purchase on ProQuest

Bibliometrics

Abstract

The use of multiprocessor architectures and compilation for such architectures to speed up the execution of numerical programs is investigated. Three types of parallelism in programs are exploited: (1) Fine-grain: parallelism at the level of individual machine operations, (2) Loop: parallelism between different iterations of the same loop, (3) Coarse-grain: parallelism between different parts of a program.

Several multiprocessor architectures to use these types of parallelism are defined. These architectures are either of shared memory multiprocessor or multiple array processor class. Parallel programs for each architecture are automatically generated from serial codes by a Fortran compiler. The performance of each architecture using different compilation methods for each type of parallelism and each architecture, and for the best use of all types together is studied through simulation.

In addition, serial loops in parallel programs are studied to determine why they remain serial, their effect on performance, and whether they can be made parallel by the compiler or by a programmer.

Cited By

Contributors

Alexander V Veidenbaum
University of California, Irvine
- Publication Years1985 - 2024
- Publication counts113
- Citation count1,166
- Available for Download63
- Downloads (cumulative)32,076
- Downloads (12 months)3,613
- Downloads (6 weeks)769
- Average Downloads per Article509
- Average Citation per Article10
View Full Profile

Comments

Recommendations

A Compiler Optimization Algorithm for Shared-Memory Multiprocessors

This paper presents a new compiler optimization algorithm that parallelizes applications for symmetric, shared-memory multiprocessors. The algorithm considers data locality, parallelism, and the granularity of parallelism. It uses dependence analysis ...
Implementation of a parallel Prolog interpreter on multiprocessors
IPPS '91: Proceedings of the Fifth International Parallel Processing Symposium

Describes the implementation of the Reduce-OR process model for the parallel execution of logic programs in an interpreter for parallel Prolog. The interpreter supports full OR and independent AND parallelism in logic programs on both shared and ...
Coarse-Grain Task Parallel Processing Using the OpenMP Backend of the OSCAR Multigrain Parallelizing Compiler
ISHPC '00: Proceedings of the Third International Symposium on High Performance Computing

This paper describes automatic coarse grain parallel processing on a shared memory multiprocessor system using a newly developed OpenMP backend of OSCAR multigrain parallelizing compiler for from single chip multiprocessor to a high performance ...

Browse Theses

Sections

Cited By

A Compiler Optimization Algorithm for Shared-Memory Multiprocessors

Implementation of a parallel Prolog interpreter on multiprocessors

Coarse-Grain Task Parallel Processing Using the OpenMP Backend of the OSCAR Multigrain Parallelizing Compiler

Sections

Cited By

Save to Binder

Recommendations

A Compiler Optimization Algorithm for Shared-Memory Multiprocessors

Implementation of a parallel Prolog interpreter on multiprocessors

Coarse-Grain Task Parallel Processing Using the OpenMP Backend of the OSCAR Multigrain Parallelizing Compiler