Proceedings of the 14th annual international symposium on Computer architecture

ISCA '87: Proceedings of the 14th annual international symposium on Computer architecture

June 1987

1987 Proceeding

Editor:
D. St. Clair

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

ISCA87: The 14th Annual International Symposium on Computer Architecture Pittsburgh Pennsylvania USA June 2 - 5, 1987

ISBN:

978-0-8186-0776-9

Published:

01 June 1987

Sponsors:

SIGARCH

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Next Conference

ISCA '25

Sponsor:
sigarch

The 52nd Annual International Symposium on Computer Architecture

June 21 - 25, 2025

Tokyo , Japan

ISCA '25 website

Reflects downloads up to 10 Oct 2024Bibliometrics

Citation Count

1,203

Downloads (6 weeks)

320

Downloads (12 months)

2,312

Downloads (cumulative)

20,824

Sections

ISCA '87: Proceedings of the 14th annual international symposium on Computer architecture

1987

Previous Next

Abstract

No abstract available.

Select All

Export Citations Save to Binder

Article

Free

Branch folding in the CRISP microprocessor: reducing branch delay to zero

D. R. Ditzel,
H. R. McLellan

Pages 2–8https://doi.org/10.1145/30350.30351

A new method of implementing branch instructions is presented. This technique has been implemented in the CRISP Microprocessor. With a combination of hardware and software techniques the execution time cost for many branches can be effectively reduced ...

- 129
- 1,195
Metrics
Total Citations129
Total Downloads1,195
Last 12 Months324
Last 6 weeks19

Abstract
View online with eReader
PDF

Article

Free

An evaluation of branch architectures

J. A. DeRosa,
H. M. Levy

Pages 10–16https://doi.org/10.1145/30350.30352

Branch instructions form a significant fraction of executed instructions, and their design is thus a crucial component of any architecture. This paper examines three alternatives in the design of branch instructions: delayed vs. non-delayed branches, ...

- 52
- 760
Metrics
Total Citations52
Total Downloads760
Last 12 Months179
Last 6 weeks17

Abstract
View online with eReader
PDF

Article

Free

Checkpoint repair for out-of-order execution machines

W. W. Hwu,
Y. N. Patt

Pages 18–26https://doi.org/10.1145/30350.30353

Out-of-order execution and branch prediction are two mechanisms that can be used profitably in the design of Supercomputers to increase performance. Unfortunately this means there must be some kind of repair mechanism, since situations do occur that ...

- 93
- 1,198
Metrics
Total Citations93
Total Downloads1,198
Last 12 Months115
Last 6 weeks17

Abstract
View online with eReader
PDF

Article

Free

Instruction issue logic for high-performance, interruptable pipelined processors

G. S. Sohi,
S. Vajapeyam

Pages 27–34https://doi.org/10.1145/30350.30354

The performance of pipelined processors is severely limited by data dependencies. In order to achieve high performance, a mechanism to alleviate the effects of data dependencies must exist. If a pipelined CPU with multiple functional units is to be used ...

- 92
- 1,344
Metrics
Total Citations92
Total Downloads1,344
Last 12 Months61
Last 6 weeks18

Abstract
View online with eReader
PDF

Article

Free

Fast temporary storage for serial and parallel execution

J. Swensen,
Y. Patt

Pages 35–43https://doi.org/10.1145/30350.30355

There is an apparent conflict between the hardware requirements for fast parallel execution and the hardware requirements for fast serial execution. For example, fast vector execution is achieved by maintaining high execution concurrency over extended ...

- 2
- 350
Metrics
Total Citations2
Total Downloads350
Last 12 Months8
Last 6 weeks1

Abstract
View online with eReader
PDF

Article

Free

Performance analysis and design of a logic simulation machine

K. Wong,
M. A. Franklin

Pages 46–55https://doi.org/10.1145/30350.30356

The high costs associated with logic simulation of large VLSI circuits has led to the need for new computer architectures tailored to the simulation task. Such architectures have the potential for significant speed-ups over software-based logic ...

- 12
- 293
Metrics
Total Citations12
Total Downloads293
Last 12 Months39
Last 6 weeks6

Abstract
View online with eReader
PDF

Article

Free

A modular systolic architecture for image convolutions

K. Doshi,
P. Varman

Pages 56–63https://doi.org/10.1145/30350.30357

This paper describes a modular, systolic design for two-dimensional convolution which is a frequent and computationally intensive operation in low-level image processing. The design consists of a one-dimensional array of homogeneous cells, each with a ...

- 5
- 414
Metrics
Total Citations5
Total Downloads414
Last 12 Months16
Last 6 weeks4

Abstract
View online with eReader
PDF

Article

Free

A template matching algorithm using optically-connected 3-D VLSI architecture

S. Fujita,
R. Aibara,
M. Yamashita,
T. Ae

Pages 64–70https://doi.org/10.1145/30350.30358

Three-dimensional VLSI (in short, 3-D VLSI) is a new device technology that is expected to realize high performance systems. In this paper, we propose an image processing architecture based on 3-D VLSI consisting of optically-connected layers. Since the ...

- 5
- 292
Metrics
Total Citations5
Total Downloads292
Last 12 Months26
Last 6 weeks6

Abstract
View online with eReader
PDF

Article

Free

Mapping data flow programs on a VLSI array of processors

B. Mendelson,
G. M. Silberman

Pages 72–80https://doi.org/10.1145/30350.30359

With the advent of VLSI, relatively large processing arrays may be realized in a single VLSI chip. Such regularly structured arrays take considerably less time to design and test, and fault-tolerance can easily be introduced into them. However, only a ...

- 15
- 453
Metrics
Total Citations15
Total Downloads453
Last 12 Months49
Last 6 weeks10

Abstract
View online with eReader
PDF

Article

Free

Analytical modeling and architectural modifications of a dataflow computer

D. Ghosal,
L. N. Bhuyan

Pages 81–89https://doi.org/10.1145/30350.30360

Dataflow computers are an alternative to the von Neumann architectures and are capable of exploiting large amount of parallelism inherent in many computer applications. This paper deals with the performance analysis of the Manchester dataflow computer ...

- 6
- 346
Metrics
Total Citations6
Total Downloads346
Last 12 Months33
Last 6 weeks11

Abstract
View online with eReader
PDF

Article

Free

A unified resource management and execution control mechanism for data flow machines

M. Takesue

Pages 90–97https://doi.org/10.1145/30350.30361

This paper presents a unified resource management and execution control mechanism for data flow machines. The mechanism integrates load control, depth-first execution control, cache memory control and a load balancing mechanism. All of these mechanisms ...

- 15
- 286
Metrics
Total Citations15
Total Downloads286
Last 12 Months44
Last 6 weeks10

Abstract
View online with eReader
PDF

Article

Free

High performance integrated Prolog processor IPP

S. Abe,
T. Bandoh,
S. Yamaguchi,
K. Kurosawa,
K. Kiriyama

Pages 100–107https://doi.org/10.1145/30350.30362

To realize the highest performance possible for a sequential processor, and to realize utilization of a large amount of existing software, an integrated Prolog processor (IPP) and its optimized compiler are now being developed.

A tagged architecture ...

- 17
- 292
Metrics
Total Citations17
Total Downloads292
Last 12 Months28
Last 6 weeks4

Abstract
View online with eReader
PDF

Article

Free

Performance studies of a parallel Prolog architecture

B. S. Fagin,
A. M. Despain

Pages 108–116https://doi.org/10.1145/30350.30363

This paper presents a new multiprocessor architecture for the parallel execution of logic programs, developed as part of the Aquarius Project. This architecture is designed to support AND-parallelism, OR-parallelism, and intelligent backtracking. We ...

- 10
- 245
Metrics
Total Citations10
Total Downloads245
Last 12 Months29
Last 6 weeks3

Abstract
View online with eReader
PDF

Article

Free

An experimental VLSI Prolog interpreter: preliminary measurements and results

P. L. Civera,
F. Maddaleno,
G. L. Piccinini,
M. Zamboni

Pages 117–126https://doi.org/10.1145/30350.30364

This work presents the preliminary results of a project oriented to the design and VLSI implementation of a Prolog interpreter. Even if the interpretative approach is being considered an inefficient way to execute high level languages when compared to ...

- 4
- 275
Metrics
Total Citations4
Total Downloads275
Last 12 Months46
Last 6 weeks6

Abstract
View online with eReader
PDF

Article

Free

Deterministic and stochastic modeling of parallel garbage collection: towards real-time criteria

O. Ridoux

Pages 128–136https://doi.org/10.1145/30350.30365

The study of garbage collection for a logic programming language machine has exhibited fundamental differences with the more popular functional programming garbage collection. These differences yield behaviours that cannot be observed with classical ...

- 0
- 245
Metrics
Total Citations0
Total Downloads245
Last 12 Months12
Last 6 weeks3

Abstract
View online with eReader
PDF

Article

The sharing of environment in AND-OR-parallel execution of logic programs

C. Sun,
Y. Tsu

Pages 137–144https://doi.org/10.1145/30350.30366

- 4
Metrics
Total Citations4

Article

Free

Architectural issues in designing symbolic processors in optics

A. Guha,
R. Ramnarayan,
M. Derstine

Pages 145–151https://doi.org/10.1145/30350.30367

This paper analyzes potential optical architectures for AI applications (such as knowledge-based systems). Our goal was to investigate architectures most suitable for implementation completely in optics. While optical computing appears to hold much ...

- 1
- 1,190
Metrics
Total Citations1
Total Downloads1,190
Last 12 Months41
Last 6 weeks12

Abstract
View online with eReader
PDF

Article

Free

Rearrangeability of multistage shuffle/exchange networks

A. Varma,
C. S. Raghavendra

Pages 154–162https://doi.org/10.1145/30350.30368

In this paper we study the rearrangeability of multistage shuffle/exchange networks. Although a theoretical lower bound of (2 log₂N - 1) stages for rearrangeability of a network with N = 2ⁿ inputs and outputs has been known, the sufficiency of (2 log₂N -...

- 4
- 430
Metrics
Total Citations4
Total Downloads430
Last 12 Months48
Last 6 weeks12

Abstract
View online with eReader
PDF

Article

Free

Optimized mesh-connected networks for SIMD and MIMD architectures

R. Beivide,
E. Herrada,
J. L. Balcazar,
J. Labarta

Pages 163–170https://doi.org/10.1145/30350.30369

A class of mesh networks with wrap-around links is obtained from a class of circulant graphs by means of a graph isomorphism. We demonstrate how to obtain, from the adjacency pattern of the graph, simple parameters that serve to construct a planar ...

- 16
- 604
Metrics
Total Citations16
Total Downloads604
Last 12 Months43
Last 6 weeks6

Abstract
View online with eReader
PDF

Article

Free

Performance evaluation of reduced bandwidth multistage interconnection networks

D. T. Harper,
J. R. Jump

Pages 171–175https://doi.org/10.1145/30350.30370

This paper presents and evaluates a class of buffered interconnection networks which provide performance and cost levels intermediate to a bus and a delta network. These networks, referred to as hybrid networks, are formed by beginning with a delta ...

- 2
- 279
Metrics
Total Citations2
Total Downloads279
Last 12 Months36
Last 6 weeks6

Abstract
View online with eReader
PDF

Article

Free

Hardware support for interprocess communication

U. Ramachandran,
M. Solomon,
M. Vernon

Pages 178–188https://doi.org/10.1145/30350.30371

In recent years there has been increasing interest in message-based operating systems, particularly in distributed environments. Such systems consist of a small message-passing kernel supporting a collection of system server processes that provide such ...

- 7
- 940
Metrics
Total Citations7
Total Downloads940
Last 12 Months85
Last 6 weeks8

Abstract
View online with eReader
PDF

Article

Free

Architecture of a message-driven processor

W. J. Dally,
L. Chao,
A. Chien,
S. Hassoun,
W. Horwat,
J. Kaplan,
P. Song,
B. Totty,
S. Wills

Pages 189–196https://doi.org/10.1145/30350.30372

We propose a machine architecture for a high-performance processing node for a message-passing, MIMD concurrent computer. The principal mechanisms for attaining this goal are the direct execution and buffering of messages and a memory-based architecture ...

- 136
- 540
Metrics
Total Citations136
Total Downloads540
Last 12 Months34
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

Free

Effect of storage allocation/reclamation methods on parallelism and storage requirements

M. Kumar

Pages 197–205https://doi.org/10.1145/30350.30373

The write after read/write synchronizations (the anti- and output-dependence constraints) inhibit the parallelism exhibited by Fortran programs. These constraints can be avoided by allocating storage for the values generated in a program dynamically, so ...

- 12
- 218
Metrics
Total Citations12
Total Downloads218
Last 12 Months23
Last 6 weeks7

Abstract
View online with eReader
PDF

Article

Free

Cache design of a sub-micron CMOS system/370

J. H. Chang,
H. Chao,
K. So

Pages 208–213https://doi.org/10.1145/30350.30374

An innovative cache accessing scheme based on high MRU (most recently used) hit ratio [1] is proposed for the design of a one-cycle cache in a CMOS implementation of System/370. It is shown that with this scheme the cache access time is reduced by 30 ~ ...

- 59
- 789
Metrics
Total Citations59
Total Downloads789
Last 12 Months63
Last 6 weeks13

Abstract
View online with eReader
PDF

Article

Free

An architectural perspective on a memory access controller

M. Freeman

Pages 214–223https://doi.org/10.1145/30350.30375

In this paper a CMOS memory access controller chip is described that provides the basis for achieving high-performance 68020-based (68030-based) systems. This controller matches the speed of the memory system to that of the microprocessor by providing a ...

- 2
- 473
Metrics
Total Citations2
Total Downloads473
Last 12 Months38
Last 6 weeks5

Abstract
View online with eReader
PDF

Article

Free

Organization and analysis of a gracefully-degrading interleaved memory system

K. Cheung,
G. Sohi,
K. Saluja,
D. Pradhan

Pages 224–231https://doi.org/10.1145/30350.30376

A hardware mechanism has been proposed to reconfigure an interleaved memory system. The reconfiguration scheme is such that, at any instant all fault-free memory banks in the memory system are utilized in interleaved manner. A performance metric is ...

- 5
- 272
Metrics
Total Citations5
Total Downloads272
Last 12 Months42
Last 6 weeks5

Abstract
View online with eReader
PDF

Article

Free

Correct memory operation of cache-based multiprocessors

C. Scheurich,
M. Dubois

Pages 234–243https://doi.org/10.1145/30350.30377

This paper shows that cache coherence protocols can implement indivisible synchronization primitives reliably and can also enforce sequential consistency. Sequential consistency provides a commonly accepted model of behavior of multiprocessors. We ...

- 111
- 1,015
Metrics
Total Citations111
Total Downloads1,015
Last 12 Months89
Last 6 weeks11

Abstract
View online with eReader
PDF

Article

Free

Hierarchical cache/bus architecture for shared memory multiprocessors

A. W. Wilson

Pages 244–252https://doi.org/10.1145/30350.30378

A new, large scale multiprocessor architecture is presented in this paper. The architecture consists of hierarchies of shared buses and caches. Extended versions of shared bus multicache coherency protocols are used to maintain coherency among all ...

- 183
- 1,773
Metrics
Total Citations183
Total Downloads1,773
Last 12 Months125
Last 6 weeks18

Abstract
View online with eReader
PDF

Article

Free

Multiprocessor cache design considerations

R. L. Lee,
P. C. Yew,
D. H. Lawrie

Pages 253–262https://doi.org/10.1145/30350.30379

In this paper, cache design is explored for large high-performance multiprocessors with hundreds or thousands of processors and memory modules interconnected by a pipe-lined multi-stage network. The majority of the multiprocessor cache studies in the ...

- 44
- 1,204
Metrics
Total Citations44
Total Downloads1,204
Last 12 Months89
Last 6 weeks9

Abstract
View online with eReader
PDF

Article

Free

Performance evaluation of multiple register sets

R. J. Eickemeyer,
J. H. Patel

Pages 264–271https://doi.org/10.1145/30350.30380

In this paper a DEC VAX with multiple register sets is evaluated under many differently sized register sets. Both the number of register sets and the number of registers per set were varied. Performance, measured in terms of memory traffic, is compared ...

- 11
- 346
Metrics
Total Citations11
Total Downloads346
Last 12 Months85
Last 6 weeks5

Abstract
View online with eReader
PDF

Save to Binder

Create a New Binder

Name

Contributors

D St. Clair
- Publication Years
- Publication counts0
- Citation count0
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article0
View Full Profile

Index Terms

Proceedings of the 14th annual international symposium on Computer architecture
1. Computer systems organization
2. Hardware

Comments

Recommendations

CompSysTech '13: Proceedings of the 14th International Conference on Computer Systems and Technologies
CSL-LICS '14: Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS)
ISCA '21: Proceedings of the 48th Annual International Symposium on Computer Architecture

Acceptance Rates

Overall Acceptance Rate 543 of 3,203 submissions, 17%

Year	Submitted	Accepted	Rate
ISCA '22	400	67	17%
ISCA '19	365	62	17%
ISCA '17	322	54	17%
ISCA '13	288	56	19%
ISCA '12	262	47	18%
ISCA '08	259	37	14%
ISCA '06	234	31	13%
ISCA '05	194	45	23%
ISCA '04	217	31	14%
ISCA '03	184	36	20%
ISCA '02	180	27	15%
ISCA '01	163	24	15%
ISCA '99	135	26	19%
Overall	3,203	543	17%

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Save to Binder

Index Terms

Recommendations

CompSysTech '13: Proceedings of the 14th International Conference on Computer Systems and Technologies

CSL-LICS '14: Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS)

ISCA '21: Proceedings of the 48th Annual International Symposium on Computer Architecture

Acceptance Rates