General and reference

Applied Filters

People

Publications

Conferences

Publication Date

32 Results for: Book/Issue: MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitectureEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,801,766 records)|Limit your search to The ACM Full-Text Collection (771,395 records)

Showing 1 - 20of32 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

Article
Free
December 1997
Resource-sensitive profile-directed data flow analysis for code optimization
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 358–368

Instruction schedulers employ code motion as a means of instruction reordering to enable scheduling of instructions at points where the resources required for their execution are available. In addition, driven by the profiling data, schedulers take ...
20
381
Metrics
Total Citations20
Total Downloads381
Last 12 Months44
Last 6 weeks5
View online with eReader
PDF
Article
Free
December 1997
Cache sensitive modulo scheduling
- F. Jesús Sánchez,
- Antonio González
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 338–348

This paper focuses on the interaction between software prefetching (both binding and nonbinding) and software pipelining for VLIW machines. First, it is shown that evaluating software pipelined schedules without considering memory effects can be rather ...
14
314
Metrics
Total Citations14
Total Downloads314
Last 12 Months22
Last 6 weeks6
View online with eReader
PDF
Article
Free
December 1997
MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 330–335

Significant advances have been made in compilation technology for capitalizing on instruction-level parallelism (ILP). The vast majority of ILP compilation research has been conducted in the context of general-purpose computing, and more specifically ...
808
17,970
Metrics
Total Citations808
Total Downloads17,970
Last 12 Months53
Last 6 weeks9
View online with eReader
PDF
Article
Free
December 1997
Available paralellism in video applications
- Heng Liao,
- Andrew Wolfe
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 321–329

Most recent research in instruction-level parallelism has focused on general-purpose applications such as the SPEC benchmarks. Many quantitative experiments have been performed over the years measuring the impact of different execution models and ...
9
434
Metrics
Total Citations9
Total Downloads434
Last 12 Months67
Last 6 weeks24
View online with eReader
PDF
Article
Free
December 1997
Predicting data cache misses in non-numeric applications through correlation profiling
- Todd C. Mowry,
- Chi-Keung Luk
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 314–320

To maximize the benefit and minimize the overhead of software-based latency tolerance techniques, we would like to apply them precisely to the set of dynamic references that suffer cache misses. Unfortunately, the information provided by the state-of-...
25
371
Metrics
Total Citations25
Total Downloads371
Last 12 Months20
Last 6 weeks2
View online with eReader
PDF
Article
Free
December 1997
Procedure placement using temporal ordering information
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 303–313

Instruction cache performance is very important to instruction fetch efficiency and overall processor performance. The layout of an executable has a substantial effect on the cache miss rate during execution. This means that the performance of an ...
41
348
Metrics
Total Citations41
Total Downloads348
Last 12 Months27
Last 6 weeks6
View online with eReader
PDF
Article
Free
December 1997
ProfileMe: hardware support for instruction-level profiling on out-of-order processors
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 292–302

Profile data is valuable for identifying performance bottlenecks and guiding optimizations. Periodic sampling of a processor's performance monitoring hardware is an effective, unobtrusive way to obtain detailed profiles. Unfortunately, existing hardware ...
130
688
Metrics
Total Citations130
Total Downloads688
Last 12 Months28
Last 6 weeks6
View online with eReader
PDF
Article
Free
December 1997
Highly accurate data value prediction using hybrid predictors
- Kai Wang,
- Manoj Franklin
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 281–290

Data dependences (data flow constraints) present a major hurdle to the amount of instruction-level parallelism that can be exploited from a program. Recent work has suggested that the limits imposed by data dependences can be overcome to some extent ...
109
932
Metrics
Total Citations109
Total Downloads932
Last 12 Months43
Last 6 weeks5
View online with eReader
PDF
Article
Free
December 1997
Can program profiling support value prediction?
- Freddy Gabbay,
- Avi Mendelson
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 270–280

This paper explores the possibility of using program profiling to enhance the efficiency of value prediction. Value prediction attempts to eliminate true-data dependencies by predicting the outcome values of instructions at run-time and executing true-...
42
409
Metrics
Total Citations42
Total Downloads409
Last 12 Months63
Last 6 weeks8
View online with eReader
PDF
Article
Free
December 1997
Value profiling
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 259–269

Identifying variables as invariant or constant at compile-time allows the compiler to perform optimizations including constant folding, code specialization, and partial evaluation. Some variables, which cannot be labeled as constants, may exhibit semi-...
104
711
Metrics
Total Citations104
Total Downloads711
Last 12 Months43
Last 6 weeks19
View online with eReader
PDF
Article
Free
December 1997
The predictability of data values
- Yiannakis Sazeides,
- James E. Smith
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 248–258

The predictability of data values is studied at a fundamental level. Two basic predictor models are defined: Computational predictors perform an operation on previous values to yield predicted next values. Examples we study are stride value prediction (...
158
784
Metrics
Total Citations158
Total Downloads784
Last 12 Months50
Last 6 weeks11
View online with eReader
PDF
Article
Free
December 1997
Streamlining inter-operation memory communication via data dependence prediction
- Andreas Moshovos,
- Gurindar S. Sohi
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 235–245

We revisit memory hierarchy design viewing memory as an inter-operation communication agent. This perspective leads to the development of novel methods of performing inter-operation memory communication. We use data dependence prediction to identify and ...
72
439
Metrics
Total Citations72
Total Downloads439
Last 12 Months38
Last 6 weeks5
View online with eReader
PDF
Article
Free
December 1997
Microarchitecture support for improving the performance of load target prediction
- Chung-Ho Chen,
- Akida Wu
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 228–234

Presents a load target prediction scheme that mitigates the impact of load latency for modern microprocessors. The scheme uses a cache-like buffer to provide the base address, offset and operand size at the instruction fetching stage of a pipeline so ...
5
346
Metrics
Total Citations5
Total Downloads346
Last 12 Months15
Last 6 weeks1
View online with eReader
PDF
Article
Free
December 1997
Procedure based program compression
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 204–213

Cost and power consumption are two of the most important design factors for many embedded systems, particularly consumer devices. Products such as personal digital assistants, pagers with integrated data services and smart phones have fixed performance ...
30
299
Metrics
Total Citations30
Total Downloads299
Last 12 Months30
Last 6 weeks3
View online with eReader
PDF
Article
Free
December 1997
Improving code density using compression techniques
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 194–203

We propose a method for compressing programs in embedded processors where instruction memory size dominates cost. A post-compilation analyzer examines a program and replaces common sequences of instructions with a single instruction codeword. A ...
90
582
Metrics
Total Citations90
Total Downloads582
Last 12 Months27
Last 6 weeks3
View online with eReader
PDF
Article
Free
December 1997
The filter cache: an energy efficient memory structure
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 184–193

Most modern microprocessors employ one or two levels of on-chip caches in order to improve performance. These caches are typically implemented with static RAM cells and often occupy a large portion of the chip area. Not surprisingly, these caches often ...
191
1,336
Metrics
Total Citations191
Total Downloads1,336
Last 12 Months59
Last 6 weeks6
View online with eReader
PDF
Article
Free
December 1997
Initial results on the performance and cost of vector microprocessors
- Corinna G. Lee,
- Derek J. DeVries
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 171–182

Increasingly wider superscalar processors are experiencing diminishing performance returns while requiring larger portions of die area dedicated to control rather than datapath. As an alternative to using these processors to exploit parallelism ...
13
190
Metrics
Total Citations13
Total Downloads190
Last 12 Months46
Last 6 weeks7
View online with eReader
PDF
Article
Free
December 1997
Out-of-order vector architectures
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 160–170

Register renaming and out-of-order instruction issue are now commonly used in superscalar processors. These techniques can also be used to significant advantage in vector processors, as this paper shows. Performance is improved and available memory ...
22
594
Metrics
Total Citations22
Total Downloads594
Last 12 Months90
Last 6 weeks17
View online with eReader
PDF
Article
Free
December 1997
The multicluster architecture: reducing cycle time through partitioning
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 149–159

The multicluster architecture that we introduce offers a decentralized, dynamically-scheduled architecture, in which the register files, dispatch queue, and functional units of the architecture are distributed across multiple clusters, and each cluster ...
99
586
Metrics
Total Citations99
Total Downloads586
Last 12 Months46
Last 6 weeks3
View online with eReader
PDF
Article
Free
December 1997
Trace processors
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on MicroarchitecturePages 138–148

Traces are dynamic instruction sequences constructed and cached by hardware. A microarchitecture organized around traces is presented as a means for efficiently executing many instructions per cycle. Trace processors exploit both control flow and data ...
140
925
Metrics
Total Citations140
Total Downloads925
Last 12 Months71
Last 6 weeks8
View online with eReader
PDF