Integrated circuits

Applied Filters

People

Publications

Conferences

Publication Date

17 Results for: Book/Issue: ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitectureEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,802,176 records)|Limit your search to The ACM Full-Text Collection (771,782 records)

Showing 1 - 17of17 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
June 2019
A stochastic-computing based deep learning framework using adiabatic quantum-flux-parametron superconducting technology
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 567–578https://doi.org/10.1145/3307650.3322270

The Adiabatic Quantum-Flux-Parametron (AQFP) superconducting technology has been recently developed, which achieves the highest energy efficiency among superconducting logic families, potentially 10^4--10⁵ gain compared with state-of-the-art CMOS. In 2016,...
32
769
Metrics
Total Citations32
Total Downloads769
Last 12 Months65
Last 6 weeks15
Get Access
research-article
Open Access
June 2019
CoNDA: efficient cache coherence support for near-data accelerators
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 629–642https://doi.org/10.1145/3307650.3322266

Specialized on-chip accelerators are widely used to improve the energy efficiency of computing systems. Recent advances in memory technology have enabled near-data accelerators (NDAs), which reside off-chip close to main memory and can yield further ...
64
2,497
Metrics
Total Citations64
Total Downloads2,497
Last 12 Months410
Last 6 weeks58
View online with eReader
PDF
research-article
June 2019
Energy-efficient video processing for virtual reality
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 91–103https://doi.org/10.1145/3307650.3322264

Virtual reality (VR) has huge potential to enable radically new applications, behind which spherical panoramic video processing is one of the backbone techniques. However, current VR systems reuse the techniques designed for processing conventional ...
23
1,080
Metrics
Total Citations23
Total Downloads1,080
Last 12 Months105
Last 6 weeks15
Get Access
research-article
Public Access
June 2019
Duality cache for data parallel acceleration
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 397–410https://doi.org/10.1145/3307650.3322257

Duality Cache is an in-cache computation architecture that enables general purpose data parallel applications to run on caches. This paper presents a holistic approach of building Duality Cache system stack with techniques of performing in-cache floating ...
72
2,902
Metrics
Total Citations72
Total Downloads2,902
Last 12 Months376
Last 6 weeks52
View online with eReader
PDF
research-article
June 2019
Laconic deep learning inference acceleration
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 304–317https://doi.org/10.1145/3307650.3322255

We present a method for transparently identifying ineffectual computations during inference with Deep Learning models. Specifically, by decomposing multiplications down to the bit level, the amount of work needed by multiplications during inference can ...
61
2,026
Metrics
Total Citations61
Total Downloads2,026
Last 12 Months139
Last 6 weeks13
Get Access
research-article
June 2019
SCU: a GPU stream compaction unit for graph processing
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 424–435https://doi.org/10.1145/3307650.3322254

Graph processing algorithms are key in many emerging applications in areas such as machine learning and data analytics. Although the processing of large scale graphs exhibits a high degree of parallelism, the memory access pattern tend to be highly ...
15
583
Metrics
Total Citations15
Total Downloads583
Last 12 Months44
Last 6 weeks7
Get Access
research-article
June 2019
Scalable interconnects for reconfigurable spatial architectures
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 615–628https://doi.org/10.1145/3307650.3322249

Recent years have seen the increased adoption of Coarse-Grained Reconfigurable Architectures (CGRAs) as flexible, energy-efficient compute accelerators. Obtaining performance using spatial architectures while supporting diverse applications requires a ...
19
970
Metrics
Total Citations19
Total Downloads970
Last 12 Months97
Last 6 weeks16
Get Access
research-article
Open Access
June 2019
AsmDB: understanding and mitigating front-end stalls in warehouse-scale computers
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 462–473https://doi.org/10.1145/3307650.3322234

The large instruction working sets of private and public cloud workloads lead to frequent instruction cache misses and costs in the millions of dollars. While prior work has identified the growing importance of this problem, to date, there has been ...
52
2,661
Metrics
Total Citations52
Total Downloads2,661
Last 12 Months571
Last 6 weeks107
View online with eReader
PDF
research-article
June 2019
Designing vertical processors in monolithic 3D
- Bhargava Gopireddy,
- Josep Torrellas
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 643–656https://doi.org/10.1145/3307650.3322233

A processor laid out vertically in stacked layers can benefit from reduced wire delays, low energy consumption, and a small footprint. Such a design can be enabled by Monolithic 3D (M3D), a technology that provides short wire lengths, good thermal ...
26
671
Metrics
Total Citations26
Total Downloads671
Last 12 Months135
Last 6 weeks12
Get Access
research-article
June 2019
TWiCe: preventing row-hammering by exploiting time window counters
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 385–396https://doi.org/10.1145/3307650.3322232

Computer systems using DRAM are exposed to row-hammer (RH) attacks, which can flip data in a DRAM row without directly accessing a row but by frequently activating its adjacent ones. There have been a number of proposals to prevent RH, but they either ...
64
862
Metrics
Total Citations64
Total Downloads862
Last 12 Months113
Last 6 weeks10
Get Access
research-article
June 2019
CROW: a low-cost substrate for improving DRAM performance, energy efficiency, and reliability
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 129–142https://doi.org/10.1145/3307650.3322231

DRAM has been the dominant technology for architecting main memory for decades. Recent trends in multi-core system design and large-dataset applications have amplified the role of DRAM as a critical system bottleneck. We propose Copy-Row DRAM (CROW), a ...
53
970
Metrics
Total Citations53
Total Downloads970
Last 12 Months76
Last 6 weeks14
Get Access
research-article
June 2019
Cambricon-F: machine learning computers with fractal von neumann architecture
- Yongwei Zhao,
- Zidong Du,
- Qi Guo,
- Shaoli Liu,
- Ling Li,
- Zhiwei Xu,
- Tianshi Chen,
- Yunji Chen
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 788–801https://doi.org/10.1145/3307650.3322226

Machine learning techniques are pervasive tools for emerging commercial applications and many dedicated machine learning computers on different scales have been deployed in embedded devices, servers, and data centers. Currently, most machine learning ...
13
1,542
Metrics
Total Citations13
Total Downloads1,542
Last 12 Months86
Last 6 weeks11
Get Access
research-article
Public Access
June 2019
Efficient metadata management for irregular data prefetching
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 449–461https://doi.org/10.1145/3307650.3322225

Temporal prefetchers have the potential to prefetch arbitrary memory access patterns, but they require large amounts of metadata that must typically be stored in DRAM. In 2013, the Irregular Stream Buffer (ISB), showed how this metadata could be cached ...
30
1,383
Metrics
Total Citations30
Total Downloads1,383
Last 12 Months282
Last 6 weeks33
View online with eReader
PDF
research-article
June 2019
Linebacker: preserving victim cache lines in idle register files of GPUs
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 183–196https://doi.org/10.1145/3307650.3322222

Modern GPUs suffer from cache contention due to the limited cache size that is shared across tens of concurrently running warps. To increase the per-warp cache size prior techniques proposed warp throttling which limits the number of active warps. Warp ...
12
939
Metrics
Total Citations12
Total Downloads939
Last 12 Months67
Last 6 weeks7
Get Access
research-article
June 2019
Cryogenic computer architecture modeling with memory-side case studies
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 774–787https://doi.org/10.1145/3307650.3322219

Modern computer architectures suffer from lack of architectural innovations, mainly due to the power wall and the memory wall. That is, architectural innovations become infeasible because they can prohibitively increase power consumption and their ...
21
1,368
Metrics
Total Citations21
Total Downloads1,368
Last 12 Months151
Last 6 weeks17
Get Access
research-article
June 2019
MnnFast: a fast and scalable system architecture for memory-augmented neural networks
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 250–263https://doi.org/10.1145/3307650.3322214

Memory-augmented neural networks are getting more attention from many researchers as they can make an inference with the previous history stored in memory. Especially, among these memory-augmented neural networks, memory networks are known for their huge ...
34
2,361
Metrics
Total Citations34
Total Downloads2,361
Last 12 Months197
Last 6 weeks19
Get Access
research-article
June 2019
Perceptron-based prefetch filtering
ISCA '19: Proceedings of the 46th International Symposium on Computer ArchitecturePages 1–13https://doi.org/10.1145/3307650.3322207

Hardware prefetching is an effective technique for hiding cache miss latencies in modern processor designs. Prefetcher performance can be characterized by two main metrics that are generally at odds with one another: coverage, the fraction of baseline ...
52
2,912
Metrics
Total Citations52
Total Downloads2,912
Last 12 Months193
Last 6 weeks27
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

A stochastic-computing based deep learning framework using adiabatic quantum-flux-parametron superconducting technology

CoNDA: efficient cache coherence support for near-data accelerators

Energy-efficient video processing for virtual reality

Duality cache for data parallel acceleration

Laconic deep learning inference acceleration

SCU: a GPU stream compaction unit for graph processing

Scalable interconnects for reconfigurable spatial architectures

AsmDB: understanding and mitigating front-end stalls in warehouse-scale computers

Designing vertical processors in monolithic 3D

TWiCe: preventing row-hammering by exploiting time window counters

CROW: a low-cost substrate for improving DRAM performance, energy efficiency, and reliability

Cambricon-F: machine learning computers with fractal von neumann architecture

Efficient metadata management for irregular data prefetching

Linebacker: preserving victim cache lines in idle register files of GPUs

Cryogenic computer architecture modeling with memory-side case studies

MnnFast: a fast and scalable system architecture for memory-augmented neural networks

Perceptron-based prefetch filtering