Book/Issue: SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis : Search

Applied Filters

People

Publications

Conferences

Publication Date

80 Results for: Book/Issue: SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,765,812 records)|Limit your search to The ACM Full-Text Collection (758,626 records)

Showing 1 - 20of80 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
November 2014
Microbank: architecting through-silicon interposer-based main memory systems
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 1059–1070https://doi.org/10.1109/SC.2014.91

Through-Silicon Interposer (TSI) has recently been proposed to provide high memory bandwidth and improve energy efficiency of the main memory system. However, the impact of TSI on main memory system architecture has not been well explored. While TSI ...
10
359
Metrics
Total Citations10
Total Downloads359
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2014
Using an adaptive HPC runtime system to reconfigure the cache hierarchy
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 1047–1058https://doi.org/10.1109/SC.2014.90

The cache hierarchy often consumes a large portion of a processor's energy. To save energy in HPC environments, this paper proposes software-controlled reconfiguration of the cache hierarchy with an adaptive runtime system. Our approach addresses the ...
3
166
Metrics
Total Citations3
Total Downloads166
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2014
Anton 2: raising the bar for performance and programmability in a special-purpose molecular dynamics supercomputer
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 41–53https://doi.org/10.1109/SC.2014.9

Anton 2 is a second-generation special-purpose supercomputer for molecular dynamics simulations that achieves significant gains in performance, programmability, and capacity compared to its predecessor, Anton 1. The architecture of Anton 2 is tailored ...
16
2,932
Metrics
Total Citations16
Total Downloads2,932
Last 12 Months22
Last 6 weeks0
Get Access
research-article
November 2014
ECC parity: a technique for efficient memory error resilience for multi-channel memory systems
- Xun Jian,
- Rakesh Kumar
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 1035–1046https://doi.org/10.1109/SC.2014.89

Servers and HPC systems often use a strong memory error correction code, or ECC, to meet their reliability and availability requirements. However, these ECCs often require significant capacity and/or power overheads. We observe that since memory ...
5
143
Metrics
Total Citations5
Total Downloads143
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2014
In-situ feature extraction of large scale combustion simulations using segmented merge trees
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 1020–1031https://doi.org/10.1109/SC.2014.88

The ever increasing amount of data generated by scientific simulations coupled with system I/O constraints are fueling a need for in-situ analysis techniques. Of particular interest are approaches that produce reduced data representations while ...
15
175
Metrics
Total Citations15
Total Downloads175
Last 12 Months1
Last 6 weeks0
Get Access
research-article
November 2014
Scalable computation of stream surfaces on large scale vector fields
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 1008–1019https://doi.org/10.1109/SC.2014.87

Stream surfaces and streamlines are two popular methods for visualizing three-dimensional flow fields. While several parallel streamline computation algorithms exist, relatively little research has been done to parallelize stream surface generation. ...
1
85
Metrics
Total Citations1
Total Downloads85
Last 12 Months2
Last 6 weeks1
Get Access
research-article
November 2014
High-performance computation of distributed-memory parallel 3D voronoi and delaunay tessellation
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 997–1007https://doi.org/10.1109/SC.2014.86

Computing a Voronoi or Delaunay tessellation from a set of points is a core part of the analysis of many simulated and measured datasets: N-body simulations, molecular dynamics codes, and LIDAR point clouds are just a few examples. Such computational ...
4
278
Metrics
Total Citations4
Total Downloads278
Last 12 Months4
Last 6 weeks3
Get Access
research-article
November 2014
Finding constant from change: revisiting network performance aware optimizations on IaaS clouds
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 982–993https://doi.org/10.1109/SC.2014.85

Network performance aware optimizations have long been an effective approach to optimizing distributed applications on traditional network environments. However, the assumptions of network topology or direct use of several measurements of pair-wise ...
7
125
Metrics
Total Citations7
Total Downloads125
Last 12 Months1
Last 6 weeks0
Get Access
research-article
November 2014
Reciprocal resource fairness: towards cooperative multiple-resource fair sharing in IaaS clouds
- Haikun Liu,
- Bingsheng He
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 970–981https://doi.org/10.1109/SC.2014.84

Resource sharing in virtualized environments have been demonstrated significant benefits to improve application performance and resource/energy efficiency. However, resource sharing, especially for multiple resource types, poses several severe and ...
11
252
Metrics
Total Citations11
Total Downloads252
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2014
FlexSlot: moving hadoop into the cloud with flexible slot management
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 959–969https://doi.org/10.1109/SC.2014.83

Load imbalance is a major source of overhead in Hadoop where the uneven distribution of input data among tasks can significantly delays the job completion. Running Hadoop in a private cloud opens up opportunities for mitigating data skew with elastic ...
6
276
Metrics
Total Citations6
Total Downloads276
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2014
Efficient shared-memory implementation of high-performance conjugate gradient benchmark and its application to unstructured matrices
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 945–955https://doi.org/10.1109/SC.2014.82

A new sparse high performance conjugate gradient benchmark (HPCG) has been recently released to address challenges in the design of sparse linear solvers for the next generation extreme-scale computing systems. Key computation, data access, and ...
19
311
Metrics
Total Citations19
Total Downloads311
Last 12 Months7
Last 6 weeks0
Get Access
research-article
November 2014
Domain decomposition preconditioners for communication-avoiding krylov methods on a hybrid CPU/GPU cluster
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 933–944https://doi.org/10.1109/SC.2014.81

Krylov subspace projection methods are widely used iterative methods for solving large-scale linear systems of equations. Researchers have demonstrated that communication-avoiding (CA) techniques can improve Krylov methods' performance on modern ...
7
213
Metrics
Total Citations7
Total Downloads213
Last 12 Months4
Last 6 weeks1
Get Access
research-article
November 2014
Parallelization of reordering algorithms for bandwidth and wavefront reduction
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 921–932https://doi.org/10.1109/SC.2014.80

Many sparse matrix computations can be speeded up if the matrix is first reordered. Reordering was originally developed for direct methods but it has recently become popular for improving the cache locality of parallel iterative solvers since reordering ...
8
199
Metrics
Total Citations8
Total Downloads199
Last 12 Months5
Last 6 weeks0
Get Access
research-article
November 2014
Real-time scalable cortical computing at 46 giga-synaptic OPS/watt with ~100× speedup in time-to-solution and ~100,000× reduction in energy-to-solution
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 27–38https://doi.org/10.1109/SC.2014.8

Drawing on neuroscience, we have developed a parallel, event-driven kernel for neurosynaptic computation, that is efficient with respect to computation, memory, and communication. Building on the previously demonstrated highly-optimized software ...
12
535
Metrics
Total Citations12
Total Downloads535
Last 12 Months7
Last 6 weeks0
Get Access
research-article
November 2014
Optimization of a multilevel checkpoint model with uncertain execution scales
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 907–918https://doi.org/10.1109/SC.2014.79

Future extreme-scale systems are expected to experience different types of failures affecting applications with different failure scales, from transient uncorrectable memory errors in processes to massive system outages. In this paper, we propose a ...
5
147
Metrics
Total Citations5
Total Downloads147
Last 12 Months1
Last 6 weeks0
Get Access
research-article
November 2014
Exploring automatic, online failure recovery for scientific applications at extreme scales
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 895–906https://doi.org/10.1109/SC.2014.78

Application resilience is a key challenge that must be addressed in order to realize the exascale vision. Process/node failures, an important class of failures, are typically handled today by terminating the job and restarting it from the last stored ...
24
371
Metrics
Total Citations24
Total Downloads371
Last 12 Months2
Last 6 weeks0
Get Access
research-article
November 2014
Understanding the effects of communication and coordination on checkpointing at scale
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 883–894https://doi.org/10.1109/SC.2014.77

Fault-tolerance poses a major challenge for future large-scale systems. Active research into coordinated, uncoordinated, and hybrid checkpointing systems has explored how the introduction of asynchrony can address anticipated scalability issues. However,...
12
198
Metrics
Total Citations12
Total Downloads198
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2014
DISC: a domain-interaction based programming model with support for heterogeneous execution
- Mehmet Can Kurt,
- Gagan Agrawal
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 869–880https://doi.org/10.1109/SC.2014.76

Several emerging trends are pointing to increasing heterogeneity among nodes and/or cores in HPC systems. Existing programming models, especially for distributed memory execution, typically have been designed to facilitate high performance on ...
0
160
Metrics
Total Citations0
Total Downloads160
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2014
Optimizing data locality for fork/join programs using constrained work stealing
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 857–868https://doi.org/10.1109/SC.2014.75

We present an approach to improving data locality across different phases of fork/join programs scheduled using work stealing. The approach consists of: (1) user-specified and automated approaches to constructing a steal tree, the schedule of steal ...
12
211
Metrics
Total Citations12
Total Downloads211
Last 12 Months1
Last 6 weeks0
Get Access
research-article
November 2014
Structure slicing: extending logical regions with fields
SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisPages 845–856https://doi.org/10.1109/SC.2014.74

Applications on modern supercomputers are increasingly limited by the cost of data movement, but mainstream programming systems have few abstractions for describing the structure of a program's data. Consequently, the burden of managing data movement, ...
11
122
Metrics
Total Citations11
Total Downloads122
Last 12 Months0
Last 6 weeks0
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Microbank: architecting through-silicon interposer-based main memory systems

Using an adaptive HPC runtime system to reconfigure the cache hierarchy

Anton 2: raising the bar for performance and programmability in a special-purpose molecular dynamics supercomputer

ECC parity: a technique for efficient memory error resilience for multi-channel memory systems

In-situ feature extraction of large scale combustion simulations using segmented merge trees

Scalable computation of stream surfaces on large scale vector fields

High-performance computation of distributed-memory parallel 3D voronoi and delaunay tessellation

Finding constant from change: revisiting network performance aware optimizations on IaaS clouds

Reciprocal resource fairness: towards cooperative multiple-resource fair sharing in IaaS clouds

FlexSlot: moving hadoop into the cloud with flexible slot management

Efficient shared-memory implementation of high-performance conjugate gradient benchmark and its application to unstructured matrices

Domain decomposition preconditioners for communication-avoiding krylov methods on a hybrid CPU/GPU cluster

Parallelization of reordering algorithms for bandwidth and wavefront reduction

Real-time scalable cortical computing at 46 giga-synaptic OPS/watt with ~100× speedup in time-to-solution and ~100,000× reduction in energy-to-solution

Optimization of a multilevel checkpoint model with uncertain execution scales

Exploring automatic, online failure recovery for scientific applications at extreme scales

Understanding the effects of communication and coordination on checkpointing at scale

DISC: a domain-interaction based programming model with support for heterogeneous execution

Optimizing data locality for fork/join programs using constrained work stealing

Structure slicing: extending logical regions with fields