Compilers

Location via proxy: [ UP ]
[Report a bug] [Manage cookies] No cookies No scripts No ads No referrer Show this form

skip to main content

Advanced Search
Browse
About
- Sign in
- Register

Advanced Search
Journals
Magazines
Proceedings
Books
SIGs
Conferences
People
More

Search ACM Digital Library

Compilers Results

Search

Advanced Search

Applied Filters

Compilers
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationRemove filter

People

Names

Eduard Ayguadé (3)
Jesus Labarta (3)
Louis-Noel Pouchet (2)
M. Valero (2)
Marc Casas (2)
Miquel Moretó (2)
Alain Ketterlin (1)
Albert Cohen (1)
Atanas Rountev (1)
Ayal Zaks (1)
Bilha Mendelson (1)
David Isaac August (1)
Dongrui Fan (1)
Fabrice E Rastello (1)
K. Taura (1)
Mikhail Smelyanskiy (1)
Mikko Herman Lipasti (1)
Ponnuswamy Sadayappan (1)
Rosa M Badia (1)
Vladimir Kiriansky (1)

Institutions

Barcelona Supercomputing Center (3)
University of Illinois Urbana-Champaign (3)
NVIDIA (2)
The Ohio State University (2)
Google LLC (1)
Higher Normal School - PSL (1)
Indian Institute of Science (1)
International Institute of Information Technology, Hyderabad (1)
Massachusetts Institute of Technology (1)
Princeton University (1)
Purdue University (1)
Qualcomm Incorporated (1)
Shanghai Jiao Tong University (1)
Sungkyunkwan University (1)
Texas A&M University (1)
The University of Tokyo (1)
University of Florida (1)
Uppsala University (1)
Wuhan University of Technology (1)
Xi'an Jiaotong University (1)

Authors

Eduard Ayguadé (3)
Jesus Labarta (3)
Louis-Noel Pouchet (2)
M. Valero (2)
Marc Casas (2)
Miquel Moretó (2)
Alain Ketterlin (1)
Albert Cohen (1)
Atanas Rountev (1)
David Isaac August (1)
Dongrui Fan (1)
Fabrice E Rastello (1)
K. Taura (1)
Mikhail Smelyanskiy (1)
Mikko Herman Lipasti (1)
Ponnuswamy Sadayappan (1)
Rosa M Badia (1)
Rudolf Eigenmann (1)
Vikram Sadanand Adve (1)
Vladimir Kiriansky (1)

Publications

Proceedings/Book Names

PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and Compilation (20)

All Publications

Proceedings (20)

Content Type

Research Article (12)
Poster (6)
Abstract (1)

Media Formats

PDF (19)
Image (2)

Publisher

Association for Computing Machinery (20)

Conferences

Sponsors

IEEE CS TCPP (20)
IEEE TCCA (20)
IFIP WG 10.3 (20)
SIGARCH (20)

Conference Event

PACT '16 (20)

Proceedings Series

PACT: Parallel Architectures and Compilation Techniques (20)

Publication Date

Upon changing this filter the page will automatically refresh

Export Citations

Selected
All Results

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

20 Results for: Book/Issue: PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationEdit SearchSave SearchFailed to save your search, try again laterSearch has been saved (My Saved Searches)RSS

Save this search

Please login to be able to save your searches and receive alerts for new content matching your search criteria.

Searched The ACM Guide to Computing Literature (3,779,773 records)|Limit your search to The ACM Full-Text Collection (764,737 records)

Results

Showing 1 - 20of20 Results

Select All

Export Citations Save to Binder

Save to Binder

Create a New Binder

Name

per page:

10
20
50

Sort by: Recency

Earliest
Latest
Downloaded
Cited

poster
Public Access
September 2016
POSTER: hVISC: A Portable Abstraction for Heterogeneous Parallel Systems
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 443–445https://doi.org/10.1145/2967938.2976039

Programming heterogeneous parallel systems can be extremely complex because a single system may include multiple different parallelism models, instruction sets, and memory hierarchies, and different systems use different combinations of these features. ...
0
466
Metrics
Total Citations0
Total Downloads466
Last 12 Months40
Last 6 weeks5
View online with eReader
PDF
poster
September 2016
POSTER: Exploiting Asymmetric Multi-Core Processors with Flexible System Sofware
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 415–417https://doi.org/10.1145/2967938.2976038

Energy efficiency has become the main challenge for high performance computing (HPC). The use of mobile asymmetric multi-core architectures to build future multi-core systems is an approach towards energy savings while keeping high performance. However, ...
1
105
Metrics
Total Citations1
Total Downloads105
Last 12 Months7
Last 6 weeks2
Get Access
poster
September 2016
POSTER: Hybrid Data Dependence Analysis for Loop Transformations
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 439–440https://doi.org/10.1145/2967938.2974059

Loop optimizations span from vectorization, scalar promotion, loop invariant code motion, software pipelining to loop fusion, skewing, tiling and loop parallelization. These transformations are essential in the quest for automated high-performance code ...
2
105
Metrics
Total Citations2
Total Downloads105
Last 12 Months13
Last 6 weeks6
Get Access
poster
September 2016
POSTER: Collective Dynamic Parallelism for Directive Based GPU Programming Languages and Compilers
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 423–424https://doi.org/10.1145/2967938.2974056

Early programs for GPU (Graphics Processing Units) acceleration were based on a flat, bulk parallel programming model, in which programs had to perform a sequence of kernel launches from the host CPU. In the latest releases of these devices, dynamic (or ...
0
73
Metrics
Total Citations0
Total Downloads73
Last 12 Months7
Last 6 weeks2
Get Access
poster
September 2016
POSTER: Pagoda: A Runtime System to Maximize GPU Utilization in Data Parallel Tasks with Limited Parallelism
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 449–450https://doi.org/10.1145/2967938.2974055

Massively multithreaded GPUs achieve high throughput by running thousands of threads in parallel. To fully utilize the hardware, contemporary workloads spawn work to the GPU in bulk by launching large tasks, where each task is a kernel that contains ...
0
191
Metrics
Total Citations0
Total Downloads191
Last 12 Months11
Last 6 weeks2
Get Access
poster
September 2016
POSTER: An Optimization of Dataflow Architectures for Scientific Applications
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 441–442https://doi.org/10.1145/2967938.2974054

Dataflow computing is proved to be promising in high-performance computing. However, traditional dataflow architectures are general-purpose and not efficient enough when dealing with typical scientific applications due to low utilization of function ...
5
98
Metrics
Total Citations5
Total Downloads98
Last 12 Months9
Last 6 weeks2
Get Access
abstract
September 2016
Student Research Poster: Software Out-of-Order Execution for In-Order Architectures
- Kim-Anh Tran
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPage 458https://doi.org/10.1145/2967938.2971466

Processor cores are divided into two categories: fast and power-hungry out-of-order processors, and efficient, but slower in-order processors. To achieve high performance with low-energy budgets, this proposal aims to deliver out-of-order processing by ...
0
87
Metrics
Total Citations0
Total Downloads87
Last 12 Months8
Last 6 weeks2
Get Access
research-article
September 2016
A DSL Compiler for Accelerating Image Processing Pipelines on FPGAs
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 327–338https://doi.org/10.1145/2967938.2967969

This paper describes an automatic approach to accelerate image processing pipelines using FPGAs. An image processing pipeline can be viewed as a graph of interconnected stages that processes images successively. Each stage typically performs a point-...
35
504
Metrics
Total Citations35
Total Downloads504
Last 12 Months34
Last 6 weeks2
Get Access
research-article
September 2016
A Static Cut-off for Task Parallel Programs
- Shintaro Iwasaki,
- Kenjiro Taura
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 139–150https://doi.org/10.1145/2967938.2967968

Task parallel models supporting dynamic and hierarchical parallelism are believed to offer a promising direction to achieving higher performance and programmability. Divide-and-conquer is the most frequently used idiom in task parallel models, which ...
16
296
Metrics
Total Citations16
Total Downloads296
Last 12 Months10
Last 6 weeks4
Get Access
research-article
Public Access
September 2016
Resource Conscious Reuse-Driven Tiling for GPUs
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 99–111https://doi.org/10.1145/2967938.2967967

Computations involving successive application of 3D stencil operators are widely used in many application domains, such as image processing, computational electromagnetics, seismic processing, and climate modeling. Enhancement of temporal and spatial ...
24
433
Metrics
Total Citations24
Total Downloads433
Last 12 Months90
Last 6 weeks12
View online with eReader
PDF
research-article
September 2016
Reducing Cache Coherence Traffic with Hierarchical Directory Cache and NUMA-Aware Runtime Scheduling
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 275–286https://doi.org/10.1145/2967938.2967962

Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provide for scaling core count and memory capacity. Also, the flat memory address space they offer considerably improves programmability. However, ccNUMA ...
9
269
Metrics
Total Citations9
Total Downloads269
Last 12 Months27
Last 6 weeks4
Get Access
research-article
Public Access
September 2016
Speculatively Exploiting Cross-Invocation Parallelism
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 207–221https://doi.org/10.1145/2967938.2967959

Automatic parallelization has shown promise in producing scalable multi-threaded programs for multi-core architectures. Most existing automatic techniques parallelize independent loops and insert global synchronization between loop invocations. For ...
1
342
Metrics
Total Citations1
Total Downloads342
Last 12 Months58
Last 6 weeks8
View online with eReader
PDF
research-article
September 2016
Automatically Exploiting Implicit Pipeline Parallelism from Multiple Dependent Kernels for GPUs
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 341–352https://doi.org/10.1145/2967938.2967952

Execution of GPGPU workloads consists of different stages including data I/O on the CPU, memory copy between the CPU and GPU, and kernel execution. While GPU can remain idle during I/O and memory copy, prior work has shown that overlapping data movement ...
11
303
Metrics
Total Citations11
Total Downloads303
Last 12 Months18
Last 6 weeks4
Get Access
research-article
September 2016
Reduction Drawing: Language Constructs and Polyhedral Compilation for Reductions on GPU
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 87–97https://doi.org/10.1145/2967938.2967950

Reductions are common in scientific and data-crunching codes, and a typical source of bottlenecks on massively parallel architectures such as GPUs. Reductions are memory-bound, and achieving peak performance involves sophisticated optimizations. There ...
21
245
Metrics
Total Citations21
Total Downloads245
Last 12 Months29
Last 6 weeks6
Get Access
research-article
Public Access
September 2016
Hash Map Inlining
- Dibakar Gope,
- Mikko H. Lipasti
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 235–246https://doi.org/10.1145/2967938.2967949

Scripting languages like Javascript and PHP are widely used to implement application logic for dynamically-generated web pages. Their popularity is due in large part to their flexible syntax and dynamic type system, which enable rapid turnaround time ...
3
325
Metrics
Total Citations3
Total Downloads325
Last 12 Months110
Last 6 weeks20
View online with eReader
PDF
research-article
Open Access
September 2016
Optimizing Indirect Memory References with milk
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 299–312https://doi.org/10.1145/2967938.2967948

Modern applications such as graph and data analytics, when operating on real world data, have working sets much larger than cache capacity and are bottlenecked by DRAM. To make matters worse, DRAM bandwidth is increasing much slower than per CPU core ...
29
4,364
Metrics
Total Citations29
Total Downloads4,364
Last 12 Months128
Last 6 weeks25
View online with eReader
PDF
research-article
September 2016
Fusion of Parallel Array Operations
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 71–85https://doi.org/10.1145/2967938.2967945

We address the problem of fusing array operations based on criteria such as shape compatibility, data reuse, and minimizing for data reuse, the fusion problem has been formulated as a static weighted graph partitioning problem (known as the Weighted ...
13
296
Metrics
Total Citations13
Total Downloads296
Last 12 Months18
Last 6 weeks3
Get Access
research-article
Public Access
September 2016
Bridging the Semantic Gaps of GPU Acceleration for Scale-out CNN-based Big Data Processing: Think Big, See Small
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 315–326https://doi.org/10.1145/2967938.2967944

Convolutional Neural Networks (CNNs) have substantially advanced the state-of-the-art accuracies of object recognition, which is the core function of a myriad of modern multimedia processing techniques such as image/video processing, speech recognition, ...
18
1,180
Metrics
Total Citations18
Total Downloads1,180
Last 12 Months308
Last 6 weeks19
View online with eReader
PDF
research-article
September 2016
Sparso: Context-driven Optimizations of Sparse Linear Algebra
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationPages 247–259https://doi.org/10.1145/2967938.2967943

The sparse matrix is a key data structure in various domains such as high-performance computing, machine learning, and graph analytics. To maximize performance of sparse matrix operations, it is especially important to optimize across the operations and ...
24
382
Metrics
Total Citations24
Total Downloads382
Last 12 Months21
Last 6 weeks2
Get Access
proceeding
September 2016
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and Compilation
The International Conference on Parallel Architectures and Compilation Techniques (PACT) started as a Data Flow Workshop in conjunction with the ISCA 1989 in Israel but has quickly evolved into a unique venue at the intersection of parallel architecture ...
750
21,876
Metrics
Total Citations750
Total Downloads21,876
Last 12 Months2,428
Last 6 weeks368

Footer

Categories

Journals
Magazines
Books
Proceedings
SIGs
Conferences
Collections
People

About

About ACM Digital Library
ACM Digital Library Board
Subscription Information
Author Guidelines
Using ACM Digital Library
All Holdings within the ACM Digital Library
ACM Computing Classification System
Accessibility Statement

Join

Join ACM
Join SIGs
Subscribe to Publications
Institutions and Libraries

Connect

Contact us via email
ACM on Facebook
ACM DL on X
ACM on Linkedin
Send Feedback
Submit a Bug Report

The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2024 ACM, Inc.

Terms of Usage
Privacy Policy
Code of Ethics

Your Search Results Download Request

We are preparing your search results for download ...

We will inform you here when the file is ready.

Your Search Results Download Request

Your file of search results citations is now ready.

Your Search Results Download Request

Your search export query has expired. Please try again.