Proceedings of the 2009 18th International Conference on Parallel Architectures and Compilation Techniques

Transactional memory is being advanced as an alternative to traditional lock-based synchronization for concurrent programming. Transactional memory simplifies the programming model and maximizes concurrency. At the same time, transactions can suffer ...

- 32
Metrics
Total Citations32

Abstract

Article

Anaphase: A Fine-Grain Thread Decomposition Scheme for Speculative Multithreading

Carlos Madriles,
Pedro Lopez,
Josep Maria Codina,
Enric Gibert,
Fernando Latorre,
Alejandro Martinez,
Raul Martinez,
Antonio Gonzalez

Pages 15–25https://doi.org/10.1109/PACT.2009.27

Industry is moving towards multi-core designs as we have hit the memory and power walls. Multi-core designs are very effective to exploit thread-level parallelism (TLP) but do not provide benefits when executing serial code (applications with low TLP, ...

- 3
Metrics
Total Citations3

Abstract

Article

Characterizing the TLB Behavior of Emerging Parallel Workloads on Chip Multiprocessors

Abhishek Bhattacharjee,
Margaret Martonosi

Pages 29–40https://doi.org/10.1109/PACT.2009.26

Translation Lookaside Buffers (TLBs) are a staple in modern computer systems and have a significant impact on overall system performance. Numerous prior studies have addressed TLB designs to lower access times and miss rates; these, however, have been ...

- 42
Metrics
Total Citations42

Abstract

Article

Interprocedural Load Elimination for Dynamic Optimization of Parallel Programs

Rajkishore Barik,
Vivek Sarkar

Pages 41–52https://doi.org/10.1109/PACT.2009.32

Load elimination is a classical compiler transformation that is increasing in importance for multi-core and many-core architectures. The effect of the transformation is to replace a memory access, such as a read of an object field or an array element, ...

- 19
Metrics
Total Citations19

Abstract

Article

Quantifying the Potential of Program Analysis Peripherals

Mohit Tiwari,
Shashidhar Mysore,
Timothy Sherwood

Pages 53–63https://doi.org/10.1109/PACT.2009.38

Tools such as multi-threaded data race detectors, memory bounds checkers, dynamic type analyzers, data flight recorders, and various performance profilers are becoming increasingly vital aids to software developers. Rather than performing all the ...

- 7
Metrics
Total Citations7

Abstract

Article

Algorithmic Skeletons within an Embedded Domain Specific Language for the CELL Processor

Tarik Saidani,
Joel Falcou,
Claude Tadonki,
Lionel Lacassagne,
Daniel Etiemble

Pages 67–76https://doi.org/10.1109/PACT.2009.21

Efficiently using the hardware capabilities of the Cell processor, a heterogeneous chip multiprocessor that uses several levels of parallelism to deliver high performance, and being able to reuse legacy code are real challenges for application ...

- 2
Metrics
Total Citations2

Abstract

Article

A Task-Centric Memory Model for Scalable Accelerator Architectures

John H. Kelm,
Daniel R. Johnson,
Steven S. Lumetta,
Matthew I. Frank,
Sanjay J. Patel

Pages 77–87https://doi.org/10.1109/PACT.2009.16

This paper presents a task-centric memory model for 1000-core compute accelerators.Visual computing applications are emerging as an important class of workloads that can exploit 1000-core processors.In these workloads, we observe data sharing and ...

- 4
Metrics
Total Citations4

Abstract

Article

SHIP: Scalable Hierarchical Power Control for Large-Scale Data Centers

Xiaorui Wang,
Ming Chen,
Charles Lefurgy,
Tom W. Keller

Pages 91–100https://doi.org/10.1109/PACT.2009.34

In today's data centers, precisely controlling server power consumption is an essential way to avoid system failures caused by power capacity overload or overheating due to increasingly high server density. While various power control strategies have ...

- 19
Metrics
Total Citations19

Abstract

Article

Exploring Phase Change Memory and 3D Die-Stacking for Power/Thermal Friendly, Fast and Durable Memory Architectures

Wangyuan Zhang,
Tao Li

Pages 101–112https://doi.org/10.1109/PACT.2009.30

Emerging three-dimensional (3D) integration technology allows for the direct placement of DRAM on top of a microprocessor, significantly reducing the wire-delay between the two and thereby alleviating memory latency and bandwidth constraints. However, ...

- 73
Metrics
Total Citations73

Abstract

Article

Core-Selectability in Chip Multiprocessors

Hashem Hashemi Najaf-abadi,
Niket Kumar Choudhary,
Eric Rotenberg

Pages 113–122https://doi.org/10.1109/PACT.2009.44

The centralized structures necessary for the extraction of instruction-level parallelism (ILP) are consuming progressively smaller portions of the total die area of chip multiprocessors (CMP). The reason for this is that scaling these structures does ...

- 9
Metrics
Total Citations9

Abstract

Article

Chainsaw: Using Binary Matching for Relative Instruction Mix Comparison

Tipp Moseley,
Dirk Grunwald,
Ramesh Peri

Pages 125–135https://doi.org/10.1109/PACT.2009.12

With advances in hardware, instruction set architectures are undergoing continual evolution. As a result, compilers are under constant pressure to adapt and take full advantage of available features. However, current techniques for evaluating relative ...

- 1
Metrics
Total Citations1

Abstract

Article

tm_db: A Generic Debugging Library for Transactional Programs

Maurice Herlihy,
Yossi Lev

Pages 136–145https://doi.org/10.1109/PACT.2009.23

Transactional Memory (TM) has received a lot of attention as a programming API for concurrent programson emerging multicore architectures. If the transactionalprogramming model is to realize its promise of simplifyingthe problem of writing correct and ...

- 7
Metrics
Total Citations7

Abstract

Article

StealthTest: Low Overhead Online Software Testing Using Transactional Memory

Jayaram Bobba,
Weiwei Xiong,
Luke Yen,
Mark D. Hill,
David A. Wood

Pages 146–155https://doi.org/10.1109/PACT.2009.15

Software testing is hard. The emergence of multicore architectures and the proliferation of bugprone multithreaded software makes testing even harder. To this end, researchers have proposed methods to continue testing software after deployment, e.g., in ...

- 2
Metrics
Total Citations2

Abstract

Article

CPROB: Checkpoint Processing with Opportunistic Minimal Recovery

Andrew Hilton,
Neeraj Eswaran,
Amir Roth

Pages 159–168https://doi.org/10.1109/PACT.2009.42

CPR (Checkpoint Processing and Recovery) is a physical register management scheme that supports a larger instruction window and higher average IPC than conventional ROB-style register management.It does so by restricting mis-speculation recovery to ...

- 3
Metrics
Total Citations3

Abstract

Article

Architecture Support for Improving Bulk Memory Copying and Initialization Performance

Xiaowei Jiang,
Yan Solihin,
Li Zhao,
Ravishankar Iyer

Pages 169–180https://doi.org/10.1109/PACT.2009.31

Bulk memory copying and initialization is one of the most ubiquitous operations performed in current computer systems by both user applications and Operating Systems. While many current systems rely on a loop of loads and stores, there are proposals to ...

- 15
Metrics
Total Citations15

Abstract

Article

Oblivious Routing in On-Chip Bandwidth-Adaptive Networks

Myong Hyon Cho,
Mieszko Lis,
Keun Sup Shim,
Michel Kinsy,
Tina Wen,
Srinivas Devadas

Pages 181–190https://doi.org/10.1109/PACT.2009.41

Oblivious routing can be implemented on simple router hardware, but network performance suffers when routes become congested. Adaptive routing attempts to avoid hot spots by re-routing flows, but requires more complex hardware to determine and configure ...

- 7
Metrics
Total Citations7

Abstract

Article

Exploiting Parallelism with Dependence-Aware Scheduling

Xiaotong Zhuang,
Alexandre E. Eichenberger,
Yangchun Luo,
Kevin O'Brien,
Kathryn O'Brien

Pages 193–202https://doi.org/10.1109/PACT.2009.10

It is well known that a large fraction of applications cannot be parallelized at compile time due to unpredictable data dependences such as indirect memory accesses and/or memory accesses guarded by data-dependent conditional statements. A significant ...

- 19
Metrics
Total Citations19

Abstract

Article

ITCA: Inter-task Conflict-Aware CPU Accounting for CMPs

Carlos Luque,
Miquel Moreto,
Francisco J. Cazorla,
Roberto Gioiosa,
Alper Buyuktosunoglu,
Mateo Valero

Pages 203–213https://doi.org/10.1109/PACT.2009.33

Chip-MultiProcessor (CMP) architectures are becoming more and more popular as an alternative to the traditional processors that only extract instruction-level parallelism from an application. CMPs introduce complexities when accounting CPU utilization. ...

- 5
Metrics
Total Citations5

Abstract

Article

Flextream: Adaptive Compilation of Streaming Applications for Heterogeneous Architectures

Amir H. Hormati,
Yoonseo Choi,
Manjunath Kudlur,
Rodric Rabbah,
Trevor Mudge,
Scott Mahlke

Pages 214–223https://doi.org/10.1109/PACT.2009.39

Increasing demand for performance and efficiency has driven the computer industry toward multicore systems. These systems have become the industry standard in almost all segments of the computer market from high-end servers to handheld devices. In order ...

- 45
Metrics
Total Citations45

Abstract

Article

DDCache: Decoupled and Delegable Cache Data and Metadata

Hemayet Hossain,
Sandhya Dwarkadas,
Michael C. Huang

Pages 227–236https://doi.org/10.1109/PACT.2009.24

In order to harness the full compute power of many-core processors, future designs must focus on effective utilization of on-chip cache and bandwidth resources. In this paper, we address the dual goals of (1) reducing on-chip communication overheads and ...

- 3
Metrics
Total Citations3

Abstract

Article

Zero-Value Caches: Cancelling Loads that Return Zero

Mafijul Md. Islam,
Per Stenstrom

Pages 237–245https://doi.org/10.1109/PACT.2009.29

The speed gap between processor and memory continues to limit performance. To address this problem, we explore the potential of eliminating Zero Loads — loads accessing memory locations that contain the value “zero” — to improve performance and energy ...

- 17
Metrics
Total Citations17

Abstract

Article

Soft-OLP: Improving Hardware Cache Performance through Software-Controlled Object-Level Partitioning

Qingda Lu,
Jiang Lin,
Xiaoning Ding,
Zhao Zhang,
Xiaodong Zhang,
P. Sadayappan

Pages 246–257https://doi.org/10.1109/PACT.2009.35

Performance degradation of memory-intensive programs caused by the LRU policy's inability to handle weak-locality data accesses in the last level cache is increasingly serious for two reasons. First,the last-level cache remains in the CPU's critical ...

- 34
Metrics
Total Citations34

Abstract

Save to Binder

Create a New Binder

Name

Index Terms

Proceedings of the 2009 18th International Conference on Parallel Architectures and Compilation Techniques

Index terms have been assigned to the content through auto-classification.

Comments

Recommendations

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing
CompSysTech '17: Proceedings of the 18th International Conference on Computer Systems and Technologies
COMAD '12: Proceedings of the 18th International Conference on Management of Data

Acceptance Rates

Overall Acceptance Rate 121 of 471 submissions, 26%

Year	Submitted	Accepted	Rate
PACT '16	119	31	26%
PACT '14	144	54	38%
PACT '13	208	36	17%
Overall	471	121	26%

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Save to Binder

Index Terms

Recommendations

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing

CompSysTech '17: Proceedings of the 18th International Conference on Computer Systems and Technologies

COMAD '12: Proceedings of the 18th International Conference on Management of Data

Acceptance Rates