Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction

Fast Template-Based Code Generation for MLIR

Florian Drescher,
Alexis Engelke

Pages 1–12https://doi.org/10.1145/3640537.3641567

Fast compilation is essential for JIT-compilation use cases like dynamic languages or databases as well as development productivity when compiling static languages. Template-based compilation allows fast compilation times, but in existing approaches, ...

- 0
- 782
Metrics
Total Citations0
Total Downloads782
Last 12 Months782
Last 6 weeks97

Abstract
View online with eReader
PDF

research-article

A Unified Memory Dependency Framework for Speculative High-Level Synthesis

Jean-Michel Gorius,
Simon Rokicki,
Steven Derrien

Pages 13–25https://doi.org/10.1145/3640537.3641581

Heterogeneous hardware platforms that leverage application-specific hardware accelerators are becoming increasingly popular as the demand for high-performance compute intensive applications rises. The design of such high-performance hardware accelerators ...

- 0
- 182
Metrics
Total Citations0
Total Downloads182
Last 12 Months182
Last 6 weeks7

Abstract
Get Access

SESSION: Static and Dynamic Analysis

research-article

If-Convert as Early as You Must

Dorit Nuzman,
Ayal Zaks,
Ziv Ben-Zion

Pages 26–38https://doi.org/10.1145/3640537.3641562

Optimizing compilers employ a rich set of transformations that generate highly efficient code for a variety of source languages and target architectures. These transformations typically operate on general control flow constructs which trigger a range of ...

- 0
- 243
Metrics
Total Citations0
Total Downloads243
Last 12 Months243
Last 6 weeks16

Abstract
Get Access

research-article

Open Access

Paguroidea: Fused Parser Generator with Transparent Semantic Actions

Yifan Zhu,
Quartic Cat,
Boluo Ge,
Shaotong Sun

Pages 39–48https://doi.org/10.1145/3640537.3641563

Parser generators have long been a savior for programmers, liberating them from the daunting task of crafting correct and maintainable parsers. Yet, this much-needed simplicity often comes at the expense of efficiency.

We present, Paguroidea, a parser ...

- 0
- 372
Metrics
Total Citations0
Total Downloads372
Last 12 Months372
Last 6 weeks37

Abstract
View online with eReader
PDF

research-article

Region-Based Data Layout via Data Reuse Analysis

Caio Salvador Rohwedder,
João P. L. De Carvalho,
José Nelson Amaral

Pages 49–59https://doi.org/10.1145/3640537.3641571

Data-structure splicing techniques, such as structure splitting, field reordering, and pointer inlining reorganize data structures to improve cache and translation look-aside buffer (TLB) utilization. Structure types are typically transformed globally in ...

- 0
- 258
Metrics
Total Citations0
Total Downloads258
Last 12 Months258
Last 6 weeks9

Abstract
Get Access

research-article

Artifacts Evaluated & Functional / v1.1

A Context-Sensitive Pointer Analysis Framework for Rust and Its Application to Call Graph Construction

Wei Li,
Dongjie He,
Yujiang Gui,
Wenguang Chen,
Jingling Xue

Pages 60–72https://doi.org/10.1145/3640537.3641574

Existing program analysis tools for Rust lack the ability to effectively detect security vulnerabilities due to the absence of an accurate call graph and precise points-to information. We present Rupta, the first context-sensitive pointer analysis ...

- 2
- 561
Metrics
Total Citations2
Total Downloads561
Last 12 Months561
Last 6 weeks26

Abstract
Get Access

research-article

Open Access

CoSense: Compiler Optimizations using Sensor Technical Specifications

Pei Mu,
Nikolaos Mavrogeorgis,
Christos Vasiladiotis,
Vasileios Tsoutsouras,
Orestis Kaparounakis,
Phillip Stanley-Marbell,
Antonio Barbalace

Pages 73–85https://doi.org/10.1145/3640537.3641576

Embedded systems are ubiquitous, but in order to maximize their lifetime on batteries there is a need for faster code execution – i.e., higher energy efficiency, and for reduced memory usage. The large number of sensors integrated into embedded systems ...

- 0
- 365
Metrics
Total Citations0
Total Downloads365
Last 12 Months365
Last 6 weeks59

Abstract
View online with eReader
PDF

SESSION: Runtime Techniques

research-article

Open Access

UNIFICO: Thread Migration in Heterogeneous-ISA CPUs without State Transformation

Nikolaos Mavrogeorgis,
Christos Vasiladiotis,
Pei Mu,
Amir Khordadi,
Björn Franke,
Antonio Barbalace

Pages 86–99https://doi.org/10.1145/3640537.3641565

Heterogeneous-ISA processor designs have attracted considerable research interest. However, unlike their homogeneous-ISA counterparts, explicit software support for bridging ISA heterogeneity is required. The lack of a compilation toolchain ready to ...

- 0
- 449
Metrics
Total Citations0
Total Downloads449
Last 12 Months449
Last 6 weeks60

Abstract
View online with eReader
PDF

research-article

Open Access

BLQ: Light-Weight Locality-Aware Runtime for Blocking-Less Queuing

Qinzhe Wu,
Ruihao Li,
Jonathan Beard,
Lizy John

Pages 100–112https://doi.org/10.1145/3640537.3641568

Message queues are used widely in parallel processing systems for worker thread synchronization. When there is a throughput mismatch between the upstream and downstream tasks, the message queue buffer will often exist as either empty or full. Polling on ...

- 0
- 285
Metrics
Total Citations0
Total Downloads285
Last 12 Months285
Last 6 weeks51

Abstract
View online with eReader
PDF

SESSION: Debugging, Profiling, and Parallelism

research-article

Open Access

APPy: Annotated Parallelism for Python on GPUs

Tong Zhou,
Jun Shirako,
Vivek Sarkar

Pages 113–125https://doi.org/10.1145/3640537.3641575

GPUs are increasingly being used used to speed up Python applications in the scientific computing and machine learning domains. Currently, the two common approaches to leveraging GPU acceleration in Python are 1) create a custom native GPU kernel, and ...

- 1
- 815
Metrics
Total Citations1
Total Downloads815
Last 12 Months815
Last 6 weeks66
- 1
Supplementary Material
Auxiliary Archive

Abstract
View online with eReader
PDF

research-article

Accurate Coverage Metrics for Compiler-Generated Debugging Information

J. Ryan Stinnett,
Stephen Kell

Pages 126–136https://doi.org/10.1145/3640537.3641578

Many debugging tools rely on compiler-produced metadata to present a source-language view of program states, such as variable values and source line numbers. While this tends to work for unoptimised programs, current compilers often generate only partial ...

- 0
- 138
Metrics
Total Citations0
Total Downloads138
Last 12 Months138
Last 6 weeks1

Abstract
Get Access

research-article

Open Access

FlowProf: Profiling Multi-threaded Programs using Information-Flow

Ahamed Al Nahian,
Brian Demsky

Pages 137–149https://doi.org/10.1145/3640537.3641577

Amdahl's law implies that even small sequential bottlenecks can seriously limit the scalability of multi-threaded programs. To achieve scalability, developers must painstakingly identify sequential bottlenecks in their program and eliminate these ...

- 0
- 407
Metrics
Total Citations0
Total Downloads407
Last 12 Months407
Last 6 weeks59

Abstract
View online with eReader
PDF

research-article

Reducing the Overhead of Exact Profiling by Reusing Affine Variables

Leon Frenot,
Fernando Magno Quintão Pereira

Pages 150–161https://doi.org/10.1145/3640537.3641569

An exact profiler inserts counters in a program to record how many times each edge of that program's control-flow graph has been traversed during an execution of it. It is common practice to instrument only edges in the complement of a minimum spanning ...

- 0
- 121
Metrics
Total Citations0
Total Downloads121
Last 12 Months121
Last 6 weeks8

Abstract
Get Access

research-article

Open Access

Stale Profile Matching

Amir Ayupov,
Maksim Panchenko,
Sergey Pupyrev

Pages 162–173https://doi.org/10.1145/3640537.3641573

Profile-guided optimizations rely on profile data for directing compilers to generate optimized code. To achieve the maximum performance boost, profile data needs to be collected on the same version of the binary that is being optimized. In practice ...

- 0
- 406
Metrics
Total Citations0
Total Downloads406
Last 12 Months406
Last 6 weeks42

Abstract
View online with eReader
PDF

SESSION: Safety and Correctness

research-article

From Low-Level Fault Modeling (of a Pipeline Attack) to a Proven Hardening Scheme

Sébastien Michelland,
Christophe Deleuze,
Laure Gonnord

Pages 174–185https://doi.org/10.1145/3640537.3641570

Fault attacks present unique safety and security challenges that require dedicated countermeasures, even for bug-free programs. Models of these complex attacks are made workable by approximating their effects to a suitable level of abstraction. The ...

- 1
- 112
Metrics
Total Citations1
Total Downloads112
Last 12 Months112
Last 6 weeks4

Abstract
Get Access

research-article

Open Access

Artifacts Evaluated & Functional / v1.1

Clog: A Declarative Language for C Static Code Checkers

Alexandru Dura,
Christoph Reichenbach

Pages 186–197https://doi.org/10.1145/3640537.3641579

We present Clog, a declarative language for describing static code checkers for C. Unlike other extensible state-of-the-art checker frameworks, Clog enables powerful interprocedural checkers without exposing the underlying program representation: Clog ...

- 0
- 575
Metrics
Total Citations0
Total Downloads575
Last 12 Months575
Last 6 weeks50

Abstract
View online with eReader
PDF

SESSION: Compilers and Machine Learning

research-article

Compiler-Based Memory Encryption for Machine Learning on Commodity Low-Power Devices

Kiwan Maeng,
Brandon Lucia

Pages 198–211https://doi.org/10.1145/3640537.3641564

Running machine learning (ML) on low-power IoT devices exposes unique security concerns. Attackers can easily steal or manipulate sensitive user data or proprietary ML models from the devices’ off-chip memory by leveraging their simple hardware structure ...

- 0
- 222
Metrics
Total Citations0
Total Downloads222
Last 12 Months222
Last 6 weeks8

Abstract
Get Access

research-article

Open Access

YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs

Cyrus Zhou,
Zack Hassman,
Dhirpal Shah,
Vaughn Richard,
Yanjing Li

Pages 212–226https://doi.org/10.1145/3640537.3641566

We address the challenges associated with deploying neural networks on CPUs, with a particular focus on minimizing inference time while maintaining accuracy. Our novel approach is to use the dataflow (i.e., computation order) of a neural network to ...

- 0
- 2,802
Metrics
Total Citations0
Total Downloads2,802
Last 12 Months2,802
Last 6 weeks448

Abstract
View online with eReader
PDF

research-article

Fast and Accurate Context-Aware Basic Block Timing Prediction using Transformers

Abderaouf Nassim Amalou,
Elisa Fromont,
Isabelle Puaut

Pages 227–237https://doi.org/10.1145/3640537.3641572

This paper introduces ORXESTRA, a context-aware execution time prediction model based on Transformers XL, specifically designed to accurately estimate performance in embedded system applications. Unlike traditional machine learning models that often ...

- 0
- 184
Metrics
Total Citations0
Total Downloads184
Last 12 Months184
Last 6 weeks7

Abstract
Get Access

research-article

Open Access