Proceedings of the 14th International Workshop on Programming Models and Applications for Multicores and Manycores

PMAM'23: Proceedings of the 14th International Workshop on Programming Models and Applications for Multicores and Manycores

February 2023

2023 Proceeding

Program Co-chairs:
Quan Chen,
Zhiyi Huang,
Min Si

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

PMAM'23: 14th International Workshop on Programming Models and Applications for Multicores and Manycores Montreal QC Canada 25 February 2023- 1 March 2023

ISBN:

979-8-4007-0115-3

Published:

25 February 2023

Sponsors:

SIGHPC, SIGPLAN

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Abstract

No abstract available.

Proceeding Downloads

PDFFront matter (Cover, Contents)

PDFBack matter (Author index)

Select All

Export Citations Save to Binder

research-article

Julia Cloud Matrix Machine: Dynamic Matrix Language Acceleration on Multicore Clusters in the Cloud

Pages 1–10https://doi.org/10.1145/3582514.3582518

Matrix computations are widely used in increasing sizes and complexity in scientific computing and engineering. But current matrix language implementations lack programmer support to effectively and seamlessly utilize cloud computing resources. We ...

research-article

Distributed Cell Set : A Library for Space-Dependent Communication/Computation Overlap on Manycore Cluster

Pages 11–19https://doi.org/10.1145/3582514.3582520

The increase in the number of cores available in modern processors makes it important for implementations to maximize their use within a node by overlapping communication and computation. However, when the dependencies between communication and ...

research-article

Public Access

Towards Maximum Throughput of Dataflow Software Pipeline under Resource Constraints

Pages 20–28https://doi.org/10.1145/3582514.3582521

This work proposes a novel algorithm and Integer Linear Programming (ILP) formulation to optimize the pipelined code mapping of dataflow graph under a given budget generated by optimizing compilers. The goal of this optimization technique is to ...

research-article

Studying the expressiveness and performance of parallelization abstractions for linear pipelines

Pages 29–38https://doi.org/10.1145/3582514.3582522

Semi-automatic parallelization provides abstractions that simplify the programming effort and allow the user to make decisions that cannot be made by tools. However, abstractions for general-purpose systems usually do not carry sufficient knowledge ...

research-article

Open Access

Harmonic CUDA: Asynchronous Programming on GPUs

Pages 39–49https://doi.org/10.1145/3582514.3582517

We introduce Harmonic CUDA, a dataflow programming model for GPUs that allows programmers to describe algorithms as a dependency graph of producers and consumers where data flows continuously through the graph for the duration of the kernel. This ...

research-article

MPI-based Remote OpenMP Offloading: A More Efficient and Easy-to-use Implementation

Pages 50–59https://doi.org/10.1145/3582514.3582519

MPI+X is the most popular hybrid programming model for distributed computation on modern heterogeneous HPC systems. Nonetheless, for simplicity, HPC developers ideally would like to implement multi-node distributed parallel computing through a single ...

research-article

Public Access

Exploring OpenMP GPU Offloading for Implementing Convolutional Neural Networks

Pages 60–69https://doi.org/10.1145/3582514.3582523

Computing on heterogeneous architecture involving CPUs and accelerators is now a popular choice of parallel computing. As a directive-based programming model, OpenMP has become more and more comprehensive that supports a large variety of hardware ...

Contributors

Quan Chen
Shanghai Jiao Tong University
- Publication Years2011 - 2025
- Publication counts78
- Citation count1,273
- Available for Download58
- Downloads (cumulative)46,632
- Downloads (12 months)12,295
- Downloads (6 weeks)1,471
- Average Downloads per Article804
- Average Citation per Article16
View Full Profile
Zhiyi Huang
University of Otago
- Publication Years2004 - 2024
- Publication counts58
- Citation count239
- Available for Download15
- Downloads (cumulative)4,770
- Downloads (12 months)760
- Downloads (6 weeks)63
- Average Downloads per Article318
- Average Citation per Article4
View Full Profile
Min Si
Meta
- Publication Years2012 - 2024
- Publication counts23
- Citation count152
- Available for Download12
- Downloads (cumulative)4,902
- Downloads (12 months)957
- Downloads (6 weeks)121
- Average Downloads per Article409
- Average Citation per Article7
View Full Profile

Comments

Recommendations

Acceptance Rates

Overall Acceptance Rate 53 of 97 submissions, 55%

Year	Submitted	Accepted	Rate
PMAM '20	15	8	53%
PMAM'19	17	10	59%
PMAM'18	17	9	53%
PMAM'17	14	7	50%
PMAM '15	34	19	56%
Overall	97	53	55%

PPOPP

Sections

Proceeding Downloads

Julia Cloud Matrix Machine: Dynamic Matrix Language Acceleration on Multicore Clusters in the Cloud

Distributed Cell Set : A Library for Space-Dependent Communication/Computation Overlap on Manycore Cluster

Towards Maximum Throughput of Dataflow Software Pipeline under Resource Constraints

Studying the expressiveness and performance of parallelization abstractions for linear pipelines

Harmonic CUDA: Asynchronous Programming on GPUs

MPI-based Remote OpenMP Offloading: A More Efficient and Easy-to-use Implementation

Exploring OpenMP GPU Offloading for Implementing Convolutional Neural Networks

LDTA '12: Proceedings of the Twelfth Workshop on Language Descriptions, Tools, and Applications

TADDS '12: Proceedings of the 4th International Workshop on Theoretical Aspects of Dynamic Distributed Systems

CompSysTech '13: Proceedings of the 14th International Conference on Computer Systems and Technologies

Acceptance Rates