Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3582514acmconferencesBook PagePublication PagesppoppConference Proceedingsconference-collections
PMAM'23: Proceedings of the 14th International Workshop on Programming Models and Applications for Multicores and Manycores
ACM2023 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
PMAM'23: 14th International Workshop on Programming Models and Applications for Multicores and Manycores Montreal QC Canada 25 February 2023- 1 March 2023
ISBN:
979-8-4007-0115-3
Published:
25 February 2023
Sponsors:
Recommend ACM DL
ALREADY A SUBSCRIBER?SIGN IN

Reflects downloads up to 31 Jan 2025Bibliometrics
Abstract

No abstract available.

Skip Table Of Content Section
research-article
Julia Cloud Matrix Machine: Dynamic Matrix Language Acceleration on Multicore Clusters in the Cloud

Matrix computations are widely used in increasing sizes and complexity in scientific computing and engineering. But current matrix language implementations lack programmer support to effectively and seamlessly utilize cloud computing resources. We ...

research-article
Distributed Cell Set : A Library for Space-Dependent Communication/Computation Overlap on Manycore Cluster

The increase in the number of cores available in modern processors makes it important for implementations to maximize their use within a node by overlapping communication and computation. However, when the dependencies between communication and ...

research-article
Public Access
Towards Maximum Throughput of Dataflow Software Pipeline under Resource Constraints

This work proposes a novel algorithm and Integer Linear Programming (ILP) formulation to optimize the pipelined code mapping of dataflow graph under a given budget generated by optimizing compilers. The goal of this optimization technique is to ...

research-article
Studying the expressiveness and performance of parallelization abstractions for linear pipelines

Semi-automatic parallelization provides abstractions that simplify the programming effort and allow the user to make decisions that cannot be made by tools. However, abstractions for general-purpose systems usually do not carry sufficient knowledge ...

research-article
Open Access
Harmonic CUDA: Asynchronous Programming on GPUs

We introduce Harmonic CUDA, a dataflow programming model for GPUs that allows programmers to describe algorithms as a dependency graph of producers and consumers where data flows continuously through the graph for the duration of the kernel. This ...

research-article
MPI-based Remote OpenMP Offloading: A More Efficient and Easy-to-use Implementation

MPI+X is the most popular hybrid programming model for distributed computation on modern heterogeneous HPC systems. Nonetheless, for simplicity, HPC developers ideally would like to implement multi-node distributed parallel computing through a single ...

research-article
Public Access
Exploring OpenMP GPU Offloading for Implementing Convolutional Neural Networks

Computing on heterogeneous architecture involving CPUs and accelerators is now a popular choice of parallel computing. As a directive-based programming model, OpenMP has become more and more comprehensive that supports a large variety of hardware ...

Contributors
  • Shanghai Jiao Tong University
  • University of Otago
  • Meta

Recommendations

Acceptance Rates

Overall Acceptance Rate 53 of 97 submissions, 55%
YearSubmittedAcceptedRate
PMAM '2015853%
PMAM'19171059%
PMAM'1817953%
PMAM'1714750%
PMAM '15341956%
Overall975355%