Proceedings of the 4th International Workshop on OpenCL

SESSION: Full papers

research-article

Automatic Test Case Reduction for OpenCL

Article No.: 1, Pages 1–12https://doi.org/10.1145/2909437.2909439

We report on an extension to the C-Reduce tool, for automatic reduction of C test cases, to handle OpenCL kernels. This enables an automated method for detecting bugs in OpenCL compilers, by generating large random kernels using the CLsmith generator, ...

research-article

The Hitchhiker's Guide to Cross-Platform OpenCL Application Development

Article No.: 2, Pages 1–12https://doi.org/10.1145/2909437.2909440

One of the benefits to programming of OpenCL is platform portability. That is, an OpenCL program that follows the OpenCL specification should, in principle, execute reliably on any platform that supports OpenCL. To assess the current state of OpenCL ...

SESSION: Technical presentations

extended-abstract

OpenCL-Based Mobile GPGPU Benchmarking: Methods and Challenges

Article No.: 3, Pages 1–4https://doi.org/10.1145/2909437.2909441

Benchmarking general-purpose computing on graphics processing unit (GPGPU) aims to profile and compare performance across different devices. Due to the low-level nature of most GPGPU APIs, GPGPU benchmarks are also useful for architectural exploration ...

extended-abstract

OpenCL Compiler Tools for FPGAs

Dmitry Denisenko

Article No.: 4, Pages 1–3https://doi.org/10.1145/2909437.2909445

Compiling OpenCL kernels to FPGAs presents a new set of usability challenges. Many OpenCL users are not hardware experts but are creating state-of-the-art hardware with the help of OpenCL compilers for FPGAs. To get great performance the compiler has to ...

extended-abstract

Optimizing OpenCL applications on Xilinx FPGA

Article No.: 5, Pages 1–2https://doi.org/10.1145/2909437.2909447

In this presentation we focus on current Xilinx FPGA (Field-Programmable Gate Array) platforms with the SDAccel OpenCL environment. FPGA have the unique feature of a reconfigurable architecture by opposition to CPU, GPU or DSP which have a fixed ...

extended-abstract

VisionCPP: A SYCL-based Computer Vision Framework

Mehdi Goli

Article No.: 6, Pages 1–4https://doi.org/10.1145/2909437.2909444

Using computer vision techniques for system-on-chip (SoC) technologies raises performance portability and stringent memory and communication issues for vision applications.

Although high-level libraries like OpenCV abstract both the system-level and ...

extended-abstract

clSPARSE: A Vendor-Optimized Open-Source Sparse BLAS Library

Article No.: 7, Pages 1–4https://doi.org/10.1145/2909437.2909442

Sparse linear algebra is a cornerstone of modern computational science. These algorithms ignore the zero-valued entries found in many domains in order to work on much larger problems at much faster rates than dense algorithms. Nonetheless, optimizing ...

extended-abstract

OpenCL caffe: Accelerating and enabling a cross platform machine learning framework

Article No.: 8, Pages 1–5https://doi.org/10.1145/2909437.2909443

Deep neural networks (DNN) achieved significant breakthrough in vision recognition in 2012 and quickly became the leading machine learning algorithm in Big Data based large scale object recognition applications. The successful deployment of DNN based ...

extended-abstract

Intel® Threading Building Block (Intel® TBB) flow graph as a software infrastructure layer for OpenCL™-based computations

Article No.: 9, Pages 1–3https://doi.org/10.1145/2909437.2909446

Modern computing systems are becoming heterogeneous with a variety of programmable units: CPU, GPU, FPGA, domain-specific accelerators, etc. OpenCL™ API is a cross-platform programming model for a wide range of computing devices, but using the language ...

extended-abstract

Optimizing convolutional neural networks on embedded platforms with OpenCL

Article No.: 10, Pages 1–4https://doi.org/10.1145/2909437.2909449

We invite the community to collaboratively design and optimize convolutional neural networks to meet the performance, accuracy and cost requirements for deployment on a range of form factors -- from sensors to self-driving cars.

extended-abstract

GPU Daemon: Road to zero cost submission

Article No.: 11, Pages 1–4https://doi.org/10.1145/2909437.2909450

In this paper we present a novel approach of utilizing new features of OpenCL 2.0: Fine-Grained SVM and device-side enqueue that allow completely new usage models and application paradigms. We present the idea of a GPU (Graphics Processing Unit) daemon ...

extended-abstract

OpenCL™ FFT Optimizations for Intel® Processor Graphics

Article No.: 12, Pages 1–4https://doi.org/10.1145/2909437.2909451

In this paper, we explore a number of OpenCL™ optimization strategies and show the pros and cons relative to clFFT, the leading OpenCL Fast Fourier Transform (FFT) library. We implemented a 1D, multi-kernel, mixed-radix Cooley-Tukey power of two ...

extended-abstract

The OpenCL Library Ecosystem: Current Status and Future Perspectives

Karl Rupp

Article No.: 13, Pages 1–2https://doi.org/10.1145/2909437.2909452

OpenCL as an open standard for parallel programming of heterogeneous systems seems to be an attractive choice for software library implementations. Indeed, iwocl.org1 lists 83 OpenCL-enabled libraries as of February 12, 2016, suggesting a healthy ...

extended-abstract

hiCL: an OpenCL abstraction layer for scientific computing, application to depth imaging on GPU and APU

Article No.: 14, Pages 1–3https://doi.org/10.1145/2909437.2909453

Hardware accelerators (HWAs), such as Graphics Processing Units (GPUs) have proven their potential to boost scientific applications performance and have been widely embraced by academia and industry. The OpenCL programming model ensures portability on ...

extended-abstract

Boost.Compute: A parallel computing library for C++ based on OpenCL

Jakub Szuppe

Article No.: 15, Pages 1–39https://doi.org/10.1145/2909437.2909454

Boost.Compute is a powerful C++ header-only template library for parallel computing based on OpenCL. It has a layered architecture and acts both as a thin C++ wrapper over the OpenCL API and as a feature-rich interface to high-level constructs that ...

WORKSHOP SESSION: Workshops

tutorial

C++ for OpenCL Workshop, IWOCL 2016

Article No.: 16, Pages 1–3https://doi.org/10.1145/2909437.2909464

OpenCL™ is an open, royalty-free standard for heterogenous parallel programming. As the number of OpenCL™ platforms is increasing, the requests for better programmability and adoption of modern C++ paradigms is growing bigger. The C++ language is ...

POSTER SESSION: Posters

poster

Extending Paralldroid for the Automatic Generation of OpenCL Code

Article No.: 17, Pages 1–3https://doi.org/10.1145/2909437.2909456

The evolution of many of today's ubiquitous technologies has been possible due to the System on Chip (SoC) technologies. This evolution has triggered an increase of the computing power of hand-held devices, that comes from heterogeneous architectures ...

poster

OpenCL meets Open Source Streaming Analytics

Article No.: 18, Pages 1–3https://doi.org/10.1145/2909437.2909457

OpenCL is leveraged to build a flexible, scalable streaming analytics platform using FPGA. The end to end solution can demonstrate over a 2x price/performance advantage over the software baseline. All this can be customized by an application developer ...

poster

Towards Interactive Visual Exploration of Parallel Programs using a Domain-Specific Language

Article No.: 19, Pages 1–2https://doi.org/10.1145/2909437.2909459

The use of GPUs and the massively parallel computing paradigm have become wide-spread. We describe a framework for the interactive visualization and visual analysis of the run-time behavior of massively parallel programs, especially OpenCL kernels. This ...

poster

Benchmarking, autotuning and crowdtuning OpenCL programs using the Collective Knowledge framework

Article No.: 20, Pages 1–2https://doi.org/10.1145/2909437.2909460

Autotuning is a popular technique to ensure performance portability for important algorithms such as BLAS, FFT and DNN across the ever evolving software and hardware stack. Unfortunately, when performed on a single machine, autotuning can explore only a ...

poster

Runtime comparison solving Gray-Scott equation on different OpenCL devices

Michael Quell

Article No.: 21, Pages 1–3https://doi.org/10.1145/2909437.2909461

n example of a reaction-diffusion equation with chaotic solutions. You can expect patterns to emerge from chaos. A uniformly discretization in space and periodic boundary conditions allows the Fast Fourier Transform to be used, so that when coupled with ...

poster

C++ Classes and Templates for OpenCL Kernels with PATOS

Article No.: 22, Pages 1–3https://doi.org/10.1145/2909437.2909462

We present PATOS, a CLANG-based source-to-source compiler to extend the OpenCL kernel language with C++ classes and template types for classes and functions. The generated code is standard conforming OpenCL-C which is usable with unmodified OpenCL ...

Year	Submitted	Accepted	Rate
IWOCL '20	30	21	70%
IWOCL '19	33	13	39%
IWOCL '18	33	16	48%
IWOCL '17	29	15	52%
IWOCL '15	27	19	70%
Overall	152	84	55%

IWOCL

Sections

Proceeding Downloads

Automatic Test Case Reduction for OpenCL

The Hitchhiker's Guide to Cross-Platform OpenCL Application Development

OpenCL-Based Mobile GPGPU Benchmarking: Methods and Challenges

OpenCL Compiler Tools for FPGAs

Optimizing OpenCL applications on Xilinx FPGA

VisionCPP: A SYCL-based Computer Vision Framework

clSPARSE: A Vendor-Optimized Open-Source Sparse BLAS Library

OpenCL caffe: Accelerating and enabling a cross platform machine learning framework

Intel® Threading Building Block (Intel® TBB) flow graph as a software infrastructure layer for OpenCL™-based computations

Optimizing convolutional neural networks on embedded platforms with OpenCL

GPU Daemon: Road to zero cost submission

OpenCL™ FFT Optimizations for Intel® Processor Graphics

The OpenCL Library Ecosystem: Current Status and Future Perspectives

hiCL: an OpenCL abstraction layer for scientific computing, application to depth imaging on GPU and APU

Boost.Compute: A parallel computing library for C++ based on OpenCL

C++ for OpenCL Workshop, IWOCL 2016

Extending Paralldroid for the Automatic Generation of OpenCL Code

OpenCL meets Open Source Streaming Analytics

Towards Interactive Visual Exploration of Parallel Programs using a Domain-Specific Language

Benchmarking, autotuning and crowdtuning OpenCL programs using the Collective Knowledge framework

Runtime comparison solving Gray-Scott equation on different OpenCL devices

C++ Classes and Templates for OpenCL Kernels with PATOS

UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing

TADDS '12: Proceedings of the 4th International Workshop on Theoretical Aspects of Dynamic Distributed Systems

IWOCL '15: Proceedings of the 3rd International Workshop on OpenCL

Acceptance Rates

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Recommendations

UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing

TADDS '12: Proceedings of the 4th International Workshop on Theoretical Aspects of Dynamic Distributed Systems

IWOCL '15: Proceedings of the 3rd International Workshop on OpenCL

Acceptance Rates