The International Workshop for OpenCL (IWOCL, which is pronounced "eye-wok-ul") was conceived in a meeting between Simon McIntosh-Smith and Ben Bergen at the Los Alamos National Laboratory on May 8th 2012. McIntosh-Smith and Bergen lamented that there were no organized workshops or meetings for the rapidly growing OpenCL community. After testing this idea with colleagues over the next few months, they decided to create the kind of OpenCL conference they wanted to go to themselves, and thus IWOCL was born.
Proceeding Downloads
clMAGMA: high performance dense linear algebra with OpenCL
This paper presents the design and implementation of several fundamental dense linear algebra (DLA) algorithms in OpenCL. In particular, these are linear system solvers and eigenvalue problem solvers. Further, we give an overview of the clMAGMA library, ...
Evaluation of a performance portable lattice Boltzmann code using OpenCL
With the advent of many-core computer architectures such as GPGPUs from NVIDIA and AMD, and more recently Intel's Xeon Phi, ensuring performance portability of HPC codes is potentially becoming more complex. In this work we have focused on one important ...
Porting a commercial application to OpenCL: a case study
The use of virtual screening to find new drug hits and leads has become commonplace within the pharmaceutical industry. 2D methods have largely been replaced by 3D ligand-based methods and by structure-based methods (docking) where a reliable protein ...
Gzip on a chip: high performance lossless data compression on FPGAs using OpenCL
Hardware implementation of lossless data compression is important for optimizing the capacity/cost/power of storage devices in data centers, as well as communication channels in high-speed networks. In this work we use the Open Computing Language (...
Pattern matching in OpenCL: GPU vs CPU energy consumption on two mobile chipsets
Adaptations of the Aho-Corasick (AC) algorithm on high performance graphics processors (also called GPUs) have garnered increasing attention in recent years. However, no results have been reported regarding their implementations on mobile GPUs. In this ...
Accelerating Lagrangian particle dispersion in the atmosphere with OpenCL across multiple platforms
FLEXPART is a popular simulator that models the transport and diffusion of air pollutants, based on the Lagrangian approach. It is capable of regional and global simulation and supports both forward and backward runs. A complex model like this contains ...
KernelInterceptor: automating GPU kernel verification by intercepting kernels and their parameters
GPUVerify is a static analysis tool for verifying that GPU kernels are free from data races and barrier divergence. It is intended as an automatic tool, but its usability is impaired by the fact that the user must explicitly supply the kernel source ...
Performance portability study of linear algebra kernels in OpenCL
The performance portability of OpenCL kernel implementations for common memory bandwidth limited linear algebra operations across different hardware generations of the same vendor as well as across vendors is studied. Certain combinations of kernel ...
Generating OpenCL C kernels from OpenACC
Hardware accelerators are now a common way to improve the performances of compute nodes. This performance improvement has a cost: applications need to be rewritten to take advantage of the new hardware. OpenACC is a set of compiler directives to target ...