Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2909437acmotherconferencesBook PagePublication PagesiwoclConference Proceedingsconference-collections
IWOCL '16: Proceedings of the 4th International Workshop on OpenCL
ACM2016 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
IWOCL '16: The 4th International Workshop on OpenCL Vienna Austria April 19 - 21, 2016
ISBN:
978-1-4503-4338-1
Published:
19 April 2016
In-Cooperation:
The University of Bristol

Reflects downloads up to 09 Nov 2024Bibliometrics
Skip Abstract Section
Abstract

The International Workshop for OpenCL (IWOCL, which is pronounced "eye-wok-ul") was conceived in a meeting between Simon McIntosh-Smith and Ben Bergen at the Los Alamos National Laboratory on May 8th 2012. McIntosh-Smith and Bergen lamented that there were no organized workshops or meetings for the rapidly growing OpenCL community. After testing this idea with colleagues over the next few months, they decided to create the kind of OpenCL conference they wanted to go to themselves, and thus IWOCL was born.

Skip Table Of Content Section
SESSION: Full papers
research-article
Automatic Test Case Reduction for OpenCL
Article No.: 1, Pages 1–12https://doi.org/10.1145/2909437.2909439

We report on an extension to the C-Reduce tool, for automatic reduction of C test cases, to handle OpenCL kernels. This enables an automated method for detecting bugs in OpenCL compilers, by generating large random kernels using the CLsmith generator, ...

research-article
The Hitchhiker's Guide to Cross-Platform OpenCL Application Development
Article No.: 2, Pages 1–12https://doi.org/10.1145/2909437.2909440

One of the benefits to programming of OpenCL is platform portability. That is, an OpenCL program that follows the OpenCL specification should, in principle, execute reliably on any platform that supports OpenCL. To assess the current state of OpenCL ...

SESSION: Technical presentations
extended-abstract
OpenCL-Based Mobile GPGPU Benchmarking: Methods and Challenges
Article No.: 3, Pages 1–4https://doi.org/10.1145/2909437.2909441

Benchmarking general-purpose computing on graphics processing unit (GPGPU) aims to profile and compare performance across different devices. Due to the low-level nature of most GPGPU APIs, GPGPU benchmarks are also useful for architectural exploration ...

extended-abstract
OpenCL Compiler Tools for FPGAs
Article No.: 4, Pages 1–3https://doi.org/10.1145/2909437.2909445

Compiling OpenCL kernels to FPGAs presents a new set of usability challenges. Many OpenCL users are not hardware experts but are creating state-of-the-art hardware with the help of OpenCL compilers for FPGAs. To get great performance the compiler has to ...

extended-abstract
Optimizing OpenCL applications on Xilinx FPGA
Article No.: 5, Pages 1–2https://doi.org/10.1145/2909437.2909447

In this presentation we focus on current Xilinx FPGA (Field-Programmable Gate Array) platforms with the SDAccel OpenCL environment. FPGA have the unique feature of a reconfigurable architecture by opposition to CPU, GPU or DSP which have a fixed ...

extended-abstract
VisionCPP: A SYCL-based Computer Vision Framework
Article No.: 6, Pages 1–4https://doi.org/10.1145/2909437.2909444

Using computer vision techniques for system-on-chip (SoC) technologies raises performance portability and stringent memory and communication issues for vision applications.

Although high-level libraries like OpenCV abstract both the system-level and ...

extended-abstract
clSPARSE: A Vendor-Optimized Open-Source Sparse BLAS Library
Article No.: 7, Pages 1–4https://doi.org/10.1145/2909437.2909442

Sparse linear algebra is a cornerstone of modern computational science. These algorithms ignore the zero-valued entries found in many domains in order to work on much larger problems at much faster rates than dense algorithms. Nonetheless, optimizing ...

extended-abstract
OpenCL caffe: Accelerating and enabling a cross platform machine learning framework
Article No.: 8, Pages 1–5https://doi.org/10.1145/2909437.2909443

Deep neural networks (DNN) achieved significant breakthrough in vision recognition in 2012 and quickly became the leading machine learning algorithm in Big Data based large scale object recognition applications. The successful deployment of DNN based ...

extended-abstract
Intel® Threading Building Block (Intel® TBB) flow graph as a software infrastructure layer for OpenCL™-based computations
Article No.: 9, Pages 1–3https://doi.org/10.1145/2909437.2909446

Modern computing systems are becoming heterogeneous with a variety of programmable units: CPU, GPU, FPGA, domain-specific accelerators, etc. OpenCL™ API is a cross-platform programming model for a wide range of computing devices, but using the language ...

extended-abstract
Optimizing convolutional neural networks on embedded platforms with OpenCL
Article No.: 10, Pages 1–4https://doi.org/10.1145/2909437.2909449

We invite the community to collaboratively design and optimize convolutional neural networks to meet the performance, accuracy and cost requirements for deployment on a range of form factors -- from sensors to self-driving cars.

extended-abstract
GPU Daemon: Road to zero cost submission
Article No.: 11, Pages 1–4https://doi.org/10.1145/2909437.2909450

In this paper we present a novel approach of utilizing new features of OpenCL 2.0: Fine-Grained SVM and device-side enqueue that allow completely new usage models and application paradigms. We present the idea of a GPU (Graphics Processing Unit) daemon ...

extended-abstract
OpenCL™ FFT Optimizations for Intel® Processor Graphics
Article No.: 12, Pages 1–4https://doi.org/10.1145/2909437.2909451

In this paper, we explore a number of OpenCL™ optimization strategies and show the pros and cons relative to clFFT, the leading OpenCL Fast Fourier Transform (FFT) library. We implemented a 1D, multi-kernel, mixed-radix Cooley-Tukey power of two ...

extended-abstract
The OpenCL Library Ecosystem: Current Status and Future Perspectives
Article No.: 13, Pages 1–2https://doi.org/10.1145/2909437.2909452

OpenCL as an open standard for parallel programming of heterogeneous systems seems to be an attractive choice for software library implementations. Indeed, iwocl.org1 lists 83 OpenCL-enabled libraries as of February 12, 2016, suggesting a healthy ...

extended-abstract
hiCL: an OpenCL abstraction layer for scientific computing, application to depth imaging on GPU and APU
Article No.: 14, Pages 1–3https://doi.org/10.1145/2909437.2909453

Hardware accelerators (HWAs), such as Graphics Processing Units (GPUs) have proven their potential to boost scientific applications performance and have been widely embraced by academia and industry. The OpenCL programming model ensures portability on ...

extended-abstract
Boost.Compute: A parallel computing library for C++ based on OpenCL
Article No.: 15, Pages 1–39https://doi.org/10.1145/2909437.2909454

Boost.Compute is a powerful C++ header-only template library for parallel computing based on OpenCL. It has a layered architecture and acts both as a thin C++ wrapper over the OpenCL API and as a feature-rich interface to high-level constructs that ...

WORKSHOP SESSION: Workshops
tutorial
C++ for OpenCL Workshop, IWOCL 2016
Article No.: 16, Pages 1–3https://doi.org/10.1145/2909437.2909464

OpenCL™ is an open, royalty-free standard for heterogenous parallel programming. As the number of OpenCL™ platforms is increasing, the requests for better programmability and adoption of modern C++ paradigms is growing bigger. The C++ language is ...

POSTER SESSION: Posters
poster
Extending Paralldroid for the Automatic Generation of OpenCL Code
Article No.: 17, Pages 1–3https://doi.org/10.1145/2909437.2909456

The evolution of many of today's ubiquitous technologies has been possible due to the System on Chip (SoC) technologies. This evolution has triggered an increase of the computing power of hand-held devices, that comes from heterogeneous architectures ...

poster
OpenCL meets Open Source Streaming Analytics
Article No.: 18, Pages 1–3https://doi.org/10.1145/2909437.2909457

OpenCL is leveraged to build a flexible, scalable streaming analytics platform using FPGA. The end to end solution can demonstrate over a 2x price/performance advantage over the software baseline. All this can be customized by an application developer ...

poster
Towards Interactive Visual Exploration of Parallel Programs using a Domain-Specific Language
Article No.: 19, Pages 1–2https://doi.org/10.1145/2909437.2909459

The use of GPUs and the massively parallel computing paradigm have become wide-spread. We describe a framework for the interactive visualization and visual analysis of the run-time behavior of massively parallel programs, especially OpenCL kernels. This ...

poster
Benchmarking, autotuning and crowdtuning OpenCL programs using the Collective Knowledge framework
Article No.: 20, Pages 1–2https://doi.org/10.1145/2909437.2909460

Autotuning is a popular technique to ensure performance portability for important algorithms such as BLAS, FFT and DNN across the ever evolving software and hardware stack. Unfortunately, when performed on a single machine, autotuning can explore only a ...

poster
Runtime comparison solving Gray-Scott equation on different OpenCL devices
Article No.: 21, Pages 1–3https://doi.org/10.1145/2909437.2909461

n example of a reaction-diffusion equation with chaotic solutions. You can expect patterns to emerge from chaos. A uniformly discretization in space and periodic boundary conditions allows the Fast Fourier Transform to be used, so that when coupled with ...

poster
C++ Classes and Templates for OpenCL Kernels with PATOS
Article No.: 22, Pages 1–3https://doi.org/10.1145/2909437.2909462

We present PATOS, a CLANG-based source-to-source compiler to extend the OpenCL kernel language with C++ classes and template types for classes and functions. The generated code is standard conforming OpenCL-C which is usable with unmodified OpenCL ...

Recommendations

Acceptance Rates

Overall Acceptance Rate 84 of 152 submissions, 55%
YearSubmittedAcceptedRate
IWOCL '20302170%
IWOCL '19331339%
IWOCL '18331648%
IWOCL '17291552%
IWOCL '15271970%
Overall1528455%