CUDA Programming, GPU Computing
176 Followers
Recent papers in CUDA Programming, GPU Computing
Graphics Processing Units (GPUs) are specialized coprocessors that were initially conceived for the purpose of accelerating vector operations, such as graphics rendering. Writing and configuring efficient algorithms for GPU devices is... more
The computational epidemiology is the development and use of computational models that aims to understand the proliferation of diseases of the dynamic point of view. The computational models are capable to simulate the behavior of an... more
CUDA (Compute Unified Device Architecture) is a parallel computing platform developed by Nvidia which provides the ability of using GPUs to run computationally intensive programs. This presentation provides a brief overview of CUDA,... more
Volume rendering is an important area of study in computer graphics, due to its application in areas such as medicine, physic simulations, oil and gas industries, and others. The main used method nowadays for volume rendering is ray... more
This thesis focuses on the development, implementation and optimization of pattern-matching algorithms in two different, yet closely-related research fields: malicious code detection in intrusion detection systems and digital forensics... more
The Graphics Processing Unit (GPU) has become an integral part of mainstream computing. The advancement and evaluation of GPU has lead to a significant performance improvement of many algorithms used in day today life. Being powerful... more
A flow diagram is a graphical presentation of an energy or chemical system with its components and their interconnections through mass and energy streams. An automatic drawing algorithm of flow diagrams has been developed and presented in... more
This paper presents a parallel implementation of the hybrid BiCGStab(2) (bi-conjugate gradient stabilized) iterative method in a GPU (graphics processing unit) for solution of large and sparse linear systems. This implementation uses the... more
This GPU book teaches both CUDA and CPU Parallel Programming using pThreads.
Fluid simulation has recently become possible to do in real-time utilizing modern programmable GPUs. In this paper, a comparison is made between more traditional CFD methods which solve the Navier-Stokes PDEs and the alternative discrete... more
This short paper present a collection of GPU lightweight decompression algorithms implementations within a FOSS library, Giddy – the first to be published to offer such function-ality. As the use of compression is important in... more
Volume rendering is an important area of study in computer graphics, due to its application in areas such as medicine, physic simulations, oil and gas industries, and others. The main used method nowadays for volume rendering is ray... more
Umumnya, Transformasi Fourier Diskrit digunakan untuk mengolah citra dengan mengubah sinyal yang berdomain waktu / spasial ke bentuk sinyal berdomain frekuensi, atau sebaliknya, mengubah sinyal berdomain frekuensi ke sinyal berdomain... more
This work has the goal to study how an efficient deep packet inspection (DPI) algorithm may be implemented using the graphical processing unit (GPU) CUDA (Computer Unified Device Architecture) enabled boards existing in personal... more
Usage of multiple unmanned aerial vehicles (UAV) in a certain mission makes flight route planning more complicated and slower. In order to obtain better performance, in the literature, most of the researchers propose using evolutionary... more
The last years have seen the rise of using GPU’s not only for graph- ics computing, but also for a more global use in parallel computing. This completely new field comes with dedicated GPU’s (GPGPU, Global Purpose GPU) that are not... more
The rapid evolution of CUDA GPU architecture and the new heterogenous platforms that break the hegemony of x86 offer opportunities for performance optimizations, but also pose challenges for scalable heterogeneous parallelization of the... more
Рассмотрен мониторинг широкого частотного диапазона с использованием ДПФ-модулированных банков фильтров. Описаны равнополосные и неравнополосные реализации банков фильтров, включая прямую реализа-цию с полной модуляцией, критически... more
En este artículo se plantea una nueva paralelización del algoritmo de enjambre de partículas (PSO) haciendo uso de múltiples GPUs. Se plantean dos implementaciones basadas en la arquitectura CUDA de NVIDIA y la conexión P2P a través del... more
This paper presents a computational performance comparison between some iterative methods used for linear systems solution. The goal is to show that the use of parallel processing provided by a Graphics Processing Unit (GPU) may be more... more
The availability of Internet, line-of-sight and satellite identification and surveillance information as well as low-power, low-cost embedded systems-on-a-chip and a wide range of visible to long-wave infrared cameras prompted Embry... more
Introduction to CUDA and STREAM.
This paper presents a comparison between two parallel architectures: Compute Unified Device Architecture (CUDA) and Open Computing Language (OpenCL). Some works in the literature have presented a computational performance comparison of... more
Overview CUDA stands for the "Compute Unified Device Architecture", which is a free software platform provided by NVidia. It enables users to control GPUs by writing programs akin to C++. All CUDA software can be downloaded from CUDA... more
This paper proposes a parallel regression formulation to reduce the computational time of variable selection algorithms. The proposed strategy can be used for several forward algorithms in order to select uncorrelated variables that... more