Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
Volume 13, Issue 2September 2013Special issue on application-specific processors
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
ISSN:1539-9087
EISSN:1558-3465
Reflects downloads up to 04 Oct 2024Bibliometrics
Skip Table Of Content Section
introduction
Free
research-article
Hardware architectural support for control systems and sensor processing
Article No.: 16, Pages 1–25https://doi.org/10.1145/2514641.2514643

The field of modern control theory and the systems used to implement these controls have shown rapid development over the last 50 years. It was often the case that those developing control algorithms could assume the computing medium was solely ...

research-article
Multicore-based vector coprocessor sharing for performance and energy gains
Article No.: 17, Pages 1–25https://doi.org/10.1145/2514641.2514644

For most of the applications that make use of a dedicated vector coprocessor, its resources are not highly utilized due to the lack of sustained data parallelism which often occurs due to vector-length variations in dynamic environments. The motivation ...

research-article
A systematic approach for optimized bypass configurations for application-specific embedded processors
Article No.: 18, Pages 1–25https://doi.org/10.1145/2514641.2514645

The diversity of today's mobile applications requires embedded processor cores with a high resource efficiency, that means, the devices should provide a high performance at low area requirements and power consumption. The fine-grained parallelism ...

research-article
Custom architecture for multicore audio beamforming systems
Article No.: 19, Pages 1–26https://doi.org/10.1145/2514641.2514646

The audio Beamforming (BF) technique utilizes microphone arrays to extract acoustic sources recorded in a noisy environment. In this article, we propose a new approach for rapid development of multicore BF systems. Research on literature reveals that ...

research-article
Design-space exploration and runtime resource management for multicores
Article No.: 20, Pages 1–27https://doi.org/10.1145/2514641.2514647

Application-specific multicore architectures are usually designed by using a configurable platform in which a set of parameters can be tuned to find the best trade-off in terms of the selected figures of merit (such as energy, delay, and area). This ...

research-article
Memory performance estimation of CUDA programs
Article No.: 21, Pages 1–22https://doi.org/10.1145/2514641.2514648

CUDA has successfully popularized GPU computing, and GPGPU applications are now used in various embedded systems. The CUDA programming model provides a simple interface to program on GPUs, but tuning GPGPU applications for high performance is still ...

research-article
Parallel architectures for the kNN classifier -- design of soft IP cores and FPGA implementations
Article No.: 22, Pages 1–21https://doi.org/10.1145/2514641.2514649

We designed a variety of k-nearest-neighbor parallel architectures for FPGAs in the form of parameterizable soft IP cores. We show that they can be used to solve large classification problems with thousands of training vectors, or thousands of vector ...

research-article
Automatic synthesis of physical system differential equation models to a custom network of general processing elements on FPGAs
Article No.: 23, Pages 1–27https://doi.org/10.1145/2514641.2514650

Fast execution of physical system models has various uses, such as simulating physical phenomena or real-time testing of medical equipment. Physical system models commonly consist of thousands of differential equations. Solving such equations using ...

research-article
LegUp: An open-source high-level synthesis tool for FPGA-based processor/accelerator systems
Article No.: 24, Pages 1–27https://doi.org/10.1145/2514740

It is generally accepted that a custom hardware implementation of a set of computations will provide superior speed and energy efficiency relative to a software implementation. However, the cost and difficulty of hardware design is often prohibitive, ...

research-article
Efficient compilation of CUDA kernels for high-performance computing on FPGAs
Article No.: 25, Pages 1–26https://doi.org/10.1145/2514641.2514652

The rise of multicore architectures across all computing domains has opened the door to heterogeneous multiprocessors, where processors of different compute characteristics can be combined to effectively boost the performance per watt of different ...

Subjects

Comments