Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJuly 2009
Efficient program scheduling for heterogeneous multi-core processors
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 927–930https://doi.org/10.1145/1629911.1630149Heterogeneous multicore processors promise high execution efficiency under diverse workloads, and program scheduling is critical in exploiting this efficiency. This paper presents a novel method to leverage the inherent characteristics of a program for ...
- research-articleJuly 2009
Green data centers and hot chips
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 888–890https://doi.org/10.1145/1629911.1630139Today's computing environment is changing, with growing emphasis on reducing both operational energy costs and the high capital costs to reliably deliver power and cooling to systems in factory-sized data centers. The "greening" of data centers is ...
- research-articleJuly 2009
Sustainable data centers: enabled by supply and demand side management
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 884–887https://doi.org/10.1145/1629911.1630138The environmental impact of data centers is significant and is growing rapidly. Servers alone in the US consumed 1.2% of the nation's energy in 2005, according to the EPA. In the following year, the EPA found that the cost of energy rose by 10%. However,...
- research-articleJuly 2009
Transmuting coprocessors: dynamic loading of FPGA coprocessors
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 848–851https://doi.org/10.1145/1629911.1630127Field-programmable gates arrays (FPGAs) are increasingly used in general-purpose computing platforms to augment microprocessors, enabling runtime loading of coprocessors customized to speed up some applications. Such transmuting coprocessors create new ...
- research-articleJuly 2009
Multiprocessor System-on-Chip designs with active memory processors for higher memory efficiency
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 806–811https://doi.org/10.1145/1629911.1630118Memory access latency and memory-related operations are often the performance bottleneck in parallel applications. In this paper, we present a concept of active memory operations which is an on-chip network transaction that operates based on the ...
- research-articleJuly 2009
Software-assisted hardware reliability: abstracting circuit-level challenges to the software stack
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 788–793https://doi.org/10.1145/1629911.1630114Power constrained designs are becoming increasingly sensitive to supply voltage noise. We propose a hardware-software collaborative approach to enable aggressive operating margins: a checkpoint-recovery mechanism corrects margin violations, while a run-...
- research-articleJuly 2009
Thermal-aware data flow analysis
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 613–614https://doi.org/10.1145/1629911.1630069This paper suggests that the thermal state of a processor can be approximated using data flow analysis. The results of this analysis can be used to evaluate the efficacy of thermal-aware compilation strategies, or as input to thermal-aware optimizations ...
- research-articleJuly 2009
No cache-coherence: a single-cycle ring interconnection for multi-core L1-NUCA sharing on 3D chips
- Shu-Hsuan Chou,
- Chien-Chih Chen,
- Chi-Neng Wen,
- Yi-Chao Chan,
- Tien-Fu Chen,
- Chao-Ching Wang,
- Jinn-Shyan Wang
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 587–592https://doi.org/10.1145/1629911.1630062Consistent with the trend towards the use of many cores in SOC and 3D Chip techniques, this paper proposes a "single-cycle ring" interconnection (SC_Ring) with ultra-low latency and minimal complexity. The proposed SC_Ring allows multiple single-cycle ...
- research-articleJuly 2009
Spectrum: a hybrid nanophotonic-electric on-chip network
- Zheng Li,
- Dan Fay,
- Alan Mickelson,
- Li Shang,
- Manish Vachharajani,
- Dejan Filipovic,
- Wounjhang Park,
- Yihe Sun
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 575–580https://doi.org/10.1145/1629911.1630060On many-core chip designs, short, often-multicast, latency-critical messages, used extensively in high-level coherence and synchronization protocols, often become the bottleneck of parallel performance scaling. This paper presents Spectrum, a hybrid ...
- research-articleJuly 2009
Event-driven gate-level simulation with GP-GPUs
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 557–562https://doi.org/10.1145/1629911.1630056Logic simulation is a critical component of the design tool flow in modern hardware development efforts. It is used widely -- from high-level descriptions down to gate-level ones -- to validate several aspects of the design, particularly functional ...
- research-articleJuly 2009
Massively parallel processing: it's déjà vu all over again
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 534–538https://doi.org/10.1145/1629911.1630050In this paper we will identify those aspects of the concurrent computing landscape that have changed since the 1980's and how those changes might impact the efficacy of parallel computing as we move from single- to multi- to many- and to massive numbers ...
- research-articleJuly 2009
Dynamic thermal management via architectural adaptation
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 484–489https://doi.org/10.1145/1629911.1630038Exponentially rising cooling/packaging costs due to high power density call for architectural and software-level thermal management. Dynamic thermal management (DTM) techniques continuously monitor the on-chip processor temperature. Appropriate ...
- research-articleJuly 2009
A design flow for application specific heterogeneous pipelined multiprocessor systems
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 250–253https://doi.org/10.1145/1629911.1629979This paper describes a rapid design methodology to create a pipeline of processers to execute streaming applications. The methodology is in two separate phases: the first phase, uses a heuristic to rapidly search through a large number of processor ...
- research-articleJuly 2009
NUDA: a non-uniform debugging architecture and non-intrusive race detection for many-core
DAC '09: Proceedings of the 46th Annual Design Automation ConferencePages 148–153https://doi.org/10.1145/1629911.1629954Traditional debug methodologies are limited in their ability to provide debugging support for many-core parallel programming. Synchronization problems or bugs due to race conditions are particularly difficult to detect with software debugging tools. ...