Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Bitwidth cognizant architecture synthesis of custom hardware accelerators

Published: 01 November 2006 Publication History

Abstract

Program-in chip-out (PICO) is a system for automatically synthesizing embedded hardware accelerators from loop nests specified in the C programming language. A key issue confronted when designing such accelerators is the optimization of hardware by exploiting information that is known about the varying number of bits required to represent and process operands. In this paper, we describe the handling and exploitation of integer bitwidth in PICO. A bitwidth analysis procedure is used to determine bitwidth requirements for all integer variables and operations in a C application. Given known bitwidths for all variables, complex problems arise when determining a program schedule that specifies on which function unit (FU) and at what time each operation executes. If operations are assigned to FUs with no knowledge of bitwidth, bitwidth-related cost benefit is lost when each unit is built to accommodate the widest operation assigned. By carefully placing operations of similar width on the same unit, hardware costs are decreased. This problem is addressed using a preliminary clustering of operations that is based jointly on width and implementation cost. These clusters are then honored during resource allocation and operation scheduling to create an efficient width-conscious design. Experimental results show that exploiting integer bitwidth substantially reduces the gate count of PICO-synthesized hardware accelerators across a range of applications

Cited By

View all
  • (2021)Thread-aware area-efficient high-level synthesis compiler for embedded devicesProceedings of the 2021 IEEE/ACM International Symposium on Code Generation and Optimization10.1109/CGO51591.2021.9370341(327-339)Online publication date: 27-Feb-2021
  • (2014)SketchiLogProceedings of the conference on Design, Automation & Test in Europe10.5555/2616606.2616793(1-4)Online publication date: 24-Mar-2014
  • (2014)Rapid evaluation of custom instruction selection approaches with FPGA estimationACM Transactions on Embedded Computing Systems10.1145/256001413:4(1-29)Online publication date: 10-Mar-2014
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems  Volume 20, Issue 11
November 2006
91 pages

Publisher

IEEE Press

Publication History

Published: 01 November 2006

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Thread-aware area-efficient high-level synthesis compiler for embedded devicesProceedings of the 2021 IEEE/ACM International Symposium on Code Generation and Optimization10.1109/CGO51591.2021.9370341(327-339)Online publication date: 27-Feb-2021
  • (2014)SketchiLogProceedings of the conference on Design, Automation & Test in Europe10.5555/2616606.2616793(1-4)Online publication date: 24-Mar-2014
  • (2014)Rapid evaluation of custom instruction selection approaches with FPGA estimationACM Transactions on Embedded Computing Systems10.1145/256001413:4(1-29)Online publication date: 10-Mar-2014
  • (2013)Elastic CGRAsProceedings of the ACM/SIGDA international symposium on Field programmable gate arrays10.1145/2435264.2435296(171-180)Online publication date: 11-Feb-2013
  • (2013)The CRNS framework and its application to programmable and reconfigurable cryptographyACM Transactions on Architecture and Code Optimization10.1145/2400682.24006929:4(1-25)Online publication date: 20-Jan-2013
  • (2013)A fast and low-overhead technique to secure programs against integer overflowsProceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)10.1109/CGO.2013.6494996(1-11)Online publication date: 23-Feb-2013
  • (2011)Global productiveness propagationACM SIGPLAN Notices10.1145/2016603.196770046:5(161-170)Online publication date: 11-Apr-2011
  • (2011)Global productiveness propagationProceedings of the 2011 SIGPLAN/SIGBED conference on Languages, compilers and tools for embedded systems10.1145/1967677.1967700(161-170)Online publication date: 11-Apr-2011
  • (2010)SQNR estimation of fixed-point DSP algorithmsEURASIP Journal on Advances in Signal Processing10.1155/2010/1710272010(1-12)Online publication date: 1-Feb-2010
  • (2009)Custom floating-point unit generation for embedded systemsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2009.201399928:5(638-650)Online publication date: 1-May-2009
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media