Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleMay 2019
Inside Project Brainwave's Cloud-Scale, Real-Time AI Processor
- Jeremy Fowers,
- Kalin Ovtcharov,
- Michael K. Papamichael,
- Todd Massengill,
- Ming Liu,
- Daniel Lo,
- Shlomi Alkalay,
- Michael Haselman,
- Logan Adams,
- Mahdi Ghandi,
- Stephen Heil,
- Prerak Patel,
- Adam Sapek,
- Gabriel Weisz,
- Lisa Woods,
- Sitaram Lanka,
- Steven K. Reinhardt,
- Adrian M. Caulfield,
- Eric S. Chung,
- Doug Burger
Growing computational demands from deep neural networks (DNNs), coupled with diminishing returns from general-purpose architectures, have led to a proliferation of Neural Processing Units (NPUs). This paper describes the Project Brainwave NPU (BW-NPU), ...
- research-articleJune 2018
A configurable cloud-scale DNN processor for real-time AI
- Jeremy Fowers,
- Kalin Ovtcharov,
- Michael Papamichael,
- Todd Massengill,
- Ming Liu,
- Daniel Lo,
- Shlomi Alkalay,
- Michael Haselman,
- Logan Adams,
- Mahdi Ghandi,
- Stephen Heil,
- Prerak Patel,
- Adam Sapek,
- Gabriel Weisz,
- Lisa Woods,
- Sitaram Lanka,
- Steven K. Reinhardt,
- Adrian M. Caulfield,
- Eric S. Chung,
- Doug Burger
ISCA '18: Proceedings of the 45th Annual International Symposium on Computer ArchitecturePages 1–14https://doi.org/10.1109/ISCA.2018.00012Interactive AI-powered services require low-latency evaluation of deep neural network (DNN) models---aka "realtime AI". The growing demand for computationally expensive, state-of-the-art DNNs, coupled with diminishing performance gains of general-...
- research-articleJanuary 2017
Configurable Clouds
- Adrian M. Caulfield,
- Eric S. Chung,
- Andrew Putnam,
- Hari Angepat,
- Daniel Firestone,
- Jeremy Fowers,
- Michael Haselman,
- Stephen Heil,
- Matt Humphrey,
- Puneet Kaur,
- Joo-Young Kim,
- Daniel Lo,
- Todd Massengill,
- Kalin Ovtcharov,
- Michael Papamichael,
- Lisa Woods,
- Sitaram Lanka,
- Derek Chiou,
- Doug Burger
Hyperscale datacenter providers have struggled to balance the growing need for specialized hardware with the economic benefits of homogeneity. The Configurable Cloud datacenter architecture introduces a layer of reconfigurable logic (FPGAs) between the ...
- research-articleOctober 2016
A reconfigurable fabric for accelerating large-scale datacenter services
- Andrew Putnam,
- Adrian M. Caulfield,
- Eric S. Chung,
- Derek Chiou,
- Kypros Constantinides,
- John Demme,
- Hadi Esmaeilzadeh,
- Jeremy Fowers,
- Gopi Prashanth Gopal,
- Jan Gray,
- Michael Haselman,
- Scott Hauck,
- Stephen Heil,
- Amir Hormati,
- Joo-Young Kim,
- Sitaram Lanka,
- James Larus,
- Eric Peterson,
- Simon Pope,
- Aaron Smith,
- Jason Thong,
- Phillip Yi Xiao,
- Doug Burger
Datacenter workloads demand high computational capabilities, flexibility, power efficiency, and low cost. It is challenging to improve all of these factors simultaneously. To advance datacenter capabilities beyond what commodity server designs can ...
- research-articleOctober 2016
A cloud-scale acceleration architecture
- Adrian M. Caulfield,
- Eric S. Chung,
- Andrew Putnam,
- Hari Angepat,
- Jeremy Fowers,
- Michael Haselman,
- Stephen Heil,
- Matt Humphrey,
- Puneet Kaur,
- Joo-Young Kim,
- Daniel Lo,
- Todd Massengill,
- Kalin Ovtcharov,
- Michael Papamichael,
- Lisa Woods,
- Sitaram Lanka,
- Derek Chiou,
- Doug Burger
MICRO-49: The 49th Annual IEEE/ACM International Symposium on MicroarchitectureArticle No.: 7, Pages 1–13Hyperscale datacenter providers have struggled to balance the growing need for specialized hardware (efficiency) with the economic benefits of homogeneity (manageability). In this paper we propose a new cloud architecture that uses reconfigurable logic ...
- invited-talkFebruary 2016
Agile Co-Design for a Reconfigurable Datacenter
- Shlomi Alkalay,
- Hari Angepat,
- Adrian Caulfield,
- Eric Chung,
- Oren Firestein,
- Michael Haselman,
- Stephen Heil,
- Kyle Holohan,
- Matt Humphrey,
- Tamas Juhasz,
- Puneet Kaur,
- Sitaram Lanka,
- Daniel Lo,
- Todd Massengill,
- Kalin Ovtcharov,
- Michael Papamichael,
- Andrew Putnam,
- Raja Seera,
- Rimon Tadros,
- Jason Thong,
- Lisa Woods,
- Derek Chiou,
- Doug Burger
FPGA '16: Proceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysPage 15https://doi.org/10.1145/2847263.2847287In 2015, a team of software and hardware developers at Microsoft shipped the world?s first commercial search engine accelerated using FPGAs in the datacenter. During the sprint to production, new algorithms in the Bing ranking service were ported into ...
- research-articleMay 2015
A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services
- Andrew Putnam,
- Adrian M. Caulfield,
- Eric S. Chung,
- Derek Chiou,
- Kypros Constantinides,
- John Demme,
- Hadi Esmaeilzadeh,
- Jeremy Fowers,
- Gopi Prashanth Gopal,
- Jan Gray,
- Michael Haselman,
- Scott Hauck,
- Stephen Heil,
- Amir Hormati,
- Joo-Young Kim,
- Sitaram Lanka,
- James Larus,
- Eric Peterson,
- Simon Pope,
- Aaron Smith,
- Jason Thong,
- Phillip Yi Xiao,
- Doug Burger
To advance datacenter capabilities beyond what commodity server designs can provide, the authors designed and built a composable, reconfigurable fabric to accelerate large-scale software services. Each instantiation of the fabric consists of a 6 x 8 2D ...
- research-articleJune 2014
A reconfigurable fabric for accelerating large-scale datacenter services
- Andrew Putnam,
- Adrian M. Caulfield,
- Eric S. Chung,
- Derek Chiou,
- Kypros Constantinides,
- John Demme,
- Hadi Esmaeilzadeh,
- Jeremy Fowers,
- Gopi Prashanth Gopal,
- Jan Gray,
- Michael Haselman,
- Scott Hauck,
- Stephen Heil,
- Amir Hormati,
- Joo-Young Kim,
- Sitaram Lanka,
- James Larus,
- Eric Peterson,
- Simon Pope,
- Aaron Smith,
- Jason Thong,
- Phillip Yi Xiao,
- Doug Burger
ISCA '14: Proceeding of the 41st annual international symposium on Computer architecuturePages 13–24Datacenter workloads demand high computational capabilities, flexibility, power efficiency, and low cost. It is challenging to improve all of these factors simultaneously. To advance datacenter capabilities beyond what commodity server designs can ...
Also Published in:
ACM SIGARCH Computer Architecture News: Volume 42 Issue 3 - research-articleFebruary 2009
FPGA-based front-end electronics for positron emission tomography
FPGA '09: Proceedings of the ACM/SIGDA international symposium on Field programmable gate arraysPages 93–102https://doi.org/10.1145/1508128.1508143Modern Field Programmable Gate Arrays (FPGAs) are capable of performing complex discrete signal processing algorithms with clock rates above 100MHz. This combined with FPGA's low expense, ease of use, and selected dedicated hardware make them an ideal ...
- posterFebruary 2008
Fpga-based data acquisition system for a positron emission tomography (PET) scanner
FPGA '08: Proceedings of the 16th international ACM/SIGDA symposium on Field programmable gate arraysPage 264https://doi.org/10.1145/1344671.1344727Modern Field Programmable Gate Arrays (FPGAs) are capable of performing complex discrete signal processing algorithms with clock rates of above 100MHz. This combined with FPGAs low expense, ease of use, and selected dedicated hardware make them an ideal ...
- ArticleApril 2005
A Comparison of Floating Point and Logarithmic Number Systems for FPGAs
FCCM '05: Proceedings of the 13th Annual IEEE Symposium on Field-Programmable Custom Computing MachinesPages 181–190https://doi.org/10.1109/FCCM.2005.6There have been many papers proposing the use of logarithmic numbers (LNS) as an alternative to floating point because of simpler multiplication, division and exponentiation computations[1,4-9,13]. However, this advantage comes at the cost of ...