Overview

Authors:

Tushar Krishna ⁰,
Hyoukjun Kwon ¹,
Angshuman Parashar ²,
Michael Pellauer ³,
…
Ananda Samajdar ⁴

Tushar Krishna
1. Georgia Institute of Technology, USA
View author publications

You can also search for this author in PubMed Google Scholar
Hyoukjun Kwon
1. Georgia Institute of Technology, USA
View author publications

You can also search for this author in PubMed Google Scholar
Angshuman Parashar
1. NVIDIA, USA
View author publications

You can also search for this author in PubMed Google Scholar
Michael Pellauer
1. NVIDIA, USA
View author publications

You can also search for this author in PubMed Google Scholar
Ananda Samajdar
1. Georgia Institute of Technology, USA
View author publications

You can also search for this author in PubMed Google Scholar

Part of the book series: Synthesis Lectures on Computer Architecture (SLCA)

4640 Accesses
5 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

eBook USD 49.99

Price excludes VAT (USA)

Softcover Book USD 64.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

This Synthesis Lecture focuses on techniques for efficient data orchestration within DNN accelerators. The End of Moore's Law, coupled with the increasing growth in deep learning and other AI applications has led to the emergence of custom Deep Neural Network (DNN) accelerators for energy-efficient inference on edge devices. Modern DNNs have millions of hyper parameters and involve billions of computations; this necessitates extensive data movement from memory to on-chip processing engines. It is well known that the cost of data movement today surpasses the cost of the actual computation; therefore, DNN accelerators require careful orchestration of data across on-chip compute, network, and memory elements to minimize the number of accesses to external DRAM. The book covers DNN dataflows, data reuse, buffer hierarchies, networks-on-chip, and automated design-space exploration. It concludes with data orchestration challenges with compressed and sparse DNNs and future trends. The target audience is students, engineers, and researchers interested in designing high-performance and low-energy accelerators for DNN inference.

Assessing the Configuration Space of the Open Source NVDLA Deep Learning Accelerator on a Mainstream MPSoC Platform

AMAIX: A Generic Analytical Model for Deep Learning Accelerators

Hardware-Aware Optimizations for Deep Learning Inference on Edge Devices

Table of contents (8 chapters)

Front Matter

Pages i-xvii

Download chapter PDF
Introduction to Data Orchestration
- Tushar Krishna, Hyoukjun Kwon, Angshuman Parashar, Michael Pellauer, Ananda Samajdar
Pages 1-10
Dataflow and Data Reuse
- Tushar Krishna, Hyoukjun Kwon, Angshuman Parashar, Michael Pellauer, Ananda Samajdar
Pages 11-31
Buffer Hierarchies
- Tushar Krishna, Hyoukjun Kwon, Angshuman Parashar, Michael Pellauer, Ananda Samajdar
Pages 33-59
Networks-on-Chip
- Tushar Krishna, Hyoukjun Kwon, Angshuman Parashar, Michael Pellauer, Ananda Samajdar
Pages 61-76
Putting it Together: Architecting a DNN Accelerator
- Tushar Krishna, Hyoukjun Kwon, Angshuman Parashar, Michael Pellauer, Ananda Samajdar
Pages 77-96
Modeling Accelerator Design Space
- Tushar Krishna, Hyoukjun Kwon, Angshuman Parashar, Michael Pellauer, Ananda Samajdar
Pages 97-109
Orchestrating Compressed-Sparse Data
- Tushar Krishna, Hyoukjun Kwon, Angshuman Parashar, Michael Pellauer, Ananda Samajdar
Pages 111-121
Conclusions
- Tushar Krishna, Hyoukjun Kwon, Angshuman Parashar, Michael Pellauer, Ananda Samajdar
Pages 123-130
Back Matter

Pages 131-146

Download chapter PDF

Authors and Affiliations

Georgia Institute of Technology, USA

Tushar Krishna, Hyoukjun Kwon, Ananda Samajdar
NVIDIA, USA

Angshuman Parashar, Michael Pellauer

About the authors

Tushar Krishna is an Assistant Professor in the School of Electrical and Computer Engineering at the Georgia Institute of Technology. He received a Ph.D. in Electrical Engineering and Computer Science from the Massachusetts Institute of Technology in 2014. Prior to that, he received an M.S.E in Electrical Engineering from Princeton University in 2009 and a B.Tech in Electrical Engineering from the Indian Institute of Technology (IIT), Delhi in 2007. Before joining Georgia Tech in 2015, he worked as a researcher in the VSSAD Group at Intel in Massachusetts. Dr. Krishna’s research spans computer architecture, interconnection networks, networks-on-chip (NoC), and deep learning accelerators, with a focus on optimizing data movement in modern computing systems. Three of his papers have been selected for IEEE Micro’s Top Picks from Computer Architecture, one more received an honorable mention, and three have won best paper awards. He received the National Science Foundation (NSF) CRII awardin 2018 and both a Google Faculty Award and a Facebook Faculty Award in 2019.Hyoukjun Kwon is a research scientist at Facebook AR/VR. He received his Ph.D. in Computer Science from Georgia Institute of Technology in 2020, advised by Dr. Tushar Krishna. He received B.S. degrees in Environmental Materials Science and in Computer Science and Engineering from Seoul National University in 2015. His research interests include communication-centric DNN accelerator designs, modeling of DNN accelerator architecture and mapping, NoC for accelerators, and co-optimization of DNN model, mapping, and accelerator architecture. He is actively leading the development of multiple open-source tools and RTLs in the DNN accelerator domain, including MAESTRO, MAERI, Microswitch NoC, and OpenSMART. One of his papers was selected for IEEE Micro’s Top Picks from computer architecture in 2019, one received honorable mention in 2018, and another won the best paper award at HPCA 2020.
Angshuman Parashar is a Senior Research Scientist at NVIDIA. His research interests are in building, evaluating, and programming spatial and data-parallel architectures, with a present focus on automated mapping of machine learning algorithms onto architectures based on explicit decoupled data orchestration. Prior to NVIDIA, he was a member of the VSSAD group at Intel, where he worked with a small team of experts in architecture, languages, workloads, and implementation to design and evaluate a new spatial architecture. Dr. Parashar received his Ph.D. in Computer Science and Engineering from the Pennsylvania State University in 2007, and his B.Tech. in Computer Science and Engineering from the Indian Institute of Technology, Delhi in 2002.
Michael Pellauer is a Senior Research Scientist at NVIDIA. His research interest is building domain specific accelerators, with a special emphasis on deep learning and sparse tensor algebra. Prior to NVIDIA, he was a member of the VSSAD group at Intel,where he performed research and advanced development on customized spatial accelerators. Dr. Pellauer holds a Ph.D. from the Massachusetts Institute of Technology in Cambridge, Massachusetts (2010), a Master’s from Chalmers University of Technology in Gothenburg, Sweden (2003), and a Bachelor’s from Brown University in Providence, Rhode Island (1999).
Ananda Samajdar is a Ph.D. student at the school of Electrical and Computer Engineering (ECE) at the Georgia Institute of Technology. He completed his B.Tech. (Hons.) in Electronics and Communication Engineering (ECE) from the Indian Institute of Information Technology, Allahabad India (IIIT-A) in 2013. Before joining Georgia Tech, Anand worked as a VLSI design engineer at Qualcomm Bangalore for three years. Anand’s research interest includes designing custom architecture for efficient and deep learning systems. He has authored a number of papers in top-tier computer architecture conferences. Two of his papers received honorablementions in the IEEE MICRO Top Picks 2019, and one was awarded the best paper award at HPCA 2020. He is also the recipient of the silver medal for the ACM student research competition at ASPLOS 2019.

Bibliographic Information

Book Title: Data Orchestration in Deep Learning Accelerators
Authors: Tushar Krishna, Hyoukjun Kwon, Angshuman Parashar, Michael Pellauer, Ananda Samajdar
Series Title: Synthesis Lectures on Computer Architecture
DOI: https://doi.org/10.1007/978-3-031-01767-4
Publisher: Springer Cham
eBook Packages: Synthesis Collection of Technology (R0), eBColl Synthesis Collection 10
Copyright Information: Springer Nature Switzerland AG 2020
Softcover ISBN: 978-3-031-00639-5Published: 18 August 2020
eBook ISBN: 978-3-031-01767-4Published: 31 May 2022
Series ISSN: 1935-3235
Series E-ISSN: 1935-3243
Edition Number: 1
Number of Pages: XVII, 146
Topics: Circuits and Systems, Processor Architectures

Publish with us

Policies and ethics

Data Orchestration in Deep Learning Accelerators

Overview

Access this book

Subscribe and save

Buy Now

Other ways to access

About this book

Similar content being viewed by others

Assessing the Configuration Space of the Open Source NVDLA Deep Learning Accelerator on a Mainstream MPSoC Platform

AMAIX: A Generic Analytical Model for Deep Learning Accelerators

Hardware-Aware Optimizations for Deep Learning Inference on Edge Devices

Table of contents (8 chapters)

Front Matter

Introduction to Data Orchestration

Dataflow and Data Reuse

Buffer Hierarchies

Networks-on-Chip

Putting it Together: Architecting a DNN Accelerator

Modeling Accelerator Design Space

Orchestrating Compressed-Sparse Data

Conclusions

Back Matter

Authors and Affiliations

Georgia Institute of Technology, USA

NVIDIA, USA

About the authors

Bibliographic Information

Publish with us

Navigation

Data Orchestration in Deep Learning Accelerators

Overview

Access this book

Subscribe and save

Buy Now

Other ways to access

About this book

Similar content being viewed by others

Table of contents (8 chapters)

Front Matter

Back Matter

Authors and Affiliations

Georgia Institute of Technology, USA

NVIDIA, USA

About the authors

Bibliographic Information

Publish with us

Search

Navigation