Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/3437539.3437702acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article

Co-exploration of neural architectures and heterogeneous ASIC accelerator designs targeting multiple tasks

Published: 18 November 2020 Publication History

Abstract

Neural Architecture Search (NAS) has demonstrated its power on various AI accelerating platforms such as Field Programmable Gate Arrays (FPGAs) and Graphic Processing Units (GPUs). However, it remains an open problem how to integrate NAS with Application-Specific Integrated Circuits (ASICs), despite them being the most powerful AI accelerating platforms. The major bottleneck comes from the large design freedom associated with ASIC designs. Moreover, with the consideration that multiple DNNs will run in parallel for different workloads with diverse layer operations and sizes, integrating heterogeneous ASIC sub-accelerators for distinct DNNs in one design can significantly boost performance, and at the same time further complicate the design space. To address these challenges, in this paper we build ASIC template set based on existing successful designs, described by their unique dataflows, so that the design space is significantly reduced. Based on the templates, we further propose a framework, namely ASICNAS, which can simultaneously identify multiple DNN architectures and the associated heterogeneous ASIC accelerator design, such that the design specifications (specs) can be satisfied, while the accuracy can be maximized. Experimental results show that compared with successive NAS and ASIC design optimizations which lead to design spec violations, ASICNAS can guarantee the results to meet the design specs with 17.77%, 2.49×, and 2.32× reductions on latency, energy, and area and less than 1.6% accuracy loss. To the best of the authors' knowledge, this is the first work on neural architecture and ASIC accelerator design co-exploration.

References

[1]
Barret Zoph and Quoc V Le. Neural architecture search with reinforcement learning. In Proc. of ICLR, 2017.
[2]
Hanxiao Liu et al. Darts: Differentiable architecture search. In Proc. of ICLR, 2019.
[3]
Hieu Pham et al. Efficient neural architecture search via parameter sharing. In Proc. of ICML, pages 4092--4101, 2018.
[4]
Bichen Wu et al. Fbnet: Hardware-aware efficient convnet design via differentiable neural architecture search. In Proc. of CVPR, pages 10734--10742, 2019.
[5]
Han Cai et al. Proxylessnas: Direct neural architecture search on target task and hardware. In Proc. of ICLR, 2019.
[6]
Weiwen Jiang et al. Accuracy vs. efficiency: Achieving both through fpga-implementation aware neural architecture search. In Proc. of DAC, 2019.
[7]
Cong Hao et al. Fpga/dnn co-design: An efficient design methodology for iot intelligence on the edge. In Proc. of DAC, 2019.
[8]
Yu-Hsin Chen et al. Eyeriss: A spatial architecture for energy-efficient dataflow for convolutional neural networks. In Proc. of ISCA, pages 367--379, 2016.
[9]
Angshuman Parashar et al. Scnn: An accelerator for compressed-sparse convolutional neural networks. In Proc. of ISCA, pages 27--40, 2017.
[10]
Zidong Du et al. Shidiannao: Shifting vision processing closer to the sensor. In Proc. of ISCA, pages 92--104, 2015.
[11]
NVIDIA. Nvdla deep learning accelerator. http://nvdla.org, 2017.
[12]
Chuan Li. https://lambdalabs.com/blog/resnet9-train-to-94-cifar10-accuracy-in-100-seconds. 2019. Accessed: 2019-11-24.
[13]
Michael Abrash. https://www.oculus.com/blog/inventing-the-future/. 2019. Accessed: 2019-11-26.
[14]
Hyoukjun Kwon et al. Herald: Optimizing heterogeneous dnn accelerators for edge devices. arXiv preprint arXiv:1909.07437, 2019.
[15]
Hyoukjun Kwon et al. Understanding reuse, performance, and hardware cost of dnn dataflow: A data-centric approach. In Proc. of MICRO, pages 754--768, 2019.
[16]
Olaf Ronneberger et al. U-net: Convolutional networks for biomedical image segmentation. In Proc. of MICCAI, pages 234--241, 2015.
[17]
Ronald J Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8(3--4):229--256, 1992.
[18]
K. Ito et al. Ilp-based cost-optimal dsp synthesis with module selection and data format conversion. IEEE Trans. TVLSI, 6(4):582--594, 1998.
[19]
Zili Shao et al. Efficient assignment and scheduling for heterogeneous dsp systems. IEEE Trans. TPDS, 16(6):516--525, 2005.
[20]
Mingxing an et al. Mnasnet: Platform-aware neural architecture search for mobile. In Proc. of CVPR, pages 2820--2828, 2019.
[21]
Michael P Perrone and Leon N Cooper. When networks disagree: Ensemble methods for hybrid neural networks. Technical report, 1992.

Cited By

View all
  • (2023)Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights GenerationACM Transactions on Design Automation of Electronic Systems10.1145/361167328:6(1-31)Online publication date: 16-Oct-2023
  • (2023) XploreNAS: Explore Adversarially Robust and Hardware-efficient Neural Architectures for Non-ideal XbarsACM Transactions on Embedded Computing Systems10.1145/359304522:4(1-17)Online publication date: 24-Jul-2023
  • (2022)A Framework for Neural Network Architecture and Compile Co-optimizationACM Transactions on Embedded Computing Systems10.1145/353325122:1(1-24)Online publication date: 29-Oct-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DAC '20: Proceedings of the 57th ACM/EDAC/IEEE Design Automation Conference
July 2020
1545 pages
ISBN:9781450367257
  • General Chair:
  • Zhuo Li

Sponsors

In-Cooperation

  • IEEE-CEDA

Publisher

IEEE Press

Publication History

Published: 18 November 2020

Check for updates

Qualifiers

  • Research-article

Conference

DAC '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

Upcoming Conference

DAC '25
62nd ACM/IEEE Design Automation Conference
June 22 - 26, 2025
San Francisco , CA , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)3
Reflects downloads up to 30 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights GenerationACM Transactions on Design Automation of Electronic Systems10.1145/361167328:6(1-31)Online publication date: 16-Oct-2023
  • (2023) XploreNAS: Explore Adversarially Robust and Hardware-efficient Neural Architectures for Non-ideal XbarsACM Transactions on Embedded Computing Systems10.1145/359304522:4(1-17)Online publication date: 24-Jul-2023
  • (2022)A Framework for Neural Network Architecture and Compile Co-optimizationACM Transactions on Embedded Computing Systems10.1145/353325122:1(1-24)Online publication date: 29-Oct-2022
  • (2022)Neural Architecture Search Survey: A Hardware PerspectiveACM Computing Surveys10.1145/352450055:4(1-36)Online publication date: 21-Nov-2022
  • (2022)A full-stack search technique for domain optimized deep learning acceleratorsProceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems10.1145/3503222.3507767(27-42)Online publication date: 28-Feb-2022
  • (2021)Automated HW/SW co-design for edge AIProceedings of the 2021 International Conference on Hardware/Software Codesign and System Synthesis10.1145/3478684.3479261(11-20)Online publication date: 30-Sep-2021
  • (2021)Intermittent-Aware Neural Architecture SearchACM Transactions on Embedded Computing Systems10.1145/347699520:5s(1-27)Online publication date: 17-Sep-2021
  • (2021)RAP: A Software Framework of Developing Convolutional Neural Networks for Resource-constrained Devices Using Environmental Monitoring as a Case StudyACM Transactions on Cyber-Physical Systems10.1145/34726125:4(1-28)Online publication date: 22-Sep-2021
  • (2021)DIANProceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design10.1109/ISLPED52811.2021.9502478(1-6)Online publication date: 26-Jul-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media