poster

Overcoming Data Transfer Bottlenecks in DNN Accelerators via Layer-Conscious Memory Managment

Authors:

Jason CongAuthors Info & Claims

FPGA '19: Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays

Page 120

https://doi.org/10.1145/3289602.3293947

Published: 20 February 2019 Publication History

Get Access

Abstract

Deep Neural Networks (DNNs) are rapidly evolving to satisfy the performance and accuracy requirements in many real world applications. The evolution renders DNNs more and more complex in terms of network topology, data sizes and layer types. Currently most state-of-the-art DNN accelerators adopt a uniform memory hierarchy (UMH) design methodology, which means that the data transferring of all convolutional and fully connected layers must go through the same memory levels. Unfortunately, for some layers, the performance is always bounded by off-chip memory transferring. It is caused by the saturating of data reuse happening in on-chip buffers, resulting in underutilization of on-chip memory. To address this issue, we propose a layer-conscious memory hierarchy (LCMH) methodology for DNN accelerators. LCMH could determine the memory levels of all the layers according to their requirements for off-chip memory bandwidth and on-chip buffer size for the data sources. As a result, the off-chip memory footprints of memory bounded layers could be avoided by keeping the data of them on chip. In addition, we provide architectural support for the accelerators equipped with LCMH. Experimental results show that designs with layer- conscious memory management could achieve up to 36% speedup compared with the designs wth UMH and 5% improvement over state-of-the-art designs.

Cited By

View all

Mohaidat TKhalil K(2024)A Survey on Neural Network Hardware AcceleratorsIEEE Transactions on Artificial Intelligence10.1109/TAI.2024.33771475:8(3801-3822)Online publication date: Aug-2024
https://doi.org/10.1109/TAI.2024.3377147
Ding BHuang JXu QWang JChen SKang Y(2022)Memory-aware Partitioning, Scheduling, and Floorplanning for Partially Dynamically Reconfigurable SystemsACM Transactions on Design Automation of Electronic Systems10.1145/353496828:1(1-21)Online publication date: 23-May-2022
https://dl.acm.org/doi/10.1145/3534968
Ma XLin SYe SHe ZZhang LYuan GTan SLi ZFan DQian XLin XMa KWang Y(2022)Non-Structured DNN Weight Pruning—Is It Beneficial in Any Platform?IEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.306326533:9(4930-4944)Online publication date: Sep-2022
https://doi.org/10.1109/TNNLS.2021.3063265
Show More Cited By

Index Terms

Overcoming Data Transfer Bottlenecks in DNN Accelerators via Layer-Conscious Memory Managment
1. Hardware
  1. Integrated circuits
    1. Reconfigurable logic and FPGAs
      1. Hardware accelerators

Recommendations

Overcoming Data Transfer Bottlenecks in FPGA-based DNN Accelerators via Layer Conscious Memory Management
DAC '19: Proceedings of the 56th Annual Design Automation Conference 2019

Deep Neural Networks (DNNs) are becoming more and more complex than before. Previous hardware accelerator designs neglect the layer diversity in terms of computation and communication behavior. On-chip memory resources are underutilized for the memory ...
Enabling Hybrid PCM Memory System with Inherent Memory Management
RACS '16: Proceedings of the International Conference on Research in Adaptive and Convergent Systems

Replacing the traditional volatile main memory, e.g., DRAM, with a non-volatile phase change memory (PCM) has become a possible solution to reduce the energy consumption of computing systems. To further reduce the bit cost of PCM, the development trend ...
Write-aware memory management for hybrid SLC-MLC PCM memory systems

In recent years, phase-change memory (PCM) has generated a great deal of interest because of its byte addressability and non-volatility properties. It is regarded as a good alternative storage medium that can reduce the performance gap between the main ...

Comments

Information & Contributors

Information

Published In

FPGA '19: Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays

February 2019

360 pages

ISBN:9781450361378

DOI:10.1145/3289602

General Chair:
Kia Bazargan
Univ. of Minnesota, USA
,
Program Chair:
Stephen Neuendorffer
Xilinx, USA

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 February 2019

Check for updates

Author Tags

Qualifiers

Poster

Funding Sources

Beijing Natural Science Foundation
Falcon Computing Solutions Inc.

Conference

FPGA '19

Sponsor:

SIGDA

FPGA '19: The 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays

February 24 - 26, 2019

CA, Seaside, USA

Acceptance Rates

Overall Acceptance Rate 125 of 627 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

8
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 06 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Mohaidat TKhalil K(2024)A Survey on Neural Network Hardware AcceleratorsIEEE Transactions on Artificial Intelligence10.1109/TAI.2024.33771475:8(3801-3822)Online publication date: Aug-2024
https://doi.org/10.1109/TAI.2024.3377147
Ding BHuang JXu QWang JChen SKang Y(2022)Memory-aware Partitioning, Scheduling, and Floorplanning for Partially Dynamically Reconfigurable SystemsACM Transactions on Design Automation of Electronic Systems10.1145/353496828:1(1-21)Online publication date: 23-May-2022
https://dl.acm.org/doi/10.1145/3534968
Ma XLin SYe SHe ZZhang LYuan GTan SLi ZFan DQian XLin XMa KWang Y(2022)Non-Structured DNN Weight Pruning—Is It Beneficial in Any Platform?IEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.306326533:9(4930-4944)Online publication date: Sep-2022
https://doi.org/10.1109/TNNLS.2021.3063265
Liang YXiao QLu LXie J(2022)FCNNLib: A Flexible Convolution Algorithm Library for Deep Learning on FPGAsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2021.310806541:8(2546-2559)Online publication date: Aug-2022
https://doi.org/10.1109/TCAD.2021.3108065
Azizimazreah AChen L(2022)Polymorphic Accelerators for Deep Neural NetworksIEEE Transactions on Computers10.1109/TC.2020.304862471:3(534-546)Online publication date: 1-Mar-2022
https://dl.acm.org/doi/10.1109/TC.2020.3048624
Chen XQi HPeng SZhuang YZhi TChen Y(2022)Tetris: A Heuristic Static Memory Management Framework for Uniform Memory Multicore Neural Network AcceleratorsJournal of Computer Science and Technology10.1007/s11390-021-1213-337:6(1255-1270)Online publication date: 30-Nov-2022
https://dl.acm.org/doi/10.1007/s11390-021-1213-3
Liang YLu LXie J(2021)OMNI: A Framework for Integrating Hardware and Software Optimizations for Sparse CNNsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2020.302390340:8(1648-1661)Online publication date: Aug-2021
https://doi.org/10.1109/TCAD.2020.3023903
Xu KQin MSun FWang YChen YRen F(2020)Learning in the Frequency Domain2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR42600.2020.00181(1737-1746)Online publication date: Jun-2020
https://doi.org/10.1109/CVPR42600.2020.00181
Hazarika APoddar SRahaman H(2020)Survey on memory management techniques in heterogeneous computing systemsIET Computers & Digital Techniques10.1049/iet-cdt.2019.009214:2(47-60)Online publication date: 21-Jan-2020
https://doi.org/10.1049/iet-cdt.2019.0092

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Overcoming Data Transfer Bottlenecks in FPGA-based DNN Accelerators via Layer Conscious Memory Management

Enabling Hybrid PCM Memory System with Inherent Memory Management

Write-aware memory management for hybrid SLC-MLC PCM memory systems

Comments

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Other Metrics

Article Metrics

Other Metrics

Cited By

Login options

Full Access

Abstract

Cited By

Index Terms

Recommendations

Overcoming Data Transfer Bottlenecks in FPGA-based DNN Accelerators via Layer Conscious Memory Management

Enabling Hybrid PCM Memory System with Inherent Memory Management

Write-aware memory management for hybrid SLC-MLC PCM memory systems

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations