Deepa PPT 49

Uploaded by

This document describes a proposed deep learning neural network architecture called DLAU that uses convolutional and Laplacian filters for image processing. The DLAU controller directs three pipelined processing units - TMMU for multiplication and accumulation, PSAU for accumulation, and AFAU for activation functions. The proposed system partitions input data into tiles to improve locality. It implements a CNN using Laplacian filters and an FSM for the activation unit. Synthesis results and comparisons to existing systems show the proposed DLAU architecture provides better scalability and reduces power consumption.

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Deepa PPT 49

Uploaded by

deepa

0% found this document useful (0 votes)

130 views20 pages

Original Description:

Original Title

deepa ppt 49 (1)

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

130 views20 pages

Deepa PPT 49

Uploaded by

deepa

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 20

Search inside document

Design & implementation of deep learning

neural networks based on convolutional &

Laplacian filter with image controlling on
DLAU controllers

B.Deepa (17Q61D5708)
M.TECH (VLSI )
Contents
ABSTRACT

INTRODUCTION

LITERATURE REVIEW

EXISTING SYSTEM

PROPOSED SYSTEM

RESULTS

SYNTHESIS REPORTS

COMPARISON TABLE

IMPROVEMENTS OBSERVED

CONCLUSION
ABSTRACT
 In this paper we design DLAU, which is a scalable
accelerator architecture for large-scale deep learning networks for
CNN structure using DLAU controller.
The DLAU accelerator employs three pipelined processing
units to improve the throughput and utilizes techniques to explore
locality for deep learning applications.
INTRODUCTION
 In the past few years, machine learning has become
pervasive in various research fields, applications and
achieved satisfactory products.
 The emergence of deep learning speeded up the
development of machine learning and artificial intelligence.
 Consequently, deep learning has become a research hot
spot in research organizations.
 In general, deep learning uses a multilayer neural
network model to extract high-level features which are a
combination of low-level abstractions to find the
distributed data features, in order to solve complex
problems in machine learning.
EXCITSING SYSTEM
TMMU , PSAU & AFAU
TMMU is in charge of multiplication and
accumulation operations.
PSAU is responsible for the accumulation operation.
PSAU architecture, which accumulates the part sum
produced by TMMU.
 If the part sum is the final result, PSAU will write the
value to output buffer and send results to AFAU in a
pipeline manner.
Conc………………
Finally, AFAU implements the activation function
using piecewise linear interpolation .
This method has been widely applied to implement
activation functions with negligible accuracy loss. In
activation function implementation of sigmoid
function is done.
PROPOSED SYSTEM
Description
In order to explore the locality of the deep learning
application, we employ tile techniques to partition the
large scale input data.
The DLAU architecture can be configured to operate
different sizes of tile data to leverage the trade-offs
between speedup and hardware costs.
Consequently the FPGA based accelerator is more
scalable to accommodate different machine learning
applications.
DLAU Controller
Description
The DLAU accelerator is composed of three fully
pipelined processing units, including TMMU, PSAU,
and AFAU.
Different network topologies such as CNN, DNN, or
even emerging neural networks can be composed from
these basic modules.
Consequently the scalability of FPGA based
accelerator is higher than ASIC based accelerator.
CNN (Convolution with laplacian filter stage 1)
if(enable) begin output1 <= 0;
output2 <= 0; output3 <= 0;
output4 <= 0; output5 <= 0;
done <= 1'b0; end
else begin
output1 <= {1'b1, ~(input2)} + 5'b00001;
output2 <= {1'b1, ~(input4)} + 5'b00001;
output3 <= {2'b00, input5} << 2;
output4 <= {1'b1, ~(input6)} + 5'b00001;
output5 <= {1'b1, ~(input8)} + 5'b00001;
done <= 1'b1;
FSM diagram for AFAU
RESULTS
SYNTHESIS REPORT
Area Utilization Power Utilization
COMPARISON TABLE
IMPROVEMENTS OBSERVED
CONCLUSION
In current design scenario we have seen Author () have proposed
DLAU with CNN architecture with 2x2 and 4x4 as separate
models
Our design criteria improvises a novel concept of CNN structure
for both 2x2 and 4x4 for each internal modules utilized in design.
This proposed structure utilized only 2 L-Filters, 3 adders and 1
max-pooling circuits where as for exitisng scenario we need to
provide at least 2 filters and 8 adders with one max-pooling
modules.
We propose the design of deep learning accelerator unit using
FSM which can reduce power consumption and speed up device.
THANK YOU

Introduction To Econometrics (4th Edition) : (This Version September 14, 2018)
Document15 pages
Introduction To Econometrics (4th Edition) : (This Version September 14, 2018)
이이
No ratings yet
DFT Interview Questions & Answers
Document22 pages
DFT Interview Questions & Answers
deepa
100% (10)
AI Programming CAT 1 N CAT 2 Muchiri
Document14 pages
AI Programming CAT 1 N CAT 2 Muchiri
Jay Kibe
No ratings yet
Frequencies.: Transition Fault Model: This Is Considered To Stuck at Fault Model Within A Time
Document15 pages
Frequencies.: Transition Fault Model: This Is Considered To Stuck at Fault Model Within A Time
deepa
No ratings yet
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
From Everand
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
Dr.Chandrakant
No ratings yet
FFCNN: Fast FPGA Based Acceleration For Convolution Neural Network Inference
Document5 pages
FFCNN: Fast FPGA Based Acceleration For Convolution Neural Network Inference
namallkumarax23
No ratings yet
CNN-MERP: An FPGA-Based Memory-Efficient Reconfigurable Processor For Forward and Backward Propagation of Convolutional Neural Networks
Document8 pages
CNN-MERP: An FPGA-Based Memory-Efficient Reconfigurable Processor For Forward and Backward Propagation of Convolutional Neural Networks
Arif
No ratings yet
1 s2.0 S0141933124000322 Main
Document7 pages
1 s2.0 S0141933124000322 Main
Nesma Muhammed
No ratings yet
International Journal of Computational Engineering Research (IJCER)
Document6 pages
International Journal of Computational Engineering Research (IJCER)
International Journal of computational Engineering research (IJCER)
No ratings yet
Design A Pipelined Datapath Synthesis System For Digital Signal Processing
Document12 pages
Design A Pipelined Datapath Synthesis System For Digital Signal Processing
Vashist Managari
No ratings yet
Design and Implementation of Hardware Computation For Convolutional Neural Networks
Document6 pages
Design and Implementation of Hardware Computation For Convolutional Neural Networks
Luu Nguyễn
No ratings yet
Design and Implementation of Hardware Computation For Convolutional Neural Networks
Document6 pages
Design and Implementation of Hardware Computation For Convolutional Neural Networks
Luu Nguyễn
No ratings yet
Hardware Implementation Backpropagation
Document14 pages
Hardware Implementation Backpropagation
DavidThân
No ratings yet
Design and Implementation of Hardware Computation For Convolutional Neural Networks
Document6 pages
Design and Implementation of Hardware Computation For Convolutional Neural Networks
Luu Nguyễn
No ratings yet
Implementation of Neural Network Back Propagation Training Algorithm On FPGA
Document19 pages
Implementation of Neural Network Back Propagation Training Algorithm On FPGA
Mix Ruksin
No ratings yet
A CNN Accelerator On FPGA Using Depthwise Separable Convolution
Document5 pages
A CNN Accelerator On FPGA Using Depthwise Separable Convolution
Tuhin Karak
No ratings yet
New Dlau
Document52 pages
New Dlau
deepa
No ratings yet
FPGA Based Artificial Neural Network
Document11 pages
FPGA Based Artificial Neural Network
29377
No ratings yet
U.maheswaran Presentation
Document25 pages
U.maheswaran Presentation
Maheswaran Umaiyorupagan
No ratings yet
A High-Throughput and Power-Efficient FPGA Implementation of YOLO CNN For Object Detection
Document13 pages
A High-Throughput and Power-Efficient FPGA Implementation of YOLO CNN For Object Detection
ullagallu
No ratings yet
An Efficient CNN Accelerator Using Inter-Frame Data Reuse of Videos On FPGAs
Document14 pages
An Efficient CNN Accelerator Using Inter-Frame Data Reuse of Videos On FPGAs
palansamy
No ratings yet
FPGA Design and Implementation Issues of Artificial Neural Network Based
Document3 pages
FPGA Design and Implementation Issues of Artificial Neural Network Based
vigneshwaran50
No ratings yet
Jurnal Ieee Communications
Document4 pages
Jurnal Ieee Communications
Prayogy Pangestu
No ratings yet
Two Hardware Implementations For Modular Multip - 2021 - Journal of Information
Document12 pages
Two Hardware Implementations For Modular Multip - 2021 - Journal of Information
nosofo4431
No ratings yet
Why FPGAs Are So Fast?
Document13 pages
Why FPGAs Are So Fast?
bayman66
No ratings yet
Convolution Optimization For DNN
Document14 pages
Convolution Optimization For DNN
Yeshudas Muttu
No ratings yet
A High Performance Reconfigurable Hardware Archite (5)
Document17 pages
A High Performance Reconfigurable Hardware Archite (5)
Nguyen Duong
No ratings yet
Fully Convolutional
Document4 pages
Fully Convolutional
REAL Gyan
No ratings yet
Fpga Implementation of A Multilayer Perceptron Neural Network Using VHDL
Document4 pages
Fpga Implementation of A Multilayer Perceptron Neural Network Using VHDL
minoramix
No ratings yet
10 1109@tvlsi 2019 2939726
Document13 pages
10 1109@tvlsi 2019 2939726
manurudin2
No ratings yet
Kanoria Shubham Anil 2023HT01569
Document9 pages
Kanoria Shubham Anil 2023HT01569
shubhamk.4412
No ratings yet
An Efficient Hardware Accelerator For Structured Sparse Convolutional Neural Networks On Fpgas
Document12 pages
An Efficient Hardware Accelerator For Structured Sparse Convolutional Neural Networks On Fpgas
Selva Kumar
No ratings yet
FP-DNN An Automated Framework For Mapping
Document8 pages
FP-DNN An Automated Framework For Mapping
Antón Rodríguez
No ratings yet
Design and Implementation of Hardware Computation For Convolutional Neural Networks
Document6 pages
Design and Implementation of Hardware Computation For Convolutional Neural Networks
Luu Nguyễn
No ratings yet
The Thing That Is in Existence
Document7 pages
The Thing That Is in Existence
scribduser53
No ratings yet
Fixed-Point CNN For FPGA
Document7 pages
Fixed-Point CNN For FPGA
manh.tranduc
No ratings yet
Jlpea 12 00011 v3
Document16 pages
Jlpea 12 00011 v3
KAJA
No ratings yet
A High-Throughput and Power-Efficient FPGA Implementation of Yolo CNN For Object Detection
Document13 pages
A High-Throughput and Power-Efficient FPGA Implementation of Yolo CNN For Object Detection
Vinh Lê Hữu
No ratings yet
Chapter-2: Literature Review
Document11 pages
Chapter-2: Literature Review
Pritam Sirpotdar
No ratings yet
Laius: An 8-Bit Fixed-Point CNN Hardware Inference Engine
Document8 pages
Laius: An 8-Bit Fixed-Point CNN Hardware Inference Engine
REAL Gyan
No ratings yet
DNA Assembly With de Bruijn Graphs On FPGA PDF
Document4 pages
DNA Assembly With de Bruijn Graphs On FPGA PDF
Bruno Oliveira
No ratings yet
International Refereed Journal of Engineering and Science (IRJES)
Document4 pages
International Refereed Journal of Engineering and Science (IRJES)
www.irjes.com
No ratings yet
Tam Metin
Document4 pages
Tam Metin
erkanduman
No ratings yet
High Performance and Scalable GPU Graph Traversal
Document15 pages
High Performance and Scalable GPU Graph Traversal
kumarabarbarian
No ratings yet
541 - Literature Review
Document19 pages
541 - Literature Review
Andi Dwiki
No ratings yet
Data Processing On Fpgas
Document12 pages
Data Processing On Fpgas
DinhThu
No ratings yet
Chapter 1
Document6 pages
Chapter 1
db.rama krishna Reddy
No ratings yet
Wang CSPNet A New Backbone That Can Enhance Learning Capability of CVPRW 2020 Paper
Document10 pages
Wang CSPNet A New Backbone That Can Enhance Learning Capability of CVPRW 2020 Paper
Serxio García
No ratings yet
An Efficient Folded Architecture For Lifting-Based Discrete Wavelet Transform
Document5 pages
An Efficient Folded Architecture For Lifting-Based Discrete Wavelet Transform
cheezesha4533
No ratings yet
Vlsi Abstracts An Accumulator-Based Test-Per-Clock Scheme
Document7 pages
Vlsi Abstracts An Accumulator-Based Test-Per-Clock Scheme
divyaftm
No ratings yet
An FPGA-Based Solution For A Graph Neural Network Accelerator
Document9 pages
An FPGA-Based Solution For A Graph Neural Network Accelerator
john Bronson
No ratings yet
FPGA Implementation of A Trained Neural Network: Seema Singh, Shreyashree Sanjeevi, Suma V, Akhil Talashi
Document10 pages
FPGA Implementation of A Trained Neural Network: Seema Singh, Shreyashree Sanjeevi, Suma V, Akhil Talashi
HaiNguyen
No ratings yet
Depth Dropout
Document7 pages
Depth Dropout
heavywater
No ratings yet
FPGA-Based Multi-Level Approximate Multipliers For High-Performance Error-Resilient Applications
Document17 pages
FPGA-Based Multi-Level Approximate Multipliers For High-Performance Error-Resilient Applications
pskumarvlsipd
No ratings yet
Faraone 2018
Document4 pages
Faraone 2018
aham krishnaha
No ratings yet
DFANet Deep Feature Aggregation For Real-Time Semantic Segmentation
Document10 pages
DFANet Deep Feature Aggregation For Real-Time Semantic Segmentation
prajna acharya
No ratings yet
A 1 TOPS - W Analog Deep Machine-Learning Engine With Floating-Gate Storage in 0.13 Μm CMOS
Document12 pages
A 1 TOPS - W Analog Deep Machine-Learning Engine With Floating-Gate Storage in 0.13 Μm CMOS
Kjfsa Tu
No ratings yet
Cafpga: An Automatic Generation Model For CNN Accelerator
Document30 pages
Cafpga: An Automatic Generation Model For CNN Accelerator
PAULO AARóN AGUIRRE áLVAREZ
No ratings yet
Bonnard Et Al-2020-On Building A CNN-based Multi-View Smart Camera For Real-Time Object Detection
Document33 pages
Bonnard Et Al-2020-On Building A CNN-based Multi-View Smart Camera For Real-Time Object Detection
Dominic Tello
No ratings yet
CSPNet A New Backbone That Can Enhance Learning Capability of CNN
Document10 pages
CSPNet A New Backbone That Can Enhance Learning Capability of CNN
lvnttya
No ratings yet
4th Process Tomograpgy
Document6 pages
4th Process Tomograpgy
Emmanuel Abdias Romano Castillo
No ratings yet
Generalised Parallel Bilinear Interpolation Archit
Document7 pages
Generalised Parallel Bilinear Interpolation Archit
Steven Riofrio
No ratings yet
VLSI Implementation of Deep Neural Network Using Integral Stochastic Computing
Document12 pages
VLSI Implementation of Deep Neural Network Using Integral Stochastic Computing
G S Rajasekhar Reddy
No ratings yet
Data-Variant Kernel Analysis
From Everand
Data-Variant Kernel Analysis
Yuichi Motai
No ratings yet
New Dlau
Document52 pages
New Dlau
deepa
No ratings yet
Basic Interview Questions On DFT
Document1 page
Basic Interview Questions On DFT
deepa
No ratings yet
Full and Fast Sequential
Document4 pages
Full and Fast Sequential
deepa
No ratings yet
Interview Vlsi
Document47 pages
Interview Vlsi
Kulwant Nagi
100% (1)
QUALCOMM Interview Questions 1. Basic View of Compression?
Document9 pages
QUALCOMM Interview Questions 1. Basic View of Compression?
deepa
100% (3)
HDL Programming Lab Manual Final Updated
Document77 pages
HDL Programming Lab Manual Final Updated
deepa
No ratings yet
NXP Interview Questions
Document29 pages
NXP Interview Questions
deepa
100% (2)
QC Interview Qns DFT
Document15 pages
QC Interview Qns DFT
deepa
No ratings yet
Setup & Hold Violation
Document20 pages
Setup & Hold Violation
deepa
No ratings yet
Interview Questions DFT
Document1 page
Interview Questions DFT
deepa
No ratings yet
Coverage Improvement: by Deepa.B
Document13 pages
Coverage Improvement: by Deepa.B
deepa
No ratings yet
Memory
Document29 pages
Memory
deepa
No ratings yet
Timing Issues: by Deepa .B
Document21 pages
Timing Issues: by Deepa .B
deepa
No ratings yet
Face Recognition Using Artificial Neural Network
Document10 pages
Face Recognition Using Artificial Neural Network
Mayank
No ratings yet
A Study of Stabilization and Swing-Up Linear Control For A Single Link Rotary Pendulum
Document6 pages
A Study of Stabilization and Swing-Up Linear Control For A Single Link Rotary Pendulum
Fathrun Naziem
No ratings yet
AVL Trees: CSE 373 Data Structures
Document43 pages
AVL Trees: CSE 373 Data Structures
Sabari Nathan
No ratings yet
FDS MCQ Question Bank Unit 1 To 4
Document15 pages
FDS MCQ Question Bank Unit 1 To 4
Ufyfivig
100% (3)
06 30175 A1000989 LC Data Structure Algorithms Main v1
Document6 pages
06 30175 A1000989 LC Data Structure Algorithms Main v1
Lylbean seo
No ratings yet
CSE23302 - DataStructures - QB - Linked Lists - 2024-25
Document2 pages
CSE23302 - DataStructures - QB - Linked Lists - 2024-25
chaitanyaam97
No ratings yet
KMP Algorithm Construct Table
Document32 pages
KMP Algorithm Construct Table
Abhilash Sharma
No ratings yet
Chapter 1 Errors
Document22 pages
Chapter 1 Errors
taqiyuddinm
50% (2)
Linear Circuit Analysis (EE-101) : Electric Signals
Document18 pages
Linear Circuit Analysis (EE-101) : Electric Signals
Tehseen Hussain
No ratings yet
Class Notes - 1547284812
Document150 pages
Class Notes - 1547284812
Khushi Yadav
No ratings yet
Best Compromise - Kattis, Kattis
Document3 pages
Best Compromise - Kattis, Kattis
Ye
No ratings yet
1st Class
Document18 pages
1st Class
chitl.23bi14075
No ratings yet
FEAP Manual
Document71 pages
FEAP Manual
Ramesh Adhikari
No ratings yet
Decision Theory 2
Document92 pages
Decision Theory 2
Shan
No ratings yet
Some Asymptotic Methods For Strongly Nonlinear Equ
Document60 pages
Some Asymptotic Methods For Strongly Nonlinear Equ
ataabuasad08
No ratings yet
DOA Estimation Slides
Document92 pages
DOA Estimation Slides
Pedro Luis Carro
No ratings yet
Unit 6 The Mathematics of Graphs Part 3
Document8 pages
Unit 6 The Mathematics of Graphs Part 3
Jennilyn Concepcion
No ratings yet
Digital Image Cryptosystem With Adaptive Steganography
Document51 pages
Digital Image Cryptosystem With Adaptive Steganography
Kapil Shere
No ratings yet
(Book) Adaptive Quadrature Clenshaw
Document242 pages
(Book) Adaptive Quadrature Clenshaw
Adonees Alyas
No ratings yet
He Luo - PRE 1997
Document7 pages
He Luo - PRE 1997
Praveen Srivastava
No ratings yet
Harvard Lecture Series Session 4 - Factor Analysis
Document50 pages
Harvard Lecture Series Session 4 - Factor Analysis
rashed azad
No ratings yet
Symplectic Integration of Hamiltonian Systems: Home Search Collections Journals About Contact Us My Iopscience
Document30 pages
Symplectic Integration of Hamiltonian Systems: Home Search Collections Journals About Contact Us My Iopscience
Nitish Kumar
No ratings yet
TEST4
Document1 page
TEST4
MANAV PRAVIN
No ratings yet
Experiment No. 5: Objective
Document5 pages
Experiment No. 5: Objective
ananyahc12
No ratings yet
Gauss-Seidel Iterative Method: = + sin (5 Ct) cos (Ct) −ρ A 2 gh
Document4 pages
Gauss-Seidel Iterative Method: = + sin (5 Ct) cos (Ct) −ρ A 2 gh
n
No ratings yet
EE3002 Past Exam Paper
Document7 pages
EE3002 Past Exam Paper
Kranthirekha Chennaboina
No ratings yet
Automatic Sleep Stage Classification Using Convolutional Neural Networks With Long Short-Term Memory
Document48 pages
Automatic Sleep Stage Classification Using Convolutional Neural Networks With Long Short-Term Memory
LOHITH H J
No ratings yet
Modern Control Systems (MCS) : Lecture-30-31 Design of Control Systems in Sate Space
Document42 pages
Modern Control Systems (MCS) : Lecture-30-31 Design of Control Systems in Sate Space
Belayneh Tadesse
100% (2)