Fpga Tutorial
Fpga Tutorial
What is FPGA?
Field-programmable gate array (FPGA) is a device that has array of Configurable logic
gates and can be programmed on-board through dedicated Joint Test Action Group (JTAG)
or through any other serial/ Parallel non-volatile Memory. FPGA architecture are based on
static random-access memory (SRAM) Volatile memory. The Data programmed inside the
memory of an FPGA erase once the board powered off. In order to configure the data,
external EEPROM is attached to FPGA.
Any type of digital hardware circuit can be implemented in FPGA Architecture. The
Configurable logic block can be used to design basic combinational logic circuit to complex
Processor Architecture.
FPGA Architecture
In FPGA, each Configurable Logic Block consist of 2 slices. Those slices are further divided
in 2 logic elements.
Also there are various dedicated circuits are present inside FPGA. They are Digital Clock
Manager, Multiplier, Block RAM and so on.
Digital Clock Manager DCM is used to perform Clock Phase shift, De skew, Clock divider
and frequency synthesis.
Multiplier Block implement dedicated 18×18 multiplier with Signed and unsigned operation.
In advanced FPGA Architecture like zynq FPGA from xilinx, dual core arm cortex A9
Processor is inbuilt inside the FPGA Fabric for high Performance implementation.
The FPGA is made of SRAM (Volatile Memory) so the data configured inside FPGA lost at
power Off state. FPGA Configuration is the process of loading the FPGA chip with
Configuration data through external devices during power On state.
Master Mode
Slave Mode
JTAG Mode
Master Modes
In the Master Mode the Configuration data is stored in external nonvolatile memories such us
SPI FLASH, Parallel FLASH, PROM and so on. During configuration process the data is
loaded in the FPGA Configurable Logic Blocks to operate as a specific application. The
configuration clock is provided by FPGA in Master Mode operation.
FPGA Configure through External PROM
In Slave Mode, The entire configuration Process is controlled by External device. Those
External device may be of processor, Microcontroller, and so on. The Configuration can
performed serially or parallel method. The Clock input is supplied by the external device for
Slave mode.
JTAG Connection
The four-wire JTAG interface is common on board testers and debugging hardware.FPGA
mainly uses JTAG interface for prototype download and debugging. JTAG Consist of TCK,
TMS, TDI and TDO lines for communication.
Block RAM and Distributed RAM in Xilinx FPGA
Block RAM:
Distributed RAM:
In this part of tutorial we are going to have a short intro on FPGA design flow. A simplified
version of FPGA design flow is given in the flowing diagram.
FPGA_Design_FLOW
Design Entry
There are different techniques for design entry. Schematic based, Hardware Description
Language and combination of both etc. . Selection of a method depends on the design and
designer. If the designer wants to deal more with Hardware, then Schematic entry is the better
choice. When the design is complex or the designer thinks the design in an algorithmic way
then HDL is the better choice. Language based entry is faster but lag in performance and
density.
HDLs represent a level of abstraction that can isolate the designers from the details of the
hardware implementation. Schematic based entry gives designers much more visibility into
the hardware. It is the better choice for those who are hardware oriented. Another method but
rarely used is state-machines. It is the better choice for the designers who think the design as
a series of states. But the tools for state machine entry are limited. In this documentation we
are going to deal with the HDL based design entry.
Synthesis
The process which translates VHDL or Verilog code into a device netlist formate. i.e a
complete circuit with logical elements( gates, flip flops, etc…) for the design.If the design
contains more than one sub designs, ex. to implement a processor, we need a CPU as one
design element and RAM as another and so on, then the synthesis process generates netlist
for each design element
Synthesis process will check code syntax and analyze the hierarchy of the design which
ensures that the design is optimized for the design architecture, the designer has selected. The
resulting netlist(s) is saved to an NGC( Native Generic Circuit) file (for Xilinx® Synthesis
Technology (XST)).
Xilinx_Synthesis
Implementation
Translate process combines all the input netlists and constraints to a logic design file. This
information is saved as a NGD (Native Generic Database) file. This can be done using NGD
Build program. Here, defining constraints is nothing but, assigning the ports in the design to
the physical elements (ex. pins, switches, buttons etc) of the targeted device and specifying
time requirements of the design. This information is stored in a file named UCF (User
Constraints File).
Tools used to create or modify the UCF are PACE, Constraint Editor etc.
FPGA_Transulate
FPGA Translate
Map process divides the whole circuit with logical elements into sub blocks such that they
can be fit into the FPGA logic blocks. That means map process fits the logic defined by the
NGD file into the targeted FPGA elements (Combinational Logic Blocks (CLB), Input
Output Blocks (IOB)) and generates an NCD (Native Circuit Description) file which
physically represents the design mapped to the components of FPGA. MAP program is used
for this purpose.
Place and Route PAR program is used for this process. The place and route process places
the sub blocks from the map process into logic blocks according to the constraints and
connects the logic blocks. Ex. if a sub block is placed in a logic block which is very near to
IO pin, then it may save the time but it may effect some other constraint. So trade off between
all the constraints is taken account by the place and route process
The PAR tool takes the mapped NCD file as input and produces a completely routed NCD
file as output. Output NCD file consists the routing information.
FPGA place and route
FPGA Place and route
Device Programming
Now the design must be loaded on the FPGA. But the design must be converted to a format
so that the FPGA can accept it. BITGEN program deals with the conversion. The routed
NCD file is then given to the BITGEN program to generate a bit stream (a .BIT file) which
can be used to configure the target FPGA device. This can be done using a cable. Selection of
cable depends on the design.
Design Verification
Behavioral Simulation (RTL Simulation) This is first of all simulation steps; those are
encountered throughout the hierarchy of the design flow. This simulation is performed before
synthesis process to verify RTL (behavioral) code and to confirm that the design is
functioning as intended. Behavioral simulation can be performed on either VHDL or Verilog
designs. In this process, signals and variables are observed, procedures and functions are
traced and breakpoints are set. This is a very fast simulation and so allows the designer to
change the HDL code if the required functionality is not met with in a short time period.
Since the design is not yet synthesized to gate level, timing and resource usage properties are
still unknown.
Static Timing Analysis This can be done after MAP or PAR processes Post MAP timing
report lists signal path delays of the design derived from the design logic. Post Place and
Route timing report incorporates timing delay information to provide a comprehensive timing
summary of the design.