DRAM Technology
DRAM Technology
OVERVIEW
DRAM (Dynamic Random Access Memory) is the main memory used for all desktop and larger
computers. Each elementary DRAM cell is made up of a single MOS transistor and a storage
capacitor (Figure 7-1). Each storage cell contains one bit of information. This charge, however,
leaks off the capacitor due to the sub-threshold current of the cell transistor. Therefore, the charge
must be refreshed several times each second.
Bit Line
Capacitor
Plate
The memory cell is written to by placing a “1” or “0” charge into the capacitor cell. This is done
during a write cycle by opening the cell transistor (gate to power supply or VCC) and presenting
either VCC or 0V (ground) at the capacitor. The word line (gate of the transistor) is then held at
ground to isolate the capacitor charge. This capacitor will be accessed for either a new write, a
read, or a refresh.
Figure 7-2 shows a simplified DRAM diagram. The gates of the memory cells are tied to the
rows. The read (or write) of a DRAM is done in two main steps as illustrated in Figure 7-3. The
row (X) and column (Y) addresses are presented on the same pads and multiplexed. The first
step consists of validating the row addresses and the second step consists of validating the
column addresses.
Address X
Row Decode
Pads
CAS
RAS
Y
Data
Sense
Data Amplifier
Data
Sense
Data Amplifier
Y Decode
Output
Dout
Buffer Data
Sense
Data Amplifier
Input
Din
Buffer
Data
Sense
Data Amplifier
RAS
CAS
Step I Step II
Row addresses are present on address pads and are internally validated by the RAS (Row Address
Access) clock. A bar on top of the signal name means this signal is active when it is at a low level.
The X addresses select one row through the row decode, while all the other non-selected rows
remain at 0V. Each cell of the selected row is tied to a sense amplifier. A sense amplifier is a cir-
cuit that is able to recognize if a charge has been loaded into the capacitor of the memory cell, and
to translate this charge or lack of charge into a 1 or 0. There are as many sense amplifiers as there
are cells on a row. Each sense amplifier is connected to a column (Y address). In this first step all
the cells of the entire row are read by the sense amplifier. This step is long and critical because the
row has a high time constant due to the fact that it is formed by the gates of the memory cells.
Also, the sense amplifier has to read a very weak charge (approximately 30 femtoFarads or 30fF).
Following the first step, column addresses are present on the address pads and are internally val-
idated by the Column Address Access (CAS) clock. Each selected memory cell has its data vali-
dated in a sense amplifier. Column access is fast. This step consists of transferring data present
in the sense amplifier to the Dout pin through the column decode and the output buffer. On
memory data sheets, the access time from RAS is termed tRAC and the access time from CAS is
listed as tCAC. On a typical standard DRAM of 60ns access time, tRAC = 60ns and tCAC = 15ns.
Refresh
To maintain data integrity, it is necessary to refresh each DRAM memory cell. Each row of cells
is refreshed every cycle. For example, if the product specification states, “Refresh cycle = 512
cycles per 8ms,” then there are 512 rows and each individual row must be refreshed every eight
milliseconds.
As explained above, during the row access step, all the cells from the same row are read by the
sense amplifier. The sense amplifier has two roles. Since it holds information within the cell, it is
able to transmit this data to the output buffer if it is selected by the column address. The sense
amplifier is also able to re-transmit (write) the information into the memory cell. In this case, it
“refreshes” the memory cell. When one row is selected, all the cells of that row are read by the
sense amplifiers and all these cells are refreshed one at a time.
Burst or distributed refresh methods can be used. Burst refresh is done by performing a series of
refresh cycles until all rows have been accessed. For the example given above, this is done every
8ms. During the refresh, other commands are not allowed. Using the distributed method and
the above example, a refresh is done every 12.6µs (8ms divided by 512). Figure 7-4 shows these
two modes.
Distributed
Refresh
Burst
Refresh
Time
Required Time To
Complete Refresh Of All Rows
Each Pulse Represents
a Refresh Cycle
Source: Micron, "Memory 1997" 20843
For standard DRAMs there are three ways to perform refresh cycles. They are RAS-only refresh,
CAS-before-RAS refresh, and hidden refresh. To perform a RAS-only refresh, a row address is put
on the address lines and then RAS goes low. To perform a CAS-before-RAS refresh, CAS first goes
low and then a refresh cycle is performed each time RAS goes low. To perform a hidden refresh
the user does a read or write cycle and then brings RAS high and then low.
MEMORY CELL
A great deal of design effort has been made to shrink the cell area, particularly, the size of the
DRAM capacitor. As memory density increases, the cell size must decrease. Designers have man-
aged to shrink overall cell size. However, due to factors such as noise sensitivity and speed, it has
been a challenge to reduce the capacitance. The capacitance must stay in the range of 30fF.
The charge (Q) stored in a capacitor is equal to capacitance times voltage (Q = C x V). Over the
years, DRAM operating voltage has decreased (i.e., 12V to 5V to 3.3V). As voltage decreases, the
stored charge will also decrease. Design improvements allow for the decrease in the cell charge
as long as the capacitance remains in the range of 30fF.
Two main developments are used to reduce capacitor area without reducing its value. These are
the use of new capacitor shapes to fit into a minimum chip surface area and increasing the
dielectric constant.
The 1Mbit DRAM generation was the first to abandon the classical planar capacitor and replace it
with a trench or a stacked capacitor. Figure 7-5 shows the feature sizes of some of the DRAM
devices that ICE analyzed in its laboratory these two last years. Trench capacitors are not widely
used in spite of continual research and development on that type of design. As shown, the major
64Mbit DRAMs available on the market are today made of stacked capacitors.
Cross sections of 64Mbit DRAMs analyzed by ICE’s laboratory illustrate three major choices for
manufacturing DRAM memory cells. Hitachi uses a stacked, multi-layer capacitor for its 64Mbit
DRAM (Figure 7-6). The trench capacitor (Figure 7-7) is used by IBM/Siemens, and the simple
stack capacitor (Figure 7-8, and 7-9) is preferred by Samsung and NEC.
Figure 7-10 shows how size cell improvements will be necessary for the next DRAM generations.
Figure 7-11 illustrates the stacked capacitor structure evolution. The decrease of cell size without
decreasing capacitor value results in increasing complexity of memory cell technology. Most lead-
ing DRAM manufacturers are working on 1Gbit cells. Their goal is to decrease the size of the cell
without compromising the value of the capacitor.
Two types of 1Gbit cell developments are shown in the next figures. Toshiba tried to improve the
trench capacitor concept by creating a bottle-shape trench design (Figure 7-12). Hitachi tried to
improve the stack concept with a vertical and circular capacitor (Figure 7-13). NEC received a
patent from the U.S. Patent Office for its proprietary HSG (hemispherical-grain) silicon technology.
METAL 3 METAL 2
METAL 1
POLY 1 TRENCH
CAPACITORS
POLY 4
CAPACITOR NITRIDE
PLATE ETCH-STOP
LAYER
POLY 2 BIT
CONTACT PAD
POLY 3
CAPACITOR
PLATE
POLY 3
“STEM”
103 10
Chip Area
102 1.0
Chip Area (mm2), Cell Area (µm2)
Minimum
101 0.1
10-1
256K 1M 4M 16M 64M 256M 1G 4G
DRAM Generation (bits)
Source: Hitachi/ICE, "Memory 1997" 20775A
CAPACITOR DIELECTRICS
The inability to scale the capacitor value has led to the consideration of new dielectric materials for
the capacitor. It is likely that materials with higher dielectrics will see more use. Many of the mate-
rials have proven track records through their use as dielectrics in discrete capacitors. Therefore, the
main challenge is the introduction of these materials into the IC process. Figure 7-14 shows some
of the materials under consideration. Tantalum Pentoxide (Ta2O5) has been viewed in several of
the 64Mbit DRAMs analyzed by the ICE laboratory. Ta2O5 seems that it could serve as a good
dielectric since it can easily be integrated into conventional stack capacitor structure. However, it
requires higher dielectric thickness, resulting in only a marginal improvement in capacitance.
PERFORMANCE
Compared with other memory ICs, DRAMs suffer from a speed problem. The on-chip circuitry
required to read the data from each cell is inherently slow. As such, DRAM speeds have not kept
pace with the increased clock speed of CPUs.
,,,,,
,,,,,
,,,,,
,,,,,,,,
,,,,,,,,
,,,,,,,,
Planar
,,,,,,,,
,,,,,,,,
HSG Crown
,,,,,,,,
Cup
,,,,,,,,
,,,,,,,,
,,,,,,,,
,,,,,,,,
,,,,,,,,
,,,,,,,, Multilayer Fin
Simple Stack
,,,,,,,,
,,,,,,,, ,,,,,,,,
,,,,,,,,
,,,,,,,, ,,,,,,,,
Simple Crown Multilayer Crown
Source: IBM/Semiconductor International/ICE, "Memory 1997" 22435
Poly-Si Electrode
Ta2O5
,,,
,,,, ,,,,,
Bit Line
,, ,,
,,,,
,,,
,,,,
,,, ,,,
,,,,,
,,,
,,,,,
,,,
Poly-Si Plug
,,,,,,,,,,,,,,,,,
,,,,,,,,,,,,,,,,,
,,,,,,,,,,,,,,,,,
LOCOS
Si Substrate
Material εr
SiO2 3.8
SiN4 7.9
Ta2O5 20 - 30
BaTiO3 15 - 30
SrTiO3 15 - 30
TiO2 35 - 50
Polyimide 2 - 2.6
Source: ICE, "Memory 1997" 18741
Early microprocessor systems were introduced with clock speeds of 1MHz (one million cycles
per second). Today, CPUs in desktop PCs are 200MHz and faster, at least a 200x improvement.
Early DRAMs had access cycle times (the time required for the DRAM chip to supply the data
back to the CPU) of 250 nanoseconds (ns). The fastest DRAM units today are about 50ns, a five-
fold improvement.
To face this speed discrepancy, DRAMs have branched into many sub-categories. Each features a
variation of system interface circuitry with the intent of enhancing performance. Furthermore, each
design attempts to answer needs of specific applications. Current offerings are shown Figure 7-15.
DRAM
VRAM (Video RAM)
Video
WRAM (Window RAM)
RDRAM (Rambus)
SLDRAM (SyncLink)
Other Configurations 3D
nDRAM (Next Generation)
MDRAM (Multi Bank)
To access DRAM data, a row address is applied, followed by a column address. The addresses of
a DRAM are multiplexed on the same package pins. When requested data is stored in the same
row as previously requested data, merely changing the column address allows access to this new
data. Therefore, with fast page mode, the data of the same row can be accessed by changing only
the column address.
As an example, consider a 1Mbit DRAM that has an array organization of 1,024 x 1,024 bits. 1,024
bits of data belong to the same row. This data will be accessible through fast page mode (a row is
considered a page). This mode is available on all the standard DRAMs. As described previously,
this access time is as fast as the data available in the sense amplifier.
Static column mode is similar to page mode except that only the column address needs to be
changed to obtain the new data, and no CAS pulse is needed. Nibble mode groups memory cells
in “four bits per nibble” so that whenever one bit is selected, four serial bits appear. This mode is
no longer widely used.
Extended data out DRAMs (EDO DRAMs), also called hyper-page-mode DRAMs, represent a
small design change in the output buffer relative to a standard fast page mode DRAM. The old
data is latched at the output while new data is addressed. EDO shortens the effective page mode
cycle time as the valid data output time is extended.
For the same technology, a product in standard mode may have a cycle time of 110ns (access time
of 60ns). This cycle time will be reduced to 40ns in fast page mode and to 25ns in EDO mode.
Several vendors offer burst versions of the EDO DRAM. The “burst” refers to the fact that all read
and write cycles occur in bursts of four, automatically sequenced by the memory chip. To accom-
plish that, a special pipeline stage and a two-bit counter are added.
The Burst EDO DRAM read access time differs from EDO DRAM in two ways. First, as the output
latch is replaced by a special pipeline stage (register), the latency will be higher but the bandwidth
will be better. Second, as the Burst EDO DRAM includes an internal address counter, only the ini-
tial address in a burst of four needs to be provided.
Manufacturers may use the same mask set for their fast page mode EDO and burst EDO DRAMs.
A wire bond option determines which product is in the final package. Figure 7-16 shows timing
differences between the various standard DRAMs. Figure 7-17 shows speed differences.
Audio DRAMs
Due to process defects during wafer probe, some DRAMs wind up with a few failed storage cells
that cannot be effectively repaired (or replaced) by the redundancy mechanism. These cannot
be sold as good devices to PC OEMs. However, a few defective bits do not affect an audio appli-
cation where there is a tolerance for error. For this reason manufacturers may sell parts with few
bad cells for use in audio applications. These devices are called Audio DRAM (ARAM). The
client can buy ARAMs cheaper than a standard DRAM.
RAS RAS
CAS CAS
Address ,,,
,,,R C ,,,,,,
,,,,,,
Address ,,
,, R C C C C C,,,,
,,,,
Dout Dout 1 2 3 4 n
RAS RAS
CAS CAS
Address
,,
,, R C,,,,,
,,,,,,,,,,
,,,,,
C C
,,,,,
,,,,, Address ,,
,,R C
,,
,, C ,,
,, C,,,
,,,C ,,,
,,,
Dout 1 2
Dout 1 2 3
FAST PAGE MODE
EDO
RAS RAS
CAS CAS
Address ,, ,,,,,,,,,,,,,,,
,, R C,,,,,,,,,,,,,,, Address ,, ,,,,,,,,,,,,,,,
,, R C,,,,,,,,,,,,,,,
Dout 1 2 3 4 Dout 1 2 3 4
NIBBLE MODE BURST EDO
R: Row Address
C: Column Address
Source: ICE, "Memory 1997" 22436
DRAM Type
DRAM Speed
(tRAC) Fast Page Burst
EDO
Mode EDO
Cache DRAMs
Another DRAM alternative is the cache DRAM (CDRAM) developed by Mitsubishi. This device
integrates a 16Mbit DRAM and a L2 (level two) SRAM cache memory (16Kbit SRAM) on the same
chip. The transfer between the DRAM and the SRAM is performed in one clock cycle through a
buffer of 8 x 16 bits. The SRAM is a six-transistor cell. The SRAM access/cycle time is 15ns.
Currently, Mitsubishi and Samsung offer CDRAM devices. Figure 7-18 shows the chip organiza-
tion of a CDRAM.
SRAM Address
4K x 4 Fast SRAM Cache
Address Latch
Clock
Cache Hit
DRAM
CS, Write Enable 64-Bit Buffer 64-Bit Buffer Data
Control
Control
Refresh
Enhanced DRAMs
Enhanced DRAMs (EDRAMs) were developed by Ramtron Corporation. The EDRAM is also sold
by IBM (3.3V and 5V parts), as IBM and Ramtron have a second source agreement for this prod-
uct. The EDRAM architecture is similar to a standard 4Mbit page mode or static column DRAM
with the addition of an integrated L2 SRAM cache and internal control. Technically, the EDRAM
is a cache DRAM (CDRAM). Rather than integrate a separate SRAM cache, the EDRAM takes
advantage of the internal architecture of a standard fast page mode DRAM, which has sense
amplifiers that act like a 2Kbit SRAM cache when reading and accessing data.
Memory read cycles always occur from the cache. When the comparator detects a hit, only the
SRAM is addressed where data is available in 15ns. When a miss is detected, the entire cache is
updated and data is available at the output within a single 35ns access. Figure 7-19 shows the chip
organization of an EDRAM.
Synchronous DRAMs
Synchronous DRAMs represent the next step in the evolution of the industry standard DRAM
architecture. Synchronous DRAMs (SDRAMs) have their read and write cycles synchronized
with the processor clock. The SDRAM is designed with two separate banks. These two indepen-
dent banks allow each bank to have different rows active at the same time. This allows concur-
rent access/refresh and recharge operations. Figure 7-20 presents a block diagram organization
of an SDRAM.
DQMU
DQML
CAS#
RAS#
WE#
CKE
CLK
CS#
Address Bus
Control Logic and A8 A9-A0
Timing Generator
MUX
Sense Amps
Columns
Columns
Decoder
Decoder
Column
Column
256
256
CAS #
Output Latch Input Latch
Latency Control /Buffer
Output Latch
Output Buffer
Data Bus
Source: Hitachi/ICE, "Memory 1997" 22437
The SDRAM is programmed using a mode register. The programmable features include burst
length (1, 2, 4, 8, full page), wrap sequence (sequential/interleave), and CAS latency (1, 2, 3).
Figure 7-21 shows an SDRAM timing sequence. This timing illustrates the different possibilities
of CAS latency and of burst length.
CLK
tRCD
Active Read
Command
Row Column
Address
CL = 1 Out 0 Out 2
Out 1 Out 3
CL = 2 Out 1 Out 3
Dout
Out 0 Out 2
CL = 3 Out 0 Out 2
Out 1 Out 3
CL: CAS Latency
Burst Length = 4
BURST LENGTH
CLK
tRCD
Active Read
Command
Row Column
Address
BL = 1 Out 0
BL = 2 Out 0
Out 1
BL = 4 Out 0 Out 2
Dout
Out 1 Out 3
BL = 8 Out 0 Out 2 Out 4 Out 6
Out 1 Out 3 Out 5 Out 7
BL = Full Page Out 0 Out 2 Out 4 Out 6 Out 8 Out 256 Out 1
Out 1 Out 3 Out 5 Out 7 Out 0
BL: Burst Length
CAS: Latency = 2
Source: Hitachi/ICE, "Memory 1997" 22439
The size of the mode register is equivalent to the number of address pins on the device and is writ-
ten during a mode register set cycle. This mode register must be reprogrammed each time any of
the programmable features have to be modified. Figure 7-22 illustrates the content of a register
for a 4Mbit SDRAM. Figure 7-23 shows a summary of the SDRAM functionality.
The purpose of the DDR DRAM is to read data of an SDRAM at two times the frequency clock. The
device delivers data on both edges of the clock, doubling effective bandwidth at a given frequency.
A9 A8 A7 A6 A5 A4 A3 A2 A1 A0
OPCODE 0 LMODE BT BL
CLK: External Clock Input and Synchronous DRAM Operations Synchronize with this Signal
CLK
Command:
Command Controls Synchronous DRAM Operation
Active Read
(CS,RAS,CAS,WE) Command is a Combination of CS,
RAS, CAS, and WE Signals.
Address
(A0 to A8) Row Column
Bank Select:
Bank Select A Memory Contains 2 Banks (Areas)
BS BS that can be Controlled Independently.
(A9)
BANK SELECT is used to Select a Bank.