Cache Fundamentals

Uploaded by

ainugiri

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views

Cache Fundamentals

Uploaded by

ainugiri

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Cache Fundamentals

Cache Memory is characterized by three parameters
1. Associativity : Decides the cache location where a block be placed. There are three type of
block placement policies
1. Direct Mapped
2. Fully Associative
3. Set Associative
2. Block Size: Block size is size of data read/written in each cache operation. It is also known as
cache line size
3. Capacity: Size of the cache is known as its capacity.

How is a Block found in the Cache?

Apartment Analogy of cache:

0 A2 A1
0 ………………………………………………………... <...B..>

2s1

Total cache capacity: C = S.A.B
Where 2S1 = #of sets
A = Associativity of each set
B = Block Size

Block Address Block Offset

Tag Index

• Each memory address referenced by a CPU instruction is treated as consisting of two parts by
cache management unit – Block Address and Block Offset.
• Block address component is further divided into two parts:
• Index identifies the cache set which generated address maps to.
• After identifying cache set, the Tag field is used to distinguish between all possible
memory addresses which get mapped to same cache set.
• After identifying cached block, the block offset part identifies the offset within the cached
block where the referenced data may be found.
3C model: Cache misses

• Cold misses: Compulsory miss when first time you're accessing a memory block.
• Capacity misses: Occurs when the amount of data accessed is greater than the actual cache
capacity essentially it occurs even when fully associative cache is in use.
• Conflict misses: Occurs when the set to which reference address resolves to is full.

4th C: Coherency miss, occurs in parallel architectures

Example: MVM:

y<A(mxn).x(n)

code:
do i = 0 to m 1,
y[i] := 0
do j = 0 to n 1,
y[i] := y[i] + A[i,j] * x[j];
od
od

Cold misses: Occur when A, x as well as y is accessed.

# of Cold Misses = e/B{m+n+m.n}

Where e = size of each element of arrays A, x and y.
B = block size.

Capacity misses:
None for small problem size.

Cold misses are always constant; conflict misses increase (Why?).

Conflict Misses (Pathological Case):
Consider a language like FORTRAN, which stores matrices in columnmajor order. If the code
depicted earlier is executed as FORTRAN program, each access to an element of A will bring in a
cache block sized data into cache. Since elements of A are stored in column major order, that results
in some portion of A's column being brought into cache. Now consider a pathological case, where all
elements in a row of matrix A map to a single cache set. In this case, if the elements of A are accessed
in row major order, as done by the code above, each periodic read on A will result in eviction of cache
entry from a particular set.

In such a case, we can use loop interchange transformation to avoid conflict misses.
{Assume all y[i]'s are 0}
do j = 0 to n 1,
do i = 0 to m 1,
y[i] := y[i] + A[i,j] * x[j];
od
od

What kind of locality is exploited by above code?
A spatial, x temporal, y spatial

Can we do better than this?
− Blocked implementation of algorithm.

Tradeoffs:
Principle: When any data block is brought into the cache, it is used as much as possible before it gets
evicted.

New code (Stripmining technique, assume rowmajor order of storage):

do i = 0, m1, B
do j = 0 to n1
do k = i, min(i+B1, m1)
y[k] = y[k] + A[k, j] * x[j];
od
od
od

In the new code, the core compute kernel does MVM for a problem size = B. B is chosen so that all
three subarrays of A, x and y fit into the cache. The access to x can be changed to blocked in similar
fashion. (Not shown here). Such computation, regardless of the problem size suffers only from cold
misses. (Also known as blocked matrixvector multiply)

Address Translation in Virtual Memory:

Virtual memory is a technique which gives an application program the impression that it has
contiguous working memory, while in fact it is physically fragmented and may even overflow on to
disk storage.

Page:
A page is a block of contiguous virtual memory addresses.

Page Tables:
Almost all implementations use page tables to translate the virtual addresses seen by the application
program into physical addresses (also referred to as "real addresses") used by the hardware to process
instructions. Each entry in a page table contains: the starting virtual address of the page; either the real
memory address at which the page is actually stored or an indicator that the page is currently held in a
disk file (if the system uses disk files to let applications use amounts of virtual memory which exceed
real memory).

All addresses in program are virtual addresses.

Translation Lookaside Buffer (TLB):
A Translation Lookaside Buffer (TLB) is a CPU cache that is used by memory management hardware
to improve the speed of virtual address translation. The recent memory references are stored in this
cache, so that future access to the same address can be resolved without going through page table
lookup, which is expensive.

Why is TLB important?
miss on TLB is very expensive (~100~2000 cycles). 1% miss rate on TLB really hurts.
Remember: not only data locality but also locality of referenced addresses is very important.

Walc 2
92% (26)
Walc 2
301 pages
Wisdom Oracle PDF
79% (57)
Wisdom Oracle PDF
248 pages
WALC 10 Memory
83% (12)
WALC 10 Memory
186 pages
Pachislo Manual PDF
100% (3)
Pachislo Manual PDF
30 pages
Hacking Into Gated Communities
100% (16)
Hacking Into Gated Communities
5 pages
Macbook Repair Guide Repair Case v1
100% (8)
Macbook Repair Guide Repair Case v1
249 pages
MACK Truck Electrical Wiring and Connections Manual CHU, CXU, GU, TD, MRU, LR Series
100% (12)
MACK Truck Electrical Wiring and Connections Manual CHU, CXU, GU, TD, MRU, LR Series
94 pages
EPA07 Maxxforce 11, 13 Engine Service Manual
79% (29)
EPA07 Maxxforce 11, 13 Engine Service Manual
490 pages
Gideon's Guardians - New Meth Recipe - A - K - A Easter Bunny Meth
67% (6)
Gideon's Guardians - New Meth Recipe - A - K - A Easter Bunny Meth
50 pages
Unlock Codes All Cell Phones
100% (25)
Unlock Codes All Cell Phones
15 pages
DIY: Immobilizer Hacking For Lost Keys or Swapped ECU
50% (4)
DIY: Immobilizer Hacking For Lost Keys or Swapped ECU
14 pages
Polaris 9300 Repair Manual
92% (12)
Polaris 9300 Repair Manual
32 pages
Cell Phone Unlock Code Instructions
63% (8)
Cell Phone Unlock Code Instructions
41 pages
Lock Picking Hotel Rooms
100% (1)
Lock Picking Hotel Rooms
22 pages
Notes - CompTIA A+ (220-801) PDF
0% (2)
Notes - CompTIA A+ (220-801) PDF
29 pages
Advanced Diagnostics Ford Manual-Ford
80% (10)
Advanced Diagnostics Ford Manual-Ford
55 pages
Ecm Titanium ENG
100% (5)
Ecm Titanium ENG
15 pages
Isuzu NPR - NPR HD - NQR Commercial Truck Tiltmaster Service Manual Supplement 2003
83% (6)
Isuzu NPR - NPR HD - NQR Commercial Truck Tiltmaster Service Manual Supplement 2003
215 pages
2019 Mac Pro Service Technician Manual
No ratings yet
2019 Mac Pro Service Technician Manual
341 pages
Scourge of The Howling Horde
No ratings yet
Scourge of The Howling Horde
36 pages
X32 Matrix Setup Guide: Step 1: Assign The Matrix Mix To An Output
No ratings yet
X32 Matrix Setup Guide: Step 1: Assign The Matrix Mix To An Output
2 pages
Mobil Edit
No ratings yet
Mobil Edit
17 pages
Logic Pro X Shortcuts
92% (13)
Logic Pro X Shortcuts
11 pages
All CDMA Codes
100% (3)
All CDMA Codes
17 pages
Micr Basics Hand Book
100% (4)
Micr Basics Hand Book
21 pages
Service Repair Manual - (Cat) Caterpillar 3126 Machine Engine SN 1bw, 55k
75% (4)
Service Repair Manual - (Cat) Caterpillar 3126 Machine Engine SN 1bw, 55k
1,094 pages
ServiceManualNamux4English PDF
100% (9)
ServiceManualNamux4English PDF
112 pages
Holley Carb Manual PDF
100% (1)
Holley Carb Manual PDF
2 pages
The Comptia A+ Cram Sheet: Hardware
No ratings yet
The Comptia A+ Cram Sheet: Hardware
2 pages
Electrical Engineering PDF
88% (8)
Electrical Engineering PDF
115 pages
PACCAR Y Cable PDF
100% (1)
PACCAR Y Cable PDF
1 page
Device Unlock Code Instructions
100% (1)
Device Unlock Code Instructions
28 pages
WMS 400 Slot Manual
100% (2)
WMS 400 Slot Manual
115 pages
Yamaha TX81Z-TX802 (4 Operator) Programming Guide
No ratings yet
Yamaha TX81Z-TX802 (4 Operator) Programming Guide
1 page
All Mobile Tricks
91% (35)
All Mobile Tricks
19 pages
Updates Resume Giri Prasad
No ratings yet
Updates Resume Giri Prasad
4 pages
Transmission Lines and Wave Guides Ec 1305
67% (3)
Transmission Lines and Wave Guides Ec 1305
27 pages
Ge6075 Professional Ethics in Engineering
100% (1)
Ge6075 Professional Ethics in Engineering
47 pages
Wireless Local Area Network
100% (2)
Wireless Local Area Network
331 pages
Cache Fundamentals
No ratings yet
Cache Fundamentals
4 pages
Computer Architecture
No ratings yet
Computer Architecture
100 pages
CS1601 Computer Architecture
100% (1)
CS1601 Computer Architecture
389 pages
07 11 04
No ratings yet
07 11 04
224 pages
Theoretical Study of Cache Systems: Dmitry Dolgikh
No ratings yet
Theoretical Study of Cache Systems: Dmitry Dolgikh
17 pages
The Anatomy of The Grid
No ratings yet
The Anatomy of The Grid
2 pages
Sockets Notes
No ratings yet
Sockets Notes
15 pages
Peterson's Algorithm in A Multi Agent Database System
No ratings yet
Peterson's Algorithm in A Multi Agent Database System
14 pages
RTP Slides
0% (1)
RTP Slides
42 pages
EC1402 Optical Communication
No ratings yet
EC1402 Optical Communication
25 pages
JSF
No ratings yet
JSF
21 pages
Efficient Algorithms For Sorting and Synchronization (Andrew
No ratings yet
Efficient Algorithms For Sorting and Synchronization (Andrew
115 pages
Memtech 2005 Am 09
No ratings yet
Memtech 2005 Am 09
15 pages
Microwave Engineering Ec 432
No ratings yet
Microwave Engineering Ec 432
25 pages
Computer Arch and Operating Sys
No ratings yet
Computer Arch and Operating Sys
53 pages
It1252 Digital Signal Processing
No ratings yet
It1252 Digital Signal Processing
22 pages
Javaserver Faces (JSF) Overview
No ratings yet
Javaserver Faces (JSF) Overview
57 pages
Learning Guitar 2-12-05
0% (1)
Learning Guitar 2-12-05
17 pages
Java White Paper Description: Simple Object-Oriented Distributed Interpreted Robust Secure
No ratings yet
Java White Paper Description: Simple Object-Oriented Distributed Interpreted Robust Secure
8 pages
Get Ready For Session 1
No ratings yet
Get Ready For Session 1
2 pages
It 1202 - Principles of Communication
No ratings yet
It 1202 - Principles of Communication
20 pages
How To Write Your Own Js F Components
No ratings yet
How To Write Your Own Js F Components
32 pages
Secret Codes For Phone
No ratings yet
Secret Codes For Phone
13 pages
10 Simple Steps To Learn To
No ratings yet
10 Simple Steps To Learn To
4 pages
Charmed RPG Player Handbook
75% (4)
Charmed RPG Player Handbook
18 pages
Allison DOC 7.0 User Guide
100% (7)
Allison DOC 7.0 User Guide
138 pages
Samsung Full Codes
100% (4)
Samsung Full Codes
7 pages
Prosecutors Ask To Revisit Data From Karen Read's SUV
No ratings yet
Prosecutors Ask To Revisit Data From Karen Read's SUV
27 pages
2003 Web GM
100% (2)
2003 Web GM
82 pages
Hacking - How To Hack - Ultimate Hacking - Harry Jones
100% (2)
Hacking - How To Hack - Ultimate Hacking - Harry Jones
38 pages
Components of A Motherboard
No ratings yet
Components of A Motherboard
14 pages
The Immobilizer System
No ratings yet
The Immobilizer System
22 pages
Network Unlock Guide
100% (3)
Network Unlock Guide
2 pages
INSITE™ Electronic Service Tool Technical Support
No ratings yet
INSITE™ Electronic Service Tool Technical Support
3 pages
Nintendo-SNES Development Manual 1
No ratings yet
Nintendo-SNES Development Manual 1
240 pages
Eti 4M09 4 09 04 29 GM Tac
No ratings yet
Eti 4M09 4 09 04 29 GM Tac
8 pages
Can Bus
33% (3)
Can Bus
16 pages
DTRANSRG
No ratings yet
DTRANSRG
20 pages
Instant Download Automotive Fuel and Emissions Control Systems 2nd Edition James D. Halderman PDF All Chapters
100% (2)
Instant Download Automotive Fuel and Emissions Control Systems 2nd Edition James D. Halderman PDF All Chapters
67 pages
0192 IP PracticeExam
100% (1)
0192 IP PracticeExam
32 pages
File Extensions PDF
No ratings yet
File Extensions PDF
93 pages
The Hacker Test (Version 1.0)
No ratings yet
The Hacker Test (Version 1.0)
17 pages
DIY Odometer Reprogramming
No ratings yet
DIY Odometer Reprogramming
21 pages
Diy Inmobilizer Circuits
100% (5)
Diy Inmobilizer Circuits
18 pages
Data Link, Fault Tracing V2
No ratings yet
Data Link, Fault Tracing V2
22 pages
MS-DOS and Command Line: Home Free Help Tips Dictionary Forums Links Contact
No ratings yet
MS-DOS and Command Line: Home Free Help Tips Dictionary Forums Links Contact
15 pages
Ipad Pro 12.9 Teardown
No ratings yet
Ipad Pro 12.9 Teardown
23 pages
Five Best Computer Diagnostic Tools
No ratings yet
Five Best Computer Diagnostic Tools
14 pages
Atmel AVR Microcontroller
100% (1)
Atmel AVR Microcontroller
63 pages
SS 2688 e PDF
No ratings yet
SS 2688 e PDF
9 pages
Blank Screen LED Error Codes
100% (1)
Blank Screen LED Error Codes
3 pages