memory hierarchy and

virtual memory

Since 1980, CPU has outpaced DRAM

what is memory hierarchy?

In computer architecture the memory hierarchy is a concept used for storing & discussing performance issues in computer architectural design, algorithm predictions, and the lower level programming constructs such as involving locality of reference.

temporal (time)

Recently accessed items will be accessed in the near future.

ex. code in loops, top of stack

spatial (space)

Items at addresses close to the addresses of recently accessed items will be accessed in the near future (sequential code, elements of arrays)

principle of locality

Programs tend to use data and instructions with addresses near or equal to those they have used recently

memory hierarcyhy

CACHE MEMORY

Cache memory is memory actually embedded inside the CPU. Cache memory is very fast, typically taking only once cycle to access, but since it is embedded directly into the CPU there is a limit to how big it can be. In fact, there are several sub-levels of cache memory (termed L1, L2, L3) all with slightly increasing speeds.

CACHE FUNDAMENTALS

hit	data found in cache
hit time	time to access the higher cache
hit rate	% of time data is found in the higher cache

miss	data not found in cache
miss penaly	time to move data from lower to upper level, then to cpu
miss rate	1 - (hit rate)

types of cache misses (Three c's)

Compulsory	The first reference to a block of memory, starting with an empty cache.
Capacity	The cache is not big enough to hold every block you want to use.
Conflict	Two blocks are mapped to the same location and there is not enough room to hold both.

CACHE block placement

(Where can a block be placed in a cache?)

CACHE block Identification

(How is a block found if it is in the cache?)

Tag	used to check all the blocks in the set
Index	used to select the set
Block Offset	address of the desired data within the block

CACHE block REPLACEMENT

(Which block should be replaced on a miss?)

Direct Mapped	Easy (only one choice)
Set Associative or Fully Associative	Random LRU (Least Recently Used) FIFO (First-In First-Out)

CACHE WRITE STRATEGY

(What happens on a write?)

	Write-Back	Write-Through
Policy	Data written to cache also written to lower level memory	Write data only to cache Update lower level when a block falls out of the cache
Debug	Easy	Hard
Do read misses produces writes?	No	Yes
Do repeated writes make it to lower level?	Yes	No

how to improve cache performance?

Reduce the miss rate
Reduce the miss penalty
Reduce the hit time

cache performance optimizations

MAIN MEMORY

All instructions and storage addresses for the processor must come from RAM. Although RAM is very fast, there is still some significant time taken for the CPU to access it (this is termed latency). RAM is stored in separate, dedicated chips attached to the motherboard, meaning it is much larger than cache memory.

SRAM (used by cache)

static RAM

1-5 ns access time

6 transistors/bit

no periodic refresh

expensive

DRAM

dynamic RAM

40-60 ns

1 transistor/bit

need periodic refresh

cheaper

ex. PC2100 comes from 133 MHz × 2 × 8 bytes or 2100 MB/sec

frequency

cycles/second (ex. 1000MHz/s)

T = 1/f

= 1/1000MHz

= 0.1 ns (clock period)

latency

timing of the RAM

delay in number of clock cycles

CAS (Column Address Strobe) Latency or CL (ex. CL20)

Response Time = T * CL

= 0.1 ns * 10

= 10 ns

Response Time = T * CL

= 0.5 ns * 20

= 10 ns

DISC MEMORY

We are all familiar with software arriving on a floppy disk or CDROM, and saving our files to the hard disk. We are also familiar with the long time a program can take to load from the hard disk -- having physical mechanisms such as spinning disks and moving heads means disks are the slowest form of storage. But they are also by far the largest form of storage.

FLASH MEMORY

Expensive to manufacture

Not yet fully available as of 2015Source: http://theydiffer.com/difference-between-flash-memory-and-hard-drive/

Fast operating speed

Lasts a long time

Small in size

HARD DRIVE

Relatively cheap

Abundantly available

Slow operating speed

Short lifespan

Large in size

VIRTUAL MEMORY

Virtual Memory gives the illusion of unlimited memory.

Virtual memory is a feature of an operating system (OS) that allows a computer to compensate for shortages of physical memory by temporarily transferring pages of data from random access memory (RAM) to disk storage.

Text

Virtual memory takes the program address and maps it to an address space

Maps some of the program's address space to the disk
When we need it, we bring it into memory

Solving not enough memory

How do we use the holes left when programs quit?
We can map a program's addresses to RAM addresses however we like.

Solving Holes in the address space

Program 1's and Program 2's addresses map to different RAM addresses
Because each program has its own address space, they cannot access each other's data: security and reliability!

keeping programs secure

How virtual memory works?

conclusion

The speed gap between CPU, memory and mass storage continues to widen. Well written programs exhibit a property called locality. Memory hierarchies based on caching close the gap by exploiting locality.

Memory Hierarchy and Virtual Memory

By Mylene King

Memory Hierarchy and Virtual Memory

COE218 Advanced Computer Systems Architecture

2,046

memory hierarchy and

virtual memory

Since 1980, CPU has outpaced DRAM

what is memory hierarchy?

temporal (time)

spatial (space)

principle of locality

memory hierarcyhy

memory hierarcyhy

CACHE MEMORY

CACHE FUNDAMENTALS

types of cache misses (Three c's)

CACHE block placement

CACHE block Identification

CACHE block REPLACEMENT

CACHE WRITE STRATEGY

how to improve cache performance?

cache performance optimizations

MAIN MEMORY

SRAM (used by cache)

DRAM

frequency

latency

DISC MEMORY

FLASH MEMORY

HARD DRIVE

VIRTUAL MEMORY

Solving not enough memory

Solving Holes in the address space

keeping programs secure

How virtual memory works?

conclusion

Memory Hierarchy and Virtual Memory

More from Mylene King