Computer Organization and Structure

Homework #6

Due: 2004/1/13

1.      Here is a series of address references given as word addresses: 1, 4, 8, 5, 20, 17, 19, 56, 9, 11, 4, 43, 5, 6, 9, 17. Assuming a direct-mapped cache with 16 one-word blocks that is initially empty, label each reference in the list as a hit or a miss and show the final contents of the cache. Using the same series of references, show the hits and misses and final cache contents for a direct-mapped cache with four-word blocks and a total size of 16 words.

2.      Cache C1 is direct-mapped with 16 one-word blocks. Cache C2 is direct-mapped with 4 four-word blocks. Assume that the miss penalty for C1 is 8 clock cycles and the miss penalty for C2 is 11 clock cycles. Assuming that the caches are initially empty, find a reference string for which C2 has a lower miss rate but spends more cycles on cache misses than C1, and find a series of reference for which C2 has more misses than C1. Use word addresses.

3.      Here is a series of address references given as word addresses: 1, 4, 8, 5, 20, 17, 19, 56, 9, 11, 4, 43, 5, 6, 9, 17. Show the hits and misses and final cache contents for a two-way set-associative cache with one-word blocks and a total size of 16 words. Assume LRU replacement. The same questions but for a fully associative cache with one-word blocks and a total size of 16 words. The same questions but for a fully associative cache with four-word blocks and a total size of 16 words.

4.      Consider three machines with different cache configurations:

n        Cache 1: Direct-mapped with one-word blocks

n        Cache 2: Direct-mapped with four-word blocks

n        Cache 3: Two-way set associative with four-word blocks

The following miss rate measurements have been made:

n        Cache 1: Instruction miss rate is 4%; data miss rate is 8%.

n        Cache 2: Instruction miss rate is 2%; data miss rate is 5%.

n        Cache 3: Instruction miss rate is 2%; data miss rate is 4%.

For these machines, one-half of the instructions contain a data reference. Assume that the cache miss penalty is 6 + Block size in words. The CPI for this workload was measured on a machine with cache 1 and was found to be 2.0. Determine which machine spends the most cycles on cache misses. If the cycle times for the machines are 2ns for the first and second machines and 2.4ns for the third machine. Determine which machine is the fastest and which is the slowest.