TY - JOUR
T1 - A study on snoop cache systems for single-chip multiprocessors
AU - Terasawa, Takuya
AU - Inoue, Keisuke
AU - Kurosawa, Hitoshi
AU - Amano, Hideharu
PY - 1998/2
Y1 - 1998/2
N2 - Progress in device design and implementation technologies will change the structure of bus-connected multiprocessors and allow them to be implemented on a single chip. In such an implementation, the speed of the bus inside the chip is far faster than that of the backplane bus, and data transfer between the chip and external devices will become a bottleneck. Many studies on snoop cache protocols have been made, but they generally assume that the cache memory is on a printed circuit board. In this paper, we first classify the snoop cache protocols in terms of accesses to off-chip shared memory, which will be the principal cause of performance degradation, and then compare them quantitatively. Evaluations are made with an instruction-level multiprocessor simulator and practical parallel applications, varying the cache size or the access latency of shared memory. The results show that an I/N/C protocol that actively uses the line transfers between caches achieves the highest performance under all conditions.
AB - Progress in device design and implementation technologies will change the structure of bus-connected multiprocessors and allow them to be implemented on a single chip. In such an implementation, the speed of the bus inside the chip is far faster than that of the backplane bus, and data transfer between the chip and external devices will become a bottleneck. Many studies on snoop cache protocols have been made, but they generally assume that the cache memory is on a printed circuit board. In this paper, we first classify the snoop cache protocols in terms of accesses to off-chip shared memory, which will be the principal cause of performance degradation, and then compare them quantitatively. Evaluations are made with an instruction-level multiprocessor simulator and practical parallel applications, varying the cache size or the access latency of shared memory. The results show that an I/N/C protocol that actively uses the line transfers between caches achieves the highest performance under all conditions.
KW - Command-level simulator
KW - Single-chip microprocessor
KW - Snoop cache protocol
UR - http://www.scopus.com/inward/record.url?scp=5844313229&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=5844313229&partnerID=8YFLogxK
U2 - 10.1002/(SICI)1520-684X(199702)28:2<62::AID-SCJ7>3.0.CO;2-P
DO - 10.1002/(SICI)1520-684X(199702)28:2<62::AID-SCJ7>3.0.CO;2-P
M3 - Article
AN - SCOPUS:5844313229
VL - 28
SP - 62
EP - 72
JO - Systems and Computers in Japan
JF - Systems and Computers in Japan
SN - 0882-1666
IS - 2
ER -