TY - GEN
T1 - A new memory module for memory intensive applications
AU - Tanabe, Noboru
AU - Hakozaki, Hirotaka
AU - Nakatake, Masasige
AU - Dohi, Yasunori
AU - Nakajo, Hironori
AU - Amano, Hideharu
PY - 2004/12/1
Y1 - 2004/12/1
N2 - Some applications with gather / scatter operations are difficult to accelerate. These operations cause inefficient cache use in each processor and fine grain global communications in parallel systems. There are several applications with such characteristics particularly in electrical engineering. For examples, circuit simulation and power flow simulation with LU decomposition for random sparse matrix has such characteristics. This paper presents how to make inexpensive personal supercomputers to solve these problems. In order to get the merit of commercial-off-the-shelf (COTS) continuously after the death of vector supercomputer vendors, it is designed without any modification on CPU, bridge chips on motherboard and memory chips. Only plugging a new memory module with vector load / store function and communication functions make an inexpensive home-use personal computer into a node similar to Earth simulator's one. Applications with unit striding or indexed accesses are going to be accelerated. How to accelerate NAS CG is shown as an example.
AB - Some applications with gather / scatter operations are difficult to accelerate. These operations cause inefficient cache use in each processor and fine grain global communications in parallel systems. There are several applications with such characteristics particularly in electrical engineering. For examples, circuit simulation and power flow simulation with LU decomposition for random sparse matrix has such characteristics. This paper presents how to make inexpensive personal supercomputers to solve these problems. In order to get the merit of commercial-off-the-shelf (COTS) continuously after the death of vector supercomputer vendors, it is designed without any modification on CPU, bridge chips on motherboard and memory chips. Only plugging a new memory module with vector load / store function and communication functions make an inexpensive home-use personal computer into a node similar to Earth simulator's one. Applications with unit striding or indexed accesses are going to be accelerated. How to accelerate NAS CG is shown as an example.
UR - http://www.scopus.com/inward/record.url?scp=13944277269&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=13944277269&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:13944277269
SN - 0769520804
SN - 9780769520803
T3 - International Conference on Parallel Computing in Electrical Engineering: Workshop on System Design Automation, SDA, PARELEC 2004
SP - 123
EP - 128
BT - International Conference on Parallel Computing in Electrical Engineering
T2 - International Conference on Parallel Computing in Electrical Engineering: Workshop on System Design Automation, SDA, PARELEC 2004
Y2 - 7 September 2004 through 10 September 2004
ER -