TY - GEN
T1 - Performance improvement methodology for ClearSpeed's CSX600
AU - Nishikawa, Yuri
AU - Koibuchi, Michihiro
AU - Yoshimi, Masato
AU - Miura, Kenichi
AU - Amano, Hideharu
PY - 2007/12/1
Y1 - 2007/12/1
N2 - This paper focuses on a performance of network-on-a-chip (NoC) and I/O of ClearSpeed's CSX600 coprocessor with 96 multithread processing elements. Two versions of the Himeno Benchmark were implemented on the CSX600 to evaluate its performance when it encounters frequent memory transfers between shared and local memories, or between local memories. In order to efficiently use the NoC bandwidth, the dataflow was customized to the one-dimensional array structure of CSX600's NoC. The results of evaluation and profiling indicate that the performance was lower than 1/50 of the sustained performance. We show three key points to improve the performance on such a case: 1) exploiting bandwidth between mono and poly memory, 2) further program tuning, and 3) architectural reform.
AB - This paper focuses on a performance of network-on-a-chip (NoC) and I/O of ClearSpeed's CSX600 coprocessor with 96 multithread processing elements. Two versions of the Himeno Benchmark were implemented on the CSX600 to evaluate its performance when it encounters frequent memory transfers between shared and local memories, or between local memories. In order to efficiently use the NoC bandwidth, the dataflow was customized to the one-dimensional array structure of CSX600's NoC. The results of evaluation and profiling indicate that the performance was lower than 1/50 of the sustained performance. We show three key points to improve the performance on such a case: 1) exploiting bandwidth between mono and poly memory, 2) further program tuning, and 3) architectural reform.
UR - http://www.scopus.com/inward/record.url?scp=47249164386&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=47249164386&partnerID=8YFLogxK
U2 - 10.1109/ICPP.2007.66
DO - 10.1109/ICPP.2007.66
M3 - Conference contribution
AN - SCOPUS:47249164386
SN - 076952933X
SN - 9780769529332
T3 - Proceedings of the International Conference on Parallel Processing
BT - 2007 International Conference on Parallel Processing, ICPP
T2 - 36th International Conference on Parallel Processing in Xi'an, ICPP
Y2 - 10 September 2007 through 14 September 2007
ER -