TY - GEN
T1 - Implementation and evaluation of self-organizing map algorithm on a graphic processor
AU - Shitara, Akihiro
AU - Nishikawa, Yuri
AU - Yoshimi, Masato
AU - Amano, Hideharu
PY - 2009/12/1
Y1 - 2009/12/1
N2 - In this paper, we introduce an implementation of algorithm for self-organizing maps(SOM) using GPUs and discuss its evaluation. We used CUDA provided by NVIDIA Corporation for parallel programming, profiling, and data flow optimization so as to exploit inherent data-level parallelism of the algorithm. By using three NVIDIA's graphic cards for evaluation, we investigated the relationships among the number of processor elements, amount of memory device and performance. As the result of performance evaluation with various parameter combinations, we found that implementation on GTX280 achieved 150 times higher performance of Intel Core 2 Quad 2.40 GHz when parameters of map size, dimension of vectors and learning size were 1372×1372, 128 and 128, respectively.
AB - In this paper, we introduce an implementation of algorithm for self-organizing maps(SOM) using GPUs and discuss its evaluation. We used CUDA provided by NVIDIA Corporation for parallel programming, profiling, and data flow optimization so as to exploit inherent data-level parallelism of the algorithm. By using three NVIDIA's graphic cards for evaluation, we investigated the relationships among the number of processor elements, amount of memory device and performance. As the result of performance evaluation with various parameter combinations, we found that implementation on GTX280 achieved 150 times higher performance of Intel Core 2 Quad 2.40 GHz when parameters of map size, dimension of vectors and learning size were 1372×1372, 128 and 128, respectively.
KW - CUDA
KW - GPGPU
KW - GPU
KW - Self-organizing Map
UR - http://www.scopus.com/inward/record.url?scp=77952386820&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77952386820&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:77952386820
SN - 9780889868113
T3 - Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Systems
SP - 253
EP - 260
BT - Proceedings of the 21st IASTED International Conference on Parallel and Distributed Computing and Systems, PDCS 2009
T2 - 21st IASTED International Conference on Parallel and Distributed Computing and Systems, PDCS 2009
Y2 - 2 November 2009 through 4 November 2009
ER -