Abstract
Cost and energy efficient supercomputers have received attention not only for scientific computation but for big data processing. In the fields of social networks and biology, the relationship between data is often represented by large target graphs that require huge computation costs to analyze. A new parallel BFS method called degree-chain traversal (DC) is proposed and implemented on the energy efficient compact supercomputer Suiren. In DC, by treating vertices that have the same parents as a form of 'chain', both the communication amount and the number of memory accesses are reduced. Evaluation results show that the total amount of computation was reduced by 30%, and the execution time was shortened by 14%, when tasks are executed with four processes. We also tried to accelerate the execution with PEZY-SC, an MIMD accelerator attached to Suiren. However, the average execution time was not improved because of the large variation in the execution time depending on the root node. Through the analysis, an unbalanced task assignment and a bottleneck of the memory were pointed out. However, this bottleneck is eased by using new PEZY-SC2 which has wider memory bandwidth.
Original language | English |
---|---|
Title of host publication | Proceedings - 2016 4th International Symposium on Computing and Networking, CANDAR 2016 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 395-401 |
Number of pages | 7 |
ISBN (Electronic) | 9781509026555 |
DOIs | |
Publication status | Published - 2017 Jan 13 |
Event | 4th International Symposium on Computing and Networking, CANDAR 2016 - Hiroshima, Japan Duration: 2016 Nov 22 → 2016 Nov 25 |
Other
Other | 4th International Symposium on Computing and Networking, CANDAR 2016 |
---|---|
Country/Territory | Japan |
City | Hiroshima |
Period | 16/11/22 → 16/11/25 |
ASJC Scopus subject areas
- Computer Science Applications
- Hardware and Architecture
- Signal Processing
- Computer Networks and Communications