抄録
A building block convolutional neural network accelerator consists of a host and multiple accelerator chips which can scale the performance by changing the number of stacked chips. In order to program the host and the accelerators, an integrated programming development environment called NAMACHA is proposed. It includes compilers for convolutional neural network accelerators and a system level simulator including inter-chip communication latency. On the simulator, the total application runs 4390x faster than that of the logic level simulation with 1.27% difference of clock cycle counts. The simulation results of implementing AlexNet, SIMD instructions provided in the accelerator improved the performance by 70% on average. It demonstrates that NAMACHA can be used for architectural exploration as well as development of practical software.
本文言語 | English |
---|---|
ページ(範囲) | 81-90 |
ページ数 | 10 |
ジャーナル | International Journal of Computers and their Applications |
巻 | 24 |
号 | 2 |
出版ステータス | Published - 2017 6月 |
ASJC Scopus subject areas
- コンピュータ サイエンス(全般)