Parallel Implementation of CNN on Multi-FPGA Cluster

Yasuyu Fukushima, Kensuke Iizuka, Hideharu Amano

研究成果: Conference contribution

1 被引用数 (Scopus)

抄録

We developed a PYNQ cluster called M-KUBOS that consists of economical Zynq boards that are interconnected through low-cost high-performance GTH serial links. For the software environment, we employed the PYNQ open-source software platform. The PYNQ cluster is anticipated to be a multi-access edge computing (MEC) server for 5G mobile networks. We implemented the ResNet-50 inference accelerator on the PYNQ cluster for image recognition of MEC applications. By estimating the execution time of each ResNet-50 layer, layers of ResNet-50 were divided into four boards so that the execution time of each board would be as equal as possible for efficient pipeline processing. Owing to the PYNQ cluster in which FPGAs were directly connected by high-speed serial links, stream processing without network bottlenecks and pipeline processing between boards were readily realized. The implementation achieved 292 GOPS performance, 75.1 FPS throughput, and 5.15 GOPS/W power efficiency. It achieved 17 times faster speed and 86 times more power efficiency compared to the implementation on the CPU, and 3.8 times more power efficiency compared to the implementation on the GPU.

本文言語English
ホスト出版物のタイトルProceedings - 2021 IEEE 14th International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2021
出版社Institute of Electrical and Electronics Engineers Inc.
ページ77-83
ページ数7
ISBN(電子版)9781665438605
DOI
出版ステータスPublished - 2021
イベント14th IEEE International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2021 - Singapore, Singapore
継続期間: 2021 12月 202021 12月 23

出版物シリーズ

名前Proceedings - 2021 IEEE 14th International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2021

Conference

Conference14th IEEE International Symposium on Embedded Multicore/Many-Core Systems-on-Chip, MCSoC 2021
国/地域Singapore
CitySingapore
Period21/12/2021/12/23

ASJC Scopus subject areas

  • 人工知能
  • コンピュータ サイエンスの応用
  • ハードウェアとアーキテクチャ
  • 電子工学および電気工学

フィンガープリント

「Parallel Implementation of CNN on Multi-FPGA Cluster」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル