Acceleration of deep recurrent neural networks with an FPGA cluster

Yuxi Sun, Akram Ben Ahmed, Hideharu Amano

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

In this paper, we propose an acceleration methodology for deep recurrent neural networks (RNNs) implemented on a multi-FPGA platform called Flow-in-Cloud (FiC). RNNs have been proven effective for modeling temporal sequences, such as human speech and written text. However, the implementation of RNNs on traditional hardware is inefficient due to their long-range dependence and irregular computation patterns. This inefficiency manifests itself in the proportional increase of run time with respect to the number of layers of deep RNNs when running on traditional hardware platforms such as a CPUs. Previous works have mostly focused on the optimization of a single RNN cell. In this work, we take advantage of the multi-FPGA system to demonstrate that we can reduce the run time of deep RNNs from O(k) to O(1).

Original languageEnglish
Title of host publicationProceedings of the 10th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2019
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450372558
DOIs
Publication statusPublished - 2019 Jun 6
Event10th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2019 - Nagasaki, Japan
Duration: 2019 Jun 62019 Jun 7

Publication series

NameACM International Conference Proceeding Series

Conference

Conference10th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2019
Country/TerritoryJapan
CityNagasaki
Period19/6/619/6/7

Keywords

  • Acceleration
  • FPGAs
  • Recurrent Neural Networks

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Acceleration of deep recurrent neural networks with an FPGA cluster'. Together they form a unique fingerprint.

Cite this