TY - GEN
T1 - A high speed design and implementation of dynamically reconfigurable processor using 28NM SOI technology
AU - Katagiri, Toru
AU - Amano, Hideharu
N1 - Publisher Copyright:
© 2014 Technical University of Munich (TUM).
PY - 2014/10/16
Y1 - 2014/10/16
N2 - Although dynamically reconfigurable processor arrays (DRPAs) are advantageous for embedded devices because of their high energy efficiency, many of the recent mobile devices are required to execute increasingly performance-centric jobs. One fairly straingtfoward way of increasing the clock frequency is introducing a pipelined structure into each PE. However, this results in frequent pipeline stalls due to the data hazard between multiple PEs. In order to mitigate the effect of data hazard between PEs, we propose a tiny vector instruction mechanism. With a single vector instruction, a small amount of data is continuously processed in the pipeline of the PE. Pipeline stalls are removed without increasing the number of hardware contexts, and thus the amount of configuration data. Evaluation results based on the implementation using 28nm SOI process technology, a DRPA with tiny vector instructions (DRPA-TVI) improves the performance by 2.4 three times compared to a base DRPA with just a small increase of area and power consumption.
AB - Although dynamically reconfigurable processor arrays (DRPAs) are advantageous for embedded devices because of their high energy efficiency, many of the recent mobile devices are required to execute increasingly performance-centric jobs. One fairly straingtfoward way of increasing the clock frequency is introducing a pipelined structure into each PE. However, this results in frequent pipeline stalls due to the data hazard between multiple PEs. In order to mitigate the effect of data hazard between PEs, we propose a tiny vector instruction mechanism. With a single vector instruction, a small amount of data is continuously processed in the pipeline of the PE. Pipeline stalls are removed without increasing the number of hardware contexts, and thus the amount of configuration data. Evaluation results based on the implementation using 28nm SOI process technology, a DRPA with tiny vector instructions (DRPA-TVI) improves the performance by 2.4 three times compared to a base DRPA with just a small increase of area and power consumption.
UR - http://www.scopus.com/inward/record.url?scp=84911091867&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84911091867&partnerID=8YFLogxK
U2 - 10.1109/FPL.2014.6927438
DO - 10.1109/FPL.2014.6927438
M3 - Conference contribution
AN - SCOPUS:84911091867
T3 - Conference Digest - 24th International Conference on Field Programmable Logic and Applications, FPL 2014
BT - Conference Digest - 24th International Conference on Field Programmable Logic and Applications, FPL 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 24th International Conference on Field Programmable Logic and Applications, FPL 2014
Y2 - 1 September 2014 through 5 September 2014
ER -