Implementation of bitsliced AES encryption on CUDA-Enabled GPU

Naoki Nishikawa, Hideharu Amano, Keisuke Iwai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

Table-based implementations have been mainly reported in research related to high-performance AES on GPUs, in which tables are stored in the shared memory. On the other hand, this kind of implementations is subject to timing attacks, due to the latency required to access tables in the shared memory. Thanks to the increasing number of registers every year, GPU programming has enabled memory intensive applications such as bitsliced AES algorithm to be easily implemented. However, researches of implementation of bitsliced AES algorithm on GPU have not so far been conducted sufficiently in terms of several parameters. For this reason, in this paper, we present an implementation of bitsliced AES encryption on CUDA-enabled GPU with several parameters, especially focusing on three kinds of parallel processing granularities. According to the conducted experiments, the throughput of bitsliced AES-ECB encryption with Bs64 granularity achieves 605.9 Gbps on Nvidia Tesla P100-PCIe resulting in an enhancement of 8.0% when compared to the table-based implementation.

Original languageEnglish
Title of host publicationNetwork and System Security - 11th International Conference, NSS 2017, Proceedings
PublisherSpringer Verlag
Pages273-287
Number of pages15
Volume10394 LNCS
ISBN (Print)9783319647005
DOIs
Publication statusPublished - 2017
Event11th International Conference on Network and System Security, NSS 2017 - Helsinki, Finland
Duration: 2017 Aug 212017 Aug 23

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10394 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other11th International Conference on Network and System Security, NSS 2017
CountryFinland
CityHelsinki
Period17/8/2117/8/23

Fingerprint

Encryption
Cryptography
Data storage equipment
Shared Memory
Granularity
Tables
Table
Timing Attack
Computer programming
Parallel Processing
Throughput
Latency
Programming
High Performance
Enhancement
Graphics processing unit
Processing
Experiments
Experiment

Keywords

  • AES
  • Bitslice
  • Cryptography
  • CUDA
  • GPU

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Nishikawa, N., Amano, H., & Iwai, K. (2017). Implementation of bitsliced AES encryption on CUDA-Enabled GPU. In Network and System Security - 11th International Conference, NSS 2017, Proceedings (Vol. 10394 LNCS, pp. 273-287). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10394 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-64701-2_20

Implementation of bitsliced AES encryption on CUDA-Enabled GPU. / Nishikawa, Naoki; Amano, Hideharu; Iwai, Keisuke.

Network and System Security - 11th International Conference, NSS 2017, Proceedings. Vol. 10394 LNCS Springer Verlag, 2017. p. 273-287 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10394 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Nishikawa, N, Amano, H & Iwai, K 2017, Implementation of bitsliced AES encryption on CUDA-Enabled GPU. in Network and System Security - 11th International Conference, NSS 2017, Proceedings. vol. 10394 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10394 LNCS, Springer Verlag, pp. 273-287, 11th International Conference on Network and System Security, NSS 2017, Helsinki, Finland, 17/8/21. https://doi.org/10.1007/978-3-319-64701-2_20
Nishikawa N, Amano H, Iwai K. Implementation of bitsliced AES encryption on CUDA-Enabled GPU. In Network and System Security - 11th International Conference, NSS 2017, Proceedings. Vol. 10394 LNCS. Springer Verlag. 2017. p. 273-287. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-64701-2_20
Nishikawa, Naoki ; Amano, Hideharu ; Iwai, Keisuke. / Implementation of bitsliced AES encryption on CUDA-Enabled GPU. Network and System Security - 11th International Conference, NSS 2017, Proceedings. Vol. 10394 LNCS Springer Verlag, 2017. pp. 273-287 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{716507bb9c7247b48ecb124444118873,
title = "Implementation of bitsliced AES encryption on CUDA-Enabled GPU",
abstract = "Table-based implementations have been mainly reported in research related to high-performance AES on GPUs, in which tables are stored in the shared memory. On the other hand, this kind of implementations is subject to timing attacks, due to the latency required to access tables in the shared memory. Thanks to the increasing number of registers every year, GPU programming has enabled memory intensive applications such as bitsliced AES algorithm to be easily implemented. However, researches of implementation of bitsliced AES algorithm on GPU have not so far been conducted sufficiently in terms of several parameters. For this reason, in this paper, we present an implementation of bitsliced AES encryption on CUDA-enabled GPU with several parameters, especially focusing on three kinds of parallel processing granularities. According to the conducted experiments, the throughput of bitsliced AES-ECB encryption with Bs64 granularity achieves 605.9 Gbps on Nvidia Tesla P100-PCIe resulting in an enhancement of 8.0{\%} when compared to the table-based implementation.",
keywords = "AES, Bitslice, Cryptography, CUDA, GPU",
author = "Naoki Nishikawa and Hideharu Amano and Keisuke Iwai",
year = "2017",
doi = "10.1007/978-3-319-64701-2_20",
language = "English",
isbn = "9783319647005",
volume = "10394 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "273--287",
booktitle = "Network and System Security - 11th International Conference, NSS 2017, Proceedings",
address = "Germany",

}

TY - GEN

T1 - Implementation of bitsliced AES encryption on CUDA-Enabled GPU

AU - Nishikawa, Naoki

AU - Amano, Hideharu

AU - Iwai, Keisuke

PY - 2017

Y1 - 2017

N2 - Table-based implementations have been mainly reported in research related to high-performance AES on GPUs, in which tables are stored in the shared memory. On the other hand, this kind of implementations is subject to timing attacks, due to the latency required to access tables in the shared memory. Thanks to the increasing number of registers every year, GPU programming has enabled memory intensive applications such as bitsliced AES algorithm to be easily implemented. However, researches of implementation of bitsliced AES algorithm on GPU have not so far been conducted sufficiently in terms of several parameters. For this reason, in this paper, we present an implementation of bitsliced AES encryption on CUDA-enabled GPU with several parameters, especially focusing on three kinds of parallel processing granularities. According to the conducted experiments, the throughput of bitsliced AES-ECB encryption with Bs64 granularity achieves 605.9 Gbps on Nvidia Tesla P100-PCIe resulting in an enhancement of 8.0% when compared to the table-based implementation.

AB - Table-based implementations have been mainly reported in research related to high-performance AES on GPUs, in which tables are stored in the shared memory. On the other hand, this kind of implementations is subject to timing attacks, due to the latency required to access tables in the shared memory. Thanks to the increasing number of registers every year, GPU programming has enabled memory intensive applications such as bitsliced AES algorithm to be easily implemented. However, researches of implementation of bitsliced AES algorithm on GPU have not so far been conducted sufficiently in terms of several parameters. For this reason, in this paper, we present an implementation of bitsliced AES encryption on CUDA-enabled GPU with several parameters, especially focusing on three kinds of parallel processing granularities. According to the conducted experiments, the throughput of bitsliced AES-ECB encryption with Bs64 granularity achieves 605.9 Gbps on Nvidia Tesla P100-PCIe resulting in an enhancement of 8.0% when compared to the table-based implementation.

KW - AES

KW - Bitslice

KW - Cryptography

KW - CUDA

KW - GPU

UR - http://www.scopus.com/inward/record.url?scp=85028448525&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85028448525&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-64701-2_20

DO - 10.1007/978-3-319-64701-2_20

M3 - Conference contribution

AN - SCOPUS:85028448525

SN - 9783319647005

VL - 10394 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 273

EP - 287

BT - Network and System Security - 11th International Conference, NSS 2017, Proceedings

PB - Springer Verlag

ER -