A Parallel GPU Implementation of SWIFFTX


Ulu M. E., CENK M.

8th International Conference on Mathematical Aspects of Computer and Information Sciences, MACIS 2019, Gebze, Türkiye, 13 - 15 Kasım 2019, ss.202-217 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası:
  • Doi Numarası: 10.1007/978-3-030-43120-4_16
  • Basıldığı Şehir: Gebze
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.202-217
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

© 2020, Springer Nature Switzerland AG.The SWIFFTX algorithm is one of the candidates of SHA-3 Hash Competition that uses the number theoretic transform (NTT). It has 256-byte input blocks and 65-byte output blocks. In this paper, a parallel implementation of the algorithm and particular techniques to make it faster on GPU are proposed. We target version 6.1 of NVIDIA®CUDA™compute architecture that employs an ISA (Instruction Set Architecture) called Parallel Thread Execution (PTX) which possesses special instrinsics, hence we modify the reference implementation for better results. Experimental results indicate almost 10x improvement in speed and 5 W decrease in power consumption per 216 hashes.