An efficient single unit T-box/T-1-box implementation for 128-bit AES on FPGA

In this paper, we present an area efficient Block RAM BRAM-based single unit design of T-box/T-1-box on a Field Programmable Gate Array FPGA for combined Advanced Encryption Standard AES encryption and decryption. Conventional FPGA designs for T-box module not only utilize several BRAMs for 16bytes parallel look-up operations but also because of asymmetric nature of AES, use separate hardware for T-1-box unit in decryption process. Alternatively in iterative architecture, BRAM in dual-port mode configuration takes eight clock cycles to access 16 look-up operations from a T-box because of its synchronous nature. Thus, resulting in an unoptimized solution not only in term of FPGA resources but also results in high latency for iterative architecture. Our proposed design uses single symmetric T-box/T-1-box table with same set of single resource-shared hardware for both the encryptor and decryptor and at the same time performs eight look-up operations from single BRAM in one clock cycle using efficient BRAM switching technique instead of using multirated clocking. Our complete 128-bit symmetric T-box/T-1-box design fits into just 2 BRAMs and 136 Slices. It occupies lowest area reported to date with 50% power saving and highest Throughput Per Slice TPS of 10.77. Copyright © 2014 John Wiley & Sons, Ltd.

[1]  Ishak Aris,et al.  Design of an ultra high speed AES processor for next generation IT security , 2011, Comput. Electr. Eng..

[2]  Chi-Jeng Chang,et al.  8-bit AES FPGA Implementation using Block RAM , 2007, IECON 2007 - 33rd Annual Conference of the IEEE Industrial Electronics Society.

[3]  Akashi Satoh,et al.  A 10 Gbps full-AES crypto design with a twisted-BDD S-Box architecture , 2002, Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors.

[4]  Pankaj Rohatgi,et al.  Introduction to differential power analysis , 2011, Journal of Cryptographic Engineering.

[5]  Jens-Peter Kaps,et al.  Investigation of DPA Resistance of Block RAMs in Cryptographic Implementations on FPGAs , 2010, 2010 International Conference on Reconfigurable Computing and FPGAs.

[6]  Antonino Mazzeo,et al.  An FPGA-Based Performance Analysis of the Unrolling, Tiling, and Pipelining of the AES Algorithm , 2003, FPL.

[7]  Arshad Aziz,et al.  A Look-Up-Table Implementation of AES , 2007, HPCNCS.

[8]  Jean-Didier Legat,et al.  Efficient Implementation of Rijndael Encryption in Reconfigurable Hardware: Improvements and Design Tradeoffs , 2003, CHES.

[9]  Arshad Aziz,et al.  Resource efficient implementation of T-Boxes in AES on Virtex-5 FPGA , 2010, Inf. Process. Lett..

[10]  Chi-Jeng Chang,et al.  Compact FPGA implementation of 32-bits AES algorithm using Block RAM , 2007, TENCON 2007 - 2007 IEEE Region 10 Conference.

[11]  Tim Güneysu,et al.  DSPs, BRAMs, and a Pinch of Logic: Extended Recipes for AES on FPGAs , 2010, TRETS.

[12]  Chi-Wu Huang,et al.  Block RAM Based Design of 8-bit AES Operation Modes , 2012 .

[13]  Frederic P. Miller,et al.  Advanced Encryption Standard , 2009 .

[14]  Tim Good,et al.  AES on FPGA from the Fastest to the Smallest , 2005, CHES.

[15]  Jean-Jacques Quisquater,et al.  Implementation of the AES-128 on Virtex-5 FPGAs , 2008, AFRICACRYPT.

[16]  John V. McCanny,et al.  Rijndael FPGA implementation utilizing look-up tables , 2001, 2001 IEEE Workshop on Signal Processing Systems. SiPS 2001. Design and Implementation (Cat. No.01TH8578).

[17]  Franc Novak,et al.  A compact AES core with on-line error-detection for FPGA applications with modest hardware resources , 2011, Microprocess. Microsystems.