L2_ISA++: Instruction set architecture extensions for 4G and LTE-advanced MPSoCs

High throughput and low latencies are essential for mobile communication systems. In recent years, particularly the physical layer implementation and its optimization have been focused on. As the bitrates of the physical layer increase, the upper layers of the protocol stack must keep pace with these increased data rates. This can be achieved in two ways. Apart from software optimizations for given hardware, the processor itself can be improved. In this paper, the processing performance of the 4G and LTE-Advanced layer 2 protocol stack of cellular communication systems is dramatically improved. In our approach, a processor is extended with a newly developed application-specific instruction set, called L2_ISA++. Each component of the 4G and LTE-Advanced layer 2 is profiled and analyzed. Afterwards, new instructions are iteratively developed and attached to increase processing performance. Experimental results show a processing performance increase of 50x compared with a RISC-based approach. The uplink as well as the downlink throughput is more than 2.5 Gbit/s for a single processor instance with a clock frequency of 500 MHz. Moreover, the processor with the newly developed instruction set L2_ISA++ is 12x energy efficient compared to the basic RISC core. Consequently, battery powered devices will especially benefit from our approach and an integration in heterogeneous MPSoCs for multi-gigabit data rates becomes feasible.

[1]  Farooq Khan,et al.  LTE for 4G Mobile Broadband: Air Interface Technologies and Performance , 2009 .

[2]  Yi Huang,et al.  Efficient design and implementation of LTE UE link-layer protocol stack , 2013, 2013 IEEE Wireless Communications and Networking Conference (WCNC).

[3]  Anas Showk,et al.  Performance analysis of LTE protocol processing on an ARM based mobile platform , 2009, 2009 International Symposium on System-on-Chip.

[4]  Vincent Rijmen,et al.  The Design of Rijndael: AES - The Advanced Encryption Standard , 2002 .

[5]  Viktor Öwall,et al.  Hardware acceleration of the robust header compression (RoHC) algorithm , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).

[6]  Sandra Dominikus,et al.  A Highly Regular and Scalable AES Hardware Architecture , 2003, IEEE Trans. Computers.

[7]  Valeria Bertacco,et al.  ForEVeR: A complementary formal and runtime verification approach to correct NoC functionality , 2014, ACM Trans. Embed. Comput. Syst..

[8]  Cheng-Wen Wu,et al.  A high-throughput low-cost AES processor , 2003, IEEE Communications Magazine.

[9]  Gerhard Fettweis,et al.  5G: Personal mobile internet beyond what cellular did to telephony , 2014, IEEE Communications Magazine.

[10]  Gerhard Fettweis,et al.  10.7 A 105GOPS 36mm2 heterogeneous SDR MPSoC with energy-aware dynamic scheduling and iterative detection-decoding for 4G in 65nm CMOS , 2014, 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC).

[11]  Sebastian Hessel,et al.  On-the-fly hardware acceleration for protocol stack processing in next generation mobile devices , 2009, CODES+ISSS '09.

[12]  Keshab K. Parhi,et al.  High-speed VLSI architectures for the AES algorithm , 2004, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[13]  Gerhard Fettweis,et al.  Tomahawk , 2014, ACM Trans. Embed. Comput. Syst..

[14]  Gerhard Fettweis,et al.  Instruction Set Architecture Extensions for a Dynamic Task Scheduling Unit , 2012, 2012 IEEE Computer Society Annual Symposium on VLSI.