An efficient GPU implementation of fixed-complexity sphere decoders for MIMO wireless systems

The use of many-core processors such as general purpose Graphic Processing Units GPUs has recently become attractive for the efficient implementation of signal processing algorithms for communication systems. This is due to the cost-effectiveness of GPUs together with their potential capability of parallel processing. This paper presents an implementation of the widely employed fixed-complexity sphere decoder on GPUs, which allows to considerably decrease the computational time required for the data detection stage in multiple-input multiple-output systems. Both, the hard-and soft-output versions of the method have been implemented. Speedup results show the proposed GPU implementation boosts the runtime of the parallel execution of the methods in a high performance multi-core CPU. In addition, the throughput of the algorithm is evaluated and is shown to outperform other recent implementations and to fulfill the real-time requirements of several LTE configurations.

[1]  Markus Rupp,et al.  Boosting sphere decoding speed through Graphic Processing Units , 2010, 2010 European Wireless Conference (EW).

[2]  Markku J. Juntti,et al.  A GPU implementation for two MIMO-OFDM detectors , 2010, 2010 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation.

[3]  Seungwon Choi,et al.  Implementation of an SDR system using graphics processing unit , 2010, IEEE Communications Magazine.

[4]  Tharmalingam Ratnarajah,et al.  A low-complexity soft-MIMO detector based on the fixed-complexity sphere decoder , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Lajos Hanzo,et al.  Multiuser MIMO-OFDM for Next-Generation Wireless Systems , 2007, Proceedings of the IEEE.

[6]  Joseph J. Boutros,et al.  Bit-interleaved coded modulations for multiple-input multiple-output channels , 2000, 2000 IEEE Sixth International Symposium on Spread Spectrum Techniques and Applications. ISSTA 2000. Proceedings (Cat. No.00TH8536).

[7]  Jose A. Belloch,et al.  Real-time massive convolution for audio applications on GPU , 2011, The Journal of Supercomputing.

[8]  Leonel Sousa,et al.  How GPUs can outperform ASICs for fast LDPC decoding , 2009, ICS.

[9]  Francisco-Jose Martínez-Zaldívar,et al.  Tridimensional block multiword LDPC decoding on GPUs , 2011, The Journal of Supercomputing.

[10]  Arnaud Tisserand,et al.  Power Consumption of GPUs from a Software Perspective , 2009, ICCS.

[11]  Stephan ten Brink,et al.  Achieving near-capacity on a multiple-antenna channel , 2003, IEEE Trans. Commun..

[12]  R. Riego,et al.  A low-cost 3 D human interface device using GPU-based optical fl ow algorithms , 2011 .

[13]  John S. Thompson,et al.  Extending a Fixed-Complexity Sphere Decoder to Obtain Likelihood Information for Turbo-MIMO Systems , 2008, IEEE Transactions on Vehicular Technology.

[14]  Björn E. Ottersten,et al.  The Error Probability of the Fixed-Complexity Sphere Decoder , 2009, IEEE Transactions on Signal Processing.

[15]  John S. Thompson,et al.  Fixing the Complexity of the Sphere Decoder for MIMO Detection , 2008, IEEE Transactions on Wireless Communications.

[16]  Almerico Murli,et al.  Integration of emerging computer technologies for an efficient image sequences analysis , 2011, Integr. Comput. Aided Eng..

[17]  Seungwon Choi,et al.  Implementation of an SDR platform using GPU and its application to a 2 × 2 MIMO WiMAX system , 2011 .

[18]  Helmut Bölcskei,et al.  Soft-output sphere decoding: algorithms and VLSI implementation , 2008, IEEE Journal on Selected Areas in Communications.

[19]  Georgios B. Giannakis,et al.  Approaching MIMO channel capacity with reduced-complexity soft sphere decoding , 2004, 2004 IEEE Wireless Communications and Networking Conference (IEEE Cat. No.04TH8733).

[20]  José Ranilla,et al.  A low-cost 3D human interface device using GPU-based optical flow algorithms , 2011, Integr. Comput. Aided Eng..

[21]  Esteban Walter Gonzalez Clua,et al.  Using graph cuts in GPUs for color based human skin segmentation , 2011, Integr. Comput. Aided Eng..

[22]  Helmut Bölcskei,et al.  An overview of MIMO communications - a key to gigabit wireless , 2004, Proceedings of the IEEE.

[23]  Miguel Lozano,et al.  A distributed visualization system for crowd simulations , 2011, Integr. Comput. Aided Eng..

[24]  Babak Hassibi,et al.  On the sphere-decoding algorithm I. Expected complexity , 2005, IEEE Transactions on Signal Processing.

[25]  Hannes Hartenstein,et al.  GPU-Based Architectures and Their Benefit for Accurate and Efficient Wireless Network Simulations , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.

[26]  Joseph R. Cavallaro,et al.  Implementation of a High Throughput Soft MIMO Detector on GPU , 2011, J. Signal Process. Syst..

[27]  Jean-Charles Noyer,et al.  Correlation-based particle filter for 3D object tracking , 2009, Integr. Comput. Aided Eng..

[28]  Joseph R. Cavallaro,et al.  A GPU implementation of a real-time MIMO detector , 2009, 2009 IEEE Workshop on Signal Processing Systems.