A Parallel IRRWBF LDPC Decoder Based on Stream-Based Processor

Low-density parity check (LDPC) codes have gained much attention due to their use of various belief-propagation (BP) decoding algorithms to impart excellent error-correcting capability. The BP decoders are quite simple; however, their computation-intensive and repetitive process prohibits their use in energy-sensitive applications such as sensor networks. Bit flipping-based decoding algorithms, especially implementation-efficient, reliability ratio-based, weighted bit-flipping (IRRWBF) decoding; have shown an excellent tradeoff between error-correction performance and implementation cost. In this paper, we show that with IRRWBF, iterative re-computation can be replaced by iterative selective updating. When compared with the original algorithm, simulation results show that, decoding speed can be increased by 200 to 600 percent , as the number of decoding iterations is increased from 5 to 1,000. The decoding steps are broken down into various stages such that the update operations are mostly of the single-instruction, multiple-data (SIMD) type. In this paper, we show that by using Intel Wireless MMX 2 accelerating technology in the proposed algorithm, the speed increased by 500 to 1,500 percent. The results of implementing the proposed scheme using an Intel/Marvell PXA320 (806 MHz) CPU are presented. The proposed scheme can be used effectively in real-time LDPC codes for energy-sensitive mobile devices.

[1]  Uri C. Weiser,et al.  MMX technology extension to the Intel architecture , 1996, IEEE Micro.

[2]  Stanley Mazor,et al.  The history of the 4004 , 1996, IEEE Micro.

[3]  Lajos Hanzo,et al.  Reliability ratio based weighted bit-flipping decoding for low-density parity-check codes , 2004 .

[4]  Marc Tremblay,et al.  VIS speeds new media processing , 1996, IEEE Micro.

[5]  Xiaohu You,et al.  Parallel Weighted Bit-Flipping Decoding , 2007, IEEE Communications Letters.

[6]  Marc P. C. Fossorier,et al.  Quasi-Cyclic Low-Density Parity-Check Codes From Circulant Permutation Matrices , 2004, IEEE Trans. Inf. Theory.

[7]  Shih-Chun Chang,et al.  The /spl pi/-rotation low-density parity check codes , 2001, GLOBECOM'01. IEEE Global Telecommunications Conference (Cat. No.01CH37270).

[8]  Qiang Wu,et al.  A parallel decoding algorithm of LDPC codes using CUDA , 2008, 2008 42nd Asilomar Conference on Signals, Systems and Computers.

[9]  Leonel Sousa,et al.  Massively LDPC Decoding on Multicore Architectures , 2011, IEEE Transactions on Parallel and Distributed Systems.

[10]  Jong-Yeol Lee,et al.  A 1-Gb/s flexible LDPC decoder supporting multiple code rates and block lengths , 2008, IEEE Transactions on Consumer Electronics.

[11]  Wonyong Sung,et al.  VLSI Implementation of a High-Throughput Soft-Bit-Flipping Decoder for Geometric LDPC Codes , 2010, IEEE Transactions on Circuits and Systems I: Regular Papers.

[12]  Vítor Manuel Mendes da Silva,et al.  Serial LDPC decoding on a SIMD DSP using horizontal scheduling , 2006, 2006 14th European Signal Processing Conference.

[13]  G. Feng,et al.  Improved parallel weighted bit-flipping decoding algorithm for LDPC codes , 2009, IET Commun..

[14]  Wayne Wolf,et al.  Implementation-efficient reliability ratio based weighted bit-flipping decoding for LDPC codes , 2005 .

[15]  Robert G. Gallager,et al.  Low-density parity-check codes , 1962, IRE Trans. Inf. Theory.

[16]  T. Ikenaga,et al.  Two-chip MPEG-2 video encoder , 1996, IEEE Micro.

[17]  Amir H. Banihashemi,et al.  On implementation of min-sum algorithm and its modifications for decoding low-density Parity-check (LDPC) codes , 2005, IEEE Transactions on Communications.

[18]  Moinul H. Khan,et al.  Programming with Intel Wireless MMX Technology: A Developer's Guide to Mobile Multimedia Applications , 2004 .

[19]  N. C. Paver,et al.  Intel/spl reg/ wireless MMXTM technology: a 64-bit SIMD architecture for mobile multimedia , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[20]  Leonel Sousa,et al.  Parallel LDPC Decoding on GPUs Using a Stream-Based Computing Approach , 2009, Journal of Computer Science and Technology.

[21]  Brendan J. Frey,et al.  Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[22]  Ruby B. Lee Subword parallelism with MAX-2 , 1996, IEEE Micro.

[23]  Marc P. C. Fossorier,et al.  A modified weighted bit-flipping decoding of low-density Parity-check codes , 2004, IEEE Communications Letters.