Parallel turbo decoder using a low-latency Max-Log-MAP kernel for a VLIW DSP

Turbo-Codes have attracted great interest in digital mobile radio due to their remarkable error correcting capabilities. In this paper, we present an optimized Turbo decoder for wireless systems following recommended CDMA2000 standard (192 time frames with 8 states per frame). This decoder is implemented using TI's latest C64x digital signal processor. To achieve the maximum parallelism in TI's VLIW architecture, we have specially redesigned the trellis computation algorithm to improve the decoder throughput and reduce the number of computation operations required. In particular, our proposed algorithm transforms a number of add/subtract operations to multiplication operations. In this way, previously unused functional units become available and therefore, more parallel instructions can run simultaneously, leading to throughput increase and latency reduction. To our findings, current TI's compiler (Code Composer V1.2) fails to generate the optimized assembly code when our algorithms are directly implemented in C. To this end, we have illustrated optimized resource binding and timing schedules by applying code motion and loop transformation techniques. The optimized Turbo decoder can finish one decoding stage in 18.1 microseconds for a C64x DSP running at 400 MHz.

[1]  Quinn Li,et al.  cdma2000: A third-generation radio transmission technology , 1998, Bell Labs Technical Journal.

[2]  A. Glavieux,et al.  Near Shannon limit error-correcting coding and decoding: Turbo-codes. 1 , 1993, Proceedings of ICC '93 - IEEE International Conference on Communications.

[3]  John Cocke,et al.  Optimal decoding of linear codes for minimizing symbol error rate (Corresp.) , 1974, IEEE Trans. Inf. Theory.

[4]  J. Eyre,et al.  The digital signal processor Derby , 2001 .

[5]  Joachim Hagenauer,et al.  Iterative decoding of binary block and convolutional codes , 1996, IEEE Trans. Inf. Theory.

[6]  Jari Nurmi,et al.  A Flexible DSP Core for Embedded Systems , 1997, IEEE Des. Test Comput..

[7]  Wen Xu,et al.  Using powerful "turbo" codes for 14.4 kbit/s data service in GSM or PCS systems , 1997, GLOBECOM 97. IEEE Global Telecommunications Conference. Conference Record.

[8]  S. A. Mujtaba Trends in digital signal processors , 1999, 1999 International Symposium on VLSI Technology, Systems, and Applications. Proceedings of Technical Papers. (Cat. No.99TH8453).

[9]  S. S. Pietrobon A turbo/MAP decoder for use in satellite circuits , 1997, Proceedings of ICICS, 1997 International Conference on Information, Communications and Signal Processing. Theme: Trends in Information Systems Engineering and Wireless Multimedia Communications (Cat..

[10]  Peter Jung,et al.  Comparison of Turbo-Code Decoders Applied to Short Frame Transmission Systems , 1996, IEEE J. Sel. Areas Commun..

[11]  Guixia Kang,et al.  The implementation of Viterbi decoder on TMS320C6201 DSP in W-CDMA system , 2000, WCC 2000 - ICCT 2000. 2000 International Conference on Communication Technology Proceedings (Cat. No.00EX420).

[12]  Ih-Chin Chen,et al.  A 1-V programmable DSP for wireless communications [CMOS] , 1997 .

[13]  Kufsteiner StraBe Turbo-Codes and Turbo-Coded Modulation in CDMA Mobile Radio Systems for Short Frame , 1998 .

[14]  Patrick Robertson,et al.  A comparison of optimal and sub-optimal MAP decoding algorithms operating in the log domain , 1995, Proceedings IEEE International Conference on Communications ICC '95.

[15]  J. H. Gass,et al.  An application of turbo trellis-coded modulation to tactical communications , 1999, MILCOM 1999. IEEE Military Communications. Conference Proceedings (Cat. No.99CH36341).