18.7 A 2.4mm2 130mW MMSE-nonbinary-LDPC iterative detector-decoder for 4×4 256-QAM MIMO in 65nm CMOS

In this work, the authors demonstrate an MMSE-NBLDPC iterative detector-decoder for a 4×4 256-QAM MIMO system to achieve an excellent error rate that improves with iterations, as shown in Fig. 18.7.1. To minimize latency over the iterative loop and improve throughput, the MMSE detector is divided into 4 task-based coarse pipeline stages so that all stages can operate in parallel. Both the number of stages and the stage latency of the detector are minimized, and the long critical paths are interleaved and placed in a slow clock domain to support a high data rate in a cost-effective way. The resulting MMSE detector achieves an 82% higher throughput compared, and almost 3.5× the throughput of the latest SD detector. The NBLDPC decoder is implemented using 78 processing nodes to enable fully parallel message passing. Serial Galois field (GF) processing is pipelined using a data forwarding technique to cut the decoding latency by 30% over the latest design. The detector and decoder exchange symbol log-likelihood ratios (LLR) that are efficiently computed based on the L1 distance to the nearest neighbors in the QAM constellation. To lower the power consumption, automatic clock gating is applied to stage boundary and buffer registers to save 53% of the detector power and 61% of the decoder power. The results are demonstrated in a 65nm MMSE-NBLDPC iterative detector-decoder test chip that achieves 1.38Gb/s detection and 1.02Gb/s decoding (5 iterations), consuming 26.5mW and 103mW, respectively.

[1]  Christoph Studer,et al.  ASIC Implementation of Soft-Input Soft-Output MIMO Detection Using MMSE Parallel Interference Cancellation , 2011, IEEE Journal of Solid-State Circuits.

[2]  Zhengya Zhang,et al.  A 1.15Gb/s fully parallel nonbinary LDPC decoder with fine-grained dynamic clock gating , 2013, 2013 IEEE International Solid-State Circuits Conference Digest of Technical Papers.

[3]  Gerd Ascheid,et al.  A 2.78 mm2 65 nm CMOS gigabit MIMO iterative detection and decoding receiver , 2012, 2012 Proceedings of the ESSCIRC (ESSCIRC).

[4]  David Declercq,et al.  Getting Closer to MIMO Capacity with Non-Binary Codes and Spatial Multiplexing , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[5]  Zhengya Zhang,et al.  A Fully Parallel Nonbinary LDPC Decoder With Fine-Grained Dynamic Clock Gating , 2015, IEEE Journal of Solid-State Circuits.

[6]  Gerhard Fettweis,et al.  A 335Mb/s 3.9mm2 65nm CMOS flexible MIMO detection-decoding engine achieving 4G wireless data rates , 2012, 2012 IEEE International Solid-State Circuits Conference.

[7]  Gerhard Fettweis,et al.  10.7 A 105GOPS 36mm2 heterogeneous SDR MPSoC with energy-aware dynamic scheduling and iterative detection-decoding for 4G in 65nm CMOS , 2014, 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC).