TY - JOUR
T1 - Design of high-throughput fully parallel LDPC decoders based on wire partitioning
AU - Onizawa, Naoya
AU - Hanyu, Takahiro
AU - Gaudet, Vincent C.
N1 - Funding Information:
Manuscript received July 29, 2008; revised October 18, 2008. First published April 14, 2009; current version published February 24, 2010. This work was supported by the VLSI Design and Education Center (VDEC), University of Tokyo, Tokyo, Japan, in collaboration with Synopsys, Inc.
PY - 2010/3
Y1 - 2010/3
N2 - We present a method to design high-throughput fully parallel low-density parity-check (LDPC) decoders. With our method, a decoder's longest wires are divided into several short wires with pipeline registers. Log-likelihood ratio messages transmitted along with these pipelined paths are thus sent over multiple clock cycles, and the decoder's critical path delay can be reduced while maintaining comparable bit error rate performance. The number of registers inserted into paths is estimated by using wiring information extracted from initial placement and routing information with a conventional LDPC decoder, and thus only necessary registers are inserted. Also, by inserting an even number of registers into the longer wires, two different codewords can be simultaneously decoded, which improves the throughput at a small penalty in area. We present our design flow as well as post-layout simulation results for several versions of a length-1024, (3,6)-regular LDPC code. Using our technique, we achieve a maximum uncoded throughput of 13.21 Gb/s with an energy consumption of 0.098 nJ per uncoded bit at $ Eb/N0= 5 dB. This represents a 28% increase in throughput, a 30% decrease in energy per bit, and a 1.6% increase in core area with respect to a conventional parallel LDPC decoder, using a 90-nm CMOS technology.
AB - We present a method to design high-throughput fully parallel low-density parity-check (LDPC) decoders. With our method, a decoder's longest wires are divided into several short wires with pipeline registers. Log-likelihood ratio messages transmitted along with these pipelined paths are thus sent over multiple clock cycles, and the decoder's critical path delay can be reduced while maintaining comparable bit error rate performance. The number of registers inserted into paths is estimated by using wiring information extracted from initial placement and routing information with a conventional LDPC decoder, and thus only necessary registers are inserted. Also, by inserting an even number of registers into the longer wires, two different codewords can be simultaneously decoded, which improves the throughput at a small penalty in area. We present our design flow as well as post-layout simulation results for several versions of a length-1024, (3,6)-regular LDPC code. Using our technique, we achieve a maximum uncoded throughput of 13.21 Gb/s with an energy consumption of 0.098 nJ per uncoded bit at $ Eb/N0= 5 dB. This represents a 28% increase in throughput, a 30% decrease in energy per bit, and a 1.6% increase in core area with respect to a conventional parallel LDPC decoder, using a 90-nm CMOS technology.
KW - Forward error control (FEC)
KW - Iterative decoding
KW - Low-density parity-check (LDPC) codes
KW - VLSI
UR - http://www.scopus.com/inward/record.url?scp=77649191580&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77649191580&partnerID=8YFLogxK
U2 - 10.1109/TVLSI.2008.2011360
DO - 10.1109/TVLSI.2008.2011360
M3 - Article
AN - SCOPUS:77649191580
SN - 1063-8210
VL - 18
SP - 482
EP - 489
JO - IEEE Transactions on Very Large Scale Integration (VLSI) Systems
JF - IEEE Transactions on Very Large Scale Integration (VLSI) Systems
IS - 3
M1 - 4814500
ER -