Abstract
Datapath widths in state-of-the-art Turbo and Viterbi decoder implementations depend on estimated upper bounds of the differences of processed metrics. Aiming at highest area and energy efficiency, this paper presents guidelines for designing Turbo and Viterbi decoder datapaths with minimal widths. This is based on maximum absolute values of branch, state and path metric differences within theMax-Log-MAP respectively Viterbi decoding algorithm applying modulo normalization. The proposed methodology for determining the maximum absolute values covers punctured as well as n-input binary convolutional and Turbo codes so it accommodates higherradix add-compare-select operations. Maximum absolute values of metric differences and minimum datapath widths are presented for the 3GPP-LTE, DVB-RCS2 and IEEE-802.16 (WiMAX) compliant Turbo decoders and for the IEEE-802.11 (Wi-Fi), IEEE-802.16 (WiMAX) and 3GPP-LTE compliant Viterbi decoders. Besides, a new dynamic branch-metric saturation scheme is presented, which enables a further datapath width reduction within Turbo decoders. In total, a datapath width reduction of two bits (−20 %) is achieved applying radix-4 Max-Log-MAP arithmetic. An overall area-time-energy complexity reduction of 31% is achieved for a soft-input soft-output decoder and of 24% for the LTE Turbo decoder.
Similar content being viewed by others
References
Berrou, C., et al. (1993). Near Shannon Limit Error-Correcting Coding and Decoding: Turbo-Codes (1). IEEE International Conference on Communications, 2, 1064–1070.
Hagenauer, J., et al. (1996). Iterative Decoding of Binary Block and Convolutional Codes. IEEE Transactions on Information Theory, 42(2), 429–445.
Viterbi, A.J. (1967). Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm. IEEE Transactions on Information Theory, 13(2), 260–269.
Hagenauer, J., & Hoeher, P. (1989). A Viterbi Algorithm with Soft-decision Outputs and its Applications. IEEE Global Telecommunications Conference and Exhibition (GLOBECOM), 3, 1680–1686.
Bahl, L.R., et al. (1974). Optimal Decoding of Linear Codes for Minimizing Symbol Error Rate. IEEE Transactions on Information Theory, 284–287.
Hekstra, A.P. (1989). An Alternative to Metric Rescaling in Viterbi Decoders. IEEE Transactions on Communications, 37(11), 1220–1222.
Siegel, P.H., et al. (1991). Exact Bounds for Viterbi Detector Path Metric Differences. IEEE International Conference on Acoustics, Speech and Signal Processing, 1093–1096.
Worm, A. (2001). Implementation Issues of Turbo-Decoders, Ph.D. Dissertation. Kaiserslautern: Universität Kaiserslautern.
Wu, Y., et al. (2001). Data width Requirements in SISO Decoding with Modulo normalization. IEEE Transactions on Communications, 49(11), 1861–1868.
Onyszchuk, I.M., et al. (1993). Quantization Loss in Convolutional Decoding. IEEE Transactions on Communications, 41(2), 261–265.
Boutillon, E., et al. (2003). VLSI Architectures for the MAP Algorithm. IEEE Transactions on Communications, 51(2), 175–185.
Yan, Z., et al. (2016). High Performance Parallel Turbo Decoder with Configurable Interleaving Network for LTE Application. Integration VLSI Journal, 52, 77–90.
Wang, G., et al. (2014). Parallel Interleaver Design for a high Throughput HSPA /LTE Multi-standard Turbo Decoder. IEEE Transactions on Circuits and Systems, 61(5), 1376–1389.
Murugappa, P., et al. (2013). Parameterized Area-efficient Multi-standard Turbo Decoder. Proc. Design, Automation and Test in Europe, 109–114.
Studer, C., et al. (2012). Implementation Trade-offs of Soft-input Soft-output MAP Decoders for Convolutional Codes. IEEE Transactions on Circuits and Systems, 59(11), 2774–2783.
Ilnseher, T., et al. (2012). A 2.15 GBit/s Turbo Code Decoder for LTE Advanced Base Station Applications, 7th International Symposium on Turbo Codes and Iterative Information Processing (ISTC), pp 21–25.
Studer, C., et al. (2011). Design and Implementation of a Parallel turbo-decoder ASIC for 3GPP-LTE. IEEE Journal of Solid-State Circuits, 46(1), 8–17.
Ilnseher, T., et al. (2010). A Multi-mode 3GPP-LTE/HSDPA Turbo Decoder. IEEE International Conference on Communication Systems, 336–340.
May, M., et al. (2010). A 150Mbit/s 3GPP LTE Turbo Code Decoder. Proc. IEEE Conference Design, Automation and Test in Europe (DATE), 1420–1425.
Broich, M., & Noll, T.G. (2014). Optimal Data path Widths for Energy- and Area-efficient Max-Log-MAP Based LTE Turbo Decoders. International Symposium on System on Chip, 1–8.
Martina, M., & Masera, G. (2011). State Metric Compression Techniques for Turbo Decoder Architectures. IEEE Transactions on Circuits and Systems, 58(5), 1119–1128.
Yang, Q., et al. (2013). Low Complexity State Metric Compression technique in turbo decoder. IEICE Electronics Express, 10(15), 1–7.
Lee, Y., et al. (2015). Memory-reduced Turbo Decoding Architecture using NII Metric Compression.
3GPP and ETSI (2015). LTE; Evolved Universal Terrestrial Radio Access (E-UTRA); Multiplexing and channel coding: 3GPP TS 36.212 version 12.4.0 Release 12.
Robertson, P., et al. (1995). A Comparison of Optimal and Sub-optimal MAP Decoding Algorithms Operating in the Log Domain. IEEE International Conference on Communications, 2, 1009–1013.
Vogt, J., & Finger, A. (2000). Improving the Max-Log-MAP Turbo Decoder. Electronics Letters, 36(23), 1937–1939.
Worm, A., et al. (2000). Advanced Implementation Issues of Turbo-decoders, Proc. 2nd International Symposium on Turbo Codes & Related Topics, pp 351–354.
Shung, C.B., et al. (1990). VLSI Architectures for Metric normalization in the Viterbi Algorithm. IEEE International Conference on Communications, 1723–1728.
Wu, Y., & Woerner, B.D. (2000). Internal Data width in SISO Decoding Module with Modular renormalization. IEEE 51st Vehicular Technology Conference, 675–679.
Viterbi, A.J. (1971). Convolutional Codes and Their Performance in Communication Systems. IEEE Transactions on Communications Technology, 19(5), 751–772.
LAN/MAN Standards Committee of the IEEE Computer Society (2012). Part 11: Wireless LAN Medium Access Control (MAC) and Physical Layer (PHY) Specifications: IEEE Std 802.11–2012.
LAN/MAN Standards Committee of the IEEE Computer Society (2012). IEEE Standard for Air Interface for Broadband Wireless Access Systems: IEEE Std 802.16–2012.
ETSI (2014). Digital Video Broadcasting (DVB); Second Generation DVB Interactive Satellite System (DVB-RCS2); Part 2: Lower Layers for Satellite standard: EN 301 545-2 version 1.2.1.
Wu, D., et al. (2009). Implementation of a High-Speed Parallel Turbo Decoder for 3GPP LTE Terminals. IEEE 8th International Conference on ASIC, 481–484.
Broich, M., & Noll, T.G. (2012). Efficient VLSI Architectures of QPP Interleavers for LTE Turbo Decoders. International Symposium on System on Chip, 1–6.
Shrestha, R., & Paily, R.P. (2014). High-throughput Turbo Decoder with Parallel Architecture for LTE Wireless Communication Standards. IEEE Transactions on Circuits and Systems I: Regular Papers, 61(9), 2699–2710.
Roth, C., et al. (2014). Efficient Parallel Turbo-decoding for High-throughput Wireless Systems. IEEE Transactions on Circuits and Systems, 61(6), 1824–1835.
Lin, C.-H., et al. (2013). Reconfigurable Parallel Turbo Decoder Design for Multiple High-Mobility 4G Systems. Journal of Signal Processing Systems, 73(2), 109–122.
Chen, X., et al. (2013). A 691 Mbps 1.392mm2 Configurable Radix-16 Turbo Decoder ASIC for 3GPP-LTE and WiMAX Systems in 65nm CMOS. IEEE Asian Solid-State Circuits Conference, 157–160.
Al-Khayat, R., et al. (2012). Architecture Efficiency of Application-Specific Processors: a 170Mbit/s O.644mm2 Multi-standard Turbo Decoder, International Symposium on System on Chip, 1–7.
Asghar, R., et al. (2012). Implementation of a Radix-4, Parallel Turbo Decoder and Enabling the Multi-Standard Support. Journal of Signal Processing Systems, 66(1), 25–41.
Lin, C.-H., et al. (2011). Area-Efficient Scalable MAP Processor Design for High-Throughput Multistandard Convolutional Turbo Decoding. IEEE Transactions on VLSI Systems, 19(2), 305– 318.
Sun, Y., & Cavallaro, J.R. (2011). Efficient Hardware Implementation of a Highly-Parallel 3GPP LTE/LTE-advance Turbo Decoder. Integration VLSI Journal, 44, 305–315.
Cheng, C.-C., et al. (2010). A 0.077 to 0.168 nJ/bit/iteration Scalable 3GPP LTE Turbo Decoder with an Adaptive Sub-Block Parallel Scheme and an Embedded DVFS Engine. IEEE Custom Integrated Circuits Conference, 1–4.
Wong, C.-C., & Chang, H.-C. (2010). Reconfigurable Turbo Decoder With Parallel Architecture for 3GPP LTE System. IEEE Transactions on Circuits and Systems II: Express Briefs, 57(7), 566–570.
Kim, J.-H., & Park, I.-C. (2009). A Unified Parallel Radix-4 Turbo Decoder for Mobile WiMAX and 3GPP-LTE. IEEE 2009 Custom Integrated Circuits Conference, 487–490.
Wong, C.-C., & Chang, H.-C. (2011). High-Efficiency Processing Schedule for Parallel Turbo Decoders Using QPP Interleaver. IEEE Transactions on Circuits and Systems, 58(6), 1412– 1420.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Broich, M., Noll, T.G. Optimal Datapath Widths Within Turbo and Viterbi Decoders for High Area- and Energy-Efficiency. J Sign Process Syst 87, 299–325 (2017). https://doi.org/10.1007/s11265-016-1140-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11265-016-1140-7