A 270ps 20mW 108-bit End-around Carry Adder for Multiply-Add Fused Floating Point Unit

Zhang, Xiao Yan; Chan, Yiu-Hing; Montoye, Robert; Sigal, Leon; Schwarz, Eric; Kelly, Michael

doi:10.1007/s11265-008-0325-0

A 270ps 20mW 108-bit End-around Carry Adder for Multiply-Add Fused Floating Point Unit

Published: 10 January 2009

Volume 58, pages 139–144, (2010)
Cite this article

Journal of Signal Processing Systems Aims and scope Submit manuscript

Xiao Yan Zhang¹,
Yiu-Hing Chan¹,
Robert Montoye²,
Leon Sigal²,
Eric Schwarz¹ &
…
Michael Kelly¹

288 Accesses
4 Citations
Explore all metrics

Abstract

A power and area efficient 108-bit end-around carry adder is implemented using IBM 65nm SOI technology. The adder is used for a multiply-add fused (MAF) floating point unit. Careful balance of the adder structure and structure-aware layout techniques enabled this adder to have a latency of 270ps at power consumption of 20mW with 1V supply.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Implementation of Low Power and Area Efficient Floating-Point Fused Multiply-Add Unit

FPGA Implementation of 16-Bit and 32-Bit Heterogeneous Adders

A Low-Power High-Speed Double Manchester Carry Chain with Carry-Skip Using D3L

References

Park, J., et al. (2000). “470ps 64-bit parallel binary adder”, Digest of of 2000 Symposium on VLSI Circuits.
Mathew, S., et al. (2001). Sub-500-ps 64-b ALU in 0.18-µm SOI/Bulk CMOS: design and scaling trends. IEEE Journal of Solid-State Circuits, 36, No. 11, November.
Mathew, S., et al. (2003). A 4-Ghz 130-nm address generation unit with 32-bit sparse-tree adder core. IEEE Journal of Solid-State Circuits, 38, No. 5, May.
Shimazaki, Y., et al. (2004). A shared-well dual-supply-voltage 64-bit ALU. IEEE Journal of Solid-State Circuits, 39, No. 3, March.
Oklobdzija, V. G., et al. (2003). energy-delay estimation technique for high-performance microprocessor VLSI adders. Proceedings of the International Symposium on Computer Arithmetic, June 15–18.
Oklobdzija, V. G., et al. (2005). Comparison of high-performance adders in the energy-delay space. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 13, No. 6, June.
Zeydel, B., et al. (2005). Efficient mapping of addition recurrence algorithms in CMOS. 17th IEEE Symposium on Computer Arithmetic, June 27–29.
Montoye, R. K., et al. Design of the IBM RISC System/6000 floating-point execution unit. IBM J Res Develop, 34(1), 59.
Schwarz, E. (2006). Binary floating-point unit design, book chapter in High Performance Energy Efficient Microprocessor Design. R. Krishnam’urthy and V. G. Oklobdzija (Eds), Springer, March.
Leobandung, E., et al. “High Performance 65nm SOI Technology with Dual Stress Liner and Low capacitance SRAM cell”, 2005 Symposium on VLSI Technology.
Kogge, P. M., et al. (1973). A parallel algorithm for the efficient solution of recurrence equations. IEEE Trans Comput, C-22(8).
Knowles, S. (2001). A Family of Adders. Proc. 15th IEEE Symposium on Computer Arithmetic, June 11–13, Vail, CO, pp. 277–284.
Yu, X., et al. (2006). “A 5 GHz+ binary floating-point adder for the POWER6 processor”, European Solid-State Circuits Conference, September.
Conn, A. R., et al. (1999). Gradient-Based Optimization of Custom Circuits Using a Static-Timing Formulation. Proceedings of the Design Automation Conference, pp. 452–459, June.
Rao, V., et al. (1999). EinsTLT: transistor level timing with EinsTimer. ACM/IEEE International Workshop on Timing Issues in the Specification and Synthesis of Digital Systems, March 8–9.
Horowitz, M. “VLSI Scaling for Architects”, Presentation slides, Computer Systems Laboratory, Stanford University.
Neely, J. S., et al. (2000). CPAM: a common power analysis methodology for high-performance VLSI design. in Proceedings, IEEE 9th Topical Meeting on Electrical Performance of Electronic Packaging, pp. 303–306, October.

Download references

Acknowledgement

Authors would like to thank Kevin Nowka and Martin Schmookler for their valuable discussions and suggestions.

Author information

Authors and Affiliations

IBM Corp, 2455 South Rd, Poughkeepsie, NY, USA
Xiao Yan Zhang, Yiu-Hing Chan, Eric Schwarz & Michael Kelly
IBM T.J. Watson Research Center, 1101 Kitchawan Road, Yorktown Heights, NY, USA
Robert Montoye & Leon Sigal

Authors

Xiao Yan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yiu-Hing Chan
View author publications
You can also search for this author in PubMed Google Scholar
Robert Montoye
View author publications
You can also search for this author in PubMed Google Scholar
Leon Sigal
View author publications
You can also search for this author in PubMed Google Scholar
Eric Schwarz
View author publications
You can also search for this author in PubMed Google Scholar
Michael Kelly
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiao Yan Zhang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, X.Y., Chan, YH., Montoye, R. et al. A 270ps 20mW 108-bit End-around Carry Adder for Multiply-Add Fused Floating Point Unit. J Sign Process Syst Sign Image Video Technol 58, 139–144 (2010). https://doi.org/10.1007/s11265-008-0325-0

Download citation

Received: 10 August 2007
Revised: 21 November 2008
Accepted: 25 November 2008
Published: 10 January 2009
Issue Date: February 2010
DOI: https://doi.org/10.1007/s11265-008-0325-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A 270ps 20mW 108-bit End-around Carry Adder for Multiply-Add Fused Floating Point Unit

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Implementation of Low Power and Area Efficient Floating-Point Fused Multiply-Add Unit

FPGA Implementation of 16-Bit and 32-Bit Heterogeneous Adders

A Low-Power High-Speed Double Manchester Carry Chain with Carry-Skip Using D3L

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now