Abstract
A power and area efficient 108-bit end-around carry adder is implemented using IBM 65nm SOI technology. The adder is used for a multiply-add fused (MAF) floating point unit. Careful balance of the adder structure and structure-aware layout techniques enabled this adder to have a latency of 270ps at power consumption of 20mW with 1V supply.








Similar content being viewed by others
References
Park, J., et al. (2000). “470ps 64-bit parallel binary adder”, Digest of of 2000 Symposium on VLSI Circuits.
Mathew, S., et al. (2001). Sub-500-ps 64-b ALU in 0.18-µm SOI/Bulk CMOS: design and scaling trends. IEEE Journal of Solid-State Circuits, 36, No. 11, November.
Mathew, S., et al. (2003). A 4-Ghz 130-nm address generation unit with 32-bit sparse-tree adder core. IEEE Journal of Solid-State Circuits, 38, No. 5, May.
Shimazaki, Y., et al. (2004). A shared-well dual-supply-voltage 64-bit ALU. IEEE Journal of Solid-State Circuits, 39, No. 3, March.
Oklobdzija, V. G., et al. (2003). energy-delay estimation technique for high-performance microprocessor VLSI adders. Proceedings of the International Symposium on Computer Arithmetic, June 15–18.
Oklobdzija, V. G., et al. (2005). Comparison of high-performance adders in the energy-delay space. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 13, No. 6, June.
Zeydel, B., et al. (2005). Efficient mapping of addition recurrence algorithms in CMOS. 17th IEEE Symposium on Computer Arithmetic, June 27–29.
Montoye, R. K., et al. Design of the IBM RISC System/6000 floating-point execution unit. IBM J Res Develop, 34(1), 59.
Schwarz, E. (2006). Binary floating-point unit design, book chapter in High Performance Energy Efficient Microprocessor Design. R. Krishnam’urthy and V. G. Oklobdzija (Eds), Springer, March.
Leobandung, E., et al. “High Performance 65nm SOI Technology with Dual Stress Liner and Low capacitance SRAM cell”, 2005 Symposium on VLSI Technology.
Kogge, P. M., et al. (1973). A parallel algorithm for the efficient solution of recurrence equations. IEEE Trans Comput, C-22(8).
Knowles, S. (2001). A Family of Adders. Proc. 15th IEEE Symposium on Computer Arithmetic, June 11–13, Vail, CO, pp. 277–284.
Yu, X., et al. (2006). “A 5 GHz+ binary floating-point adder for the POWER6 processor”, European Solid-State Circuits Conference, September.
Conn, A. R., et al. (1999). Gradient-Based Optimization of Custom Circuits Using a Static-Timing Formulation. Proceedings of the Design Automation Conference, pp. 452–459, June.
Rao, V., et al. (1999). EinsTLT: transistor level timing with EinsTimer. ACM/IEEE International Workshop on Timing Issues in the Specification and Synthesis of Digital Systems, March 8–9.
Horowitz, M. “VLSI Scaling for Architects”, Presentation slides, Computer Systems Laboratory, Stanford University.
Neely, J. S., et al. (2000). CPAM: a common power analysis methodology for high-performance VLSI design. in Proceedings, IEEE 9th Topical Meeting on Electrical Performance of Electronic Packaging, pp. 303–306, October.
Acknowledgement
Authors would like to thank Kevin Nowka and Martin Schmookler for their valuable discussions and suggestions.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, X.Y., Chan, YH., Montoye, R. et al. A 270ps 20mW 108-bit End-around Carry Adder for Multiply-Add Fused Floating Point Unit. J Sign Process Syst Sign Image Video Technol 58, 139–144 (2010). https://doi.org/10.1007/s11265-008-0325-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11265-008-0325-0