Article

An iterative division algorithm for FPGAs

Authors:
Jianhua Liu

University of California, San Diego, CA

University of California, San Diego, CA
View Profile

,
Michael Chang

University of California, San Diego, CA

University of California, San Diego, CA
View Profile

,
Chung-Kuan Cheng

University of California, San Diego, CA

University of California, San Diego, CA
View Profile

FPGA '06: Proceedings of the 2006 ACM/SIGDA 14th international symposium on Field programmable gate arraysFebruary 2006Pages 83–89https://doi.org/10.1145/1117201.1117213

Published:22 February 2006Publication History

FPGA '06: Proceedings of the 2006 ACM/SIGDA 14th international symposium on Field programmable gate arrays

Pages 83–89

ABSTRACT

Division is one of the most complicated and expensive arithmetic operations. Both clock frequency and operation delay are limited by the memory wall, even in LUT-based FPGA devices. To conquer the memory limitation, we propose a hybrid division algorithm which employs Prescaling, Series expansion and Taylor expansion (PST) algorithms. The proposed algorithm boosts very-high radix division efficiently. The algorithm is multiplicative, and feasible for the modern FPGA devices with build-in multipliers. The algorithm is implemented in Altera StratixII FPGA devices and compared with the division IP core generated by MegaWizard. The result shows that the PST algorithm has higher clock frequency, lower execution time and also lower power consumption.

References

Robertson J, "A New Class of Digital Division Methods", IRE Trans. on Electronic Computers, vol. 7, 1958, pp. 218--222.Google ScholarCross Ref
Tocher T, "Techniques of Multiplication and Division for Automatic Binary Computers", Quarterly J. Mech. App. Math., vol. 2, pt. 3, 1958, pp. 364--384.Google ScholarCross Ref
Coe T, Tang P, "It takes six ones to reach a flaw {Pentium processor}", Computer Arithmetic, 1995., Proceedings of the 12th Symposium on 19-21 July 1995 Page(s):140--146. Google ScholarDigital Library
Svoboda A, "An Algorithm for Division", Information Processing Machines, vol. 9, 1963, pp. 183--190.Google Scholar
Tung C, "A Division Algorithm for Signed-Digit Arithmetic", IEEE Trans. on Computers, vol.17, 1968, pp. 887--889.Google ScholarDigital Library
Oberman S, Flynn M, "Division Algorithms and Implementations", IEEE Trans. on Computers, vol. 46, NO. 8, 1997, pp. 833--854. Google ScholarDigital Library
Goldschmidt R, "Applications of Division by Convergence", MS thesis, Dept. of Electrical Eng., Massachusetts Inst. of Technology, Cambridge, Mass., June 1964.Google Scholar
Hung P, Fahmy H, Mencer O, Flynn M, "Fast Division Algorithm with a Small Lookup Table", Conference Record of the 33rd Asilomar conference on Signals, Systems, and Computers, IEEE. Part vol. 2, 1999, pp. 1465--1468.Google Scholar
Markstein P, "Computation of Elementary Functions on the IBM RISC System/6000 Processor", IBM J. Research and Development, vol. 34, no. 1, pp.111--119, Jan. 1990. Google ScholarDigital Library
Oberman S, "Floating Point Division and Square Root Algorithms and Implementation in the AMD-K7 Microprocessor", Proc. 14th IEEE Symp. Computer Arithmetic, I. Koren and P. Kornerup, eds., pp. 106--115, Apr. 1999. Google ScholarDigital Library
Ibrahem AA, Elsimary HA, Salama AE, "FPGA implementation of fast radix 4 division algorithm", 4th IEEE International Workshop on System-on-Chip for Real-Time Applications. IEEE Comput. Soc. 2004, pp.69--72. Los Alamitos, CA, USA. Google ScholarDigital Library
Sutter G, Bioul G, Deschamps J-P, "Comparative study of SRT-dividers in FPGA", Field-Programmable Logic and Applications. 14th International Conference, FPL 2004. Proceedings (Lecture Notes in Comput. Sci. Vol.3203). Springer-Verlag. 2004, pp.209--20. Berlin, Germany.Google Scholar
Altera Documentation Group, "Stratix II Device Handbook"', Altera Corporation, July 2005.Google Scholar

Index Terms

An iterative division algorithm for FPGAs
1. Hardware
  1. Integrated circuits
    1. Logic circuits
      1. Arithmetic and datapath circuits

Recommendations

Pipelining of double precision floating point division and square root operations
ACM-SE 44: Proceedings of the 44th annual Southeast regional conference

Space applications rely increasingly on high data rate DSP algorithms. These algorithms use double precision floating point arithmetic operations. While most DSP applications can be compiled on DSP processors, high data rate DSP computations require ...
Read More
An Unified Architecture for Single, Double, Double-Extended, and Quadruple Precision Division

A hardware architecture for quadruple precision floating point division arithmetic with multi-precision support is presented. Division is an important yet far more complex arithmetic operation than addition and multiplication, which demands significant ...
Read More
Power and energy efficiency evaluation for HW and SW implementation of nxn matrix multiplication on Altera FPGAs
FPGAworld '09: Proceedings of the 6th FPGAworld Conference

Matrix multiplication is most often involved in graphics, image processing, digital signal processing, robotics and control engineering applications. In this paper we compared and analyzed the power and energy consumption in three different designs, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
FPGA '06: Proceedings of the 2006 ACM/SIGDA 14th international symposium on Field programmable gate arrays
February 2006
248 pages
ISBN:1595932925
DOI:10.1145/1117201
General Chair:
Steve Wilton
University of British Columbia, Canada
,
Program Chair:
André DeHon
California Institute of Technology, USA
Copyright © 2006 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 22 February 2006
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
FPGA
division
high performance
low power
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate125of627submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 6
  Total Citations
  View Citations
- 1,295
  Total Downloads
- Downloads (Last 12 months)11
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

An iterative division algorithm for FPGAs

FPGA '06: Proceedings of the 2006 ACM/SIGDA 14th international symposium on Field programmable gate arrays

ABSTRACT

References

Cited By

Index Terms

Recommendations

Pipelining of double precision floating point division and square root operations

An Unified Architecture for Single, Double, Double-Extended, and Quadruple Precision Division

Power and energy efficiency evaluation for HW and SW implementation of nxn matrix multiplication on Altera FPGAs

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

An iterative division algorithm for FPGAs

FPGA '06: Proceedings of the 2006 ACM/SIGDA 14th international symposium on Field programmable gate arrays

ABSTRACT

References

Cited By

Index Terms

Recommendations

Pipelining of double precision floating point division and square root operations

An Unified Architecture for Single, Double, Double-Extended, and Quadruple Precision Division

Power and energy efficiency evaluation for HW and SW implementation of nxn matrix multiplication on Altera FPGAs

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media