skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Optimizing the Fast Fourier Transform Using Mixed Precision on Tensor Core Hardware

Conference ·
 [1];  [2]; ORCiD logo [3];  [3];  [4]
  1. National Institute of Technology Tiruchirappalli
  2. The Hong Kong University of Science and Technology
  3. ORNL
  4. University of Tennessee, Knoxville (UTK)

The Fast Fourier Transform is a fundamental tool in scientific and technical computation. The highly parallelizable nature of the algorithm makes it a suitable candidate for GPU acceleration. This paper focuses on exploiting the speedup due to using the half precision multiplication capability of the latest GPUs' tensor core hardware without significantly degrading the precision of the Fourier Transform result. We develop an algorithm that dynamically splits the input single precision dataset into two half precision sets at the lowest level, uses half precision multiplication, and recombines the result at a later step. This work paves the way for using tensor cores for high precision inputs.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1559731
Resource Relation:
Conference: Paralle Fast Fourier Transforms (PFFT18) - Bengaluru, , India - 12/17/2018 5:00:00 AM-12/20/2018 5:00:00 AM
Country of Publication:
United States
Language:
English

References (6)

Fast Fourier Transforms conference January 1966
Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems journal November 2007
SPFP: Speed without compromise—A mixed precision model for GPU accelerated molecular dynamics simulations journal February 2013
Accelerating scientific computations with mixed precision algorithms journal December 2009
Fourier Transform Infrared Spectroscopic Analysis of Protein Secondary Structures journal August 2007
Implementation of a fast Fourier transform (FFT) for image processing applications journal December 1974

Similar Records

Mixed-precision iterative refinement using tensor cores on GPUs to accelerate solution of linear systems
Journal Article · Wed Nov 25 00:00:00 EST 2020 · Proceedings of the Royal Society. A. Mathematical, Physical and Engineering Sciences · OSTI ID:1559731

APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores
Conference · Sun Nov 14 00:00:00 EST 2021 · OSTI ID:1559731

MULTI-CORE AND OPTICAL PROCESSOR RELATED APPLICATIONS RESEARCH AT OAK RIDGE NATIONAL LABORATORY
Conference · Tue Jan 01 00:00:00 EST 2008 · OSTI ID:1559731

Related Subjects