MiniFloats on RISC-V Cores: ISA Extensions with Mixed-Precision Short Dot Products
Abstract
Low-precision floating-point (FP) formats have recently been intensely investigated in the context of machine learning inference and training applications. While 16-bit formats are already widely used, 8-bit FP data types have lately emerged as a viable option for neural network training when employed in a mixed-precision scenario and combined with rounding methods increasing the precision in compound additions, such as stochastic rounding. So far, hardware implementations supporting FP8 are mostly implemented within domain-specific accelerators. We propose two RISC-V instruction set architecture (ISA) extensions, enhancing respectively scalar and vector general-purpose cores with low and mixed-precision capabilities. The extensions support two 8-bit and two 16-bit FP formats and are based on dot-product instructions accumulating at higher precision. We develop a hardware unit supporting mixed-precision dot products and stochastic rounding and integrate it into an open-source floating-point unit (FPU). Finally, we integrate the enhanced FPU into a cluster of scalar cores, as well as a cluster of vector cores, and implement them in a 12 nm FinFET technology. The former achieves 575 GFLOPS/W on FP8-to-FP16 matrix multiplications at 0.8 V, 1.26 GHz; the latter reaches 860 GFLOPS/W at 0.8 V, 1.08 GHz, 1.93x higher efficiency than computing on FP16-to-FP32. Show more
Permanent link
https://doi.org/10.3929/ethz-b-000667948Publication status
publishedExternal links
Journal / series
IEEE Transactions on Emerging Topics in ComputingVolume
Pages / Article No.
Publisher
IEEESubject
Transprecision computing; RISC-V; ISA Extension; Floating-Point Architectures; Widening Dot Product; NN TrainingOrganisational unit
03996 - Benini, Luca / Benini, Luca
Funding
877056 - A Cognitive Fractal and Secure EDGE based on an unique Open-Safe-Reliable-Low Power Hardware Platform Node (EC)
101034126 - Pilot using Independent Local & Open Technologies (EC)
Related publications and datasets
Continues: https://doi.org/10.3929/ethz-b-000590801
More
Show all metadata
ETH Bibliography
yes
Altmetrics