skip to main content
10.1145/3564625.3564629acmotherconferencesArticle/Chapter ViewAbstractPublication PagesacsacConference Proceedingsconference-collections
research-article

Parallel Small Polynomial Multiplication for Dilithium: A Faster Design and Implementation

Published: 05 December 2022 Publication History

Abstract

The lattice-based signature scheme CRYSTALS-Dilithium is one of the two signature finalists in the third round NIST post-quantum cryptography (PQC) standardization project. For applications of low-power Internet-of-Things (IoT) devices, recent research efforts have been focusing on the performance optimization of PQC algorithms on embedded systems. In particular, performance optimization is more demanding for PQC signature algorithms that are usually significantly more time-consuming than PQC public-key encryption counterparts. For most cryptographic algorithms based on algebraic lattices including Dilithium, the fundamental and most time-consuming operation is polynomial multiplication over rings. For this computational task, number theoretic transform (NTT) is the most efficient multiplication method for NTT-friendly rings, and is now the typical technique for performing fast polynomial multiplications when implementing lattice-based PQC algorithms.
The key observation of this work is that, besides multiplications of polynomials of standard forms, Dilithium involves a list of multiplications for polynomials of very small coefficients. Can we have more efficient methods for multiplying such polynomials of small coefficients? Under this motivation, we present in this work a parallel small polynomial multiplication algorithm to speed up the implementations of Dilithium. We complete both C reference implementation and ARM Neon implementation. Moreover, we conducted some speed tests in combination with Becker’s Neon NTT [4]. The results show that, in comparison with the C reference implementation of Dilithium submitted to the third round of the NIST PQC competition, our reference implementation with the proposed parallel small polynomial multiplication is faster: specifically, our Sign and Verify speed up 18% and 19% respectively for Dilithium-2 (30% and 7% for Dilithium-3, 27% and 3% for Dilithium-5, respectively). As for the Arm Neon implementation, we achieved a performance improvement of about 64% in Sign and 50% in Verify for Dilithium-2 (60% and 32% for Dilithium-3) compared with the C reference implementation of Dilithium submitted to the third round of the NIST PQC competition. We aslo compared our work with the state-of-the-art Arm Neon implementation of Dilithium [4], the results show our speed of Sign is 13.4% faster for Dilithium-2 and 8.0% faster for Dilithium-3, achieving a new record of fast Dilithium implementation.

References

[1]
Amin Abdulrahman, Vincent Hwang, Matthias J. Kannwischer, and Daan Sprenkels. 2022. Faster Kyber and Dilithium on the Cortex-M4. In Applied Cryptography and Network Security - 20th International Conference, ACNS 2022, Rome, Italy, June 20-23, 2022, Proceedings(Lecture Notes in Computer Science, Vol. 13269), Giuseppe Ateniese and Daniele Venturi (Eds.). Springer, 853–871. https://doi.org/10.1007/978-3-031-09234-3_42
[2]
Erdem Alkim, Yusuf Alper Bilgin, Murat Cenk, and François Gérard. 2020. Cortex-M4 optimizations for {R, M} LWE schemes. IACR Trans. Cryptogr. Hardw. Embed. Syst. 2020, 3 (2020), 336–357. https://doi.org/10.13154/tches.v2020.i3.336-357
[3]
Roberto Avanzi, Joppe Bos, and Léo Ducas. 2020. CRYSTALS-Dilithium. In Submission to the NIST Post-Quantum Cryptography Standardization Project. https://pq-crystals.org/dilithium
[4]
Hanno Becker, Vincent Hwang, Matthias J. Kannwischer, Bo-Yin Yang, and Shang-Yi Yang. 2022. Neon NTT: Faster Dilithium, Kyber, and Saber on Cortex-A72 and Apple M1. IACR Trans. Cryptogr. Hardw. Embed. Syst. 2022, 1 (2022), 221–244. https://doi.org/10.46586/tches.v2022.i1.221-244
[5]
Luke Beckwith, Duc Tri Nguyen, and Kris Gaj. 2022. High-Performance Hardware Implementation of Lattice-Based Digital Signatures. Cryptology ePrint Archive(2022).
[6]
Leon Botros, Matthias J Kannwischer, and Peter Schwabe. 2019. Memory-efficient high-speed implementation of Kyber on Cortex-M4. In International Conference on Cryptology in Africa. Springer, 209–228.
[7]
Nicolas Courtois, Alexander Klimov, Jacques Patarin, and Adi Shamir. 2000. Efficient algorithms for solving overdefined systems of multivariate polynomial equations. In International Conference on the Theory and Applications of Cryptographic Techniques. Springer, 392–407.
[8]
Léo Ducas, Tancrede Lepoint, Vadim Lyubashevsky, Peter Schwabe, Gregor Seiler, and Damien Stehlé. 2018. Crystals–dilithium: Digital signatures from module lattices. (2018).
[9]
Alexander El-Kady, Apostolos P Fournaris, Thanasis Tsakoulis, Evangelos Haleplidis, and Vassilis Paliouras. 2021. High-Level Synthesis design approach for Number-Theoretic Transform Implementations. In 2021 IFIP/IEEE 29th International Conference on Very Large Scale Integration (VLSI-SoC). IEEE, 1–6.
[10]
Ruben Gonzalez, Andreas Hülsing, Matthias J Kannwischer, Juliane Krämer, Tanja Lange, Marc Stöttinger, Elisabeth Waitz, Thom Wiggers, and Bo-Yin Yang. 2021. Verifying post-quantum signatures in 8 kb of RAM. In International Conference on Post-Quantum Cryptography. Springer, 215–233.
[11]
Denisa OC Greconici, Matthias J Kannwischer, and Daan Sprenkels. 2021. Compact dilithium implementations on Cortex-M3 and Cortex-M4. IACR Transactions on Cryptographic Hardware and Embedded Systems (2021), 1–24.
[12]
Naina Gupta, Arpan Jati, Anupam Chattopadhyay, and Gautam Jha. 2022. Lightweight Hardware Accelerator for Post-Quantum Digital Signature CRYSTALS-Dilithium. Cryptology ePrint Archive(2022).
[13]
James Howe and Bas Westerbaan. 2022. Benchmarking and Analysing the NIST PQC Finalist Lattice-Based Signature Schemes on the ARM Cortex M7. Cryptology ePrint Archive(2022).
[14]
Murat Burhan İlter, Neşe Koçak, Erkan Uslu, Oğuz Yayla, and Nergiz Yuca. 2021. On the Number of Arithmetic Operations in NTT-based Polynomial Multiplication in Kyber and Dilithium Cryptosystems. In 2021 14th International Conference on Security of Information and Networks (SIN), Vol. 1. IEEE, 1–7.
[15]
Patrick Karl, Jonas Schupp, Tim Fritzmann, and Georg Sigl. 2022. Post-Quantum Signatures on RISC-V with Hardware Acceleration. Cryptology ePrint Archive(2022).
[16]
Youngbeom Kim, Jingyo Song, Taek-Young Youn, and Seog Chung Seo. 2022. Crystals-Dilithium on ARMv8. Security and Communication Networks 2022 (2022).
[17]
Georg Land, Pascal Sasdrich, and Tim Güneysu. 2021. A hard crystal-implementing dilithium on reconfigurable hardware. In International Conference on Smart Card Research and Advanced Applications. Springer, 210–230.
[18]
Adeline Langlois and Damien Stehlé. 2015. Worst-case to average-case reductions for module lattices. Designs, Codes and Cryptography 75, 3 (2015), 565–599.
[19]
Jihye Lee, Whijin Kim, Sohyeon Kim, and Ji-Hoon Kim. 2022. Post-Quantum Cryptography Coprocessor for RISC-V CPU Core. In 2022 International Conference on Electronics, Information, and Communication (ICEIC). IEEE, 1–2.
[20]
Vadim Lyubashevsky, Léo Ducas, Eike Kiltz, Tancrède Lepoint, Peter Schwabe, Gregor Seiler, Damien Stehlé, and Shi Bai. 2017. Crystals-dilithium. Submission to the NIST Post-Quantum Cryptography Standardization [NIS] (2017).
[21]
Vadim Lyubashevsky and Daniele Micciancio. 2006. Generalized compact knapsacks are collision resistant. In International Colloquium on Automata, Languages, and Programming. Springer, 144–155.
[22]
Vadim Lyubashevsky, Chris Peikert, and Oded Regev. 2010. On ideal lattices and learning with errors over rings. In Annual international conference on the theory and applications of cryptographic techniques. Springer, 1–23.
[23]
Robert J McEliece. 1978. A public-key cryptosystem based on algebraic. Coding Thv 4244(1978), 114–116.
[24]
Daniele Micciancio and Oded Regev. 2009. Lattice-based cryptography. In Post-quantum cryptography. Springer, 147–191.
[25]
Duc Tri Nguyen and Kris Gaj. 2021. Optimized software implementations of CRYSTALS-Kyber, NTRU, and Saber using NEON-based special instructions of ARMv8. In Proceedings of the NIST 3rd PQC Standardization Conference (NIST PQC 2021).
[26]
Chris Peikert and Alon Rosen. 2006. Efficient collision-resistant hashing from worst-case assumptions on cyclic lattices. In Theory of Cryptography Conference. Springer, 145–166.
[27]
Cong Peng, Jianhua Chen, Sherali Zeadally, and Debiao He. 2019. Isogeny-based cryptography: A promising post-quantum technique. IT Professional 21, 6 (2019), 27–32.
[28]
Luis J Dominguez Perez 2021. Implementing CRYSTAL-Dilithium on FRDM-K64. In 2021 IEEE 12th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON). IEEE, 0178–0183.
[29]
Prasanna Ravi, Romain Poussier, Shivam Bhasin, and Anupam Chattopadhyay. 2020. On Configurable SCA Countermeasures Against Single Trace Attacks for the NTT - A Performance Evaluation Study over Kyber and Dilithium on the ARM Cortex-M4. In Security, Privacy, and Applied Cryptography Engineering - 10th International Conference, SPACE 2020, Kolkata, India, December 17-21, 2020, Proceedings(Lecture Notes in Computer Science, Vol. 12586), Lejla Batina, Stjepan Picek, and Mainack Mondal (Eds.). Springer, 123–146. https://doi.org/10.1007/978-3-030-66626-2_7
[30]
Sara Ricci, Lukas Malina, Petr Jedlicka, David Smékal, Jan Hajny, Peter Cibik, Petr Dzurenda, and Patrik Dobias. 2021. Implementing crystals-dilithium signature scheme on fpgas. In The 16th International Conference on Availability, Reliability and Security. 1–11.
[31]
Pakize Sanal, Emrah Karagoz, Hwajeong Seo, Reza Azarderakhsh, and Mehran Mozaffari-Kermani. 2021. Kyber on ARM64: Compact implementations of Kyber on 64-bit ARM Cortex-A processors. In International Conference on Security and Privacy in Communication Systems. Springer, 424–440.
[32]
Peter W Shor. 1999. Polynomial-time algorithms for prime factorization and discrete logarithms on a quantum computer. SIAM review 41, 2 (1999), 303–332.
[33]
Deepraj Soni and Ramesh Karri. 2021. Efficient hardware implementation of pqc primitives and pqc algorithms using high-level synthesis. In 2021 IEEE Computer Society Annual Symposium on VLSI (ISVLSI). IEEE, 296–301.
[34]
Cankun Zhao, Neng Zhang, Hanning Wang, Bohan Yang, Wenping Zhu, Zhengdong Li, Min Zhu, Shouyi Yin, Shaojun Wei, and Leibo Liu. 2022. A Compact and High-Performance Hardware Architecture for CRYSTALS-Dilithium.IACR Trans. Cryptogr. Hardw. Embed. Syst. 2022, 1 (2022), 270–295.

Cited By

View all
  • (2025)PUF-Dilithium: Design of a PUF-Based Dilithium Architecture Benchmarked on ARM ProcessorsACM Transactions on Embedded Computing Systems10.1145/371532824:2(1-20)Online publication date: 25-Jan-2025
  • (2025)Performance Analysis of Diverse Active Queue Management AlgorithmsInternational Journal of Networked and Distributed Computing10.1007/s44227-025-00056-113:1Online publication date: 18-Feb-2025
  • (2024)Optimizing Dilithium Implementation with AVX2/-512ACM Transactions on Embedded Computing Systems10.1145/368730923:6(1-30)Online publication date: 10-Aug-2024
  • Show More Cited By

Index Terms

  1. Parallel Small Polynomial Multiplication for Dilithium: A Faster Design and Implementation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ACSAC '22: Proceedings of the 38th Annual Computer Security Applications Conference
    December 2022
    1021 pages
    ISBN:9781450397599
    DOI:10.1145/3564625
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 05 December 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Badges

    Author Tags

    1. Arm Cortex A72
    2. CRYSTAL-Dilithium
    3. Digital signature
    4. Polynomial multiplication
    5. Post-quantum cryptography

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ACSAC

    Acceptance Rates

    Overall Acceptance Rate 104 of 497 submissions, 21%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)149
    • Downloads (Last 6 weeks)23
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)PUF-Dilithium: Design of a PUF-Based Dilithium Architecture Benchmarked on ARM ProcessorsACM Transactions on Embedded Computing Systems10.1145/371532824:2(1-20)Online publication date: 25-Jan-2025
    • (2025)Performance Analysis of Diverse Active Queue Management AlgorithmsInternational Journal of Networked and Distributed Computing10.1007/s44227-025-00056-113:1Online publication date: 18-Feb-2025
    • (2024)Optimizing Dilithium Implementation with AVX2/-512ACM Transactions on Embedded Computing Systems10.1145/368730923:6(1-30)Online publication date: 10-Aug-2024
    • (2024)Cybersecurity in Critical Infrastructures: A Post-Quantum Cryptography PerspectiveIEEE Internet of Things Journal10.1109/JIOT.2024.341070211:18(30217-30244)Online publication date: 15-Sep-2024
    • (2023)Towards Dependable RISC-V Cores for Edge Computing Devices2023 IEEE 29th International Symposium on On-Line Testing and Robust System Design (IOLTS)10.1109/IOLTS59296.2023.10224862(1-7)Online publication date: 3-Jul-2023
    • (2023)Conceptual Review on Number Theoretic Transform and Comprehensive Review on Its ImplementationsIEEE Access10.1109/ACCESS.2023.329444611(70288-70316)Online publication date: 2023

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media