Fast Multiple Montgomery Multiplications Using Intel AVX-512IFMA Instructions

Takahashi, Daisuke

doi:10.1007/978-3-030-58814-4_52

Daisuke Takahashi ORCID: orcid.org/0000-0003-1357-5770¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12253))

Included in the following conference series:

International Conference on Computational Science and Its Applications

1498 Accesses

Abstract

In this paper, we propose a fast implementation of multiple Montgomery multiplications using Intel AVX-512IFMA (Integer Fused Multiply-Add) instructions. The proposed implementation is based on a modified Montgomery multiplication. For Montgomery multiplication operands with 52 bits or fewer, the proposed implementation using Intel AVX-512IFMA instructions is up to approximately 12.22 and 4.30 times faster than the implementations using Intel 64 and Intel AVX-512F (Foundation) instructions on an Intel Core i3-8121U processor, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Accelerating Large Integer Multiplication Using Intel AVX-512IFMA

Low Complexity and High Speed Montgomery Multiplication Based on FFT

Research and Modification of Montgomery Multiplication Algorithm

References

Montgomery, P.L.: Modular multiplication without trial division. Math. Comput. 44, 519–521 (1985)
Article MathSciNet Google Scholar
Intel Corporation: Intel 64 and IA-32 architectures software developer’s manual, volume 1: Basic architecture (2019). https://software.intel.com/sites/default/files/managed/a4/60/253665-sdm-vol-1.pdf
Gueron, S., Krasnov, V.: Software implementation of modular exponentiation, using advanced vector instructions architectures. In: Özbudak, F., Rodríguez-Henríquez, F. (eds.) WAIFI 2012. LNCS, vol. 7369, pp. 119–135. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-31662-3_9
Chapter MATH Google Scholar
Bos, J.W., Montgomery, P.L., Shumow, D., Zaverucha, G.M.: Montgomery multiplication using vector instructions. In: Lange, T., Lauter, K., Lisoněk, P. (eds.) SAC 2013. LNCS, vol. 8282, pp. 471–489. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-43414-7_24
Chapter Google Scholar
Drucker, N., Gueron, S.: Fast modular squaring with AVX512IFMA. In: Latifi, S. (ed.) 16th International Conference on Information Technology-New Generations (ITNG 2019). AISC, vol. 800, pp. 3–8. Springer, Cham (2019)
Chapter Google Scholar
Page, D., Smart, N.P.: Parallel cryptographic arithmetic using a redundant Montgomery representation. IEEE Trans. Comput. 53, 1474–1482 (2004)
Article Google Scholar
Bos, J.W.: High-performance modular multiplication on the Cell processor. In: Hasan, M.A., Helleseth, T. (eds.) WAIFI 2010. LNCS, vol. 6087, pp. 7–24. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-13797-6_2
Chapter Google Scholar
Meng, L., Johnson, J.R., Franchetti, F., et al.: Spiral-generated modular FFT algorithms. In: Proceedings of 4th International Workshop on Parallel and Symbolic Computation (PASCO 2010), pp. 169–170 (2010)
Google Scholar
Takahashi, D.: Computation of the 100 quadrillionth hexadecimal digit of $\pi $ on a cluster of Intel Xeon Phi processors. Parallel Comput. 75, 1–10 (2018)
Article MathSciNet Google Scholar
Intel Corporation: Intel C++ compiler 19.0 developer guide and reference (2019). https://software.intel.com/sites/default/files/cpp_dev_guide_190_u5_1.pdf

Download references

Acknowledgments

This research was partially supported by JSPS KAKENHI Grant Number JP19K11989.

Author information

Authors and Affiliations

Center for Computational Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki, 305-8577, Japan
Daisuke Takahashi

Authors

Daisuke Takahashi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daisuke Takahashi .

Editor information

Editors and Affiliations

University of Perugia, Perugia, Italy
Osvaldo Gervasi
University of Basilicata, Potenza, Potenza, Italy
Beniamino Murgante
Chair- Center of ICT/ICE, Covenant University, Ota, Nigeria
Sanjay Misra
University of Cagliari, Cagliari, Italy
Chiara Garau
University of Cagliari, Cagliari, Italy
Ivan Blečić
Clayton School of Information Technology, Monash University, Clayton, VIC, Australia
David Taniar
Department of Information Science, Kyushu Sangyo University, Fukuoka, Japan
Bernady O. Apduhan
University of Minho, Braga, Portugal
Ana Maria A. C. Rocha
Polytechnic University of Bari, Bari, Italy
Eufemia Tarantino
Polytechnic University of Bari, Bari, Italy
Carmelo Maria Torre
Department of Neurology, University of Massachusetts Medical School, Worcester, MA, USA
Yeliz Karaca

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Takahashi, D. (2020). Fast Multiple Montgomery Multiplications Using Intel AVX-512IFMA Instructions. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2020. ICCSA 2020. Lecture Notes in Computer Science(), vol 12253. Springer, Cham. https://doi.org/10.1007/978-3-030-58814-4_52

Download citation

DOI: https://doi.org/10.1007/978-3-030-58814-4_52
Published: 03 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58813-7
Online ISBN: 978-3-030-58814-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Fast Multiple Montgomery Multiplications Using Intel AVX-512IFMA Instructions

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Accelerating Large Integer Multiplication Using Intel AVX-512IFMA

Low Complexity and High Speed Montgomery Multiplication Based on FFT

Research and Modification of Montgomery Multiplication Algorithm

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Fast Multiple Montgomery Multiplications Using Intel AVX-512IFMA Instructions

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Accelerating Large Integer Multiplication Using Intel AVX-512IFMA

Low Complexity and High Speed Montgomery Multiplication Based on FFT

Research and Modification of Montgomery Multiplication Algorithm

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation