skip to main content
10.1145/3225058.3225147acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article

Optimization of the Spherical Harmonics Transform based Tree Traversals in the Helmholtz FMM Algorithm

Published: 13 August 2018 Publication History

Abstract

The fast multipole method (FMM) provides a fast and accurate method of solving large N-body problems with application in a broad range of physics problems. While there exists a large body of work for efficient implementation of the Laplace variant of the FMM algorithm, an optimized implementation of the Helmholtz variant of FMM which is used to study wave physics related phenomena is more complicated. In this study, we focus on the computationally expensive tree traversal operations of the Helmholtz FMM algorithm and describe a series of techniques to improve their performance. We demonstrate that the described techniques yield significant speedups (up to a factor of 4.71x) on a realistic surface geometry commonly encountered in Helmholtz problems.

References

[1]
Emmanuel Agullo, Bérenger Bramas, Olivier Coulaud, Eric Darve, Matthias Messner, and Toru Takahashi. 2014. Task-based FMM for multicore architectures. SIAM Journal on Scientific Computing 36, 1 (2014), C66-C93.
[2]
A. Appel. 1985. An efficient program for many-body simulations. SIAM J. Sci. Comput. 6 (1985), 85--103.
[3]
J. Barnes and P. Hut. 1986. A hierarchical ((n log n) force calculation algorithm. Nature 324 (1986), 446--449.
[4]
Cris Cecka and Eric Darve. 2013. Fourier-based fast multipole method for the Helmholtz equation. SIAM Journal on Scientific Computing 35, 1 (2013), A79--A103.
[5]
W.C. Chew, V. Jandala, C.C. Lu, E. Michielssen, B. Shanker, J.M. Song, and J.S. Zhao. 1997. Fast multilevel techniques for solving integral equations in electromagnetics. In Microwave Conference Proceedings, 1997. APMC '97., 1997 Asia-Pacific, Vol. 1. 457--460 vol.1.
[6]
Ronald Coifman, Vladimir Rokhlin, and Stephen Wandzura. 1993. The fast multipole method for the wave equation: A pedestrian prescription. IEEE Antennas and Propagation Magazine 35, 3 (1993), 7--12.
[7]
Eric Darve. 2000. The fast multipole method: numerical implementation. J. Comput. Phys. 160, 1 (2000), 195--240.
[8]
Ozgur Ergul and Levent Gurel. 2006. Enhancing the accuracy of the interpolations and anterpolations in MLFMA. IEEE Antennas and Wireless Propagation Letters 5, 1 (2006).
[9]
Matteo Frigo and Steven G. Johnson. 2005. The Design and Implementation of FFTW3. Proc. IEEE 93, 2 (2005), 216--231. Special issue on "Program Generation, Optimization, and Platform Adaptation".
[10]
L. Greengard and V. Rokhlin. 1987. A fast algorithm for particle simulations. J. Comput. Phys. 20 (1987), 63--71.
[11]
S. Hughey, H. M. Aktulga, and B. Shanker. 2018. Scalable and Accurate Tree Traversal Operators for Helmholtz FMM. under submission (2018).
[12]
R Jakob-Chien and B K Alpert. 1997. A fast spherical filter with uniform resolution. J. Comp. Phys. 136 (1997), 580--584.
[13]
J. Sarvas. 2003. Performing Interpolation and Anterpolation by the Fast Fourier Transform in the 3D Multilevel Fast Multipole Algorithm. SIAM J. Numer. Anal. 41 (2003), 2180--2196.
[14]
B. Shanker, A.A. Ergin, Mingyu Lu, and E. Michielssen. 2003. Fast analysis of transient electromagnetic scattering phenomena using the multilevel plane wave time domain algorithm. Antennas and Propagation, IEEE Transactions on 51, 3 (March 2003), 628--641.
[15]
Jiming Song, Cai-Cheng Lu, and Weng Cho Chew. 1997. Multilevel fast multipole algorithm for electromagnetic scattering by large complex objects. IEEE Transactions on Antennas and Propagation 45, 10 (1997), 1488--1493.
[16]
J. M. Song and W. C. Chew. 1995. Multilevel fast-multipole algorithm for solving combined field integral equations of electromagnetic scattering. Microwave and Optical Technology Letters 10, 1 (1995), 14--19.
[17]
Hari Sundar, Rahul S Sampath, and George Biros. 2008. Bottom-up construction and 2: 1 balance refinement of linear octrees in parallel. SIAM Journal on Scientific Computing 30, 5 (2008), 2675--2708.
[18]
M. Vikram, He Huang, B. Shanker, and T. Van. 2009. A Novel Wideband FMM for Fast Integral Equation Solution of Multiscale Problems in Electromagnetics. Antennas and Propagation, IEEE Transactions on 57, 7 (July 2009), 2094--2104.
[19]
M.S. Warren and J.K. Salmon. 1993. A parallel hashed oct-tree N-body algorithm. In Proc. Supercomputing. 1--12.
[20]
Nils P Wedi, Mats Hamrud, and George Mozdzynski. 2013. A fast spherical harmonics transform for global NWP and climate models. Monthly Weather Review 141, 10 (2013), 3450--3461.
[21]
Lexing Ying and Denis Zorin. 2004. A simple manifold-based construction of surfaces of arbitrary smoothness. ACM Transactions on Graphics 23 (August 2004), 271--275. Issue 3.
[22]
Jun-Sheng Zhao and Weng Cho Chew. 2000. Integral Equation Solution of Maxwells Equations from Zero Frequency to Microwave Frequencies. IEEE Transactions on Antennas and Propagation 48 (2000), 1635--1645.

Cited By

View all
  • (2022)High Performance Evaluation of Helmholtz Potentials Using the Multi-Level Fast Multipole AlgorithmIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2022.316564933:12(3651-3666)Online publication date: 1-Dec-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICPP '18: Proceedings of the 47th International Conference on Parallel Processing
August 2018
945 pages
ISBN:9781450365109
DOI:10.1145/3225058
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • University of Oregon: University of Oregon

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 August 2018

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICPP 2018

Acceptance Rates

ICPP '18 Paper Acceptance Rate 91 of 313 submissions, 29%;
Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 19 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2022)High Performance Evaluation of Helmholtz Potentials Using the Multi-Level Fast Multipole AlgorithmIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2022.316564933:12(3651-3666)Online publication date: 1-Dec-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media