Abstract
Mesh generation is a critical component for many (bio-)engineering applications. However, parallel mesh generation codes, which are essential for these applications to take the fullest advantage of the high-end computing platforms, belong to the broader class of adaptive and irregular problems, and are among the most complex, challenging, and labor intensive to develop and maintain. As a result, parallel mesh generation is one of the last applications to be installed on new parallel architectures. In this paper we present a way to remedy this problem for new highly-scalable architectures. We present a multi-layered tetrahedral/triangular mesh generation approach capable of delivering and sustaining close to 1018 of concurrent work units. We achieve this by leveraging concurrency at different granularity levels using a hybrid algorithm, and by carefully matching these levels to the hierarchy of the hardware architecture. This paper makes two contributions: (1) a new evolutionary path for developing multi-layered parallel mesh generation codes capable of increasing the concurrency of the state-of-the-art parallel mesh generation methods by at least 10 orders of magnitude and (2) a new abstraction for multi-layered runtime systems that target parallel mesh generation codes, to efficiently orchestrate intra- and inter-layer data movement and load balancing for current and emerging multi-layered architectures with deep memory and network hierarchies.
This material is based upon work supported by the National Science Foundation under Grants No. CCF-0833081, CSR-0719929, and CCS-0750901 and by the John Simon Guggenheim Foundation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Antonopoulos, C.D., Ding, X., Chernikov, A.N., Blagojevic, F., Nikolopoulos, D.S., Chrisochoides, N.P.: Multigrain parallel Delaunay mesh generation: Challenges and opportunities for multithreaded architectures. In: Proceedings of the 19th Annual International Conference on Supercomputing, pp. 367–376. ACM Press, New York (2005)
Barker, K., Chrisochoides, N.: An evalaution of a framework for the dynamic load balancing of highly adaptive and irregular applications. In: Supercomputing Conference. ACM, New York (2003)
Barker, K., Chernikov, A., Chrisochoides, N., Pingali, K.: A load balancing framework for adaptive and asynchronous applications. IEEE Transactions on Parallel and Distributed Systems 15(2), 183–192 (2004)
Blelloch, G.E., Hardwick, J.C., Miller, G.L., Talmor, D.: Design and implementation of a practical parallel Delaunay algorithm. Algorithmica 24, 243–269 (1999)
Blelloch, G.E., Miller, G.L., Talmor, D.: Developing a practical projection-based parallel Delaunay algorithm. In: Proceedings of the 12th Annual ACM Symposium on Computational Geometry, Philadelphia, PA, May 1996, pp. 186–195 (1996)
Bowyer, A.: Computing Dirichlet tesselations. Computer Journal 24, 162–166 (1981)
Burstedde, C., Ghattas, O., Stadler, G., Tu, T., Wilcox, L.C.: Towards adaptive mesh PDE simulations on petascale computers. In: Proceedings of Teragrid (2008)
Chernikov, A.N., Chrisochoides, N.P.: Practical and efficient point insertion scheduling method for parallel guaranteed quality Delaunay refinement. In: Proceedings of the 18th Annual International Conference on Supercomputing, Malo, France, pp. 48–57. ACM Press, New York (2004)
Chernikov, A.N., Chrisochoides, N.P.: Parallel guaranteed quality Delaunay uniform mesh refinement. SIAM Journal on Scientific Computing 28, 1907–1926 (2006)
Chernikov, A.N., Chrisochoides, N.P.: Algorithm 872: Parallel 2D constrained Delaunay mesh generation. ACM Transactions on Mathematical Software 34(1), 1–20 (2008)
Chernikov, A.N., Chrisochoides, N.P.: Three-dimensional Delaunay refinement for multi-core processors. In: Proceedings of the 22nd Annual International Conference on Supercomputing, Island of Kos, Greece, pp. 214–224. ACM Press, New York (2008)
Paul Chew, L.: Guaranteed-quality triangular meshes. Technical Report TR89983, Cornell University, Computer Science Department (1989)
Chrisochoides, N., Barker, K., Nave, D., Hawblitzel, C.: Mobile object layer: a runtime substrate for parallel adaptive and irregular computations. Adv. Eng. Softw. 31(8-9), 621–637 (2000)
Chrisochoides, N.P.: A survey of parallel mesh generation methods. Technical Report BrownSC-2005-09, Brown University (2005); Also appears as a chapter in Bruaset, A.M., Tveito, A.: Numerical Solution of Partial Differential Equations on Parallel Computers. Springer, Heidelberg (2006)
Devine, K., Hendrickson, B., Boman, E., John, M.S., Vaughan, C.: Design of dynamic load-balancing tools for parallel applications. In: Proc. of the Int. Conf. on Supercomputing, Santa Fe (May 2000)
Devine, K.D., Boman, E.G., Riesen, L.A., Catalyurek, U.V., Chevalier, C.: Getting started with zoltan: A short tutorial. In: Proc. of 2009 Dagstuhl Seminar on Combinatorial Scientific Computing, Also available as Sandia National Labs Tech. Report SAND2009-0578C
Diachin, L., Bauer, A., Fix, B., Kraftcheck, J., Jansen, K., Luo, X., Miller, M., Ollivier-Gooch, C., Shephard, M.S., Tautges, T., Trease, H.: Interoperable mesh and geometry tools for advanced petascale simulations. Journal of Physics: Conference Series 78(1), 12015 (2007)
Dong, S., Lucor, D., Karniadakis, G.E.: Flow past a stationary and moving cylinder: DNS at Re=10,000. In: Proceedings of the 2004 Users Group Conference (DOD_UGC 2004), Williamsburg, VA, pp. 88–95 (2004)
George, P.-L., Borouchaki, H.: Delaunay Triangulation and Meshing. Application to Finite Elements. HERMES (1998)
Isenburg, M., Liu, Y., Shewchuk, J., Snoeyink, J.: Streaming computation of Delaunay triangulations. ACM Transactions on Graphics 25(3), 1049–1056 (2006)
Johnson, K., Kaashoek, M., Wallach, D.: CRL: High-performance all-software distributed shared memory. In: 15th Symp. on OS Prin (COSP15), December 1995, pp. 213–228 (1995)
Kadow, C.: Parallel Delaunay Refinement Mesh Generation. PhD thesis, Carnegie Mellon University (2004)
Kadow, C., Walkington, N.: Design of a projection-based parallel Delaunay mesh generation and refinement algorithm. In: 4th Symposium on Trends in Unstructured Mesh Generation, Albuquerque, NM (July 2003), http://www.andrew.cmu.edu/user/sowen/usnccm03/agenda.html
Kalé, L., Krishnan, S.: CHARM++: A portable concurrent object oriented system based on C++. In: Proceedings of OOPSLA 1993, pp. 91–108 (1993)
Kot, A., Chernikov, A., Chrisochoides, N.: Effective out-of-core parallel Delaunay mesh refinement using off-the-shelf software. In: Proceedings of the 20th IEEE International Parallel and Distributed Processing Symposium, Rhodes Island, Greece (April 2006). http://ieeexplore.ieee.org/search/wrapper.jsp?arnumber=1639361
Kot, A., Chernikov, A.N., Chrisochoides, N.P.: Out-of-core parallel Delaunay mesh generation. In: 17th IMACS World Congress Scientific Computation, Applied Mathematics and Simulation, Paris, France, Paper T1-R-00-0710 (2005)
Kulkarni, M., Pingali, K., Ramanarayanan, G., Walter, B., Bala, K., Chew, L.P.: Optimistic parallelism benefits from data partitioning. In: Architectural Support for Programming Languages and Operating Systems (2008)
Kulkarni, M., Pingali, K., Walter, B., Ramanarayanan, G., Bala, K., Chew, L.P.: Optimistic parallelism requires abstractions. SIGPLAN Not. 42(6), 211–222 (2007)
Linardakis, L., Chrisochoides, N.: Delaunay decoupling method for parallel guaranteed quality planar mesh refinement. SIAM Journal on Scientific Computing 27(4), 1394–1423 (2006)
Linardakis, L., Chrisochoides, N.: Algorithm 870: A static geometric medial axis domain decomposition in 2D Euclidean space. ACM Transactions on Mathematical Software 34(1), 1–28 (2008)
Linardakis, L., Chrisochoides, N.: Graded Delaunay decoupling method for parallel guaranteed quality planar mesh generation. SIAM Journal on Scientific Computing 30(4), 1875–1891 (2008)
Mitchell, S.A., Vavasis, S.A.: Quality mesh generation in higher dimensions. SIAM Journal for Computing 29(4), 1334–1370 (2000)
Nave, D., Chrisochoides, N., Chew, L.P.: Guaranteed–quality parallel Delaunay refinement for restricted polyhedral domains. In: Proceedings of the 18th ACM Symposium on Computational Geometry, Barcelona, Spain, pp. 135–144 (2002)
Nieplocha, J., Carpenter, B.: Armci: A portable remote memory copy library for distributed array libraries and compiler runtime systems. In: Proceedings RTSPP IPPS/SDP 1999 (1999) ID: bib:Nieplocha
Scott, M., Spear, M., Dalessandro, L., Marathe, V.: Delaunay triangulation with transactions and barriers. In: Proceedings of 2007 IEEE International Symposium on Workload Characterization (2007)
Shewchuk, J.R.: Triangle: Engineering a 2D Quality Mesh Generator and Delaunay Triangulator. In: Lin, M.C., Manocha, D. (eds.) FCRC-WS 1996 and WACG 1996. LNCS, vol. 1148, pp. 203–222. Springer, Heidelberg (1996)
Shewchuk, J.R.: Delaunay refinement algorithms for triangular mesh generation. Computational Geometry: Theory and Applications 22(1–3), 21–74 (2002)
Shöberl, J.: NETGEN: An advancing front 2d/3d-mesh generator based on abstract rules. Computing and Visualization in Science 1, 41–52 (1997)
Si, H., Gaertner, K.: Meshing piecewise linear complexes by constrained Delaunay tetrahedralizations. In: Proceedings of the 14th International Meshing Roundtable, San Diego, CA, pp. 147–163. Springer, Heidelberg (2005)
Sterling, T.: A hybrid technology multithreaded computer architecture for petaflops computing 1997. TY: STD; CAPSL Technical Memo 01, Jet Propulsion Library, California Institute of Technology, California (January 1997)
To, A.C., Liu, W.K., Olson, G.B., Belytschko, T., Chen, W., Shephard, M.S., Chung, Y.W., Ghanem, R., Voorhees, P.W., Seidman, D.N., Wolverton, C., Chen, J.S., Moran, B., Freeman, A.J., Tian, R., Luo, X., Lautenschlager, E., Challoner, A.D.: Materials integrity in microsystems: a framework for a petascale predictive-science-based multiscale modeling and simulation system. Computational Mechanics 42, 485–510 (2008)
von Eicken, T., Culler, D., Goldstein, S., Schauser, K.: Active messages: A mechanism for integrated communication and computation. In: Proceedings of the 19th Int. Symp. on Comp. Arch., pp. 256–266. ACM Press, New York (1992)
Walters, R.A.: Coastal ocean models: Two useful finite element methods. Recent Developments in Physical Oceanographic Modeling: Part II 25, 775–793 (2005)
Watson, D.F.: Computing the n-dimensional Delaunay tesselation with application to Voronoi polytopes. Computer Journal 24, 167–172 (1981)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chrisochoides, N., Chernikov, A., Fedorov, A., Kot, A., Linardakis, L., Foteinos, P. (2009). Towards Exascale Parallel Delaunay Mesh Generation. In: Clark, B.W. (eds) Proceedings of the 18th International Meshing Roundtable. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04319-2_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-04319-2_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04318-5
Online ISBN: 978-3-642-04319-2
eBook Packages: EngineeringEngineering (R0)