research-article

High-Performance and Programmable Attentional Graph Neural Networks with Global Tensor Formulations

Authors:

Maciej Besta,

Pawel Renc,

Robert Gerstenberger,

Grzegorz Kwasniewski,

Raghavendra Kanakagiri,

Chio Ge,

Sammy Jaeger,

Jarosław Wąs,

Flavio Vella,

Torsten HoeflerAuthors Info & Claims

SC '23: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

Article No.: 66, Pages 1 - 16

https://doi.org/10.1145/3581784.3607067

Published: 11 November 2023 Publication History

Get Access

Abstract

Graph attention models (A-GNNs), a type of Graph Neural Networks (GNNs), have been shown to be more powerful than simpler convolutional GNNs (C-GNNs). However, A-GNNs are more complex to program and difficult to scale. To address this, we develop a novel mathematical formulation, based on tensors that group all the feature vectors, targeting both training and inference of A-GNNs. The formulation enables straightforward adoption of communication-minimizing routines, it fosters optimizations such as vectorization, and it enables seamless integration with established linear algebra DSLs or libraries such as GraphBLAS. Our implementation uses a data redistribution scheme explicitly developed for sparse-dense tensor operations used heavily in GNNs, and fusing optimizations that further minimize memory usage and communication cost. We ensure theoretical asymptotic reductions in communicated data compared to the established message-passing GNN paradigm. Finally, we provide excellent scalability and speedups of even 4--5x over modern libraries such as Deep Graph Library.

Supplemental Material

MP4 File - SC23 paper presentation recording for "High-Performance and Programmable Attentional Graph Neural Networks with Global Tensor Formulations"

SC23 paper presentation recording for "High-Performance and Programmable Attentional Graph Neural Networks with Global Tensor Formulations", by Maciej Besta, Pawel Renc, Robert Gerstenberger, Paolo Sylos Labini, Alexandros Ziogas, Tiancheng Chen, Lukas Gianinazzi, Florian Scheidl, Kalman Szenes, Armon Carigiet, Patrick Iff, Grzegorz Kwasniewski, Raghavendra Kanakagiri, Chio Ge, Sammy Jaeger, Jaros?aw W?s, Flavio Vella, Torsten Hoefler

Download
283.32 MB

References

[1]

Ariful Azad, Aydın Buluç, and John Gilbert. 2015. Parallel Triangle Counting and Enumeration Using Matrix Algebra. In Proceedings of the International Parallel and Distributed Processing Symposium Workshop (IPDPSW '15). IEEE, 804--811.

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

Toward the analysis of graph neural networks

Graph Mining with Graph Neural Networks

Training multi-layered neural networks for pattern classification using linear programming formulations

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Badges

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations