Exaflops Biomedical Knowledge Graph Analytics
- ORNL
- University of California, San Francisco
- Georgia Institute of Technology
- Georgia Institute of Technology, Atlanta
We are motivated by newly proposed methods for mining large-scale corpora of scholarly publications (e.g., full biomedical literature), which consists of tens of millions of papers spanning decades of research. In this setting, analysts seek to discover relationships among concepts. They construct graph representations from annotated text databases and then formulate the relationship-mining problem as an all-pairs shortest paths (APSP) and validate connective paths against curated biomedical knowledge graphs (e.g., Spoke). In this context, we present Coast (Exascale Communication-Optimized All-Pairs Shortest Path) and demonstrate 1.004 EF/s on 9,200 Frontier nodes (73,600 GCDs). We develop hyperbolic performance models (HYPERMOD), which guide optimizations and parametric tuning. The proposed Coast algorithm achieved the memory constant parallel efficiency of 99% in the single-precision tropical semiring. Looking forward, Coast will enable the integration of scholarly corpora like PubMed into the Spoke biomedical knowledge graph.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- DOE Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1997807
- Resource Relation:
- Conference: The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC22) - Dallas, Texas, United States of America - 11/13/2022 10:00:00 AM-11/18/2022 5:00:00 AM
- Country of Publication:
- United States
- Language:
- English
Similar Records
K-RET: knowledgeable biomedical relation extraction system
Visual Understanding of COVID-19 Knowledge Graph for Predictive Analysis