skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Scalable Data-Intensive Geocomputation: A Design for Real-Time Continental Flood Inundation Mapping

Conference ·

The convergence of data-intensive and extreme-scale computing enables an integrated software and data ecosystem for scientific discovery. Developments in this realm will fuel transformative research in data-driven interdisciplinary domains. Geocomputation provides computing paradigms in Geographic Information Systems (GIS) for interactive computing of geographic data, processes, models, and maps. Because GIS is data-driven, the computational scalability of a geocomputation workflow is directly related to the scale of the GIS data layers, their resolution and extent, as well as the velocity of the geo-located data streams to be processed. Unique in high user interactivity and low end-to-end latency requirements, geocomputation applications will dramatically benefit from the convergence of high-end data analytics (HDA) and high-performance computing (HPC). The application level challenge, however, is to identify and eliminate computational bottlenecks that arise along a geocomputation workflow. Indeed, poor scalability at any of the workflow components is detrimental to the entire end-to-end pipeline. Here, we study a large geocomputation use case in flood inundation mapping that handles multiple national-scale geospatial datasets and targets low end-to-end latency. We discuss benefits and challenges for harnessing both HDA and HPC for data-intensive geospatial data processing and intensive numerical modeling of geographic processes. We propose an HDA+HPC geocomputation architecture design that couples HDA (e.g., Spark)-based spatial data handling and HPC-based parallel data modeling. Key techniques for coupling HDA and HPC to bridge the two different software stacks are reviewed and discussed.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1657950
Resource Relation:
Journal Volume: 1315; Conference: Smoky Mountains Computational Sciences & Engineering Conference 2020 (SMC2020) - Kingsport, Tennessee, United States of America - 8/26/2020 12:00:00 PM-8/28/2020 12:00:00 PM
Country of Publication:
United States
Language:
English

Similar Records

Real-time GIS Programming and Geocomputation
Book · Sat Jan 01 00:00:00 EST 2022 · OSTI ID:1657950

ISAVS: Interactive Scalable Analysis and Visualization System
Conference · Mon Nov 27 00:00:00 EST 2017 · SA '17: SIGGRAPH Asia 2017 Symposium on Visualization · OSTI ID:1657950

Aggregation and Structuring of Materials and Chemicals Data from Diverse Sources
Technical Report · Tue Dec 31 00:00:00 EST 2019 · OSTI ID:1657950

Related Subjects