skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Parallel Multivariate Spatio-Temporal Clustering of Large Ecological Datasets on Hybrid Supercomputers

Conference ·

A proliferation of data from vast networks of remote sensing platforms (satellites, unmanned aircraft systems (UAS), airborne etc.), observational facilities (meteorological, eddy covariance etc.), state-of-the-art sensors, and simulation models offer unprecedented opportunities for scientific discovery. Unsupervised classification is a widely applied data mining approach to derive insights from such data. However, classification of very large data sets is a complex computational problem that requires efficient numerical algorithms and implementations on high performance computing (HPC) platforms. Additionally, increasing power, space, cooling and efficiency requirements has led to the deployment of hybrid supercomputing platforms with complex architectures and memory hierarchies like the Titan system at Oak Ridge National Laboratory. The advent of such accelerated computing architectures offers new challenges and opportunities for big data analytics in general and specifically, large scale cluster analysis in our case. Although there is an existing body of work on parallel cluster analysis, those approaches do not fully meet the needs imposed by the nature and size of our large data sets. Moreover, they had scaling limitations and were mostly limited to traditional distributed memory computing platforms. We present a parallel Multivariate Spatio-Temporal Clustering (MSTC) technique based on k-means cluster analysis that can target hybrid supercomputers like Titan. We developed a hybrid MPI, CUDA and OpenACC implementation that can utilize both CPU and GPU resources on computational nodes. We describe performance results on Titan that demonstrate the scalability and efficacy of our approach in processing large ecological data sets.

Research Organization:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1399976
Resource Relation:
Conference: IEEE Cluster 2017 - Honolulu, Hawaii, United States of America - 9/5/2017 4:00:00 PM-9/8/2017 4:00:00 PM
Country of Publication:
United States
Language:
English

Similar Records

OpenACC unified programming environment for GPU and FPGA multi-hybrid acceleration
Conference · Wed Jul 01 00:00:00 EDT 2020 · OSTI ID:1399976

Utilizing many-core accelerators for halo and center finding within a cosmology simulation
Conference · Thu Oct 01 00:00:00 EDT 2015 · 2015 IEEE 5th Symposium on Large Data Analysis and Visualization (LDAV); 25-26 Oct. 2015; Chicago, IL, USA · OSTI ID:1399976

Resident Block-Structured Adaptive Mesh Refinement on Thousands of Graphics Processing Units, In: 2015 44th International Conference on Parallel Processing
Conference · Tue Sep 01 00:00:00 EDT 2015 · 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP) · OSTI ID:1399976

Related Subjects