Reference Hub3
Modeling and Computing Overlapping Aggregation of Large Data Sequences in Geographic Information Systems

Modeling and Computing Overlapping Aggregation of Large Data Sequences in Geographic Information Systems

Driss En-Nejjary, Francois Pinet, Myoung-Ah Kang
Copyright: © 2019 |Volume: 10 |Issue: 1 |Pages: 22
ISSN: 1947-8186|EISSN: 1947-8194|EISBN13: 9781522566601|DOI: 10.4018/IJISMD.2019010102
Cite Article Cite Article

MLA

En-Nejjary, Driss, et al. "Modeling and Computing Overlapping Aggregation of Large Data Sequences in Geographic Information Systems." IJISMD vol.10, no.1 2019: pp.20-41. http://doi.org/10.4018/IJISMD.2019010102

APA

En-Nejjary, D., Pinet, F., & Kang, M. (2019). Modeling and Computing Overlapping Aggregation of Large Data Sequences in Geographic Information Systems. International Journal of Information System Modeling and Design (IJISMD), 10(1), 20-41. http://doi.org/10.4018/IJISMD.2019010102

Chicago

En-Nejjary, Driss, Francois Pinet, and Myoung-Ah Kang. "Modeling and Computing Overlapping Aggregation of Large Data Sequences in Geographic Information Systems," International Journal of Information System Modeling and Design (IJISMD) 10, no.1: 20-41. http://doi.org/10.4018/IJISMD.2019010102

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Recently, in the field of information systems, the acquisition of geo-referenced data has made a huge leap forward in terms of technology. There is a real issue in terms of the data processing optimization, and different research works have been proposed to analyze large geo-referenced datasets based on multi-core approaches. In this article, different methods based on general-purpose logic on graphics processing unit (GPGPU) are modelled and compared to parallelize overlapping aggregations of raster sequences. Our methods are tested on a sequence of rasters representing the evolution of temperature over time for the same region. Each raster corresponds to a different data acquisition time period, and each raster geo-referenced cell is associated with a temperature value. This article proposes optimized methods to calculate the average temperature for the region for all the possible raster subsequences of a determined length, i.e., to calculate overlapping aggregated data summaries. In these aggregations, the same subsets of values are aggregated several times. For example, this type of aggregation can be useful in different environmental data analyses, e.g., to pre-calculate all the average temperatures in a database. The present article highlights a significant increase in performance and shows that the use of GPGPU parallel processing enabled us to run the aggregations up to more than 50 times faster than the sequential method including data transfer cost and more than 200 times faster without data transfer cost.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.