A robust framework for real-time distributed processing of satellite data

https://doi.org/10.1016/j.jpdc.2005.12.001Get rights and content

Abstract

It is estimated that future satellite instruments such as the Advanced Baseline Imager (ABI) and the Hyperspectral Environmental Suite (HES) on the GOES-R series of satellites will provide raw data volume of about 1.5 Terabyte per day. Due to the high data rate, satellite ground data processing will require considerable computing power to process data in real-time. Cluster technologies employing a multi-processor system present the only current economically viable option. To sustain high levels of system reliability and operability in a cluster-oriented operational environment, a fault-tolerant data processing framework is proposed to provide a platform for encapsulating science algorithms for satellite data processing. The science algorithms together with the framework are hosted on a Linux cluster.

In this paper we present an architectural model and a system prototype for providing performance, reliability, and scalability of candidate hardware and software for a satellite data processing system. Furthermore, benchmarking results are presented for a selected number of science algorithms for the Geosynchronous Imaging Fourier Transform Spectrometer (GIFTS) instrument showing that considerable performance can be gained without sacrificing the reliability and high availability constraints imposed on the operational cluster system.

Section snippets

Shahram Tehranian holds a M.S. in Fluid Dynamics from the Royal Institute of Technology, Stockholm, Sweden, and a M.S. in Computer Science from George Mason University, Virginia, USA. His research interests are high performance scientific computing on parallel and distributed platforms and his recent research focus has been on developing high performance data processing and archival systems for the National Oceanic and Atmospheric Administration (NOAA) using Linux cluster technology.

References (29)

  • K. Hawick et al.

    Distributed frameworks and parallel algorithms for processing large-scale geographic data

    Parallel Computing

    (2003)
  • R. Batchu et al.

    MPI/FT: architecture and taxonomies for fault-tolerant, message-passing middleware for performance-portable parallel computing

  • Boost serialization library:...
  • A. Bouteiller et al.

    MPICH-V2: a fault tolerant MPI for volatile nodes based on the pessimistic sender based message logging

  • Data redistribution and remote method invocation in parallel component architectures, 19th IEEE International Parallel...
  • R.K. Garcia et al.

    Design studies for real-time distributed processing of infrared hyperspectral image data

  • R.K. Garcia et al.

    Component-oriented design studies for efficient processing of hyperspectral infrared imager data

  • GIFTS Downlink Subsystem User's Guide, Version 1.2, Document Reference 1201-USRG-001, ASRC Aerospace Corp., Maryland,...
  • GOES_R First Hyperspectral Sounder Data Compression Meeting, January 2002,...
  • R. Gordon

    Essential JNI: Java Native Interface

    (1998)
  • W. Gropp, S. Huss-Lederman, A. Lumsdaine, E. Lusk, MPI—The Complete Reference, vol. 2, The MPI Extensions, MIT Press,...
  • P. Haggerty et al.

    The NOAA MODIS/near-real-time System

  • K. Hawick, H. James, A. Silis, D. Grove, C. Pattern, J. Mathew, P. Coddington, K. Kerry, J. Hercus, F. Vaughan,...
  • High availability user's guide,...
  • Cited by (33)

    • Evaluating hedge fund downside risk using a multi-objective neural network

      2019, Journal of Visual Communication and Image Representation
      Citation Excerpt :

      Among them, there are two important risk measurement methods: dispersion risk measurement and risk exposure measurement. As the understanding of downside risk has gradually deepened, variance, semi-variance, and lower partial moment methods have been derived to measure dispersion risk [27–31]. The semi-variance method improves on the bilateral variance method by assuming that only downward fluctuations form the risk.

    • EDAWS: A distributed framework with efficient data analytics workspace towards discriminative services for critical infrastructures

      2018, Future Generation Computer Systems
      Citation Excerpt :

      Big data research gains the popularity and it is successfully applied to a broad variety of sustainability fields (e.g., home security, building management, smart cities, emergency control). With the help of analyzing large data in distributed computing environment, people thus benefit a lot from big data researches [24]. Today’s data mining software is confronted with enormous challenges about how to make use of newly developed paradigms of data processing.

    • SunwayMR: A distributed parallel computing framework with convenient data-intensive applications programming

      2017, Future Generation Computer Systems
      Citation Excerpt :

      Although MapReduce (e.g., Hadoop) gains popularity due to its simplicity, the main drawback of MapReduce is its inefficiency for processing structured (relational) data and graph data. In particular, Shahram, et al. propose a robust framework for real-time distributed processing of satellite data [10], with using the multiple-server redundancy backup manners to guarantee the framework’s reality. Nonetheless, it has few types of data to be processed.

    • A High-Performance Infrastructure for Remote Sensing Data Applications Using HPC Paradigms

      2022, International Journal of Electrical and Electronics Research
    View all citing articles on Scopus

    Shahram Tehranian holds a M.S. in Fluid Dynamics from the Royal Institute of Technology, Stockholm, Sweden, and a M.S. in Computer Science from George Mason University, Virginia, USA. His research interests are high performance scientific computing on parallel and distributed platforms and his recent research focus has been on developing high performance data processing and archival systems for the National Oceanic and Atmospheric Administration (NOAA) using Linux cluster technology.

    Yongsheng Zhao holds a Ph.D. degree in Aerospace Engineering from University of Maryland, and a M.S. and B.S. in Aerospace Engineering from Northwestern Polytechnical University, China. His current work is to develop real time, high performance, and fault tolerant software on distributed platforms such as Linux Clusters, with application to NOAA's satellite ground data processing systems.

    Tony Harvey received a BSc in Mathematics from London University in 1968. He has been involved in communications systems development from the early 1970s. Starting with terrestrial data and voice networks in Europe and then satellite ground systems in the USA, he has developed software for realtime communications systems for more than 30 years. In many of these systems he developed highly reliable software architectures in a variety of multi-processor environments. At AC Technologies, Tony was involved with the ground systems for NOAA's geostationary and polar orbiting satellites, implementing a one-way secure bridge for data flow out of the secure operations environment to provide public access to NOAA's weather data.

    Anand Swaroop holds a Ph.D. in Physics from the Indian Institute of Technology, Delhi, India, and a M.S. in Physics from Lucknow University, India. He is Program Manager with NORTEL Government Solutions, Inc., and supports National Oceanic and Atmospheric Administration (NOAA) satellite ground systems software development utilizing new hardware and software technologies. His research interests are real time high performance scientific computing on parallel and distributed platforms and developing high performance data processing and archival systems using Linux cluster technology for next generation satellite instrument such as GIFTS on board geosynchronous satellite.

    Keith McKenzie joined the NOAA National Environmental Satellite Data and Information Service (NESDIS) in 1980. He has worked as an engineering Project Manager managing GOES satellite ground system development projects since 1987. He is currently the manager of the Replacement Product Monitor (RPM), Modernized Sensor Processing System (MSPS), and GIFTS (level 0) calibration algorithm projects. He holds a M.S. in Telecommunications and Computers from George Washington University in Washington, D.C.

    View full text