A queuing network model for minimizing the total makespan of computational grids

https://doi.org/10.1016/j.compeleceng.2012.03.011Get rights and content

Abstract

This paper offers a mathematical solution based on queuing theory and a generalized stochastic Petri net model to minimize the total makespan of the grid computing environments. A grid manager could minimize the total makespan through cautious distribution of subtasks to the grid resources. Subtask arrival rates depend on the arrival rate of the grid tasks submitted to the grid manager, local tasks directly submitted to the grid resources and the processing speed of the resources. Modeling the grid environment using queuing network, the steady state analysis of the network will result in the mean response time of the resources. Therefore, the total makespan could be minimized by minimizing the longest mean response time of the resources. The accuracy of the values obtained for the subtasks arrival rates at each of the grid resources from solving the corresponding queuing network could be further evaluated by steady state analysis of the generalized stochastic Petri net modeling the same grid environment.

Highlights

► We model grid computing environment using queuing networks (QNs) and generalized stochastic Petri nets (GSPNs). ► The proposed models consider both grid and local tasks. ► The aim of the modeling is to find subtask arrival rates at each of the grid resources to reach minimum total makespan. ► The results obtained for subtasks arrival rates in both methods (QN and GSPN) are similar.

Introduction

Grid computing is a technology to build dynamically constructed problem solving environments using geographically and organizationally distributed computational resources connected via communication links [1]. Grid computing provides supercomputing like power on demand, just as a power grid which provides electricity on demand. Typically, when talking about grids and grid environments, the mean is referring to grid computing or computational grids. Based on definition, a computational grid is a large collection of computers linked via the Internet so that their combined processing power can be harnessed to work on difficult or time consuming problems. With the advent of processors that obtain higher and higher performance measures and the emergence of open system operating systems such as Linux, inexpensive clusters of multiple processor systems for medium- and higher-scale computations are more possible and also cost effective. Computational grids allow one to control the extra central processing unit (CPU) cycles available on network and apply those CPU cycles to other, more resource-intensive purposes [2]. The overall motivation for most current grid projects is to enable the resource and human interactions that facilitate large-scale science and engineering such as aerospace systems design, high-energy physics data analysis, climate research, large scale remote instrument operation, collaborative astrophysics and so on [1].

High throughput computing (HTC) is of great concern to grid computing environments, because tasks submitted to a grid manager (GM) are mostly independent and relatively large [3], [4], [5]. HTC is aimed at minimizing the total completion time of all the tasks submitted to a GM in long periods of time, whilst high performance computing (HPC) prefers fast response to individual tasks [6], [7], [8]. To achieve HTC in grid computing environments, GM could keep track of availability and processing power of the grid resources. Thereby, GM could dispatch the subtasks produced by splitting an entire task to the appropriate grid resources considering the total makespan of the grid environment. The makespan of a resource is the time slot between the start and completion of the sequence of tasks assigned to the resource [9], [10], [11]. The total makespan of a computational grid is defined as the largest makespan of the grid resources. Minimizing the total makespan of a grid computing environment, the throughput of the environment can be increased accordingly [6], [11].

Various task scheduling algorithms and dispatching strategies have been proposed in grid computing environments to appropriately dispatch tasks within grid resources to reach a predefined target. This target which is known as a goal of scheduling has a close relationship with quality of service (QoS) parameters in grid environments. Generally, grid scheduling algorithms are presented to optimize at least one of the QoS parameters. The total makespan of the system is one of the most interesting QoS parameters in grid computing environments. Many task scheduling algorithms have been presented to dispatch the grid tasks to grid resources minimizing the total makespan of the environment [4], [6], [8], [9], [12], [13]. Nevertheless, almost none of these algorithms can find a deterministic solution for the scheduling problem, because scheduling tasks between the resources is an NP-complete problem [6], [8], [9], [10], [11], [12], [14]. So, in the best cases, the algorithms find a near optimal solution for the problem and therefore, the estimated value for makespan is not the minimum possible value. In order to overcome this difficulty, some other scheduling and dispatching schemes have been presented to relax the problem into one can be solved with mathematical solutions [15], [16], [17], [18], [19], [20], [21]. Using these approaches, various QoS measures can be taken into account and considered as a goal of scheduling. Furthermore, some realistic assumptions such as local tasks’ arrivals and grid environments’ topologies can be considered in these models in which had been ignored in previously mentioned algorithms.

In this paper, a mathematical approach based on queuing theory is presented to minimize the total makespan of the grid computing environments. To do this, grid resources and tasks should be mapped to the related concepts in queuing theory. Makespan of a grid resource could be considered as the mean response time of the server nodes in steady state analysis of the open queuing network (QN) modeling the grid computing environment [22], [23]. Each of the grid resources can be considered as M/M/1 queuing systems [15], [17], [18] within such QN. In the steady state analysis of M/M/1 queuing systems, the inverse of the difference of task arrival and service rates is equal to the mean response time of the server [24], [25]. To minimize the maximum mean response time among the resources’ mean response times, the subtasks arrival rates to each of the resources has to be adjusted. To achieve this, in our proposed approach, a set of inequalities describing the subtasks arrival constraints and equalities describing the overall tasks arrival rates are built. The major constraint in steady state analysis of QNs is that the subtasks arrival rate has to be less than the service rate in each of the resources; otherwise, there will be an infinite queue of tasks waiting for the resource services. In addition, the overall task arrival rate to the queuing network has to be equal to the sum of the subtask arrival rates to the resources. After finding this equality and inequality system, the Simplex method [26] could be applied to solve it.

In order to provide a formal description of the scheduling scheme in grid environments and graphically represent the grid scheduling workflow, a generalized stochastic Petri net (GSPN) model is offered in this paper. GSPNs are an abstract formal graph model useful for representing concurrency, synchronization and communication. The probabilistic nature of GSPNs allows such operations to be described with a high level of abstraction. Using this extension of Petri nets, one can model time consuming actions of the system (such as task processing in grids) and immediate actions (such as tasks dispatching in GMs) simultaneously. Modeling the grid task scheduling using GSPNs, the results obtained from queuing network solution can be evaluated further. The proposed GSPN models the arrival of grid and local tasks and dispatch of the subtasks in the grid computing environment. In addition, the services provided by the grid resources to execute the submitted tasks are modeled by the presented GSPN. Using steady state analysis of the proposed GSPN and examining various weights for immediate transitions, modeling subtasks distribution, the best dispatching weights for each of the grid resources can be obtained. Applying these weights to the grid tasks arrival rate, the subtasks arrival rates at each of the resources could be estimated. Adjusting the subtasks arrival rates using both the GSPN and mathematical solution of the corresponding QN, the total makespan of the grid environment could be minimized. In Section 5, it is shown that the subtasks arrival rates obtained from steady state analysis of the proposed GSPN are almost the same as the arrival rates obtained by solving the equality and inequality system resulted from corresponding QN.

The remaining parts of the paper are organized as follows: in Section 2, the related works done around the modeling and analysis of the grid environments using QNs and Petri nets are introduced. Sections 3 The proposed QN solution for grid computing environment, 4 The proposed GSPN model for grid computing environment present the proposed approaches for tasks dispatching in grid computing environments using open QNs and GSPNs, respectively. In Section 5, two numerical examples are presented to illustrate the proposed approaches and to show the applicability of them. Finally, Section 6 concludes the paper and presents future work which can be done in this research field.

Section snippets

Related works

There are on-going research works on performance and throughput modeling and analysis of distributed systems (e.g. grid environments) [15], [16], [17], [18], [19], [21], [27], [28], [29], [30], [31], [32]. Some of these research works model the grid environments in the form of queuing networks and various extensions of Petri nets. The aim of the modeling is mainly to find several useful metrics for grid environments to estimate the quality of task scheduling and dispatching processes. However,

The proposed QN solution for grid computing environment

Queuing networks (QNs) consisting of several service stations are very suitable for representing the structure of many systems with large number of resources [24], [25]. Considering the properties of the grid computing environments (e.g. distribution function of tasks arrival rates and service rates of the resources), one can easily model the grid environment as a QN [15], [17], [18], [27]. To do this, the following assumptions are made about the environment. These assumptions provide a general

The proposed GSPN model for grid computing environment

Petri nets (PNs) are graphical tools for formal description of systems whose dynamics are characterized by concurrency, synchronization, mutual exclusion and conflicts which are typical features of distributed environments [38]. The formal definition of PNs is as follows:

Definition 1. A PN is a 5-tuple PN=(P,T,I-,I+,M0) where

  • P={p1,p2,,pn} is a finite and non-empty set of places,

  • T={t1,t2,,tm} is a finite and non-empty set of transitions,

  • PT=ϕ,

  • I-,I+:P×TN0 are the backward and forward incidence

Illustrative examples

In order to facilitate the explanation of the proposed methods and to highlight the generality of them, two different examples are considered in this section. The aim of this section is to show the applicability of the approaches on the real systems. Actually, no comparison has been done in this section, because other similar methods do not consider some details of the system such as local tasks arrival rates. On the other hand, the target of the scheduling and system parameters influencing the

Conclusion and future work

To attain HTC within grid environments, the total makespan of the environment could be minimized. The minimum makespan can be achieved by adjusting subtasks arrival rates at each of the grid resources. The subtasks arrival rates depend on the processing speed of the resources and the overall tasks arrival rate at GM. The arrival rates could be formulated through steady state analysis of a QN, modeling the grid environment. The result of the analysis will be an equality and inequality system

Acknowledgement

We gratefully acknowledge Mr. Khosrow Shakouri for helping us in preparation of this paper.

Saeed Parsa received the B.S. degree in mathematics and computer Science from Sharif University of Technology in Iran, and the M.S. and Ph.D. degrees in computer science from the University of Salford at England. He is an associate professor of computer science at Iran University of Science and Technology (IUST). His research interests include software engineering, soft Computing and algorithms.

References (41)

  • M.A. Azgomi et al.

    Task scheduling modelling and reliability evaluation of grid services using coloured Petri nets

    J Future Gener Comput Syst

    (2010)
  • A. Afzal et al.

    Capacity planning and scheduling in grid computing environments

    J Future Gener Comput Syst

    (2008)
  • J. Palmer et al.

    Optimal and heuristic policies for dynamic server allocation

    J Parallel Distrib Comput

    (2005)
  • W.E. Johnston

    Implementing production grids

  • A.J. Wells

    Grid application systems design

    (2008)
  • L. Wang et al.

    Grid computing: infrastructure, service and applications

    (2009)
  • Xue Y, Wang Y, Wang J, Luo Y, Hu Y, Zhong S, et al. High throughput computing for spatial information processing system...
  • X. He et al.

    A QoS guided min-min heuristic for grid task scheduling

    J Comput Sci Technol

    (2003)
  • S. Parsa et al.

    RASA: a new grid task scheduling algorithm

    Int J Digital Content Technol Appl

    (2009)
  • Entezari-Maleki R, Movaghar A. A genetic-based scheduling algorithm to minimize the makespan of the grid applications....
  • Cited by (7)

    View all citing articles on Scopus

    Saeed Parsa received the B.S. degree in mathematics and computer Science from Sharif University of Technology in Iran, and the M.S. and Ph.D. degrees in computer science from the University of Salford at England. He is an associate professor of computer science at Iran University of Science and Technology (IUST). His research interests include software engineering, soft Computing and algorithms.

    Reza Entezari-Maleki is currently a Ph.D. student in computer engineering (software discipline) at the Department of Computer Engineering in Sharif University of Technology in Tehran, Iran. He received his B.S. and M.S. degrees in computer engineering (software discipline) from Iran University of Science and Technology (IUST) in Tehran, Iran in 2007 and 2009, respectively. He is also a member of Iran’s National Elite Foundation. His main research interests include grid computing, performance evaluation, performability and dependability modeling, and task scheduling algorithms.

    Reviews processed and proposed for publication to Editor-in-Chief by Associate Editor Xiaolong Wu.

    View full text