Load Balancing in Cluster Using BLCR Checkpoint/Restart

Hariyale, Hemant; Vardhan, Manu; Pandey, Ankit; Mishra, Ankit; Kushwaha, Dharmender Singh

doi:10.1007/978-3-642-31513-8_74

Hemant Hariyale⁴,
Manu Vardhan⁴,
Ankit Pandey⁴,
Ankit Mishra⁴ &
…
Dharmender Singh Kushwaha⁴

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 176))

2015 Accesses

Abstract

Modern computation is becoming complex in a way that the resource requirement is gradually increasing. High Throughput Computing is one technique to deal with such a complexity. After a significant amount of time, computing clusters gets highly overloaded resulting in degradation of performance. Since there is no central coordinator in Computer Supported Cooperative Working (CSCW) load-balancing is more complex. An overloaded node does not participate in a CSCW network as they are already overloaded. This paper proposes migration of computation intensive jobs from overloaded nodes, which will allow overloaded nodes to be able to participate in CSCW. The proposed solution improves the performance by making more nodes participating in CSCW by migrating compute intensive jobs from overloaded nodes to underloaded nodes. Evaluation of proposed approach shows that the availability and performance of the CSCW clusters is improved by 30%-40% with fault-tolerance based load balancing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

An Efficient Load Balancing System with Resource Failure Consideration for a Distributed System

Dynamic load balancing in distributed exascale computing systems

Article 19 May 2017

Task replication to improve the reliability of running workflows on the cloud

Article 27 April 2020

References

Selikhov, A., Germain, C.: A Channel Memory based fault tolerance for MPI applications. Future Generation Computer Systems 21(5), 709–715 (2005)
Article Google Scholar
Al-Saqabi, K.H., Saleh, K.A.: An efficient process migration algorithm for homogeneous clusters. Information and Software Technology 38(9), 569–580 (1996)
Article Google Scholar
Hursey, J., Graham, R.L.: Analyzing fault aware collective performance in a process fault tolerant MPI. Parallel Computing 38(1-2), 15–25 (2012)
Article Google Scholar
Chtepen, M., Claeys, F.H.A., Dhoedt, B., De Turck, F., Demeester, P., Vanrolleghem, P.A.: Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids. IEEE Transactions on Parallel and Distributed Systems 20(2), 180–190 (2009)
Article Google Scholar
Lopriore, L.: Object and process migration in a single-address-space distributed system. Microprocessors and Microsystems 23(10), 587–595 (2000)
Article Google Scholar
Payli, R.U., et al.: DLB—a dynamic load balancing tool for grid computing. Scientific International Journal for Parallel and Distributed Computing 07(02) (2004)
Google Scholar
Cao, J., et al.: Grid load balancing using intelligent agents. Future Generation Computer Systems 21(1), 135–149 (2005)
Article Google Scholar
Yagoubi, Slimani, Y.: Task load balancing for grid computing. Journal of Computer Science 3(3), 186–194 (2007)
Article Google Scholar
Nehra, N., Patel, R.B., Bhatt, V.K.: A framework for distributed dynamic load balancing in heterogeneous cluster. Journal of Computer Science (2007)
Google Scholar
Hargrove, P.H., Duell, J.C.: Berkeley lab checkpoint/restart (BLCR) for Linux clusters, https://ftg.lbl.gov/assets/projects/CheckpointRestart/Pubs/LBNL-60520.pdf
Rodríguez, G., Pardo, X.C., Martín, M.J., González, P.: Performance evaluation of an application-level checkpointing solution on grids. Future Generation Computer Systems 26, 1012–1023 (2010), doi:10.1016/j.future.2010.04.016
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Motilal Nehru National Institute of Technology Allahabad, Allahabad, 211004, U. P., India
Hemant Hariyale, Manu Vardhan, Ankit Pandey, Ankit Mishra & Dharmender Singh Kushwaha

Authors

Hemant Hariyale
View author publications
You can also search for this author in PubMed Google Scholar
Manu Vardhan
View author publications
You can also search for this author in PubMed Google Scholar
Ankit Pandey
View author publications
You can also search for this author in PubMed Google Scholar
Ankit Mishra
View author publications
You can also search for this author in PubMed Google Scholar
Dharmender Singh Kushwaha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hemant Hariyale .

Editor information

Editors and Affiliations

, Department of Computer Science, Jackson State University, John R. Lynch Street 1400, Jackson, 39217, USA
Natarajan Meghanathan
Wireilla Net Solutions PTY Ltd, Melbourne, Australia
Dhinaharan Nagamalai
Department of Computer Science & Eng., University of Calcutta, Calcutta, 700 073, India
Nabendu Chaki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hariyale, H., Vardhan, M., Pandey, A., Mishra, A., Kushwaha, D.S. (2012). Load Balancing in Cluster Using BLCR Checkpoint/Restart. In: Meghanathan, N., Nagamalai, D., Chaki, N. (eds) Advances in Computing and Information Technology. Advances in Intelligent Systems and Computing, vol 176. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31513-8_74

Download citation

DOI: https://doi.org/10.1007/978-3-642-31513-8_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31512-1
Online ISBN: 978-3-642-31513-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics