Abstract
Since I/O-intensive tasks running on a heterogeneous cluster need a highly effective usage of global I/O resources, previous CPU- or memory-centric load balancing schemes suffer significant performance drop under I/O-intensive workload due to the imbalance of I/O load. To solve this problem, we develop two I/O-aware load-balancing schemes, which consider system heterogeneity and migrate more I/O-intensive tasks from a node with high I/O utilization to those with low I/O utilization. If the workload is memory-intensive in nature, the new method applies a memory-based load balancing policy to assign the tasks. Likewise, when the workload becomes CPU-intensive, our scheme leverages a CPU-based policy as an efficient means to balance the system load. In doing so, the proposed approach maintains the same level of performance as the existing schemes when I/O load is low or well balanced. Results from a trace-driven simulation study show that, when a workload is I/O-intensive, the proposed schemes improve the performance with respect to mean slowdown over the existing schemes by up to a factor of 8. In addition, the slowdowns of almost all the policies increase consistently with the system heterogeneity.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Harchol-Balter, M., Downey, A.: Exploiting process lifetime distributions for load balancing. ACM Transactions on Computer Systems 15, 253–285 (1997)
Acharva, A., Setia, S.: Availability and utility of idle memory in workstation clusters. In: Proceedings of the ACM SIGMETRICS Conf. on Measuring and Modeling of Computer Systems (1999)
Xiao, L., Zhang, X., Qu, Y.: Effective load sharing on heterogeneous networks of workstations. In: Proc. of International Symposium on Parallel and Distributed Processing (2000)
Qin, X., Jiang, H., Zhu, Y., Swanson, D.: A dynamic load balancing scheme for I/O-intensive applications in distributed systems. In: Proceedings of the 32nd International Conference on Parallel Processing Workshops (2003)
Qin, X., Jiang, H., Zhu, Y., Swanson, D.: Boosting performance for I/O-intensive workload by preemptive job migrations in a cluster system. In: Proc. of the 15th Symp. on Computer Architecture and High Performance Computing, Brazil (2003)
Scheuermann, P., Weikum, G., Zabback, P.: Data partitioning and load balancing in parallel disk systems. The VLDB Journal, 48–66 (1998)
Cho, Y., Winslett, M.S., Kuo, J.L., Chen, Y.: Parallel I/O for scientific applications on heterogeneous clusters: A resource-utilization approach. In: Proceedings of Supercomputing (1999)
Zhu, Y., Jiang, H., Qin, X., Feng, D., Swanson, D.: Scheduling for improved write performance in a cost-effective, fault-tolerant parallel virtual file system (CEFTPVFS). In: The Fourth LCI International Conference on Linux Clusters (2003)
Zhu, Y., Jiang, H., Qin, X., Feng, D., Swanson, D.: Improved read performance in a cost-effective, fault-tolerant parallel virtual file system (ceft-pvfs). In: Proc. of the 3rd IEEE/ACM Intl. Symp. on Cluster Computing and the Grid (2003)
Ma, X., Winslett, M., Lee, J., Yu, S.: Faster collective output through active buffering. In: Proceedings of the International Symposium on Parallel and Distributed Processing (2002)
Qin, X., Jiang, H., Zhu, Y., Swanson, D.: Dynamic load balancing for I/O- and memory-intensive workload in clusters using a feedback control mechanism. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds.) Euro-Par 2003. LNCS, vol. 2790, pp. 224–229. Springer, Heidelberg (2003)
Forney, B., Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H.: Storage-aware caching: Revisiting caching for heterogeneous storage systems. In: Proceedings of the 1st Symposium on File and Storage Technology, Monterey, California, USA (2002)
Geoffray, P.: Opiom: Off-processor I/O with myrinet. Future Generation Computer Systems 18, 491–499 (2002)
Franklin, M., Govindan, V.: A general matrix iterative model for dynamic load balancing. Parallel Computing 33 (1996)
Eager, D., Lazowska, E., Zahorjan, J.: Adaptive load sharing in homogeneous distributed systems. IEEE Trans. on Software Eng. 12, 662–675 (1986)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Qin, X., Jiang, H., Zhu, Y., Swanson, D.R. (2003). Dynamic Load Balancing for I/O-Intensive Tasks on Heterogeneous Clusters. In: Pinkston, T.M., Prasanna, V.K. (eds) High Performance Computing - HiPC 2003. HiPC 2003. Lecture Notes in Computer Science, vol 2913. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24596-4_32
Download citation
DOI: https://doi.org/10.1007/978-3-540-24596-4_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20626-2
Online ISBN: 978-3-540-24596-4
eBook Packages: Springer Book Archive