ABSTRACT
With the recent rapid development of computing power, interest in machine learning research on large data sets is increasing significantly. The machine learning is used in a wide variety of fields, from information retrieval, data mining, and speech recognition to human-computer interaction and application development by non-experts using machine learning platforms. However, there is not enough research on load balancing for distributed systems composed of heterogeneous servers with different performances and architectures that process machine learning tasks.
Therefore, in this paper, we propose level hashing-based load balancing applicable to heterogeneous machine learning platforms. The proposed load balancing technique improves the execution time of all machine learning tasks in a machine learning platform by considering the characteristics of machine learning tasks and computing resources of each server.
- DongJun Choi, Kwang Sik Chung, and JinGon Shon. 2010. An Improvement on the Weighted Least-Connection Scheduling Algorithm for Load Balancing in Web Cluster Systems. In Grid and Distributed Computing, Control and Automation, Taihoon Kim, Stephen S. Yau, Osvaldo Gervasi, Byeong-Ho Kang, Adrian Stoica, and Dominik Ślęzak (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 127--134.Google Scholar
- D. M. Dias, W. Kish, R. Mukherjee, and R. Tewari. 1996. A scalable and highly available web server. In COMPCON '96. Technologies for the Information Superhighway Digest of Papers. 85--92.Google Scholar
- Ju-Yeon Jo and Yoohwan Kim. 2004. Hash-based Internet traffic load balancing. In Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration, 2004. IRI 2004. 204--209.Google Scholar
- Dharmesh Kashyap and Jaydeep Viradiya. 2014. A Survey Of Various Load Balancing Algorithms In Cloud Computing. International Journal of Scientific & Technology Research 3 (2014), 115--119.Google Scholar
- Yasir Khalid, Muhammad Aleem, Radu Prodan, Muhammad Iqbal, and Arshad Islam. 2018. E-OSched: a load balancing scheduler for heterogeneous multicores. The Journal of Supercomputing 74 (05 2018), 5399--5431. https://doi.org/10.1007/s11227-018-2435-1Google ScholarDigital Library
- Heejin Kim, Younggwan Kim, and Jiman Hong. 2019. Cluster Management Framework for Autonomic Machine Learning Platform. In Proceedings of the Conference on Research in Adaptive and Convergent Systems (RACS '19). Association for Computing Machinery, New York, NY, USA, 128--130. https://doi.org/10.1145/3338840.3355691Google ScholarDigital Library
- Keon Myung Lee, Jaesoo Yoo, Sang Wook Kim, Jee Hyong Lee, and Jiman Hong. 2019. Autonomic machine learning platform. International Journal of Information Management 49 (Dec. 2019), 491--501. https://doi.org/10.1016/j.ijinfomgt.2019.07.003Google ScholarDigital Library
- Giang Nguyen, Stefan Dlugolinsky, Martin Bob'ak, Viet Tran, 'Alvaro L'opez Garc'ia, Ignacio Heredia, Peter Mat'ik, and Ladislav Hluch'y. 2019. Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: a survey. Artificial Intelligence Review (2019), 1--48.Google ScholarDigital Library
- Rasmus Pagh and Flemming Friche Rodler. 2004. Cuckoo Hashing. J. Algorithms 51, 2 (May 2004), 122--144. https://doi.org/10.1016/j.jalgor.2003.12.002Google ScholarDigital Library
- Xiaoke Zhu, Qi Zhang, Ling Liu, Taining Cheng, Shaowen Yao, Wei Zhou, and Jing He. 2019. DLB: Deep Learning Based Load Balancing. (2019). arXiv:cs.DC/1910.08494Google Scholar
- Pengfei Zuo, Yu Hua, and Jie Wu. 2018. Write-Optimized and High-Performance Hashing Index Scheme for Persistent Memory. In Proceedings of the 13th USENIX Conference on Operating Systems Design and Implementation (OSDI'18). USENIX Association, USA, 461--476.Google ScholarDigital Library
Index Terms
- Load Balancing for Machine Learning Platform in Heterogeneous Distribute Computing Environment
Recommendations
Machine learning for load balancing in the Linux kernel
APSys '20: Proceedings of the 11th ACM SIGOPS Asia-Pacific Workshop on SystemsThe OS load balancing algorithm governs the performance gains provided by a multiprocessor computer system. The Linux's Completely Fair Scheduler (CFS) scheduler tracks process loads by average CPU utilization to balance workload between processor ...
A study on the application of existing load balancing algorithms for large, dynamic, heterogeneous distributed systems
SEPADS'10: Proceedings of the 9th WSEAS international conference on Software engineering, parallel and distributed systemsIn a large distributed computing environment, like Grid, tasks can be submitted at any host and the random arrival of tasks in such an environment can cause some hosts to be heavily loaded while others are idle or lightly loaded. So, in such environment,...
RILNET: A Reinforcement Learning Based Load Balancing Approach for Datacenter Networks
Machine Learning for NetworkingAbstractModern datacenter networks are facing various challenges, e.g., highly dynamic workloads, congestion, topology asymmetry. ECMP, as a traditional load balancing mechanism which is widely used in today’s datacenters, can balance load poorly and lead ...
Comments