Skip to main content

Resource Aware Scheduling for EDA Regression Jobs

  • Conference paper
  • First Online:
Euro-Par 2019: Parallel Processing Workshops (Euro-Par 2019)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11997))

Included in the following conference series:

Abstract

Typical Integrated Circuit (IC) design projects use Electronic Design Automation (EDA) tool flows to launch thousands of regressions every day on shared compute grids to complete the IC design verification process. These regressions in turn launch compute jobs with varied resource requirements and inter-job dependency constraints. Traditional grid schedulers, such as the Univa Grid Engine (UGE) [12] prioritize fairness over performance to maximize the number of jobs run with equal distribution of resources at any time. A constant challenge in day-to-day operations is to schedule these jobs for minimum overall job completion time so that developers can expect predictable regression turn-around time (TAT).

We propose a resource-aware scheduling mechanism that balances performance and fairness for real-word EDA-centric workloads. We present an analysis of historical profile information from a set of regressions with complex inter-job dependencies and highly variable resource requirements to show that many of these regression jobs are well suited for efficient packing on grid machines.

We formulate the regression scheduling problem as a variant of the bin packing problem, where the size of bins and balls may vary according to job-resource requirements and differing server configurations on the grid. We propose using two analytic techniques – namely k-means clustering [8] and adaptive binning [10], to solve this problem. We then evaluate the performance of our proposed solution using real workloads from daily regressions on an enterprise compute grid.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.ibm.com/support/knowledgecenter/en/SSFJTW_5.1.0.

  2. 2.

    https://www.pbspro.org/.

  3. 3.

    https://slurm.schedmd.com/.

  4. 4.

    http://www.univa.com/products/.

  5. 5.

    https://redis.io/.

  6. 6.

    https://www.elastic.co/products/beats/metricbeat.

References

  1. Gen, M., Cheng, R.: Genetic Algorithms and Engineering Optimization, vol. 7. Wiley, Hoboken (2000)

    Google Scholar 

  2. Grandl, R., Ananthanarayanan, G., Kandula, S., Rao, S., Akella, A.: Multi-resource packing for cluster schedulers. ACM SIGCOMM 44(4), 455–466 (2015)

    Article  Google Scholar 

  3. Hao, X., Lin, L., Gen, M., Ohno, K.: Effective estimation of distribution algorithm for stochastic job shop scheduling problem. Procedia Comput. Sci. 20, 102–107 (2013)

    Article  Google Scholar 

  4. Harchol-Balter, M., Schroeder, B., Agrawal, M., Bansal, N.: Size-based scheduling to improve web performance. ACM Trans. Comput. Syst. 21, 207–233 (2003)

    Article  Google Scholar 

  5. Holte, R.C.: Very simple classification rules perform well on most commonly used datasets. Mach. Learn. 11, 63–91 (1993). https://doi.org/10.1023/A:1022631118932

    Article  MATH  Google Scholar 

  6. Kamath, V., Giri, R., Muralidhar, R.: Experiences with a private enterprise cloud: providing fault tolerance and high availability for interactive EDA applications. In: Sixth IEEE International Conference on Cloud Computing (CLOUD), pp. 770–777, June 2013. https://doi.org/10.1109/CLOUD.2013.72

  7. Lawler, E.L., Lenstra, J.K., Kan, A.H.R., Shmoys, D.B.: Sequencing and scheduling: algorithms and complexity. Handb. Oper. Res. Manag. Sci. 4, 445–522 (1993)

    Google Scholar 

  8. Macqueen, J.: Some methods for classification and analysis of multivariate observations. In: 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)

    Google Scholar 

  9. Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  10. Poli, R., Woodward, J., Burke, E.K.: A histogram-matching approach to the evolution of bin-packing strategies. In: 2007 IEEE Congress on Evolutionary Computation, pp. 3500–3507. IEEE (2007)

    Google Scholar 

  11. Tibshirani, R., Walther, G., Hastie, T.: Estimating the number of clusters in a dataset via the gap statistic. J. R. Stat. Soc.: Ser. B (Stat. Methodol.) 63, 411–423 (2000)

    Article  Google Scholar 

  12. Univa Corporation: Univa® grid engine®. http://www.univa.com/. http://www.univa.com/products

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Saurav Nanda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nanda, S., Parthasarathy, G., Choudhary, P., Venkatachar, A. (2020). Resource Aware Scheduling for EDA Regression Jobs. In: Schwardmann, U., et al. Euro-Par 2019: Parallel Processing Workshops. Euro-Par 2019. Lecture Notes in Computer Science(), vol 11997. Springer, Cham. https://doi.org/10.1007/978-3-030-48340-1_49

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-48340-1_49

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-48339-5

  • Online ISBN: 978-3-030-48340-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics