Towards effective science cloud provisioning for a large-scale high-throughput computing

Kim, Seoyoung; Kim, Jik-Soo; Hwang, Soonwook; Kim, Yoonhee

doi:10.1007/s10586-014-0371-2

Towards effective science cloud provisioning for a large-scale high-throughput computing

Published: 19 April 2014

Volume 17, pages 1157–1169, (2014)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Seoyoung Kim¹,
Jik-Soo Kim¹,
Soonwook Hwang¹ &
…
Yoonhee Kim²

342 Accesses
8 Citations
Explore all metrics

Abstract

The science cloud paradigm has been actively developed and investigated, but still requires a suitable model for science cloud system in order to support increasing scientific computation needs with high performance. This paper presents an effective provisioning model of science cloud, particularly for large-scale high throughput computing applications. In this model, we utilize job traces where a statistical method is applied to pick the most influential features to improve application performance. With these features, a system determines where VM is deployed (allocation) and which instance type is proper (provisioning). An adaptive evaluation step which is subsequent to the job execution enables our model to adapt to dynamical computing environments. We show performance achievements by comparing the proposed model with other policies through experiments and expect noticeable improvements on performance as well as reduction of cost from resource consumption through our model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

PSO-DS: a scheduling engine for scientific workflow managers

Article 03 March 2017

Resource Provisioning Strategy for Scientific Workflows in Cloud Computing Environment

Cloud resource management: towards efficient execution of large-scale scientific applications and workflows on complex infrastructures

Article Open access 19 June 2017

References

Wang, L., Zhan, J., Shi, W.: In cloud, can scientific communities benefit from the economies of scale? TPDS 99, 1 (2011)
Google Scholar
Wang, X.Y., et al.: Appliance-based autonomic provisioning framework for virtualized outsourcing data center. In: Proceedings of the Fourth International Conference on Autonomic Computing, p. 29 (2007).
Li, H., Groep, D., Wolters, L.: Efficient response time predictions by exploiting application and resource state similarities, In Proceedings of the 6th IEEE/ACM International Workshop on Grid Computing. IEEE Computer Society, pp. 234–241 (2005).
Urgaonkar, B., Shenoy, P,. and Roscoe, T.: Resource overbooking and application profiling in a shared Internet hosting platform. ACM Trans. Internet Technol. 9, 1, Article 1 (February 2009), pp. 45. 2009.
Raicu, I., Foster, I.T., and Yong Z.: Many-task computing for grids and supercomputers”, MTAGS 2008. In: Workshop on Many-Task Computing on Grids and Supercomputers, pp. 1–11 (2008).
Morris, G.M., Goodsell, D.S., Halliday, R.S., Huey, R., Hart, W.E., Belew, R.K., Olson, A.J.: Automated docking using a lamarckian genetic algorithm and and empirical binding free energy function. J. Comput. Chem. 19, 1639–1662 (1998)
Article Google Scholar
Alwall, J., Herquet, M., Maltoni, F., Mattelaer, O., Stelzer, T.: MadGraph 5: going beyond. J. High Energy Phys. 6, 1–40 (2011)
Google Scholar
Rho, S., Kim, S., Kim, S., Kim, S., Kim, J.-S., and Hwang, S.: HTCaaS: a large-scale high-throughput computing by leveraging grids, supercomputers and cloud, In: Research Poster at IEEE/ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC’12), November (2012).
Jolliffe, I.T.: Principal Component Analysis (PCA), Springer Series in Statistics., 2nd edn. Springer-Verlag, New York (2002)
Google Scholar
Amazon EC2 (Elastic Compute Cloud), http://aws.amazon.com/ec2. Accessed 12 April 2014
Flanagan Scientific Library, http://www.ee.ucl.ac.uk/~mflanaga/java/. Accessed 12 April 2014
DAS2-Grid, http://cs.vu.nl/das2. Accessed 12 April 2014
Grid Workload Archive (GWA), http://gwa.ewi.tudelft.nl/. Accessed 12 April 2014

Download references

Acknowledgments

S.Y Kim thanks S.-h. Nam for useful comments and supports. This research was supported in part by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and Future Planning (NRF-2013R1A1A3007866)

Author information

Authors and Affiliations

National Institute of Supercomputing and Networking, KISTI, Daejeon, 305-806, Korea
Seoyoung Kim, Jik-Soo Kim & Soonwook Hwang
Department of Computer Science, Sookmyung Women’s University, Seoul, 140-742, Korea
Yoonhee Kim

Authors

Seoyoung Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jik-Soo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Soonwook Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Yoonhee Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yoonhee Kim.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kim, S., Kim, JS., Hwang, S. et al. Towards effective science cloud provisioning for a large-scale high-throughput computing. Cluster Comput 17, 1157–1169 (2014). https://doi.org/10.1007/s10586-014-0371-2

Download citation

Received: 29 September 2013
Revised: 25 February 2014
Accepted: 15 March 2014
Published: 19 April 2014
Issue Date: December 2014
DOI: https://doi.org/10.1007/s10586-014-0371-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Towards effective science cloud provisioning for a large-scale high-throughput computing

Abstract

Access this article

Similar content being viewed by others

PSO-DS: a scheduling engine for scientific workflow managers

Resource Provisioning Strategy for Scientific Workflows in Cloud Computing Environment

Cloud resource management: towards efficient execution of large-scale scientific applications and workflows on complex infrastructures

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Towards effective science cloud provisioning for a large-scale high-throughput computing

Abstract

Access this article

Similar content being viewed by others

PSO-DS: a scheduling engine for scientific workflow managers

Resource Provisioning Strategy for Scientific Workflows in Cloud Computing Environment

Cloud resource management: towards efficient execution of large-scale scientific applications and workflows on complex infrastructures

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation