ABSTRACT
This paper presents a performance evaluation of scientific workflows on networked cloud systems with particular emphasis on evaluating the effect of provisioned network bandwidth on application I/O performance. The experiments were run on ExoGENI, a widely distributed networked infrastructure as a service (NIaaS) testbed. ExoGENI orchestrates a federation of independent cloud sites located around the world along with backbone circuit providers. The evaluation used a representative data-intensive scientific workflow application called Montage. The application was deployed on a virtualized HTCondor environment provisioned dynamically from the ExoGENI networked cloud testbed, and managed by the Pegasus workflow manager.
The results of our experiments show the effect of modifying provisioned network bandwidth on disk I/O throughput and workflow execution time. The marginal benefit as perceived by the workflow reduces as the network bandwidth allocation increases to a point where disk I/O saturates. There is little or no benefit from increasing network bandwidth beyond this inflection point. The results also underline the importance of network and I/O performance isolation for predictable application performance, and are applicable for general data-intensive workloads. Insights from this work will also be useful for real-time monitoring, application steering and infrastructure planning for data-intensive workloads on networked cloud platforms.
- Amazon Elastic Compute Cloud (Amazon EC2). http://www.amazon.com/ec2.Google Scholar
- I. Baldine, Y. Xin, A. Mandal, P. Ruth, A. Yumerefendi, and J. Chase. Exogeni: A multi-domain infrastructure-as-a-service testbed. In 8th International ICST Conference on Testbeds and Research Infrastructures for the Development of Networks and Communities (TRIDENTCOM 2012), 2012.Google ScholarCross Ref
- J. Chase, L. Grit, D. Irwin, V. Marupadi, P. Shivam, and A. Yumerefendi. Beyond virtual data centers: Toward an open resource control architecture. In Selected Papers from the International Conference on the Virtual Computing Initiative (ACM Digital Library), May 2007.Google Scholar
- J. S. Chase, D. E. Irwin, L. E. Grit, J. D. Moore, and S. E. Sprenkle. Dynamic Virtual Clusters in a Grid Site Manager. In Proceedings of the Twelfth International Symposium on High Performance Distributed Computing (HPDC), June 2003. Google ScholarDigital Library
- E. Deelman, G. Juve, M. Malawski, and J. Nabrzyski. Hosted science: Managing computational workflows in the cloud. Parallel Processing Letters, 23(2), 2013.Google ScholarCross Ref
- E. Deelman, G. Singh, M.-H. Su, J. Blythe, Y. Gil, C. Kesselman, G. Mehta, K. Vahi, G. B. Berriman, J. Good, et al. Pegasus: A framework for mapping complex scientific workflows onto distributed systems. Scientific Programming, 13(3): 219--237, 2005. Google ScholarDigital Library
- Eucalyptus Systems. http://www.eucalyptus.com/.Google Scholar
- ExoGENI Wiki. http://wiki.exogeni.net.Google Scholar
- Y. Fu, J. Chase, B. Chun, S. Schwab, and A. Vahdat. SHARP: An Architecture for Secure Resource Peering. In Proceedings of the 19th ACM Symposium on Operating System Principles, October 2003. Google ScholarDigital Library
- D. Ghoshal, R. S. Canon, and L. Ramakrishnan. I/o performance of virtualized cloud environments. In Proceedings of the second international workshop on Data intensive computing in the clouds, DataCloud-SC '11, pages 71--80, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
- A. Iosup, S. Ostermann, N. Yigitbasi, R. Prodan, T. Fahringer, and D. Epema. Performance analysis of cloud computing services for many-tasks scientific computing. IEEE Trans. Parallel Distrib. Syst., 22(6): 931--945, June 2011. Google ScholarDigital Library
- D. Irwin, J. S. Chase, L. Grit, A. Yumerefendi, D. Becker, and K. G. Yocum. Sharing Networked Resources with Brokered Leases. In Proceedings of the USENIX Technical Conference, June 2006. Google ScholarDigital Library
- K. Jackson, L. Ramakrishnan, K. Muriki, S. Canon, S. Cholia, J. Shalf, H. J. Wasserman, and N. Wright. Performance analysis of high performance computing applications on the amazon web services cloud. In Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on, pages 159--168, 2010. Google ScholarDigital Library
- J. C. Jacob, D. S. Katz, G. B. Berriman, J. C. Good, A. C. Laity, E. Deelman, C. Kesselman, G. Singh, M. Su, T. A. Prince, and R. Williams. Montage a grid portal and software toolkit for science grade astronomical image mosaicking. Int. J. Comput. Sci. Eng., 4(2): 73--87, July 2009. Google ScholarDigital Library
- G. Juve and E. Deelman. Scientific workflows and clouds. Crossroads, 16(3): 14--18, Mar. 2010. Google ScholarDigital Library
- G. Juve and E. Deelman. Scientific workflows in the cloud. In M. Cafaro and G. Aloisio, editors, Grids, Clouds and Virtualization, Computer Communications and Networks, pages 71--91. Springer London, 2011.Google Scholar
- G. Juve, E. Deelman, G. B. Berriman, B. P. Berman, and P. Maechling. An evaluation of the cost and performance of scientific workflows on amazon ec2. J. Grid Comput., 10(1): 5--21, 2012. Google ScholarDigital Library
- Z. Li, L. O'Brien, H. Zhang, and R. Cai. A factor framework for experimental design for performance evaluation of commercial cloud services. In CloudCom, pages 169--176. IEEE, 2012. Google ScholarDigital Library
- X. Lin, Y. Mao, F. Li, and R. Ricci. Towards fair sharing of block storage in a multi-tenant cloud. In Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing, HotCloud'12, pages 15--15, Berkeley, CA, USA, 2012. USENIX Association. Google ScholarDigital Library
- Linux iotop. http://guichaz.free.fr/iotop/.Google Scholar
- M. Malawski, G. Juve, E. Deelman, and J. Nabrzyski. Cost- and deadline-constrained provisioning for scientific workflow ensembles in iaas clouds. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC '12, pages 22:1--22:11, Los Alamitos, CA, USA, 2012. IEEE Computer Society Press. Google ScholarDigital Library
- A. Mandal, I. Baldine, Y. Xin, P. Ruth, and C. Heerman. Enabling persistent queries for cross-aggregate performance monitoring. Technical Report TR-13-01, Renaissance Computing Institute, 2013, http://www.renci.org/wp-content/uploads/2013/04/TR-13-01.pdf.Google Scholar
- A. Mandal, Y. Xin, I. Baldine, P. Ruth, C. Heerman, J. Chase, V. Orlikowski, and A. Yumerefendi. Provisioning and evaluating multi-domain networked clouds for hadoop-based applications. In Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on, pages 690--697, 2011. Google ScholarDigital Library
- D. Milojicic, I. M. Llorente, and R. S. Montero. Opennebula: A cloud management tool. IEEE Internet Computing, 15(2): 11--14, 2011. Google ScholarDigital Library
- OpenStack Cloud Software. http://openstack.org.Google Scholar
- S. Ostermann, A. Iosup, N. Yigitbasi, R. Prodan, T. Fahringer, and D. H. J. Epema. A performance analysis of ec2 cloud computing services for scientific computing. In D. R. Avresky, M. Diaz, A. Bode, B. Ciciani, and E. Dekel, editors, CloudComp, volume 34 of Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, pages 115--131. Springer, 2009.Google Scholar
- X. Pu, L. Liu, Y. Mei, S. Sivathanu, Y. Koh, and C. Pu. Understanding performance interference of i/o workload in virtualized cloud environments. In Proceedings of the 2010 IEEE 3rd International Conference on Cloud Computing, CLOUD '10, pages 51--58, Washington, DC, USA, 2010. IEEE Computer Society. Google ScholarDigital Library
- L. Ramakrishnan, L. Grit, A. Iamnitchi, D. Irwin, A. Yumerefendi, and J. Chase. Toward a Doctrine of Containment: Grid Hosting with Adaptive Resource Control. In Supercomputing (SC06), November 2006. Google ScholarDigital Library
- J. Shafer. I/o virtualization bottlenecks in cloud computing today. In Proceedings of the 2nd conference on I/O virtualization, WIOV'10, pages 5--5, Berkeley, CA, USA, 2010. USENIX Association. Google ScholarDigital Library
- R. Tudoran, A. Costan, G. Antoniu, and L. Bougé. A performance evaluation of azure and nimbus clouds for scientific applications. In Proceedings of the 2nd International Workshop on Cloud Computing Platforms, CloudCP '12, pages 4:1--4:6, New York, NY, USA, 2012. ACM. Google ScholarDigital Library
- J.-S. Vöckler, G. Juve, E. Deelman, M. Rynge, and B. Berriman. Experiences using cloud computing for a scientific workflow application. In Proceedings of the 2nd international workshop on Scientific cloud computing, ScienceCloud '11, pages 15--24, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
- J. Wang and I. Altintas. Early cloud experiences with the kepler scientific workflow system. Procedia Computer Science, 9(0): 1630--1634, 2012.Google ScholarCross Ref
Index Terms
- Evaluating I/O aware network management for scientific workflows on networked clouds
Recommendations
Adapting scientific workflows on networked clouds using proactive introspection
UCC '15: Proceedings of the 8th International Conference on Utility and Cloud ComputingRecent advances in cloud technologies and on-demand network circuits have created an unprecedented opportunity to enable complex data-intensive scientific applications to run on dynamic, networked cloud infrastructure. However, there is a lack of tools ...
Comparing FutureGrid, Amazon EC2, and Open Science Grid for Scientific Workflows
Scientists have many computing infrastructures available to conduct their research, including grids and public or private clouds. This article explores the use of these cyberinfrastructures to execute scientific workflows, an important class of ...
Network Analysis of Scientific Workflows: A Gateway to Reuse
Online workflow repositories let scientists share successful experimental routines and compose new workflows from best practices and existing service components. The authors share the results of a social- network analysis of the myExperiment workflow ...
Comments