skip to main content
10.1145/2534695.2534698acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Evaluating I/O aware network management for scientific workflows on networked clouds

Published:17 November 2013Publication History

ABSTRACT

This paper presents a performance evaluation of scientific workflows on networked cloud systems with particular emphasis on evaluating the effect of provisioned network bandwidth on application I/O performance. The experiments were run on ExoGENI, a widely distributed networked infrastructure as a service (NIaaS) testbed. ExoGENI orchestrates a federation of independent cloud sites located around the world along with backbone circuit providers. The evaluation used a representative data-intensive scientific workflow application called Montage. The application was deployed on a virtualized HTCondor environment provisioned dynamically from the ExoGENI networked cloud testbed, and managed by the Pegasus workflow manager.

The results of our experiments show the effect of modifying provisioned network bandwidth on disk I/O throughput and workflow execution time. The marginal benefit as perceived by the workflow reduces as the network bandwidth allocation increases to a point where disk I/O saturates. There is little or no benefit from increasing network bandwidth beyond this inflection point. The results also underline the importance of network and I/O performance isolation for predictable application performance, and are applicable for general data-intensive workloads. Insights from this work will also be useful for real-time monitoring, application steering and infrastructure planning for data-intensive workloads on networked cloud platforms.

References

  1. Amazon Elastic Compute Cloud (Amazon EC2). http://www.amazon.com/ec2.Google ScholarGoogle Scholar
  2. I. Baldine, Y. Xin, A. Mandal, P. Ruth, A. Yumerefendi, and J. Chase. Exogeni: A multi-domain infrastructure-as-a-service testbed. In 8th International ICST Conference on Testbeds and Research Infrastructures for the Development of Networks and Communities (TRIDENTCOM 2012), 2012.Google ScholarGoogle ScholarCross RefCross Ref
  3. J. Chase, L. Grit, D. Irwin, V. Marupadi, P. Shivam, and A. Yumerefendi. Beyond virtual data centers: Toward an open resource control architecture. In Selected Papers from the International Conference on the Virtual Computing Initiative (ACM Digital Library), May 2007.Google ScholarGoogle Scholar
  4. J. S. Chase, D. E. Irwin, L. E. Grit, J. D. Moore, and S. E. Sprenkle. Dynamic Virtual Clusters in a Grid Site Manager. In Proceedings of the Twelfth International Symposium on High Performance Distributed Computing (HPDC), June 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. E. Deelman, G. Juve, M. Malawski, and J. Nabrzyski. Hosted science: Managing computational workflows in the cloud. Parallel Processing Letters, 23(2), 2013.Google ScholarGoogle ScholarCross RefCross Ref
  6. E. Deelman, G. Singh, M.-H. Su, J. Blythe, Y. Gil, C. Kesselman, G. Mehta, K. Vahi, G. B. Berriman, J. Good, et al. Pegasus: A framework for mapping complex scientific workflows onto distributed systems. Scientific Programming, 13(3): 219--237, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Eucalyptus Systems. http://www.eucalyptus.com/.Google ScholarGoogle Scholar
  8. ExoGENI Wiki. http://wiki.exogeni.net.Google ScholarGoogle Scholar
  9. Y. Fu, J. Chase, B. Chun, S. Schwab, and A. Vahdat. SHARP: An Architecture for Secure Resource Peering. In Proceedings of the 19th ACM Symposium on Operating System Principles, October 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Ghoshal, R. S. Canon, and L. Ramakrishnan. I/o performance of virtualized cloud environments. In Proceedings of the second international workshop on Data intensive computing in the clouds, DataCloud-SC '11, pages 71--80, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Iosup, S. Ostermann, N. Yigitbasi, R. Prodan, T. Fahringer, and D. Epema. Performance analysis of cloud computing services for many-tasks scientific computing. IEEE Trans. Parallel Distrib. Syst., 22(6): 931--945, June 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. Irwin, J. S. Chase, L. Grit, A. Yumerefendi, D. Becker, and K. G. Yocum. Sharing Networked Resources with Brokered Leases. In Proceedings of the USENIX Technical Conference, June 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. K. Jackson, L. Ramakrishnan, K. Muriki, S. Canon, S. Cholia, J. Shalf, H. J. Wasserman, and N. Wright. Performance analysis of high performance computing applications on the amazon web services cloud. In Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on, pages 159--168, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. C. Jacob, D. S. Katz, G. B. Berriman, J. C. Good, A. C. Laity, E. Deelman, C. Kesselman, G. Singh, M. Su, T. A. Prince, and R. Williams. Montage a grid portal and software toolkit for science grade astronomical image mosaicking. Int. J. Comput. Sci. Eng., 4(2): 73--87, July 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. G. Juve and E. Deelman. Scientific workflows and clouds. Crossroads, 16(3): 14--18, Mar. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. G. Juve and E. Deelman. Scientific workflows in the cloud. In M. Cafaro and G. Aloisio, editors, Grids, Clouds and Virtualization, Computer Communications and Networks, pages 71--91. Springer London, 2011.Google ScholarGoogle Scholar
  17. G. Juve, E. Deelman, G. B. Berriman, B. P. Berman, and P. Maechling. An evaluation of the cost and performance of scientific workflows on amazon ec2. J. Grid Comput., 10(1): 5--21, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Z. Li, L. O'Brien, H. Zhang, and R. Cai. A factor framework for experimental design for performance evaluation of commercial cloud services. In CloudCom, pages 169--176. IEEE, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. X. Lin, Y. Mao, F. Li, and R. Ricci. Towards fair sharing of block storage in a multi-tenant cloud. In Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing, HotCloud'12, pages 15--15, Berkeley, CA, USA, 2012. USENIX Association. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Linux iotop. http://guichaz.free.fr/iotop/.Google ScholarGoogle Scholar
  21. M. Malawski, G. Juve, E. Deelman, and J. Nabrzyski. Cost- and deadline-constrained provisioning for scientific workflow ensembles in iaas clouds. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC '12, pages 22:1--22:11, Los Alamitos, CA, USA, 2012. IEEE Computer Society Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Mandal, I. Baldine, Y. Xin, P. Ruth, and C. Heerman. Enabling persistent queries for cross-aggregate performance monitoring. Technical Report TR-13-01, Renaissance Computing Institute, 2013, http://www.renci.org/wp-content/uploads/2013/04/TR-13-01.pdf.Google ScholarGoogle Scholar
  23. A. Mandal, Y. Xin, I. Baldine, P. Ruth, C. Heerman, J. Chase, V. Orlikowski, and A. Yumerefendi. Provisioning and evaluating multi-domain networked clouds for hadoop-based applications. In Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on, pages 690--697, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. D. Milojicic, I. M. Llorente, and R. S. Montero. Opennebula: A cloud management tool. IEEE Internet Computing, 15(2): 11--14, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. OpenStack Cloud Software. http://openstack.org.Google ScholarGoogle Scholar
  26. S. Ostermann, A. Iosup, N. Yigitbasi, R. Prodan, T. Fahringer, and D. H. J. Epema. A performance analysis of ec2 cloud computing services for scientific computing. In D. R. Avresky, M. Diaz, A. Bode, B. Ciciani, and E. Dekel, editors, CloudComp, volume 34 of Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, pages 115--131. Springer, 2009.Google ScholarGoogle Scholar
  27. X. Pu, L. Liu, Y. Mei, S. Sivathanu, Y. Koh, and C. Pu. Understanding performance interference of i/o workload in virtualized cloud environments. In Proceedings of the 2010 IEEE 3rd International Conference on Cloud Computing, CLOUD '10, pages 51--58, Washington, DC, USA, 2010. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. L. Ramakrishnan, L. Grit, A. Iamnitchi, D. Irwin, A. Yumerefendi, and J. Chase. Toward a Doctrine of Containment: Grid Hosting with Adaptive Resource Control. In Supercomputing (SC06), November 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. J. Shafer. I/o virtualization bottlenecks in cloud computing today. In Proceedings of the 2nd conference on I/O virtualization, WIOV'10, pages 5--5, Berkeley, CA, USA, 2010. USENIX Association. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. R. Tudoran, A. Costan, G. Antoniu, and L. Bougé. A performance evaluation of azure and nimbus clouds for scientific applications. In Proceedings of the 2nd International Workshop on Cloud Computing Platforms, CloudCP '12, pages 4:1--4:6, New York, NY, USA, 2012. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. J.-S. Vöckler, G. Juve, E. Deelman, M. Rynge, and B. Berriman. Experiences using cloud computing for a scientific workflow application. In Proceedings of the 2nd international workshop on Scientific cloud computing, ScienceCloud '11, pages 15--24, New York, NY, USA, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. J. Wang and I. Altintas. Early cloud experiences with the kepler scientific workflow system. Procedia Computer Science, 9(0): 1630--1634, 2012.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Evaluating I/O aware network management for scientific workflows on networked clouds

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        NDM '13: Proceedings of the Third International Workshop on Network-Aware Data Management
        November 2013
        84 pages
        ISBN:9781450325226
        DOI:10.1145/2534695
        • General Chairs:
        • Mehmet Balman,
        • Surendra Byna,
        • Brian L. Tierney

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 November 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        NDM '13 Paper Acceptance Rate9of14submissions,64%Overall Acceptance Rate14of23submissions,61%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader