skip to main content
10.1145/3366623.3368137acmconferencesArticle/Chapter ViewAbstractPublication PagesmiddlewareConference Proceedingsconference-collections
research-article

FaaS Orchestration of Parallel Workloads

Published:09 December 2019Publication History

ABSTRACT

Function as a Service (FaaS) is based on a reactive programming model where functions are activated by triggers in response to cloud events (e.g., objects added to an object store). The inherent elasticity and the pay-per-use model of serverless functions make them very appropriate for embarrassingly parallel tasks like data preprocessing, or even the execution of MapReduce jobs in the cloud.

But current Serverless orchestration systems are not designed for managing parallel fork-join workflows in a scalable and efficient way. We demonstrate in this paper that existing services like AWS Step Functions or Azure Durable Functions incur in considerable overheads, and only Composer at IBM Cloud provides suitable performance.

Successively, we analyze the architecture of OpenWhisk as an open-source FaaS systems and its orchestration features (Composer). We outline its architecture problems and propose guidelines for orchestrating massively parallel workloads using serverless functions.

References

  1. 2013. Camunda Open Source Workflow Platform. https://camunda.com/.Google ScholarGoogle Scholar
  2. 2014. Reactive Manifesto. https://www.reactivemanifesto.org/.Google ScholarGoogle Scholar
  3. 2019. Amazon AWS Serverless Definition. https://aws.amazon.com/serverless/.Google ScholarGoogle Scholar
  4. 2019. Apache Conductor Actions. https://github.com/apache/openwhisk/blob/master/docs/conductors.md.Google ScholarGoogle Scholar
  5. Ioana Baldini, Perry Cheng, Stephen J. Fink, Nick Mitchell, Vinod Muthusamy, Rodric Rabbah, Philippe Suter, and Olivier Tardieu. 2017. The Serverless Trilemma: Function Composition for Serverless Computing. In Onward! 2017. 89--103.Google ScholarGoogle Scholar
  6. Opher Etzion, Peter Niblett, and David C Luckham. 2011. Event processing in action. Manning Greenwich.Google ScholarGoogle Scholar
  7. Sadjad Fouladi, Riad S Wahby, Brennan Shacklett, Karthikeyan Vasuki Balasubramaniam, William Zeng, Rahul Bhalerao, Anirudh Sivaraman, George Porter, and Keith Winstein. 2017. Encoding, fast and slow: Low-latency video processing using thousands of tiny threads. In USENIX NSDI 17. 363--376.Google ScholarGoogle Scholar
  8. Apache Software Foundation. 2019. Apache OpenWhisk. https://github.com/apache/openwhisk.Google ScholarGoogle Scholar
  9. Apache Software Foundation. 2019. Apache OpenWhisk Composer. https://github.com/apache/openwhisk-composer.Google ScholarGoogle Scholar
  10. Pedro García-López, Marc Sánchez-Artigas, Gerard París, Daniel Barcelona-Pons, Álvaro Ruiz, and David Arroyo-Pinto. 2018. Comparison of FaaS Orchestration Systems. In WoSC4 - UCC Companion. 148--153.Google ScholarGoogle Scholar
  11. Mohammad Islam, Angelo K Huang, Mohamed Battisha, Michelle Chiang, Santhosh Srinivasan, Craig Peters, Andreas Neumann, and Alejandro Abdelnur. 2012. Oozie: towards a scalable workflow management system for hadoop. In Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies. ACM, 4:1-4:10.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Eric Jonas, Qifan Pu, Shivaram Venkataraman, Ion Stoica, and Benjamin Recht. 2017. Occupy the Cloud: Distributed Computing for the 99%. In SoCC'17. ACM, New York, NY, USA, 445--451.Google ScholarGoogle Scholar
  13. Youngbin Kim and Jimmy Lin. 2018. Serverless Data Analytics with Flint. In IEEE CLOUD'18. 451--455.Google ScholarGoogle Scholar
  14. Maciej Malawski, Adam Gajek, Adam Zima, Bartosz Balis, and Kamil Figiela. in press. Serverless execution of scientific workflows: Experiments with HyperFlow, AWS Lambda and Google Cloud Functions. Future Generation Computer Systems (in press).Google ScholarGoogle Scholar
  15. Josep Sampé, Gil Vernik, Marc Sánchez-Artigas, and Pedro García-López. 2018. Serverless Data Analytics in the IBM Cloud. In Proceedings of the 19th International Middleware Conference Industry (Middleware '18). 1--8.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. FaaS Orchestration of Parallel Workloads

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      WOSC '19: Proceedings of the 5th International Workshop on Serverless Computing
      December 2019
      59 pages
      ISBN:9781450370387
      DOI:10.1145/3366623

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 9 December 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader