Skip to main content

Running Serverless Function on Resource Fragments in Data Center

  • Conference paper
  • First Online:
Algorithms and Architectures for Parallel Processing (ICA3PP 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14489))

  • 80 Accesses

Abstract

Repetitive scheduling of resources in data centers leads to a large number of transient resource fragments. Serverless computing can decompose long-period mixed deployment applications into small-volume and short-period functions. Fine-grained functions can effectively respond to the transient fluctuation behavior of fragments. However, the fluctuation characteristics and predictability of resource fragments in the cloud environment vary greatly, which may still interfere with the execution process of functions.

In this paper, we introduce Tianxuan, a function-level resource scheduler. It classifies the resource fragments in the cloud environment according to their usage forms into unallocated and unused fragments and extracts the fluctuation characteristics from open-source datasets of each type of fragment. Then, it builds a fragment feature classification-aware spatiotemporal prediction model based on these characteristics and schedules functions on spatiotemporally complementary fragments to achieve resource efficiency by reducing the probability of resource contention. Experimental results show that Tianxuan improves the prediction accuracy of resource fragments by 15% compared to existing techniques and increases server CPU utilization by 16%–25% and function throughput by 17%–37%.

Y. Chu and W. Huang—Contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alibaba. [cp/ol]. https://github.com/alibaba/clusterdata (2018).

  2. Unicloud. [cp/ol]. https://www.unicloud.com/ (2018)

  3. Azure. [cp/ol]. https://github.com/Azure/AzurePublicDataset (2019)

  4. Google. [cp/ol]. https://github.com/google/cluster-data (2019)

  5. Openfaas. openfaas [cp/ol]. https://www.openfaas.com/ (2021)

  6. Mysql. [cp/ol]. https://www.mysql.com/ (2022)

  7. Redis. [cp/ol]. https://redis.com/ (2022)

  8. Abadi, M., et al.: \(\{\)TensorFlow\(\}\): a system for \(\{\)Large-Scale\(\}\) machine learning. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), pp. 265–283 (2016)

    Google Scholar 

  9. Ambati, P., et al.: Providing \(\{\)SLOs\(\}\) for \(\{\)Resource-Harvesting\(\}\)\(\{\)VMs\(\}\) in cloud platforms. In: 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20), pp. 735–751 (2020)

    Google Scholar 

  10. Chen, S., Delimitrou, C., Martínez, J.F.: Parties: Qos-aware resource partitioning for multiple interactive services. In: Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 107–120 (2019)

    Google Scholar 

  11. Cook, H., Moreto, M., Bird, S., Dao, K., Patterson, D.A., Asanovic, K.: A hardware evaluation of cache partitioning to improve utilization and energy-efficiency while preserving responsiveness. ACM SIGARCH Comput. Architect. News 41(3), 308–319 (2013)

    Article  Google Scholar 

  12. Ferdman, M., et al.: Clearing the clouds: a study of emerging scale-out workloads on modern hardware (2012). http://infoscience.epfl.ch/record/173764

  13. Fuerst, A., et al.: Memory-harvesting vms in cloud platforms. In: Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems. pp. 583–594 (2022)

    Google Scholar 

  14. Govindan, S., Liu, J., Kansal, A., Sivasubramaniam, A.: Cuanta: quantifying effects of shared on-chip resource interference for consolidated virtual machines. In: Proceedings of the 2nd ACM Symposium on Cloud Computing, pp. 1–14 (2011)

    Google Scholar 

  15. Javadi, S.A., Suresh, A., Wajahat, M., Gandhi, A.: Scavenger: A black-box batch workload resource manager for improving utilization in cloud environments. In: Proceedings of the ACM Symposium on Cloud Computing, pp. 272–285 (2019)

    Google Scholar 

  16. Jonas, E., et al.: Cloud programming simplified: a berkeley view on serverless computing. arXiv preprint arXiv:1902.03383 (2019)

  17. Kim, J., Lee, K.: Practical cloud workloads for serverless faas. In: Proceedings of the ACM Symposium on Cloud Computing.,pp. 477–477 (2019)

    Google Scholar 

  18. Li, Z., Guo, L., Cheng, J., Chen, Q., He, B., Guo, M.: The serverless computing survey: a technical primer for design architecture. ACM Comput. Surv. (CSUR) 54(10s), 1–34 (2022)

    Article  Google Scholar 

  19. Lo, D., Cheng, L., Govindaraju, R., Ranganathan, P., Kozyrakis, C.: Heracles: Improving resource efficiency at scale. In: Proceedings of the 42nd Annual International Symposium on Computer Architecture, pp. 450–462 (2015)

    Google Scholar 

  20. Machina, J., Sodan, A.: Predicting cache needs and cache sensitivity for applications in cloud computing on cmp servers with configurable caches. In: 2009 IEEE International Symposium on Parallel & Distributed Processing, pp. 1–8. IEEE (2009)

    Google Scholar 

  21. Maji, A.K., Mitra, S., Bagchi, S.: Ice: An integrated configuration engine for interference mitigation in cloud services. In: 2015 IEEE International Conference on Autonomic Computing, pp. 91–100. IEEE (2015)

    Google Scholar 

  22. Manikantan, R., Rajan, K., Govindarajan, R.: Probabilistic shared cache management (prism). In: 2012 39th Annual International Symposium on Computer Architecture (ISCA), pp. 428–439. IEEE (2012)

    Google Scholar 

  23. Mars, J., Tang, L., Hundt, R., Skadron, K., Soffa, M.L.: Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations. In: Proceedings of the 44th annual IEEE/ACM International Symposium on Microarchitecture, pp. 248–259 (2011)

    Google Scholar 

  24. Mei, Y., Liu, L., Pu, X., Sivathanu, S., Dong, X.: Performance analysis of network i/o workloads in virtualized data centers. IEEE Trans. Serv. Comput. 6(1), 48–63 (2011)

    Article  Google Scholar 

  25. Nathuji, R., Kansal, A., Ghaffarkhah, A.: Q-clouds: managing performance interference effects for qos-aware clouds. In: Proceedings of the 5th European Conference on Computer Systems, pp. 237–250 (2010)

    Google Scholar 

  26. Novaković, D., Vasić, N., Novaković, S., Kostić, D., Bianchini, R.: \(\{\)DeepDive\(\}\): Transparently identifying and managing performance interference in virtualized environments. In: 2013 USENIX Annual Technical Conference (USENIX ATC 13), pp. 219–230 (2013)

    Google Scholar 

  27. Srikantaiah, S., Kandemir, M., Wang, Q.: Sharp control: controlled shared cache management in chip multiprocessors. In: Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, pp. 517–528 (2009)

    Google Scholar 

  28. Suresh, A., Gandhi, A.: Servermore: Opportunistic execution of serverless functions in the cloud. In: Proceedings of the ACM Symposium on Cloud Computing, pp. 570–584 (2021)

    Google Scholar 

  29. Wang, Y., et al.: Smartharvest: harvesting idle cpus safely and efficiently in the cloud. In: Proceedings of the Sixteenth European Conference on Computer Systems, pp. 1–16 (2021)

    Google Scholar 

  30. Yang, H., Breslow, A., Mars, J., Tang, L.: Bubble-flux: Precise online qos management for increased utilization in warehouse scale computers. ACM SIGARCH Comput. Arch. News 41(3), 607–618 (2013)

    Article  Google Scholar 

  31. Zhang, Y., et al.: Faster and cheaper serverless computing on harvested resources. In: Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles, pp. 724–739 (2021)

    Google Scholar 

  32. Zhang, Y., Prekas, G., Fumarola, G.M., Fontoura, M., Goiri, Í., Bianchini, R.: \(\{\)History-Based\(\}\) harvesting of spare cycles and storage in \(\{\)Large-Scale\(\}\) datacenters. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), pp. 755–770 (2016)

    Google Scholar 

  33. Zhao, L., et al.: Rhythm: component-distinguishable workload deployment in datacenters. In: Proceedings of the Fifteenth European Conference on Computer Systems, pp. 1–17 (2020)

    Google Scholar 

  34. Zhu, H., Erez, M.: Dirigent: enforcing qos for latency-critical tasks on shared multicore systems. In: Proceedings of the Twenty-first International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 33–47 (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Contributions

Yukang Chu and Wenhao Huang : contributed equally to this work.

Corresponding author

Correspondence to Laiping Zhao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chu, Y., Huang, W., Zhao, L. (2024). Running Serverless Function on Resource Fragments in Data Center. In: Tari, Z., Li, K., Wu, H. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2023. Lecture Notes in Computer Science, vol 14489. Springer, Singapore. https://doi.org/10.1007/978-981-97-0798-0_26

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-0798-0_26

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-0797-3

  • Online ISBN: 978-981-97-0798-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics