Running Serverless Function on Resource Fragments in Data Center

Chu, Yukang; Huang, Wenhao; Zhao, Laiping

doi:10.1007/978-981-97-0798-0_26

Yukang Chu¹⁰,
Wenhao Huang¹⁰ &
Laiping Zhao¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14489))

Included in the following conference series:

International Conference on Algorithms and Architectures for Parallel Processing

80 Accesses

Abstract

Repetitive scheduling of resources in data centers leads to a large number of transient resource fragments. Serverless computing can decompose long-period mixed deployment applications into small-volume and short-period functions. Fine-grained functions can effectively respond to the transient fluctuation behavior of fragments. However, the fluctuation characteristics and predictability of resource fragments in the cloud environment vary greatly, which may still interfere with the execution process of functions.

In this paper, we introduce Tianxuan, a function-level resource scheduler. It classifies the resource fragments in the cloud environment according to their usage forms into unallocated and unused fragments and extracts the fluctuation characteristics from open-source datasets of each type of fragment. Then, it builds a fragment feature classification-aware spatiotemporal prediction model based on these characteristics and schedules functions on spatiotemporally complementary fragments to achieve resource efficiency by reducing the probability of resource contention. Experimental results show that Tianxuan improves the prediction accuracy of resource fragments by 15% compared to existing techniques and increases server CPU utilization by 16%–25% and function throughput by 17%–37%.

Y. Chu and W. Huang—Contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alibaba. [cp/ol]. https://github.com/alibaba/clusterdata (2018).
Unicloud. [cp/ol]. https://www.unicloud.com/ (2018)
Azure. [cp/ol]. https://github.com/Azure/AzurePublicDataset (2019)
Google. [cp/ol]. https://github.com/google/cluster-data (2019)
Openfaas. openfaas [cp/ol]. https://www.openfaas.com/ (2021)
Mysql. [cp/ol]. https://www.mysql.com/ (2022)
Redis. [cp/ol]. https://redis.com/ (2022)
Abadi, M., et al.: \(\{\)TensorFlow\(\}\): a system for \(\{\)Large-Scale\(\}\) machine learning. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), pp. 265–283 (2016)
Google Scholar
Ambati, P., et al.: Providing \(\{\)SLOs\(\}\) for \(\{\)Resource-Harvesting\(\}\)\(\{\)VMs\(\}\) in cloud platforms. In: 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20), pp. 735–751 (2020)
Google Scholar
Chen, S., Delimitrou, C., Martínez, J.F.: Parties: Qos-aware resource partitioning for multiple interactive services. In: Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 107–120 (2019)
Google Scholar
Cook, H., Moreto, M., Bird, S., Dao, K., Patterson, D.A., Asanovic, K.: A hardware evaluation of cache partitioning to improve utilization and energy-efficiency while preserving responsiveness. ACM SIGARCH Comput. Architect. News 41(3), 308–319 (2013)
Article Google Scholar
Ferdman, M., et al.: Clearing the clouds: a study of emerging scale-out workloads on modern hardware (2012). http://infoscience.epfl.ch/record/173764
Fuerst, A., et al.: Memory-harvesting vms in cloud platforms. In: Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems. pp. 583–594 (2022)
Google Scholar
Govindan, S., Liu, J., Kansal, A., Sivasubramaniam, A.: Cuanta: quantifying effects of shared on-chip resource interference for consolidated virtual machines. In: Proceedings of the 2nd ACM Symposium on Cloud Computing, pp. 1–14 (2011)
Google Scholar
Javadi, S.A., Suresh, A., Wajahat, M., Gandhi, A.: Scavenger: A black-box batch workload resource manager for improving utilization in cloud environments. In: Proceedings of the ACM Symposium on Cloud Computing, pp. 272–285 (2019)
Google Scholar
Jonas, E., et al.: Cloud programming simplified: a berkeley view on serverless computing. arXiv preprint arXiv:1902.03383 (2019)
Kim, J., Lee, K.: Practical cloud workloads for serverless faas. In: Proceedings of the ACM Symposium on Cloud Computing.,pp. 477–477 (2019)
Google Scholar
Li, Z., Guo, L., Cheng, J., Chen, Q., He, B., Guo, M.: The serverless computing survey: a technical primer for design architecture. ACM Comput. Surv. (CSUR) 54(10s), 1–34 (2022)
Article Google Scholar
Lo, D., Cheng, L., Govindaraju, R., Ranganathan, P., Kozyrakis, C.: Heracles: Improving resource efficiency at scale. In: Proceedings of the 42nd Annual International Symposium on Computer Architecture, pp. 450–462 (2015)
Google Scholar
Machina, J., Sodan, A.: Predicting cache needs and cache sensitivity for applications in cloud computing on cmp servers with configurable caches. In: 2009 IEEE International Symposium on Parallel & Distributed Processing, pp. 1–8. IEEE (2009)
Google Scholar
Maji, A.K., Mitra, S., Bagchi, S.: Ice: An integrated configuration engine for interference mitigation in cloud services. In: 2015 IEEE International Conference on Autonomic Computing, pp. 91–100. IEEE (2015)
Google Scholar
Manikantan, R., Rajan, K., Govindarajan, R.: Probabilistic shared cache management (prism). In: 2012 39th Annual International Symposium on Computer Architecture (ISCA), pp. 428–439. IEEE (2012)
Google Scholar
Mars, J., Tang, L., Hundt, R., Skadron, K., Soffa, M.L.: Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations. In: Proceedings of the 44th annual IEEE/ACM International Symposium on Microarchitecture, pp. 248–259 (2011)
Google Scholar
Mei, Y., Liu, L., Pu, X., Sivathanu, S., Dong, X.: Performance analysis of network i/o workloads in virtualized data centers. IEEE Trans. Serv. Comput. 6(1), 48–63 (2011)
Article Google Scholar
Nathuji, R., Kansal, A., Ghaffarkhah, A.: Q-clouds: managing performance interference effects for qos-aware clouds. In: Proceedings of the 5th European Conference on Computer Systems, pp. 237–250 (2010)
Google Scholar
Novaković, D., Vasić, N., Novaković, S., Kostić, D., Bianchini, R.: \(\{\)DeepDive\(\}\): Transparently identifying and managing performance interference in virtualized environments. In: 2013 USENIX Annual Technical Conference (USENIX ATC 13), pp. 219–230 (2013)
Google Scholar
Srikantaiah, S., Kandemir, M., Wang, Q.: Sharp control: controlled shared cache management in chip multiprocessors. In: Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, pp. 517–528 (2009)
Google Scholar
Suresh, A., Gandhi, A.: Servermore: Opportunistic execution of serverless functions in the cloud. In: Proceedings of the ACM Symposium on Cloud Computing, pp. 570–584 (2021)
Google Scholar
Wang, Y., et al.: Smartharvest: harvesting idle cpus safely and efficiently in the cloud. In: Proceedings of the Sixteenth European Conference on Computer Systems, pp. 1–16 (2021)
Google Scholar
Yang, H., Breslow, A., Mars, J., Tang, L.: Bubble-flux: Precise online qos management for increased utilization in warehouse scale computers. ACM SIGARCH Comput. Arch. News 41(3), 607–618 (2013)
Article Google Scholar
Zhang, Y., et al.: Faster and cheaper serverless computing on harvested resources. In: Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles, pp. 724–739 (2021)
Google Scholar
Zhang, Y., Prekas, G., Fumarola, G.M., Fontoura, M., Goiri, Í., Bianchini, R.: \(\{\)History-Based\(\}\) harvesting of spare cycles and storage in \(\{\)Large-Scale\(\}\) datacenters. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), pp. 755–770 (2016)
Google Scholar
Zhao, L., et al.: Rhythm: component-distinguishable workload deployment in datacenters. In: Proceedings of the Fifteenth European Conference on Computer Systems, pp. 1–17 (2020)
Google Scholar
Zhu, H., Erez, M.: Dirigent: enforcing qos for latency-critical tasks on shared multicore systems. In: Proceedings of the Twenty-first International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 33–47 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Intelligence and Computing (CIC), Tianjin University, Tianjin, China
Yukang Chu, Wenhao Huang & Laiping Zhao

Authors

Yukang Chu
View author publications
You can also search for this author in PubMed Google Scholar
Wenhao Huang
View author publications
You can also search for this author in PubMed Google Scholar
Laiping Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Yukang Chu and Wenhao Huang : contributed equally to this work.

Corresponding author

Correspondence to Laiping Zhao .

Editor information

Editors and Affiliations

Royal Melbourne Institute of Technology, Melbourne, VIC, Australia
Zahir Tari
Tianjin University, Tianjin, China
Keqiu Li
University of Arizona, Tucson, AZ, USA
Hongyi Wu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chu, Y., Huang, W., Zhao, L. (2024). Running Serverless Function on Resource Fragments in Data Center. In: Tari, Z., Li, K., Wu, H. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2023. Lecture Notes in Computer Science, vol 14489. Springer, Singapore. https://doi.org/10.1007/978-981-97-0798-0_26

Download citation

DOI: https://doi.org/10.1007/978-981-97-0798-0_26
Published: 01 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0797-3
Online ISBN: 978-981-97-0798-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Running Serverless Function on Resource Fragments in Data Center