Abstract
Memory disaggregation has attracted increasing attention in recent years because it is a cost-efficient approach to scale memory capacity for applications in a data center. However, the latency of remote memory access is a major concern in disaggregated memory systems. This paper presents VANDI, a virtual memory paging mechanism that allows applications to use remote memory pools transparently. VANDI enables effective data caching and prefetching mechanisms to address the problem of high access latency in disaggregated memory systems. VANDI exploits a low-complexity cache replacement strategy to optimize the asynchronous staging queue so that the remote write latency can be significantly reduced. VANDI can also prefetch data in multi-granularity from a remote memory pool in an adaptive manner, and thus further improves the hit rate of the local cache to reduce the read latency of remote memory. Our extensive experiments using micro-benchmarks show that VANDI can improve the performance of typical remote paging system–Infiniswap by up to 15\(\times \)–102\(\times \). VANDI can also improve the performance of state-of-the-art disaggregated memory system–Valet by 1.2\(\times \)–2.7\(\times \). For typical machine learning workloads, VANDI can achieve 20% to 80% performance improvement compared with the state-of-the-art Valet.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amaro, E., et al.: Can far memory improve job throughput? In: Proceedings of the Fifteenth European Conference on Computer Systems, pp. 1–16 (2020)
Bae, J., Su, G., Iyengar, A., Wu, Y., Liu, L.: Efficient orchestration of host and remote shared memory for memory intensive workloads. In: Proceedings of The International Symposium on Memory Systems, pp. 194–208 (2020)
Duan, Z., et al.: Gengar: an RDMA-based distributed hybrid memory pool. In: Proceedings of the 41st IEEE International Conference on Distributed Computing Systems, pp. 92–103 (2021)
Elmeleegy, K., Olston, C., Reed, B.: Spongefiles: mitigating data skew in mapreduce using distributed memory. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 551–562 (2014)
Endo, W., Sato, S., Taura, K.: MENPS: a decentralized distributed shared memory exploiting RDMA. In: Proceedings of IEEE/ACM Fourth Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware, pp. 9–16 (2020)
Gao, P.X., et al.: Network requirements for resource disaggregation. In: Proceedings of 12th USENIX Symposium on Operating Systems Design and Implementation, pp. 249–264 (2016)
Gu, J., Lee, Y., Zhang, Y., Chowdhury, M., Shin, K.G.: Efficient memory disaggregation with infiniswap. In: Proceedings of 14th USENIX Symposium on Networked Systems Design and Implementation, pp. 649–667 (2017)
Guo, Z., Shan, Y., Luo, X., Huang, Y., Zhang, Y.: Clio: a hardware-software co-designed disaggregated memory system. In: Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 417–433 (2022)
Jia, Y., et al.: Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 675–678 (2014)
Lim, K., et al.: System-level implications of disaggregated memory. In: Proceedings of IEEE International Symposium on High-Performance Computer Architecture, pp. 1–12 (2012)
Liu, L., Cao, W., Sahin, S., Zhang, Q., Bae, J., Wu, Y.: Memory disaggregation: research problems and opportunities. In: Proceedings of IEEE 39th International Conference on Distributed Computing Systems, pp. 1664–1673 (2019)
Magoutis, K.: Memory management support for multi-programmed Remote Direct Memory Access (RDMA) systems. In: Proceedings of IEEE International Conference on Cluster Computing, pp. 1–8 (2005)
Meena, J.S., Sze, S.M., Chand, U., Tseng, T.Y.: Overview of emerging nonvolatile memory technologies. Nanoscale Res. Lett. 9(1), 1–33 (2014)
Nelson, J., et al.: Latency-tolerant software distributed shared memory. In: Proceedings of USENIX Annual Technical Conference, pp. 291–305 (2015)
Nitu, V., Teabe, B., Tchana, A., Isci, C., Hagimont, D.: Welcome to zombieland: practical and energy-efficient memory disaggregation in a datacenter. In: Proceedings of the Thirteenth European Conference on Computer Systems, pp. 1–12 (2018)
Oura, H., Midorikawa, H., Kitagawa, K., Kai, M.: Design and evaluation of page-swap protocols for a remote memory paging system. In: Proceedings of IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, pp. 1–8 (2017)
Reiss, C., Tumanov, A., Ganger, G.R., Katz, R.H., Kozuch, M.A.: Heterogeneity and dynamicity of clouds at scale: google trace analysis. In: Proceedings of the Third ACM Symposium on Cloud Computing, pp. 1–13 (2012)
Ruan, Z., Schwarzkopf, M., Aguilera, M.K., Belay, A.: AIFM: high-performance, application-integrated far memory. In: Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, pp. 315–332 (2020)
Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The Hadoop distributed file system. In: Proceedings of IEEE 26th Symposium on Mass Storage Systems and Technologies, pp. 1–10 (2010)
Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of USENIX Workshop on Hot Topics in Cloud Computing (2010)
Zhang, P., Li, X., Chu, R., Wang, H.: HybridSwap: a scalable and synthetic framework for guest swapping on virtualization platform. In: Proceedings of IEEE Conference on Computer Communications, pp. 864–872 (2015)
Acknowledgements
This work is supported jointly by National Natural Science Foundation of China (NSFC) under grants No. 62072198, 61832006, 61825202, 61929103.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 Springer Nature Switzerland AG
About this paper
Cite this paper
Wang, T., Liu, H., Jin, H. (2023). Efficient Remote Memory Paging for Disaggregated Memory Systems. In: Meng, W., Lu, R., Min, G., Vaidya, J. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2022. Lecture Notes in Computer Science, vol 13777. Springer, Cham. https://doi.org/10.1007/978-3-031-22677-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-031-22677-9_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-22676-2
Online ISBN: 978-3-031-22677-9
eBook Packages: Computer ScienceComputer Science (R0)