ABSTRACT
A VM-based disaggregated cloud memory platform (DCM) virtualizes the memory device of a remote server connected to a highspeed network as an expansion of local memory. DCM provides large memory to applications to increase throughput. However, DCM is not well-suited to managing fair memory usage between processes when they run concurrently in a VM. This is because DCM has no mechanism to provide independent memory space to each process. As a result, DCM does not guarantee fairness and performance to processes. Partitioning memory for each process is a way to solve this problem. However, in DCM, the host kernel running DCM cannot obtain the memory page information of a process (including memory page address and PID) running in the guest kernel. So it can not segregate memory pages according to the process. Therefore, this paper proposes an efficient method for the host kernel to obtain the memory page information to partition the memory for each process in DCM, called MFence. The MFence was evaluated using two Linux servers connected by a 100 Gbps IB network. Extensive evaluation has confirmed that MFence ideally provides memory partitioning to provide fairness between processes and improve overall performance.
- Emmanuel Amaro, Christopher Branner-Augmon, Zhihong Luo, Amy Ousterhout, Marcos K. Aguilera, Aurojit Panda, Sylvia Ratnasamy, and Scott Shenker. 2020. Can Far Memory Improve Job Throughput?. In Proceedings of the 15th European Conference on Computer Systems (EuroSys '20). 1--16.Google ScholarDigital Library
- Blake Caldwell, Sepideh Goodarzy, Sangtae Ha, Richard Han, Eric Keller, Eric Rozner, and Youngbin Im. 2020. FluidMem: Full, Flexible, and Fast Memory Disaggregation for the Cloud. In Proceedings of the IEEE 40th International Conference on Distributed Computing Systems (ICDCS '20). 665--677.Google ScholarCross Ref
- Intel corporation. 2015. Intel® SSD 750 Series Product Specification. https://www.intel.com/content/dam/www/public/us/en/documents/product-specifications/ssd-750-spec.pdfGoogle Scholar
- Assaf Eisenman, Darryl Gardner, Islam AbdelRahman, Jens Axboe, Siying Dong, Kim Hazelwood, Chris Petersen, Asaf Cidon, and Sachin Katti. 2018. Reducing DRAM Footprint with NVM in Facebook. In Proceedings of the 13th EuroSys Conference (EuroSys '18). 1--13.Google ScholarDigital Library
- Donghyun Gouk, Sangwon Lee, Miryeong Kwon, and Myoungsoo Jung. 2022. Direct Access, High-Performance Memory Disaggregation with DirectCXL. In Proceedings of the USENIX Annual Technical Conference (ATC '22). 287--294.Google Scholar
- Juncheng Gu, Youngmoon Lee, Yiwen Zhang, Mosharaf Chowdhury, and Kang G. Shin. 2017. Efficient Memory Disaggregation with Infiniswap. In Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI '17). 649--667.Google Scholar
- Zhiyuan Guo, Yizhou Shan, Xuhao Luo, Yutong Huang, and Yiying Zhang. 2022. Clio: A Hardware-Software Co-Designed Disaggregated Memory System. In Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '22). 417--433.Google ScholarDigital Library
- Munira Hussain. 2018. Need for Speed : Comparing FDR and EDR InfiniBand. https://downloads.dell.com/manuals/all-products/esuprt_software/esuprt_it_ops_datcentr_mgmt/high-computing-solution-resources_white-papers77_en-us.pdfGoogle Scholar
- Yang Jisoo and Seymour Julian. 2018. Pmbench: A Micro-Benchmark for Profiling Paging Performance on a System with Low-Latency SSDs. In Proceedings of the Information Technology New Generations (ITNG '18). 627--633.Google Scholar
- Awais Khan, Attique Muhammad, Youngjae Kim, Sungyong Park, and Byungchul Tak. 2018. EDGESTORE: A Single Namespace and Resource-Aware Federation File System for Edge Servers. In Proceedings of the IEEE International Conference on Edge Computing (EDGE '18). 101--108.Google ScholarCross Ref
- Awais Khan, Hyogi Sim, Sudharshan S Vazhkudai, and Youngjae Kim. 2021. Mosiqs: Persistent Memory Object Storage with Metadata Indexing and Querying for Scientific computing. IEEE Access 9 (2021), 85217--85231.Google ScholarCross Ref
- Awais Khan, Hyogi Sim, Sudharshan S Vazhkudai, Jinsuk Ma, Myeong-Hoon Oh, and Youngjae Kim. 2020. Persistent Memory Object Storage and Indexing for Scientific Computing. In Proceedings of the IEEE/ACM Workshop on Memory Centric High Performance Computing (MCHPC '20). IEEE, 1--9.Google ScholarCross Ref
- Kwangwon Koh, Kangho Kim, Seunghyub Jeon, and Jaehyuk Huh. 2019. Disaggregated Cloud Memory with Elastic Block Management. IEEE Trans. Comput. 68, 1 (2019), 39--52.Google ScholarDigital Library
- Andres Lagar-Cavilla, Junwhan Ahn, Suleiman Souhlal, Neha Agarwal, Radoslaw Burny, Shakeel Butt, Jichuan Chang, Ashwin Chaugule, Nan Deng, Junaid Shahid, Greg Thelen, Kamil Adam Yurtsever, Yu Zhao, and Parthasarathy Ranganathan. 2019. Software-Defined Far Memory in Warehouse-Scale Computers. In Proceedings of the 24th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '19). 317--330.Google ScholarDigital Library
- Ling Liu, Wenqi Cao, Semih Sahin, Qi Zhang, Juhyun Bae, and Yanzhao Wu. 2019. Memory Disaggregation: Research Problems and Opportunities. In Proceedings of the IEEE 39th International Conference on Distributed Computing Systems (ICDCS '21). 1664--1673.Google ScholarCross Ref
- Qifan Pu, Haoyuan Li, Matei Zaharia, Ali Ghodsi, and Ion Stoica. 2016. FairRide: Near-Optimal, Fair Cache Sharing. In Proceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI '16). 393--406.Google Scholar
- Zhenyuan Ruan, Malte Schwarzkopf, Marcos K. Aguilera, and Adam Belay. 2020. AIFM: High-Performance, Application-Integrated Far Memory. In Proceedings of of the 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI '20). 315--332.Google Scholar
- Jun Xiao, Yixian Shen, and Andy D. Pimentel. 2022. Cache Interference-Aware Task Partitioning for Non-Preemptive Real-Time Multi-Core Systems. ACM Transactions on Embedded Computing Systems, 21, 3 (2022), 1--28.Google ScholarDigital Library
- Heechul Yun, Waqar Ali, Santosh Gondi, and Siddhartha Biswas. 2017. BWLOCK: A Dynamic Memory Access Control Framework for Soft Real-Time Applications on Multicore Platforms. IEEE Trans. Comput. 66, 7 (2017), 1247--1252.Google ScholarDigital Library
Index Terms
- MFence: Defending Against Memory Access Interference in a Disaggregated Cloud Memory Platform
Recommendations
Dynamic Memory Pressure Aware Ballooning
MEMSYS '15: Proceedings of the 2015 International Symposium on Memory SystemsHardware virtualization is a major component of large scale server and data center deployments due to their facilitation of server consolidation and scalability. Virtualization, however, comes at a high cost in terms of system main memory utilization. ...
Cooperating Write Buffer Cache and Virtual Memory Management for Flash Memory Based Systems
RTAS '11: Proceedings of the 2011 17th IEEE Real-Time and Embedded Technology and Applications SymposiumFlash memory is becoming the storage media of choice for mobile devices and embedded systems. The performance of flash memory is impacted by the asymmetric speed of read and write operations, limited number of erase times and the absence of in-place ...
Writeback-aware partitioning and replacement for last-level caches in phase change main memory systems
Special Issue on High-Performance Embedded Architectures and CompilersPhase-Change Memory (PCM) has emerged as a promising low-power main memory candidate to replace DRAM. The main problems of PCM are that writes are much slower and more power hungry than reads, write bandwidth is much lower than read bandwidth, and ...
Comments