Abstract
The performance of distributed database system heavily relies on network to assist collaboration among nodes, while the traditional TCP/IP network has become the major bottleneck of the distributed database. Meanwhile, the emerging technology RDMA, featured with CPU offloading, bypassing the operating system kernel and zero copy, is envisioned to achieve low latency and high throughput data transmission between nodes. For this purpose, in this paper, we design and implement a high performance data transmission scheme based on RDMA and apply it to the distributed database system. Firstly, the paper analyzes the application scenarios and internal communication requirements of distributed database systems, combined with RDMA hardware characteristics. Secondly, based on the analysis results, we perform targeted design optimizations in three aspects, RDMA data transmission and memory region management, data placement, and congestion control. We propose efficient variable length data transmission mechanism based on the sliding window, application friendly data placement mechanism, and priority based adaptive congestion control mechanism to optimize the three aspects respectively. We further implement our idea with the typical distributed database GoldenX, and perform comprehensive experiments. Results shows that, compared with using RDMA network directly, the performance of GoldenX integrated with the optimizations is increased to 2.95 times, and the throughput is up to 2.61 times.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Binnig, C., Crotty, A., Galakatos, A., Kraska, T., Zamanian, E.: The end of slow networks: it’s time for a redesign. arXiv preprint arXiv:1504.01048 (2015)
Chen, J., et al.: A parallel random forest algorithm for big data in a spark cloud computing environment. IEEE Trans. Parallel Distrib. Syst. 28(4), 919–933 (2016)
Fent, P., van Renen, A., Kipf, A., Leis, V., Neumann, T., Kemper, A.: Low-latency communication for fast DBMS using RDMA and shared memory. In: 2020 IEEE 36th International Conference on Data Engineering (ICDE), pp. 1477–1488. IEEE (2020)
Kalia, A., Kaminsky, M., Andersen, D.G.: Fasst: fast, scalable and simple distributed transactions with two-sided (RDMA) datagram RPCS. In: 12th USENIX Symposium on Operating Systems Design and Implementation (\(\{\)OSDI\(\}\) 2016), pp. 185–201 (2016)
Lu, F., et al.: Improving the performance of MongoDB with RDMA. In: 2019 IEEE 21st International Conference on High Performance Computing and Communications (HPCC/SmartCity/DSS), pp. 1004–1010. IEEE (2019)
Ma, T., et al.: X-RDMA: effective RDMA middleware in large-scale production environments. In: 2019 IEEE International Conference on Cluster Computing (CLUSTER), pp. 1–12. IEEE (2019)
Peng, S., et al.: An immunization framework for social networks through big data based influence modeling. IEEE Trans. Dependable Secure Comput. 16(6), 984–995 (2017)
Yu, S., Liu, M., Dou, W., Liu, X., Zhou, S.: Networking for big data: a survey. IEEE Commun. Surv. Tutor. 19(1), 531–549 (2016)
Zamanian, E., Yu, X., Stonebraker, M., Kraska, T.: Rethinking database high availability with RDMA networks. Proc. VLDB Endow. 12(11), 1637–1650 (2019)
Acknowledgement
This work was supported in part by the National Key Research and Development Program of China under Grant 2019YFB2102000; in part by the Natural Science Foundation of Jiangsu Province under Grant BK20200067.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Tu, Y., Han, Y., Jin, H., Chen, Z., Zhao, Y. (2021). RDMA Based Performance Optimization on Distributed Database Systems: A Case Study with GoldenX. In: Liu, Z., Wu, F., Das, S.K. (eds) Wireless Algorithms, Systems, and Applications. WASA 2021. Lecture Notes in Computer Science(), vol 12938. Springer, Cham. https://doi.org/10.1007/978-3-030-86130-8_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-86130-8_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86129-2
Online ISBN: 978-3-030-86130-8
eBook Packages: Computer ScienceComputer Science (R0)