Abstract
RDMA is increasingly becoming popular not only in HPC but also in data centers where high throughput and low latency are critical requirements. RDMA supports several types of transports, each of which has different characteristics, so that users can choose the right one to meet their requirements. Reliable connected (RC) transport has advantages on usability but disadvantages on scalability while unreliable datagram (UD) transport is scalable but hard to use. Dynamically connected (DC) transport has been newly introduced to address these limitations when using one of the existing transports while delivering both usability and scalability. However, despite all of these merits, DC transport is not yet generally adopted in the related fields due to hardware dependency. To eliminate the hardware dependency, in this paper, we design and implement SoftDC, a totally software-based DC transport. SoftDC uses the basic RDMA primitives to emulate the behavior of DC transport; UD transport for connection and RC transport for data transfer. We build and evaluate a SoftDC transport-based application to prove its effectiveness compared with RC transport-based one. Our experimental results show that our scheme has potential to provide the advantages of both RC and UD transports although our prototype has large connection establishment costs that can be amortized across a large data transfer.
Similar content being viewed by others
References
Park, J., Yeom, H.Y.: Design and implementation of software-based dynamically connected transport. In: International Workshop on Autonomic Management of High Performance Grid and Cloud Computing (AMGCC 2018), (2018)
Koop, M.J., Jones, T., Panda, D.K.: MVAPICH-Aptus: scalable high-performance multi-transport MPI over InfiniBand. In: IEEE International Symposium on Parallel and Distributed Processing, 2008, pp. 1–12. IPDPS 2008. IEEE (2008)
Jose, J., Subramoni, H., Kandalla, K., Wasi-ur-Rahman, M., Wang, H., Narravula, S., Panda, D. K.: Scalable memcached design for infiniband clusters using hybrid transports. In: Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), pp. 236–243. IEEE Computer Society (2012)
MVAPICH2-X: Unified MPI+PGAS Communication Runtime over OpenFabrics/Gen2 for Exascale Systems, http://mvapich.cse.ohio-state.edu/
Dragojevic, A., Narayanan, D., Hodson, O., Castro, M.: FaRM: Fast remote memory. In: Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation, pp. 401–414 (2014)
Haynes, T.: Network File System (NFS) Version 4 Minor Version 2 Protocol (2016)
Axboe, J.: Fio-flexible i/o tester synthetic benchmark. https://github.com/axboe/fio (2005). Accessed 28 Dec 2018
Kalia, A., Kaminsky, M., Andersen, D.G.: Using RDMA efficiently for key-value services. In: ACM SIGCOMM Computer Communication Review, vol. 44, pp. 295–306. ACM (2014)
Kalia, A., Kaminsky, M., Andersen, D.G.: FaSST: fast, scalable and simple distributed transactions with two-sided (RDMA) Datagram RPCs. In: OSDI, vol. 16, pp. 185–201 (2016)
Su, M., Zhang, M., Chen, K., Guo, Z., Wu, Y.: Rfp: When rpc is faster than server-bypass with rdma. In: Proceedings of the 12th European Conference on Computer Systems, pp. 1–15. ACM (2017)
Kalia, A., Kaminsky, M., Andersen, D.G.: Design guidelines for high performance RDMA systems. In: 2016 USENIX Annual Technical Conference, p. 437 (2016)
Memcached: A distributed Memory Object Caching System. https://memcached.org
Wei, X., Shi, J., Chen, Y., Chen, R., Chen, H.: Fast in-memory transaction processing using RDMA and HTM. In: Proceedings of the 25th Symposium on Operating Systems Principles, pp. 87–104. ACM (2015)
Subramoni, H., Hamidouche, K., Venkatesh, A., Chakraborty, S., Panda, D.K.: Designing MPI library with dynamic connected transport (DCT) of InfiniBand: early experiences. In: International Supercomputing Conference, pp. 278–295. Springer, Cham (2014)
Acknowledgements
This research was supported by Next-Generation Information Computing Development Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science and ICT (2015M3C4A7065646, NRF-2017R1A2B4005681).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Park, J., Son, Y., Yeom, H.Y. et al. SoftDC: software-based dynamically connected transport. Cluster Comput 23, 347–357 (2020). https://doi.org/10.1007/s10586-019-02926-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-019-02926-0