Skip to main content

Advertisement

Log in

SoftDC: software-based dynamically connected transport

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

RDMA is increasingly becoming popular not only in HPC but also in data centers where high throughput and low latency are critical requirements. RDMA supports several types of transports, each of which has different characteristics, so that users can choose the right one to meet their requirements. Reliable connected (RC) transport has advantages on usability but disadvantages on scalability while unreliable datagram (UD) transport is scalable but hard to use. Dynamically connected (DC) transport has been newly introduced to address these limitations when using one of the existing transports while delivering both usability and scalability. However, despite all of these merits, DC transport is not yet generally adopted in the related fields due to hardware dependency. To eliminate the hardware dependency, in this paper, we design and implement SoftDC, a totally software-based DC transport. SoftDC uses the basic RDMA primitives to emulate the behavior of DC transport; UD transport for connection and RC transport for data transfer. We build and evaluate a SoftDC transport-based application to prove its effectiveness compared with RC transport-based one. Our experimental results show that our scheme has potential to provide the advantages of both RC and UD transports although our prototype has large connection establishment costs that can be amortized across a large data transfer.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Park, J., Yeom, H.Y.: Design and implementation of software-based dynamically connected transport. In: International Workshop on Autonomic Management of High Performance Grid and Cloud Computing (AMGCC 2018), (2018)

    Article  Google Scholar 

  2. Koop, M.J., Jones, T., Panda, D.K.: MVAPICH-Aptus: scalable high-performance multi-transport MPI over InfiniBand. In: IEEE International Symposium on Parallel and Distributed Processing, 2008, pp. 1–12. IPDPS 2008. IEEE (2008)

  3. Jose, J., Subramoni, H., Kandalla, K., Wasi-ur-Rahman, M., Wang, H., Narravula, S., Panda, D. K.: Scalable memcached design for infiniband clusters using hybrid transports. In: Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), pp. 236–243. IEEE Computer Society (2012)

  4. MVAPICH2-X: Unified MPI+PGAS Communication Runtime over OpenFabrics/Gen2 for Exascale Systems, http://mvapich.cse.ohio-state.edu/

  5. Dragojevic, A., Narayanan, D., Hodson, O., Castro, M.: FaRM: Fast remote memory. In: Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation, pp. 401–414 (2014)

  6. Haynes, T.: Network File System (NFS) Version 4 Minor Version 2 Protocol (2016)

  7. Axboe, J.: Fio-flexible i/o tester synthetic benchmark. https://github.com/axboe/fio (2005). Accessed 28 Dec 2018

  8. Kalia, A., Kaminsky, M., Andersen, D.G.: Using RDMA efficiently for key-value services. In: ACM SIGCOMM Computer Communication Review, vol. 44, pp. 295–306. ACM (2014)

  9. Kalia, A., Kaminsky, M., Andersen, D.G.: FaSST: fast, scalable and simple distributed transactions with two-sided (RDMA) Datagram RPCs. In: OSDI, vol. 16, pp. 185–201 (2016)

  10. Su, M., Zhang, M., Chen, K., Guo, Z., Wu, Y.: Rfp: When rpc is faster than server-bypass with rdma. In: Proceedings of the 12th European Conference on Computer Systems, pp. 1–15. ACM (2017)

  11. Kalia, A., Kaminsky, M., Andersen, D.G.: Design guidelines for high performance RDMA systems. In: 2016 USENIX Annual Technical Conference, p. 437 (2016)

  12. Memcached: A distributed Memory Object Caching System. https://memcached.org

  13. Wei, X., Shi, J., Chen, Y., Chen, R., Chen, H.: Fast in-memory transaction processing using RDMA and HTM. In: Proceedings of the 25th Symposium on Operating Systems Principles, pp. 87–104. ACM (2015)

  14. Subramoni, H., Hamidouche, K., Venkatesh, A., Chakraborty, S., Panda, D.K.: Designing MPI library with dynamic connected transport (DCT) of InfiniBand: early experiences. In: International Supercomputing Conference, pp. 278–295. Springer, Cham (2014)

    Google Scholar 

Download references

Acknowledgements

This research was supported by Next-Generation Information Computing Development Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science and ICT  (2015M3C4A7065646, NRF-2017R1A2B4005681).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yoonhee Kim.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Park, J., Son, Y., Yeom, H.Y. et al. SoftDC: software-based dynamically connected transport. Cluster Comput 23, 347–357 (2020). https://doi.org/10.1007/s10586-019-02926-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-019-02926-0

Keywords

Navigation