skip to main content
10.1145/3230718.3230731acmconferencesArticle/Chapter ViewAbstractPublication PagesancsConference Proceedingsconference-collections
research-article

FTvNF: fault tolerant virtual network functions

Published:23 July 2018Publication History

ABSTRACT

One of the major concerns about Network Function Virtualization (NFV) is the reduced stability of virtual network functions (VNFs), compared to dedicated hardware appliances. Stateful VNFs make recovery a complex process, where a major concern is how to handle non-determinism such as multi-threaded processing, time dependence, and randomness.

In this paper we present FTvNF --- a new approach for network functions recovery with very low overhead in failure-free time. This is in contrast to previous suggestions to take snapshots of the VNF state at certain checkpoints or to store the VNF state externally. Compared with state-of-the-art approaches, our approach significantly reduces the latency overhead incurred by the network elements, both in failure-free operations and when failures occur. In addition, our approach better suits the common case of NFV service chaining, as our mechanisms are applied once per chain, thus significantly improve the performance over approaches that treat each VNF separately.

References

  1. M. Balakrishnan, D. Malkhi, V. Prabhakaran, T. Wobber, M. Wei, and J. D. Davis. 2012. CORFU: A Shared Log Design for Flash Clusters.. In NSDI. 1--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Bremler-Barr, Y. Harchol, and D. Hay. 2016. OpenBox: A Software-Defined Framework for Developing, Deploying, and Managing Network Functions. In SIGCOMM. 511--524. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. B. Cully, G. Lefebvre, D. Meyer, M. Feeley, N. Hutchinson, and A. Warfield. 2008. Remus: High Availability via Asynchronous Virtual Machine Replication. In NSDI. 161--174. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Y. Z. Dong, W. Ye, Y. H. Jiang, I. Pratt, S. Q. Ma, J. Li, and H. B. Guan. 2013. COLO: COarse-grained LOck-stepping Virtual Machines for Non-stop Service. In SOCC. Article 3, 16 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. G. W. Dunlap, D. G. Lucchetti, M. A. Fetterman, and P. M. Chen. 2008. Execution Replay of Multiprocessor Virtual Machines. In VEE. 121--130. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. ETSI. 2012. Network Functions Virtualization - Introductory White Paper. (2012). http://portal.etsi.org/NFV/NFV_White_Paper.pdf.Google ScholarGoogle Scholar
  7. S. K. Fayazbakhsh, L. Chiang, V. Sekar, M. Yu, and J. C. Mogul. 2014. Enforcing Network-wide Policies in the Presence of Dynamic Middlebox Actions Using Flowtags. In NSDI. 533--546. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Gember-Jacobson, R. Viswanathan, C. Prakash, R. Grandl, J. Khalid, S. Das, and A. Akella. 2014. OpenNF: Enabling Innovation in Network Function Control. In SIGCOMM. 163--174. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Gross, T. Sridhar, P. Garg, C. Wright, I. Ganga, P. Agarwal, K. Duda, D. Dutt, and J. Hudson. 2015. Geneve: Generic Network Virtualization Encapsulation. IETF Internet-Draft. (November 2015). https://tools. ietf.org/html/draft-ietf-nvo3-geneve-00.Google ScholarGoogle Scholar
  10. M. Kablan, A. Alsudais,, E. Keller, and F. Le. 2017. Stateless Network Functions: Breaking the Tight Coupling of State and Processing. In NSDI. 97--112. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Kanizo, O. Rottenstreich, I. Segall, and J. Yallouz. 2017. Optimizing Virtual Backup Allocation for Middleboxes. ACM Transactions on Networking 25, 5 (Oct. 2017), 2759--2772. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Y. Kanizo, O. Rottenstreich, I. Segall, and J. Yallouz. 2018. Designing Optimal Middlebox Recovery Schemes with Performance Guarantees. In INFOCOM.Google ScholarGoogle Scholar
  13. E. Kohler, R. Morris, B. Chen, J. Jannotti, and M. F. Kaashoek. 2000. The Click Modular Router. ACM TOCS 18, 3 (2000), 263--297. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. KVM. 2017. Kernel Virtual Machine. (2017). https://www.nnux-kvm.org/page/Main_Page.Google ScholarGoogle Scholar
  15. O. Laadan, N. Viennot, and J. Nieh. 2010. Transparent, Lightweight Application Execution Replay on Commodity Multiprocessor Operating Systems. In SIGMETRICS. 155--166. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. C. Lattner and V. Adve. 2004. LLVM: A Compilation Framework for Lifelong Program Analysis and Transformation. In CGO. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. R. Lorch, A. Baumann, L. Glendenning, D. Meyer, and A. Warfield. 2015. Tardigrade: Leveraging Lightweight Virtual Machines to Easily and Efficiently Construct Fault-Tolerant Services. In NSDI. 575--588. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. B. Pfaff, J. Pettit, T. Koponen, E. Jackson, A. Zhou, J. Rajahalme, J. Gross, A. Wang, J. Stringer, P. Shelar, K. Amidon, and M. Casado. 2015. The Design and Implementation of Open vSwitch. In NSDI. 117--130. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R. Potharaju and N. Jain. 2013. Demystifying the Dark Side of the Middle: A Field Study of Middlebox Failures in Datacenters. IMC. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. S. Rajagopalan, D. Williams, and H. Jamjoom. 2013. Pico Replication: A High Availability Framework for Middleboxes. In SOCC. 1:1--1:15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Sherry, P. X. Gao, S. Basu, A. Panda, A. Krishnamurthy, C. Maciocco, M. Manesh, J. Martins, S. Ratnasamy, L. Rizzo, and S. Shenker. 2015. Rollback-recovery for middleboxes. In ACM SIGCOMM Computer Communication Review, Vol. 45. ACM, 227--240. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. FTvNF: fault tolerant virtual network functions

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ANCS '18: Proceedings of the 2018 Symposium on Architectures for Networking and Communications Systems
          July 2018
          181 pages
          ISBN:9781450359023
          DOI:10.1145/3230718

          Copyright © 2018 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 23 July 2018

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate88of314submissions,28%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader