skip to main content
10.1145/2967938.2974051acmconferencesArticle/Chapter ViewAbstractPublication PagespactConference Proceedingsconference-collections
poster

POSTER: Fault-tolerant Execution on COTS Multi-core Processors with Hardware Transactional Memory Support

Published:11 September 2016Publication History

ABSTRACT

Software-based fault-tolerance mechanisms can increase the reliability of multi-core CPUs while being cheaper and more flexible than hardware solutions like lockstep architectures. However, checkpoint creation, error detection and correction entail high performance overhead if implemented in software. We propose a software/hardware hybrid approach, which leverages Intel's hardware transactional memory (TSX) to support implicit checkpoint creation and fast rollback. Hardware enhancements are proposed and evaluated, leading to a resulting performance overhead of 19% on average.

References

  1. F. Haas, S. Weis, S. Metzlaff, and T. Ungerer. Exploiting Intel TSX for Fault-Tolerant Execution in Safety-Critical Systems. In Proceedings of the International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT), pages 197--202, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  2. P. Hammarlund, A. J. Martinez, A. A. Bajwa, D. L. Hill, E. Hallnor, et al. Haswell: The Fourth-Generation Intel Core Processor. IEEE Micro, 34(2):6--20, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  3. A. Shye, J. Blomstedt, T. Moseley, V. J. Reddi, and D. A. Connors. PLR: A Software Approach to Transient Fault Tolerance for Multicore Architectures. IEEE Transactions on Dependable and Secure Computing (TDSC), 6(2):135--148, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. C. Smolens, B. T. Gold, B. Falsafi, and J. C. Hoe. Reunion: Complexity-Effective Multicore Redundancy. In Proceedings of the International Symposium on Microarchitecture (MICRO), pages 223--234, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. POSTER: Fault-tolerant Execution on COTS Multi-core Processors with Hardware Transactional Memory Support

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and Compilation
              September 2016
              474 pages
              ISBN:9781450341219
              DOI:10.1145/2967938

              Copyright © 2016 Owner/Author

              Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 11 September 2016

              Check for updates

              Qualifiers

              • poster

              Acceptance Rates

              PACT '16 Paper Acceptance Rate31of119submissions,26%Overall Acceptance Rate121of471submissions,26%

              Upcoming Conference

              PACT '24
              International Conference on Parallel Architectures and Compilation Techniques
              October 14 - 16, 2024
              Southern California , CA , USA

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader