Skip to main content

On the Impact of Hardware Faults – An Investigation of the Relationship between Workload Inputs and Failure Mode Distributions

  • Conference paper
Book cover Computer Safety, Reliability, and Security (SAFECOMP 2012)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 7612))

Included in the following conference series:

Abstract

Technology scaling of integrated circuits is making transistors increasingly sensitive to process variations, wear-out effects and ionizing particles. This may lead to an increasing rate of transient and intermittent errors in future microprocessors. In order to assess the risk such errors pose to safety critical systems, it is essential to investigate how temporary errors in the instruction set architecture (ISA) registers and main memory locations influence the behaviour of executing programs. To this end, we investigate – by means of extensive fault injection experiments – how such errors affect the execution of four target programs. The paper makes three contributions. First, we investigate how the failure modes of the target programs vary for different input sets. Second, we evaluate the error coverage of a software-implemented hardware fault tolerant technique that relies on triple-time redundant execution, majority voting and forward recovery. Third, we propose an approach based on assembly language metrics which can be used to correlate the dynamic fault-free behaviour of a program with its failure mode distribution obtained by fault injection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Borkar, S.: Designing reliable systems from unreliable components: the challenges of transistor variability and degradation. IEEE Micro 25(6), 10–16 (2005)

    Article  Google Scholar 

  2. Rebaudengo, M., Sonza Reorda, M., Violante, M.: A new approach to software-implemented fault tolerance. Journal of Electronic Testing: Theory and Applications 20(4), 433–437 (2004)

    Article  Google Scholar 

  3. Reis, G.A., et al.: SWIFT: Software implemented fault tolerance. In: Int. Symp. on Code generation and optimization (CGO 2005), pp. 243–254 (2005)

    Google Scholar 

  4. Skarin, D., Karlsson, J.: Software implemented detection and recovery of soft errors in a brake-by-wire System. In: 7th European Dependable Computing Conf. (EDDC-07), pp. 145–154 (2008)

    Google Scholar 

  5. Segall, Z., et al.: FIAT-fault injection based automated testing environment. In: 18th Int. Symp. on Fault-Tolerant Computing (FTCS-18), pp. 102–107 (1988)

    Google Scholar 

  6. Folkesson, P., Karlsson, J.: Considering Workload Input Variations in Error Coverage Estimation. In: Hlavicka, J., Maehle, E., Pataricza, A. (eds.) EDDC 1999. LNCS, vol. 1667, pp. 171–190. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  7. Alexandersson, R., Karlsson, J.: Fault injection-based assessment of aspect-oriented implementation of fault tolerance. In: 41st Int. Dependable Systems & Networks Conf (DSN), pp. 303–314 (2011)

    Google Scholar 

  8. Barbosa, R., Vinter, J., Folkesson, P., Karlsson, J.: Assembly-Level Pre-injection Analysis for Improving Fault Injection Efficiency. In: Dal Cin, M., Kaâniche, M., Pataricza, A. (eds.) EDCC 2005. LNCS, vol. 3463, pp. 246–262. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  9. Mibench Version 1, http://www.eecs.umich.edu/mibench/

  10. Skarin, D., Barbosa, R., Karlsson, J.: GOOFI-2: A tool for experimental dependability assessment. In: 40th Int. Dependable Systems & Networks Conf. (DSN), pp. 557–562 (2010)

    Google Scholar 

  11. Carreira, J., Madeira, H., Silva, J.G.: A technique for the experimental evaluation of dependability in modern computer system. IEEE Trans. Soft. Eng. 24(2), 125–136 (1998)

    Article  Google Scholar 

  12. Jain, A., Murty, M., Flynn, P.: Data clustering: a review. ACM Computing Surveys (CSUR) 31(3), 264–323 (1999)

    Article  Google Scholar 

  13. Natella, R., Cotroneo, D., Duraes, J., Madeira, H.: On fault representativeness of software fault injection. IEEE Trans. Soft. Eng (2011) (in press) (preprint)

    Google Scholar 

  14. Kanawati, G.A., Kanawati, N.A., Abraham, J.A.: FERRARI: a tool for the validation of system dependability properties. In: 22nd Int. Symp. on Fault-Tolerant Computing (FTCS-22), pp. 336–344 (1992)

    Google Scholar 

  15. Madeira, H., Rela, M., Moreira, F., Silva, J.G.: RIFLE: A general purpose pin-level fault injector. In: 1st European Dependable Computing Conf (EDDC-01), pp. 199–216 (1994)

    Google Scholar 

  16. Arlat, J., et al.: Comparison of physical and software-implemented fault injection techniques. IEEE Trans. on Computers 52(9), 1115–1133 (2003)

    Article  Google Scholar 

  17. Martinez-Alvarez, A., et al.: Compiler-Directed soft error mitigation for embedded systems. IEEE Trans. on Dependable and Secure Computing 9(2), 159–172 (2012)

    Article  Google Scholar 

  18. Ruiz, J.C., Gil, P., Yeste, P., de Andrés, D.: Dependability Benchmarking of automotive control system. In: Dependability Benchmarking for Computer Systems. John Wiley & Sons, Inc. (2008)

    Google Scholar 

  19. Eeckhout, L., Sampson, J., Calder, B.: Exploiting program microarchitecture independent characteristics and phase behavior for reduced benchmark suite simulation. In: IEEE Int. Workload Characterization Symp., pp. 2–12 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Di Leo, D., Ayatolahi, F., Sangchoolie, B., Karlsson, J., Johansson, R. (2012). On the Impact of Hardware Faults – An Investigation of the Relationship between Workload Inputs and Failure Mode Distributions. In: Ortmeier, F., Daniel, P. (eds) Computer Safety, Reliability, and Security. SAFECOMP 2012. Lecture Notes in Computer Science, vol 7612. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33678-2_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33678-2_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33677-5

  • Online ISBN: 978-3-642-33678-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics