Skip to main content

Concurrent Error Detection Using Watchdog Processors in the Multiprocessor System MEMSY

  • Conference paper
Fault-Tolerant Computing Systems

Part of the book series: Informatik-Fachberichte ((INFORMATIK,volume 283))

Abstract

In this paper a proposal for an architecture of a general purpose watchdog processor is made. This watchdog processor monitors the behavior of the main processor by checking the control flow of processes using the Extended Signature Integrity Checking method (ESIC). The watchdog processor is independent of the architecture of the main processor because it is linked to the main processor by a memory interface.

The watchdog processor is well usable for multiprocessor systems based on standard components and a RISC/CISC processor with large cache as main processor. For the usage in multiprocessor systems the watchdog processors have their own communication network to exchange diagnosis information. As example for the distributed shared memory multiprocessor system MEMSY the realization of this system-level error detection mechanism is shown.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Andrews, D. M.: Using Executable Assertions for Testing and Fault-Tolerance, Proc. 9th FTCS, 102–105, 1979

    Google Scholar 

  2. Daniels, S. F.: A Concurrent Test Technique for Standard Microprocessors, Dig. Papers Compcon Spring 83, 389–394, 1983

    Google Scholar 

  3. Eifert, J. B.; Shen, J. P.: Processor Monitoring Using Asynchronous Signatured Instruction Streams, Proc. 14th FTCS, 394–399, 1984

    Google Scholar 

  4. Fritsch, G.; Henning, W.; Hessenauer, H.; Klar, R.; Linster, C.U.; Oehlrich, C.W.; Schlenk, P.; Volkert, J.: Distributed Shared Memory Multiprocessor Architecture MEMS Y for High Performance Parallel Computations, ACM Computer Architecture News, vol$117, no. 6,22–35, 1989

    Google Scholar 

  5. Hackbusch, W. et. al.: Multigrid Methods, Proceedings of the Conference held at Cologne, Nov 23–27,1981, Springer Verlag, Berlin 1982

    Book  MATH  Google Scholar 

  6. Hildebrand, U.: Transputer-basiertes Testsystem für Verbindungsnetzwerke speichergekoppelter Multiprozessoren, Workshop über Parallelverarbeitung, Segau, June 24.–30., 1990

    Google Scholar 

  7. Hönig, J.: Implementierung eines Präprozessors zur Generierung von Watchdogprogrammen, Diplomarbeit, IMMD in, Uni-Erlangen, 1990

    Google Scholar 

  8. Horton, G.: Parallel Solution of the Navier-Stokes Equations in Complex Geometries; NNFM Vol. 31, pp. 120–128; Vieweg, Braunschweig 1991

    Google Scholar 

  9. Lehmann-Emilius, L.: Rekonfiguration und Rückwärtsfehlerbehebung für Multiprozessoren mit begrenzter Nachbarschaft - eine Untersuchung zur verteilten Recovery, Dissertationsschrift, Arbeitsberichte des IMMD, Uni-Erlangen, Bd. 23, No. 2, 1990

    Google Scholar 

  10. Lu, D. J.: Watchdog Processors and Structural Integrity Checking, IEEE,T.o.C., 31 /7, 681–685, 1982

    Google Scholar 

  11. Mahmood, A; McCluskey, E. J.: Concurrent Error Detection Using Watchdog Processors - A Survey, IEEE,T.o.C., 37 /2, 160–174, 1988

    Google Scholar 

  12. Namjoo, M.: Cerberus-16: An Architecture for a General Purpose Watchdog Processor, Proc. 13th FTCS, 216–219, 1983

    Google Scholar 

  13. Namjoo, M.; McCluskey, E. J.: Watchdog Processors and Capability Checking, Proc. 12th FTCS, 245–248, 1982

    Google Scholar 

  14. Schuette, M. A.; Shen, J. P.: Processor Control Flow Monitoring Using Signatured Instruction Streams, IEEE T.o.C., 36 /2, 264–275, 1987.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1991 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Michel, E., Hohl, W. (1991). Concurrent Error Detection Using Watchdog Processors in the Multiprocessor System MEMSY. In: Cin, M.D., Hohl, W. (eds) Fault-Tolerant Computing Systems. Informatik-Fachberichte, vol 283. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76930-6_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-76930-6_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-54545-3

  • Online ISBN: 978-3-642-76930-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics