Skip to main content

Surge handling as a measure of real-time system dependability

  • Workshop on Parallel and Distributed Real-Time Systems David Andrews, UofA, USA P.D.V. van der Stok, Eindhoven University, The Netherlands Kenji Toda, Electro-tech Laboratory, Japan
  • Conference paper
  • First Online:
Parallel and Distributed Processing (IPPS 1998)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1388))

Included in the following conference series:

  • 103 Accesses

Abstract

Traditional reliability measures for computer systems can be classified into Computer-Centric or Application-Centric categories. The former concentrate on the hardware resources while ignoring the application's needs. The latter focus on the requirements of a specific application which is being executed, thus requiring the knowledge of all the details of the application; information which may not always be readily available. Also, the narrow view on the system's reliability through a single application is too restrictive and provides very limited information regarding the way the system will handle other applications.

In this paper we present new measures for real-time system reliability. These measures are application-sensitive rather than application-centric, and are especially suitable for systems executing various applications with different attributes, some of which may not be known in advance.

Our proposed measures capture the capability of a real-time system to respond successfully to unexpected surges in the workload. These surges may result from a phase change in the system's mission, an application-related emergency situation or the failure of some system resources. The ability of the system to handle such surges determines, to a large extent, its chances of survival and meeting its applications' deadlines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. M. D. Beaudry, “Performance-Related Reliability Measures for Computing Systems,” IEEE Trans. Computers, Vol. C-29, 1978.

    Google Scholar 

  2. C. M. Krishna and K. G. Shin, “Performance Measures for Multiprocessor Controllers,” in A.K. Agrawala and S.K. Tripathi, eds., Performance `83, 1983.

    Google Scholar 

  3. C. M. Krishna and K. G. Shin, “Performance Measures for Control Computers,” IEEE Trans Automatic Control, Vol. AC-32, 1987.

    Google Scholar 

  4. C. M. Krishna and K. G. Shin, Real-Time Systems, New York: McGraw-Hill, 1997.

    Google Scholar 

  5. C. M. Krishna, K. G. Shin, and Y.-H. Lee, “Optimization Criteria for Checkpointing,” Communications of the ACM, Vol. 27, No. 10, 1984.

    Google Scholar 

  6. C. L. Liu and J. W. Layland, “Scheduling Algorithms for Multiprogramming in a Hard Real-time Environment,” Journal of the ACM, Vol. 20, 1973, pp. 46–61.

    Article  Google Scholar 

  7. J. F. Meyer, “On Evaluating the Performability of Degradable Computing Systems,” IEEE Trans. Computers, Vol. C-29, 1980.

    Google Scholar 

  8. J. F. Meyer, D. G. Furchtgott, and L. T. Wu, “Performability Evaluation of the SIFT Computer,” IEEE Trans. Computers, Vol. C-29, 1980.

    Google Scholar 

  9. K. G. Shin and C. M. Krishna, “Characterization of Real-Time Computers,” NASA Contractor Report 3807, August 1984.

    Google Scholar 

  10. K. G. Shin and C. M. Krishna, “New Performance Measures for Design and Analysis of Real-Time Multiprocessors,” Journal of Computer Science and Engineering Systems, Vol. 1, pp. 179–192, October 1986.

    Google Scholar 

  11. K. G. Shin, C. M. Krishna, and Y.-H. Lee, “A Unified Method for Characterizing Real-Time Computer Controller and its Application,” IEEE Transactions on Automatic Control, Vol. AC-30, No.4, April 1985, pp. 357–366.

    Google Scholar 

  12. K. Yu and I. Koren, “Reliability Enhancement of Real-Time Multiprocessor Systems through Dynamic Reconfiguration,” Fault-Tolerant Parallel and Distributed Systems, D. Pradhan and D. Avresky (Editors), pp. 161–168, IEEE Computer Society Press, Los Alamitos, CA, 1995.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

José Rolim

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Koren, Z., Koren, I., Krishna, C.M. (1998). Surge handling as a measure of real-time system dependability. In: Rolim, J. (eds) Parallel and Distributed Processing. IPPS 1998. Lecture Notes in Computer Science, vol 1388. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64359-1_776

Download citation

  • DOI: https://doi.org/10.1007/3-540-64359-1_776

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-64359-3

  • Online ISBN: 978-3-540-69756-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics