Elsevier

Performance Evaluation

Volume 22, Issue 1, February 1995, Pages 111-133
Performance Evaluation

Data integrity analysis of disk array systems with analytic modeling of coverage

https://doi.org/10.1016/0166-5316(93)E0041-3Get rights and content

Abstract

Detailed dependability models of various disk array organizations are developed taking into account both the hard disk failures and transient errors. Various error and failure modes of individual disks and the disk array are identified. A small proportion of transient errors account for uncovered failures in a disk array. This leads to analytic computation of probability of disk failures based on several factors including byte error rate of a disk and ECC code etc. Traditionally used measures like MTTDL and data availability remain virtually unchanged with change in mean recovery time. New dependability measures such as degraded capacity time are considered to bring out the effect of mean recovery time on dependability of disk arrays. Our analysis also reveals that mirrored disk organization has higher MTTDL than other disk array organizations if the only failure mode considered is data loss while catastrophic errors are ignored. However, if catastrophic errors are taken into account, then RAID-3,4,5 organizations have higher data-integrity than other disk array schemes. We also develop models that take into account the reliability of support hardware components and different placement schemes for arranging support hardware such as power supply. Our analysis reveals that RAID-1 benefits the most from orthogonal placement of support hardware.

References (26)

  • M. Malhotra et al.

    Reliability analysis of redundant arrays of inexpensive disks

    J. Parallel Distrib. Comput.

    (1993)
  • E.R. Berlekamp

    The technology of error-correcting codes

  • D. Bitton et al.

    Disk shadowing

  • G.A. Gibson

    Performance and reliability in redundant arrays of inexpensive disks

  • G.A. Gibson et al.

    Failure correction techniques for large disk arrays

  • G.A. Gibson et al.

    Designing disk arrays for high data reliability

    J. Parallel Distrib. Comput.

    (1993)
  • J. Gray et al.

    Parity striping of disc arrays: low-cost reliable storage with acceptable throughput

  • D. Heimann et al.

    Availability and reliability modeling of computer systems

  • M. Holland, Personal communication,...
  • M. Holland et al.

    Parity declustering for continuous operation in redundant disk arrays

  • T.D. Howell

    Analysis of correctable errors in IBM 3380 disk file

    IBM J. Res. Develop.

    (1984)
  • R.H. Katz et al.

    Disk system architectures for high performance computing

  • J.A. Katzman

    A fault-tolerant computing system

  • Cited by (8)

    View all citing articles on Scopus

    This work was supported in part by the National Science Foundation under Grant CCR-9108114 and by the Naval Surface Warfare Center N60921-92-C-0161.

    Current address: AT&T Bell Labs., Holmdel, NJ 07733, USA.

    View full text