skip to main content
research-article

A Hybrid Approach to Failed Disk Recovery Using RAID-6 Codes: Algorithms and Performance Evaluation

Published: 01 October 2011 Publication History

Abstract

The current parallel storage systems use thousands of inexpensive disks to meet the storage requirement of applications. Data redundancy and/or coding are used to enhance data availability, for instance, Row-diagonal parity (RDP) and EVENODD codes, which are widely used in RAID-6 storage systems, provide data availability with up to two disk failures. To reduce the probability of data unavailability, whenever a single disk fails, disk recovery will be carried out. We find that the conventional recovery schemes of RDP and EVENODD codes for a single failed disk only use one parity disk. However, there are two parity disks in the system, and both can be used for single disk failure recovery. In this article, we propose a hybrid recovery approach that uses both parities for single disk failure recovery, and we design efficient recovery schemes for RDP code (RDOR-RDP) and EVENODD code (RDOR-EVENODD). Our recovery scheme has the following attractive properties: (1) “read optimality” in the sense that our scheme issues the smallest number of disk reads to recover a single failed disk and it reduces approximately 1/4 of disk reads compared with conventional schemes; (2) “load balancing property” in that all surviving disks will be subjected to the same (or almost the same) amount of additional workload in rebuilding the failed disk.
We carry out performance evaluation to quantify the merits of RDOR-RDP and RDOR-EVENODD on some widely used disks with DiskSim. The offline experimental results show that RDOR-RDP and RDOR-EVENODD outperform the conventional recovery schemes of RDP and EVENODD codes in terms of total recovery time and recovery workload on individual surviving disk. However, the improvements are less than the theoretical value (approximately 25%), as RDOR-RDP and RDOR-EVENODD change the disk access pattern from purely sequential to a more random one compared with their conventional schemes.

References

[1]
Baker, M., Shah, M., Rosenthal, D. S. H., Roussopoulos, M., Maniatis, P., Giuli, T., and Bungale, P. 2006. A fresh look at the reliability of long-term digital storage. In Proceedings of the EuroSys Conference (EuroSys’06). ACM, 221--234.
[2]
Blaum, M., Brady, J., Bruck, J., and Menon, J. 1995. EVENODD: An efficient scheme for tolerating double disk failures in RAID architectures. IEEE Trans. Comput. 44, 2, 192--202.
[3]
Bucy, J., Schindler, J., Schlosser, S., and Ganger, G. 2008. The DiskSim simulation environment (v4.0). Tech. rep. CMU_PDL_08_101, Carnegie Melon University.
[4]
Cassidy, B. and Hafner, J. L. 2007. Space efficient matrix methods for lost data reconstruction in erasure codes. Tech. rep. RJ10415, IBM Research.
[5]
Chen, P. M., Lee, E. K., Gibson, G. A., Katz, R. H., and Patterson, D. A. 1994. RAID: High-performance, reliable secondary storage. ACM Comput. Surv. 26, 145--185.
[6]
Corbett, P., English, B., Goel, A., Grcanac, T., Kleiman, S., Leong, J., and Sankar, S. 2004. Row-diagonal parity for double disk failure correction. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST’04). USENIX Association, Berkeley, CA, 1--14.
[7]
Dimakis, A. G., Godfrey, P. B., Wu, Y., Wainwright, M. J., and Ramchandran, K. 2010. Network coding for distributed storage systems. IEEE Trans. Inform. Theory 56, 4539--4551.
[8]
Emrich, T., Graf, F., Kriegel, H.-P., Schubert, M., and Thoma, M. 2010. On the impact of flash ssds on spatial indexing. In Proceedings of the 6th International Workshop on Data Management on New Hardware (DaMoN’10). ACM, New York, NY, 3--8.
[9]
Ghemawat, S., Gobioff, H., and Leung, S.-T. 2003. The Google file system. In Proceedings of the 19th ACM Symposium on Operating Systems Principles (SOSP’03). ACM, New York, NY, 29--43.
[10]
Greenan, K. M., Li, X., and Wylie, J. J. 2010. Flat XOR-based erasure codes in storage systems: Constructions, efficient recovery, and tradeoffs. In Procceedings of the 26th Symposium on Mass Storage Systems and Technologies (MSST’10). IEEE, Los Alamitos, CA, 1--14.
[11]
Hafner, J. L., Deenadhayalan, V., Rao, K. K., and Tomlin, J. A. 2005. Matrix methods for lost data reconstruction in erasure codes. In Proceedings of the 4th USENIX Conference on File and Storage Technologies (FAST’05). USENIX Association, Berkeley, CA, 15--30.
[12]
Holland, M. 1994. On-line data reconstruction in redundant disk arrays. Ph.D. thesis, Carnegie Mellon University, Pittsburgh, PA.
[13]
Holland, M. and Gibson, G. A. 1992. Parity declustering for continuous operation in redundant disk arrays. In Proceedings of the 5th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-V). ACM, New York, NY, 23--35.
[14]
Holland, M., Gibson, G. A., and Siewiorek, D. P. 1993. Fast, on-line failure recovery in redundant disk arrays. In Proceedings of the 23rd Annual International Symposium on Fault-Tolerant Computing (FTCS’93). 422--431.
[15]
Holland, M., Gibson, G. A., and Siewiorek, D. P. 1994. Architectures and algorithms for on-line failure recovery in redundant disk arrays. Distrib. Paral. Datab. 2, 3, 295--335.
[16]
Joukov, N., Krishnakumar, A. M., Patti, C., Rai, A., Satnur, S., Traeger, A., and Zadok, E. 2007. RAIF: Redundant array of independent filesystems. In Proceedings of 24th IEEE Conference on Mass Storage Systems and Technologies (MSST’07). IEEE, Los Alamitos, CA, 199--212.
[17]
Kubiatowicz, J., Bindel, D., Chen, Y., Czerwinski, S., Eaton, P., Geels, D., Gummadi, R., Rhea, S., Weatherspoon, H., Weimer, W., Wells, C., and Zhao, B. 2000. Oceanstore: An architecture for global-scale persistent storage. In Proceedings of the 9th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS’00). 190--201.
[18]
Lee, J. Y. B. and Lui, J. C. S. 2002. Automatic recovery from disk failure in continuous-media servers. IEEE Trans. Paral. Distrib. Syst. 13, 5, 499--515.
[19]
Lueth, C. 2004. RAID-DP: Network Appliance implementation of RAID double parity for data protection. Tech. rep. No. 3298, Network Appliance Inc.
[20]
Lyman, P. and Varian, H. R. 2003. How much information? http://www.sims.berkeley.edu/how-much-info-2003.
[21]
Ma, S. L. 1994. A survey of partial difference sets. Des. Codes Cryptog. 4, 3, 221--261.
[22]
Menon, J. and Mattson., D. 1992. Comparison of sparing alternative for disk arrays. In Proceedings of the International Symposium on Computer Architecture (ISCA’92). 318--329.
[23]
Merchant, A. and Yu, P. S. 1996. Analytic modeling of clustered RAID with mapping based on nearly random permutation. IEEE Trans. Comput. 45, 3, 367--373.
[24]
Muntz, R. R. and Lui, J. C. S. 1990. Performance analysis of disk arrays under failure. In Proceedings of the 16th International Conference on Very Large Databases (VLDB’90). Morgan Kaufmann Publishers Inc., San Francisco, CA, 162--173.
[25]
Pinheiro, E., Weber, W.-D., and Barroso, L. A. 2007. Failure trends in a large disk drive population. In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST’07). USENIX Association, Berkeley, CA, 17--28.
[26]
Plank, J. S. 2008. The RAID-6 liberation codes. In Proceedings of the 6th USENIX Conference on File and Storage Technologies (FAST’08). USENIX Association, Berkeley, CA, 1--14.
[27]
Plank, J. S., Luo, J., Schuman, C. D., Xu, L., and Wilcox-O’Hearn, Z. 2009. A performance evaluation and examination of open-source erasure coding libraries for storage. In Proccedings of the 7th Conference on File and Storage Technologies (FAST’09). USENIX Association, Berkeley, CA, 253--265.
[28]
Pless, V. 1998. Introduction to the Theory of Error-Correcting Codes. Wiley Interscience.
[29]
Schroeder, B. and Gibson, G. A. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST’07). USENIX Association, Berkeley, CA, 1--16.
[30]
Seagate. 2007. Cheetah® 15K.5 Fibre Channel 146-GB Hard Drive ST3146855FC Product Manual. Tech. rep. Cheetah 15K.5 FC, Seagate Inc.
[31]
Thomasian, A. and Menon, J. 1997. RAID-5 performance with distributed sparing. IEEE Trans. Paral. Distrib. Syst. 8, 6, 640--657.
[32]
Tian, L., Feng, D., Jiang, H., Zhou, K., Zeng, L., Chen, J., Wang, Z., and Song, Z. 2007. PRO: A popularity-based multi-threaded reconstruction optimization for RAID-structured storage systems. In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST’05). USENIX Association, Berkeley, CA, 301--314.
[33]
van Lint, J., Wilson, R. M., and Hale, J. K. 1993. A Course in Combinatorics. Cambridge University Press, Cambridge, UK.
[34]
Wikipedia. 2010. DDR2 SDRAM. http://en.wikipedia.org/wiki/DDR2_SDRAM.
[35]
Wu, S., Jiang, H., Feng, D., Tian, L., and Mao, B. 2009. Workout: I/O workload outsourcing for boosting RAID reconstruction performance. In Proccedings of the 7th USENIX Conference on File and Storage Technologies (FAST’09). USENIX Association, Berkeley, CA, 239--252.
[36]
Xiang, L., Xu, Y., Lui, J. C., and Chang, Q. 2010. Optimal recovery of single disk failure in RDP code storage systems. In Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’10). ACM, New York, NY, 119--130.
[37]
Xin, Q., Miller, E. L., Schwarz, T., Long, D. D. E., Brandt, S. A., and Litwin, W. 2003. Reliability mechanisms for very large storage systems. In Proceedings of the 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies (MSST’03). IEEE, Los Alamitos, CA, 146--156.
[38]
Xin, Q., Miller, E. L., and Schwarz, T. J. E. 2004. Evaluation of distributed recovery in large-scale storage systems. In Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing (HPDC’04). IEEE, Los Alamitos, CA, 172--181.
[39]
Xu, L. and Bruck, J. 1999. X-code: MDS array codes with optimal encoding. IEEE Trans. Inform. Theory 45, 1, 272--276.

Cited By

View all
  • (2024)A Survey of the Past, Present, and Future of Erasure Coding for Storage SystemsACM Transactions on Storage10.1145/3708994Online publication date: 31-Dec-2024
  • (2023)Optimal Repair Algorithm of Single-Disk Failure for Array Codes with Local Properties2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386568(2319-2325)Online publication date: 15-Dec-2023
  • (2022)SA-RSR: a read-optimal data recovery strategy for XOR-coded distributed storage systemsSA-RSR: 一种适用于异或类纠删码分布式存储系统的数据读取最优恢复方法Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.210024223:6(858-875)Online publication date: 5-Jul-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Storage
ACM Transactions on Storage  Volume 7, Issue 3
October 2011
120 pages
ISSN:1553-3077
EISSN:1553-3093
DOI:10.1145/2027066
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2011
Accepted: 01 May 2011
Revised: 01 April 2011
Received: 01 October 2010
Published in TOS Volume 7, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Disk failure
  2. EVENODD code
  3. RAID recovery
  4. RDP code
  5. recovery algorithm

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)16
  • Downloads (Last 6 weeks)0
Reflects downloads up to 24 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Survey of the Past, Present, and Future of Erasure Coding for Storage SystemsACM Transactions on Storage10.1145/3708994Online publication date: 31-Dec-2024
  • (2023)Optimal Repair Algorithm of Single-Disk Failure for Array Codes with Local Properties2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386568(2319-2325)Online publication date: 15-Dec-2023
  • (2022)SA-RSR: a read-optimal data recovery strategy for XOR-coded distributed storage systemsSA-RSR: 一种适用于异或类纠删码分布式存储系统的数据读取最优恢复方法Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.210024223:6(858-875)Online publication date: 5-Jul-2022
  • (2022)A New Architecture of Task Classification Oriented Atomization Computing Scheduling2022 International Conference on Intelligent Manufacturing, Advanced Sensing and Big Data (IMASBD)10.1109/IMASBD57215.2022.00020(82-86)Online publication date: Jul-2022
  • (2022)BibliographyStorage Systems10.1016/B978-0-32-390796-5.00023-1(641-693)Online publication date: 2022
  • (2022)Redundant Arrays of Independent Disks - RAIDStorage Systems10.1016/B978-0-32-390796-5.00014-0(269-336)Online publication date: 2022
  • (2021)A single failure correction accelerated RAID-6 code2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT)10.1109/ICESIT53460.2021.9696995(120-123)Online publication date: 22-Nov-2021
  • (2021)An Efficient RAID6 System Based on XOR Accelerator2021 3rd International Conference on Computer Communication and the Internet (ICCCI)10.1109/ICCCI51764.2021.9486809(171-175)Online publication date: 25-Jun-2021
  • (2020)Binary MDS Array Codes With Optimal RepairIEEE Transactions on Information Theory10.1109/TIT.2019.293911166:3(1405-1422)Online publication date: Mar-2020
  • (2020)AZ-Recovery: An Efficient Crossing-AZ Recovery Scheme for Erasure Coded Cloud Storage Systems2020 International Symposium on Reliable Distributed Systems (SRDS)10.1109/SRDS51746.2020.00031(236-245)Online publication date: Sep-2020
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media