Skip to main content
Log in

An efficient data layout scheme for better I/O balancing in RAID-6 storage systems

  • Published:
Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Abstract

Among redundant arrays of independent disks (RAID)-6 codes, maximum distance separable (MDS) based RAID-6 codes are popular because they have the optimal storage efficiency. Although vertical MDS codes exhibit better load balancing compared to horizontal MDS codes in partial stripes, an I/O unbalancing problem still exists in some vertical codes. To address this issue, we propose a novel efficient data layout, uniform P-code (UPC), to support highly balanced I/Os among P-coded disk arrays (i.e., PC). In UPC, the nonuniformly distributed information symbols in each parity chain of P-code are moved along their columns to other rows, thus enabling the parity chain to keep original parity relationships and tolerate double disk failures. The UPC scheme not only achieves optimal storage efficiency, computational complexity, and update complexity, but also supports better I/O balancing in the context of large-scale storage systems. We also conduct a performance study on reconstruction algorithms using an analytical model. Besides extensive theoretical analysis, comparative performance experiments are conducted by replaying real-world workloads under various configurations. Experimental results illustrate that our UPC scheme significantly outperforms the PC scheme in terms of average user response time. In particular, in the case of a 12-disk array, the UPC scheme can improve the access performance of the RAID-6 storage system by 29.9% compared to the PC scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Bachmat, E., Ofek, Y., Zakai, A., et al., 2004. Load Balancing on Disk Array Storage Device. US Patent 6711649.

    Google Scholar 

  • Blaum, M., Roth, R.M., 1999. On lowest density MDS codes. IEEE Trans. Inform. Theory, 45(1):46–59. [doi:10.1109/18.746771]

    Article  MATH  MathSciNet  Google Scholar 

  • Blaum, M., Brady, J., Bruck, J., et al., 1995. EVENODD: an efficient scheme for tolerating double disk failures in RAID architectures. IEEE Trans. Comput., 44(2):192–202. [doi:10.1109/12.364531]

    Article  MATH  Google Scholar 

  • Corbett, P., English, B., Goel, A., et al., 2004. Row-diagonal parity for double disk failure correction. Proc. 3rd USENIX Conf. on File and Storage Technologies, p.1–14.

    Google Scholar 

  • Ganger, G.R., Worthington, B.L., Hou, R.Y., et al., 1993. Disk subsystem load balancing: disk striping vs. conventional data placement. Proc. 26th Hawaii Int. Conf. on System Sciences, p.40–49. [doi:10.1109/HICSS.1993.270759]

    Google Scholar 

  • Greenan, K.M., Li, X.Z., Wylie, J.J., 2010. Flat XOR-based erasure codes in storage systems: constructions, efficient recovery, and tradeoffs. Proc. IEEE 26th Symp. on Mass Storage Systems and Technologies, p.1–14. [doi:10.1109/MSST.2010.5496983]

    Google Scholar 

  • Hafner, J.L., 2005. WEAVER codes: highly fault tolerant erasure codes for storage systems. Proc. 4th USENIX Conf. on File and Storage Technologies, p.16.

    Google Scholar 

  • Holland, M., Gibson, G.A., 1992. Parity declustering for continuous operation in redundant disk arrays. Proc. 5th Int. Conf. on Architectural Support for Programming Languages and Operating Systems, p.23–35. [doi:10.1145/143365.143383]

    Google Scholar 

  • Huang, C., Chen, M., Li, J., 2007. Pyramid codes: flexible schemes to trade space for access efficiency in reliable data storage systems. Proc. 6th IEEE Int. Symp. on Network Computing and Applications, p.79–86. [doi:10.1109/NCA.2007.37]

    Google Scholar 

  • Jantz, R.M., 1999. Method for Host-Based I/O Workload Balancing on Redundant Array Controllers. US Patent 5937428.

    Google Scholar 

  • Jin, C., Jiang, H., Feng, D., et al., 2009. P-code: a new RAID-6 code with optimal properties. Proc. 23rd Int. Conf. on Supercomputing, p.360–369. [doi:10.1145/1542275.1542326]

    Google Scholar 

  • Khan, O., Burns, R.C., Plank, J.S., et al., 2012. Rethinking erasure codes for cloud file systems: minimizing I/O for recovery and degraded reads. Proc. 11th USENIX Conf. on File and Storage Technologies, p.20.

    Google Scholar 

  • Patterson, D.A., Gibson, G., Katz, R.H., 1988. A case for redundant arrays of inexpensive disks (RAID). Proc. ACM SIGMOD Int. Conf. on Management of Data, p.109–116. [doi:10.1145/50202.50214]

    Google Scholar 

  • Plank, J.S., Luo, J., Schuman, C.D., et al., 2009. A performance evaluation and examination of open-source erasure coding libraries for storage. Proc. 8th USENIX Conf. on File and Storage Technologies, p.253–265.

    Google Scholar 

  • Reed, I.S., Solomon, G., 1960. Polynomial codes over certain finite fields. J. Soc. Ind. Appl. Math., 8(2):300–304. [doi:10.1137/0108018]

    Article  MATH  MathSciNet  Google Scholar 

  • Scheuermann, P., Weikum, G., Zabback, P., 1998. Data partitioning and load balancing in parallel disk systems. VLDB J., 7(1):48–66. [doi:10.1007/s007780050053]

    Article  Google Scholar 

  • Schroeder, B., Gibson, G.A., 2007. Disk failures in the real world: what does an MTTF of 1,000,000 hours mean to you? Proc. 6th USENIX Conf. on File and Storage Technologies, p.1–16.

    Google Scholar 

  • Wan, S., Cao, Q., Xie, C.S., et al., 2010. Code-M: a non-MDS erasure code scheme to support fast recovery from up to two-disk failures in storage systems. Proc. IEEE/IFIP Int. Conf. on Dependable Systems and Networks, p.51–60. [doi:10.1109/DSN.2010.5545009]

    Google Scholar 

  • Wang, Z.Y., Dimakis, A.G., Bruck, J., 2010. Rebuilding for array codes in distributed storage systems. Proc. IEEE GLOBECOM Workshops, p.1905–1909. [doi:10.1109/GLOCOMW.2010.5700274]

    Google Scholar 

  • Wu, S., Jiang, H., Feng, D., et al., 2009. WorkOut: I/O workload outsourcing for boosting RAID reconstruction performance. Proc. 8th USENIX Conf. on File and Storage Technologies, p.239–252.

    Google Scholar 

  • Xiang, L.P., Xu, Y.L., Lui, J., et al., 2011. A hybrid approach to failed disk recovery using RAID-6 codes: algorithms and performance evaluation. ACM Trans. Stor., 7(3), Article 11. [doi:10.1145/2027066.2027071]

    Google Scholar 

  • Xie, P., Huang, J.Z., Cao, Q., et al., 2015. A new non- MDS RAID-6 code to support fast reconstruction and balanced I/Os. Comput. J., in press. [doi:10.1093/comjnl/bxv006]

    Google Scholar 

  • Xu, L.H., Bruck, J., 1999. X-code: MDS array codes with optimal encoding. IEEE Trans. Inform. Theory, 45(1):272–276. [doi:10.1109/18.746809]

    Article  MATH  MathSciNet  Google Scholar 

  • Zhu, Y.F., Lee, P.P.C., Hu, Y.C., et al., 2012a. On the speedup of single-disk failure recovery in XOR-coded storage systems: theory and practice. Proc. IEEE 28th Symp. on Mass Storage Systems and Technologies, p.1–12. [doi:10.1109/MSST.2012.6232371]

    Google Scholar 

  • Zhu, Y.F., Lee, P.P.C., Xiang, L.P., et al., 2012b. A costbased heterogeneous recovery scheme for distributed storage systems with RAID-6 codes. Proc. 42nd Annual IEEE/IFIP Int. Conf. on Dependable Systems and Networks, p.1–12. [doi:10.1109/DSN.2012.6263934]

    Chapter  Google Scholar 

  • Zomaya, A.Y., Teh, Y.H., 2001. Observations on using genetic algorithms for dynamic load-balancing. IEEE Trans. Parall. Distr. Syst., 12(9):899–911. [doi:10.1109/71.954620]

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jian-zhong Huang.

Additional information

Project supported by the National Basic Research Program (973) of China (No. 2011CB302303) and the National High-Tech R&D Program (863) of China (No. 2013AA013203)

ORCID: Ping XIE, http://orcid.org/0000-0001-9122-8534

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xie, P., Huang, Jz., Dai, Ew. et al. An efficient data layout scheme for better I/O balancing in RAID-6 storage systems. Frontiers Inf Technol Electronic Eng 16, 335–345 (2015). https://doi.org/10.1631/FITEE.1400362

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/FITEE.1400362

Key words

Navigation