ABSTRACT
SSDs exhibit very different failure characteristics compared to hard drives. In particular, the Bit Error Rate (BER) of an SSD climbs as it receives more writes. As a result, RAID arrays composed from SSDs are subject to correlated failures. By balancing writes evenly across the array, RAID schemes can wear out devices at similar times. When a device in the array fails towards the end of its lifetime, the high BER of the remaining devices can result in data loss. We propose Diff-RAID, a parity-based redundancy solution that creates an age differential in an array of SSDs. Diff-RAID distributes parity blocks unevenly across the array, leveraging their higher update rate to age devices at different rates. To maintain this age differential when old devices are replaced by new ones, Diff-RAID reshuffles the parity distribution on each drive replacement. We evaluate Diff-RAID's reliability by using real BER data from 12 flash chips on a simulator and show that it is more reliable than RAID-5, in some cases by multiple orders of magnitude. We also evaluate Diff-RAID's performance using a software implementation on a 5-device array of 80 GB Intel X25-M SSDs and show that it offers a trade-off between throughput and reliability.
- D. G. Andersen, J. Franklin, M. Kaminsky, A. Phanishayee, L. Tan, and V. Vasudevan. FAWN: A Fast Array of Wimpy Nodes. In Proceedings of the 22nd Symposium on Operating Systems Principles (SOSP'09), Big Sky, MT, October 2009. Google ScholarDigital Library
- R. Bitar. Deploying Hybrid Storage Pools With Sun Flash Technology and the Solaris ZFS File System. Technical Report SUN-820-5881-10, Sun Microsystems, October 2008.Google Scholar
- P. Desnoyers. Empirical Evaluation of NAND Flash Memory Performance. In The First Workshop on Hot Topics in Storage (HotStorage'09), Big Sky, MT, October 2009.Google Scholar
- Fusion-io. MySpace Case Study. http://www.fusionio.com/case-studies/myspace-case-study.pdf.Google Scholar
- K. M. Greenan, D. D. Long, E. L. Miller, T. J. E. Schwarz, and A.Wildani. Building Flexible, Fault-Tolerant Flash-based Storage Systems. In The Fifth Workshop on Hot Topics in Dependability (HotDep'09), Lisbon, Portugal, June 2009.Google Scholar
- L. M. Grupp, A. M. Caulfield, J. Coburn, S. Swanson, E. Yaakobi, P. H. Siegel, and J. K. Wolf. Characterizing Flash Memory: Anomalies, Observations, and Applications. In MICRO'09, New York, December 2009. Google ScholarDigital Library
- Intel Corporation. Intel X18-M/X25-M SATA Solid State Drive. http://download.intel.com/design/flash/nand/mainstream/mainstream-sata-ssd-datasheet.pdf.Google Scholar
- A. Kadav, M. Balakrishnan, V. Prabhakaran, and D. Malkhi. Differential RAID: Rethinking RAID for SSD Reliability. In The First Workshop on Hot Topics in Storage (HotStorage' 09), Big Sky, MT, October 2009.Google Scholar
- J. Matthews, S. Trika, D. Hensgen, R. Coulson, and K. Grimsrud. Intel R turbo memory: Nonvolatile disk caches in the storage hierarchy of mainstream computer systems. Transactions on Storage, 4(2):1--24, 2008. Google ScholarDigital Library
- N. Mielke, T. Marquart, N. Wu, J. Kessenich, H. Belgal, E. Schares, F. Trivedi, E. Goodness, and L. R. Nevill. Bit error rate in NAND Flash memories. In IEEE International Reliability Physics Symposium (IRPS), pages 9--19, April 2008.Google ScholarCross Ref
- D. Narayanan, E. Thereska, A. Donnelly, S. Elnikety, and A. Rowstron. Migrating server storage to SSDs: analysis of tradeoffs. In Proceedings of the 4th ACM European conference on Computer systems, pages 145--158, 2009. Google ScholarDigital Library
- Robin Harris. Why RAID-5 stops working in 2009. http://blogs.zdnet.com/storage/?p=162.Google Scholar
- Woody Hutsell. An In-depth Look at the RamSan-500 Cached Flash Solid State Disk. http://www.texmemsys.com/files/f000233.pdf.Google Scholar
Index Terms
- Differential RAID: rethinking RAID for SSD reliability
Recommendations
Differential RAID: Rethinking RAID for SSD reliability
SSDs exhibit very different failure characteristics compared to hard drives. In particular, the bit error rate (BER) of an SSD climbs as it receives more writes. As a result, RAID arrays composed from SSDs are subject to correlated failures. By ...
HPDA: A hybrid parity-based disk array for enhanced performance and reliability
Flash-based Solid State Drive (SSD) has been productively shipped and deployed in large scale storage systems. However, a single flash-based SSD cannot satisfy the capacity, performance and reliability requirements of the modern storage systems that ...
Elastic RAID: Implementing RAID over SSDs with Built-in Transparent Compression
SYSTOR '23: Proceedings of the 16th ACM International Conference on Systems and StorageThis paper studies how RAID (redundant array of independent disks) could take full advantage of modern SSDs (solid-state drives) with built-in transparent compression. In current practice, RAID users are forced to choose a specific RAID level (e.g., ...
Comments