ABSTRACT
In order to suffice the storage requirements under the big data environment, scaling method is generally adopted to increase the storage capacity of the storage system with the exponential growth of data in the current. RAID has received wide attention in the academic and the industry due to good independence and redundancy. After long-term development, RAID has derived different RAID levels according to distinct requirements of users. In order to achieve load balance including old and new, some data need to be migrated from old data disks to new data disks. However, there will generate the disk I/O operations and the XOR computational operations because of the existence of independent parity disk during the migration process. These affect the efficiency of scaling to a certain extent and lead the problem of long scaling time. In this paper, the scaling process for RAID-4 is optimized by reducing the disk I/O operations and the XOR computational overhead. In the comparison experiment analysis of real storage system scaling, the scaling time of optimization approach is reduced by 49.9% to 57.3% compared with the traditional scaling approach.
- Patterson D A, Gibson G, Katz R H. A case for redundant arrays of inexpensive disks (RAID){C}// Proc. ACM SIGMOD Conference. 1988:109--116. Google ScholarDigital Library
- Chen P M, Lee E K, Gibson G A, et al. RAID: high-performance, reliable secondary storage{J}. Acm Computing Surveys, 1994, 26(2):145--185. Google ScholarDigital Library
- Zhang G, Zheng W, Shu J. ALV: A New Data Redistribution Approach to RAID-5 Scaling{J}. IEEE Transactions on Computers, 2010, 59(3):345--357. Google ScholarDigital Library
- Wu C, He X. GSR: A Global Stripe-Based Redistribution Approach to Accelerate RAID-5 Scaling{C}// International Conference on Parallel Processing. IEEE Computer Society, 2012:460--469. Google ScholarDigital Library
- Zhang G, Zheng W, Li K. Rethinking RAID-5 Data Layout for Better Scalability{J}. IEEE Transactions on Computers, 2014, 63(11):2816--2828. Google ScholarDigital Library
- Mao Y, Wan J, Zhu Y, et al. A New Parity-Based Migration Method to Expand RAID-5{J}. IEEE Transactions on Parallel & Distributed Systems, 2014, 25(8):1945--1954.Google ScholarCross Ref
- Liang J, Xu Y, Li Y, et al. ISM- An Intra-Stripe Data Migration Approach for RAID-5 Scaling{C}// International Conference on Networking, Architecture, and Storage. IEEE, 2017:1--10.Google Scholar
- Wu C, He X, Han J, et al. SDM: A Stripe-Based Data Migration Scheme to Improve the Scalability of RAID-6{C}// IEEE International Conference on Cluster Computing. IEEE, 2012. Google ScholarDigital Library
- Zhang G, Li K, Wang J, et al. Accelerate RDP RAID-6 Scaling by Reducing Disk I/Os and XOR Operations{J}. IEEE Transactions on Computers, 2014, 64(1):32--44.Google ScholarCross Ref
- Wan J, Xu P, He X, et al. H-Scale:A Fast Approach to Scale Disk Arrays via Hybrid Stripe Deployment{J}. Acm Transactions on Storage, 2016, 12(3):1--30. Google ScholarDigital Library
- Xia S, Mao Y, Tan M, et al. HCS: Expanding H-Code RAID 6 without Recalculating Parity Blocks in Big Data Circumstance{C}// International Conference of Young Computer Scientists, Engineers and Educators. Springer, Berlin, Heidelberg, 2015:65--72.Google Scholar
- Zhang G, Wu G, Lu Y, et al. Xscale: Online X-Code RAID-6 Scaling Using Lightweight Data Reorganization{J}. IEEE Transactions on Parallel & Distributed Systems, 2016, 27(12):3687--3700. Google ScholarDigital Library
- Gonzalez J L, Cortes T. Increasing the capacity of RAID5 by online gradual assimilation{C}// International Workshop on Storage Network Architecture and Parallel I/os. ACM, 2004:17--24. Google ScholarDigital Library
- Goel A, Shahabi C, Yao S Y D, et al. SCADDAR: An Efficient Randomized Technique to Reorganize Continuous Media Blocks{C}// International Conference on Data Engineering. IEEE Computer Society, 2002:473. Google ScholarDigital Library
- Zheng W, Zhang G. FastScale: accelerate RAID scaling by minimizing data migration{C}// Usenix Conference on File and Stroage Technologies. USENIX Association, 2011:11--11. Google ScholarDigital Library
- Zhang G, Wang J, Li K, et al. Redistribute Data to Regain Load Balance during RAID-4 Scaling{J}. IEEE Transactions on Parallel & Distributed Systems, 2014, 26(1):219--229.Google ScholarCross Ref
- XIE Ping. Research on a New RAID-4 Capacity Expand Layout. Journal of Chinese Computer Systems, 2018, 39(7): 1518--1521.Google Scholar
- Jin H, Zhou X, Feng D, et al. Improving partial stripe write performance in RAID level 5{C}// Second IEEE International Caracas Conference on Devices, Circuits and Systems. IEEE, 1998:396--400.Google Scholar
Index Terms
- Accelerate RAID scaling by reducing disk I/Os and XOR operations
Recommendations
Design and Evaluation of a New Approach to RAID-0 Scaling
Scaling up a RAID-0 volume with added disks can increase its storage capacity and I/O bandwidth simultaneously. For preserving a round-robin data distribution, existing scaling approaches require all the data to be migrated. Such large data migration ...
Striping in a RAID level 5 disk array
SIGMETRICS '95/PERFORMANCE '95: Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systemsRedundant disk arrays are an increasingly popular way to improve I/O system performance. Past research has studied how to stripe data in non-redundant (RAID Level 0) disk arrays, but none has yet been done on how to stripe data in redundant disk arrays ...
Striping in a RAID level 5 disk array
Redundant disk arrays are an increasingly popular way to improve I/O system performance. Past research has studied how to stripe data in non-redundant (RAID Level 0) disk arrays, but none has yet been done on how to stripe data in redundant disk arrays ...
Comments