A reliable and energy-efficient storage system with erasure coding cache

Wan, Ji-guang; Li, Da-ping; Qu, Xiao-yang; Yin, Chao; Wang, Jun; Xie, Chang-sheng

doi:10.1631/FITEE.1600972

A reliable and energy-efficient storage system with erasure coding cache

Published: 27 October 2017

Volume 18, pages 1370–1384, (2017)
Cite this article

Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Ji-guang Wan ORCID: orcid.org/0000-0003-0489-6783¹,
Da-ping Li¹,
Xiao-yang Qu¹,
Chao Yin¹,
Jun Wang¹ &
…
Chang-sheng Xie¹

78 Accesses
Explore all metrics

Abstract

In modern energy-saving replication storage systems, a primary group of disks is always powered up to serve incoming requests while other disks are often spun down to save energy during slack periods. However, since new writes cannot be immediately synchronized into all disks, system reliability is degraded. In this paper, we develop a high-reliability and energy-efficient replication storage system, named RERAID, based on RAID10. RERAID employs part of the free space in the primary disk group and uses erasure coding to construct a code cache at the front end to absorb new writes. Since code cache supports failure recovery of two or more disks by using erasure coding, RERAID guarantees a reliability comparable with that of the RAID10 storage system. In addition, we develop an algorithm, called erasure coding write (ECW), to buffer many small random writes into a few large writes, which are then written to the code cache in a parallel fashion sequentially to improve the write performance. Experimental results show that RERAID significantly improves write performance and saves more energy than existing solutions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

ColdStore: A Storage System for Archival Data

Article 11 December 2019

WarmCache: A Comprehensive Distributed Storage System Combining Replication, Erasure Codes and Buffer Cache

A High-Performance and High-Reliability RAIS5 Storage Architecture with Adaptive Stripe

References

Amazon, 2007. Amazon S3: Object Storage Built to Store and Retrieve Any Amount of Data from Anywhere. http://aws.amazon.com/s3/
Google Scholar
Amur, H., Cipar, J., Gupta, V., et al., 2010. Robust and flexible power-proportional storage. Proc. 1st ACM Symp. on Cloud Computing, p.217–228. https://doi.org/10.1145/1807128.1807164
Google Scholar
Bhadkamkar, M., Guerra, J., Useche, L., et al., 2009. BORG: Block-reORGanization for self-optimizing storage systems. Proc. Usenix Conf. on File and Storage Technologies, p.183–196.
Google Scholar
Blaum, M., Brady, J., Bruck, J., et al., 1994. EVENODD: an optimal scheme for tolerating double disk failures in RAID architectures. Proc. 21st Int. Symp. on Computer Architecture, p.245–254. https://doi.org/10.1109/isca.1994.288145
Chapter Google Scholar
Borthaku, D., 2010. What is Apache Hadoop? http:// hadoop.apache.org/
Google Scholar
Chen, Y., Hsu, W., Young, H., 2000. Logging RAID— an approach to fast, reliable, and low-cost disk arrays. Euro-Par, p.1302–1312. https://doi.org/10.1007/3-540-44520-x_182
Google Scholar
Colarelli, D., Grunwald, D., 2002. Massive arrays of idle disks for storage archives. Proc. ACM/IEEE Conf. on Supercomputing, p.1–11. https://doi.org/10.1109/sc.2002.10058
Google Scholar
Corbett, P., English, B., Goel, A., et al., 2004. Row-diagonal parity for double disk failure correction. Proc. 3rd USENIX Conf. on File and Storage Technologies, p.1–14.
Google Scholar
Department of Energy, 2012. NETL shares computing speed, efficiency to tackle barriers. Fossil Energy Today, 1(6):1–3.
EMC, 2008. ATMOS: Big. Smart. Elastic. http://www.emc. com/storage/atmos/atmos.htm
Eom, H., Hollingsworth, J.K., 2000. Speed vs. accuracy in simulation for I/O-intensive applications. Proc. 14th Int. Parallel and Distributed Processing Symp., p.315–322. https://doi.org/10.1109/ipdps.2000.846001
Google Scholar
Ghemawat, S., Gobioff, H., Leung, S., 2003. The Google File System. Proc. 19th ACM Symp. on Operating Systems Principles, p.29–43. https://doi.org/10.1145/945445.945450
Google Scholar
Hu, Y., Yang, Q., 1996. DCD—disk caching disk: a new approach for boosting I/O performance. ACM SIGARCH Comput. Archit. News, 24(2):169–178. https://doi.org/10.1145/232974.232991
Article Google Scholar
Li, D., Wang, J., 2004. EERAID: energy-efficient redundant and inexpensive disk array. Proc. 11th ACM SIGOPS European Workshop, p.29. https://doi.org/10.1145/1133572.1133577
Google Scholar
Li, D., Wang, J., 2006. eRAID: a queuing model based energy saving policy. Proc. IEEE Int. Symp. on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, p.77–86. https://doi.org/10.1109/mascots.2006.23
Google Scholar
Lu, L., Varman, P.J., Wang, J., 2007. DiskGroup: energy efficient disk layout for RAID1 systems. Proc. Int. Conf. on Networking, Architecture, and Storage, p.233–242. https://doi.org/10.1109/nas.2007.21
Google Scholar
Mao, B., Feng, D., Jiang, H., et al., 2008. GRAID: a green RAID storage architecture with improved energy efficiency and reliability. Proc. IEEE Int. Symp. on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, p.113–120. https://doi.org/10.1109/mascot.2008.4770574
Google Scholar
Menon, J., 1995. A performanee comparison of RAID-5 and log-struetured arrays. Proc. 4th IEEE In. Symp. on High Performance Distributed Computing, p.167–178. https://doi.org/10.1109/hpdc.1995.518707
Chapter Google Scholar
Patterson, D.A., Gibson, G., Katz, R.H., 1988. A case for redundant arrays of inexpensive disks (RAID). Proc. ACM SIGMOD Int. Conf. on Management of Data, p.109–116. https://doi.org/10.1145/971701.50214
Google Scholar
Plank, J.S., Xu, L.H., 2006. Optimizing Cauchy Reed- Solomon codes for fault-tolerant storage applications. Proc. 5th Int. Symp. on Network Computing and Applications, p.173–180. https://doi.org/10.1109/nca.2006.43
Google Scholar
Pinheiro, E., Bianchini, R., 2004. Energy conservation techniques for disk array-based servers. Proc. 18th Annual Int. Conf. on Supercomputing, p.68–78. https://doi.org/10.1145/1006209.1006220
Google Scholar
Pinheiro, E., Weber, W.D., Barroso, L.A., 2007. Failure trends in a large disk drive population. Proc. 5th USENIX Conf. on File and Storage Technologies, p.17–28.
Google Scholar
Soundararajan, G., Prabhakaran, V., Balakrishnan, M., et al., 2010. Extending SSD lifetimes with disk-based write caches. Proc. 8th USENIX Conf. on File and Storage Technologies, p.101–114.
Google Scholar
Stodolsky, D., Gibson, G., Holland, M., 1993. Parity logging overcoming the small write problem in redundant disk arrays. ACM SIGARCH Comput. Architect. News, 21(2):64–75. https://doi.org/10.1145/173682.165143
Article Google Scholar
Thereska, E., Donnelly, A., Narayanan, D., 2011. Sierra: practical power-proportionality for data center storage. Proc. 6th Conf. on Computer Systems, p.169–182. https://doi.org/10.1145/1966445.1966461
Google Scholar
Wang, J., Zhu, H.J., Li, D., 2008. eRAID: conserving energy in conventional disk-based RAID system. IEEE Trans. Comput., 57(3):359–374. https://doi.org/10.1109/tc.2007.70821
Article MathSciNet Google Scholar
Weil, S., Brandt, S.A., Miller, E.L., et al., 2006. Ceph: a scalable, high-performance distributed file system. Proc. 7th Conf. on Operating Systems Design and Implementation, p.307–320.
Google Scholar
Wilkes, J., Golding, R., Staelin, R., et al., 1996. The HP AutoRAID hierarchical storage system. ACM Trans. Comput. Syst., 14(1):108–136. https://doi.org/10.1145/225535.225539
Article Google Scholar
Xin, Q., Miller, E.L., Schwarz, T., et al., 2003. Reliability mechanisms for very large storage systems. Proc. 20th IEEE/11th NASA Goddard Conf. on Mass Storage Systems and Technologie, p.146–156. https://doi.org/10.1109/mass.2003.1194851
Chapter Google Scholar
Xu, L.H., Bruck, J., 1999. X-code: MDS array codes with optimal encoding. IEEE Trans. Inform. Theory, 45(1):272–276. https://doi.org/10.1109/18.746809
Article MathSciNet Google Scholar
Yue, Y., Tian, L., Jiang, H., et al., 2010. RoLo: a rotated logging storage architecture for enterprise data centers. Proc. IEEE 30th Int. Conf. on Distributed Computing Systems, p.293–304. https://doi.org/10.1109/ICDCS.2010.22
Google Scholar
Yue, Y., He, B., Tian, L., et al., 2016. Rotated logging storage architectures for data centers: models and optimizations. IEEE Trans. Comput., 65(1):203–215. https://doi.org/10.1109/tc.2015.2417539
Article MathSciNet Google Scholar
Zhu, Q., Chen, Z., Tan, L., et al., 2005. Hibernator: helping disk arrays sleep through the winter. Proc. 20th ACM Symp. on Operating Systems Principles, p.177–190. https://doi.org/10.1145/1095809.1095828
Google Scholar

Download references

Author information

Authors and Affiliations

Wuhan National Laboratory for Optoelectronics, Department of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China
Ji-guang Wan, Da-ping Li, Xiao-yang Qu, Chao Yin, Jun Wang & Chang-sheng Xie

Authors

Ji-guang Wan
View author publications
You can also search for this author inPubMed Google Scholar
Da-ping Li
View author publications
You can also search for this author inPubMed Google Scholar
Xiao-yang Qu
View author publications
You can also search for this author inPubMed Google Scholar
Chao Yin
View author publications
You can also search for this author inPubMed Google Scholar
Jun Wang
View author publications
You can also search for this author inPubMed Google Scholar
Chang-sheng Xie
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Ji-guang Wan.

Additional information

Project supported by the National Natural Science Foundation of China (Nos. 61472152, 61432007, 61572209, and 61300047), the Fundamental Research Funds for the Central Universities, China (No. 2015QN069), the Director Fund of Wuhan National Laboratory for Optoelectronics (WNLO), and the MOE Key Laboratory of Data Storage System, China

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wan, Jg., Li, Dp., Qu, Xy. et al. A reliable and energy-efficient storage system with erasure coding cache. Frontiers Inf Technol Electronic Eng 18, 1370–1384 (2017). https://doi.org/10.1631/FITEE.1600972

Download citation

Received: 31 January 2016
Accepted: 03 May 2016
Published: 27 October 2017
Issue Date: September 2017
DOI: https://doi.org/10.1631/FITEE.1600972

Key words

CLC number

TP316.4

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A reliable and energy-efficient storage system with erasure coding cache

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

ColdStore: A Storage System for Archival Data

WarmCache: A Comprehensive Distributed Storage System Combining Replication, Erasure Codes and Buffer Cache

A High-Performance and High-Reliability RAIS5 Storage Architecture with Adaptive Stripe

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Subscribe and save

Buy Now