DDUC: an erasure-coded system with decoupled data updating and coding

Tu, Yaofeng; Xiao, Rong; Han, Yinjun; Chen, Zhenghua; Jin, Hao; Qi, Xuecheng; Sun, Xinyuan

doi:10.1631/FITEE.2200466

DDUC: an erasure-coded system with decoupled data updating and coding

DDUC: 数据更新与编码解耦的纠删码系统

Research Article
Published: 02 June 2023

Volume 24, pages 716–730, (2023)
Cite this article

Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Yaofeng Tu (屠要峰) ORCID: orcid.org/0000-0002-2616-2273^1,2,
Rong Xiao (肖蓉)²,
Yinjun Han (韩银俊) ORCID: orcid.org/0000-0001-5578-2351^1,2,
Zhenghua Chen (陈正华)²,
Hao Jin (金浩)²,
Xuecheng Qi (齐学成)² &
…
Xinyuan Sun (孙辛远)²

87 Accesses
Explore all metrics

Abstract

In distributed storage systems, replication and erasure code (EC) are common methods for data redundancy. Compared with replication, EC has better storage efficiency, but suffers higher overhead in update. Moreover, consistency and reliability problems caused by concurrent updates bring new challenges to applications of EC. Many works focus on optimizing the EC solution, including algorithm optimization, novel data update method, and so on, but lack the solutions for consistency and reliability problems. In this paper, we introduce a storage system that decouples data updating and EC encoding, namely, decoupled data updating and coding (DDUC), and propose a data placement policy that combines replication and parity blocks. For the (N, M) EC system, the data are placed as N groups of M+1 replicas, and redundant data blocks of the same stripe are placed in the parity nodes, so that the parity nodes can autonomously perform local EC encoding. Based on the above policy, a two-phase data update method is implemented in which data are updated in replica mode in phase 1, and the EC encoding is done independently by parity nodes in phase 2. This solves the problem of data reliability degradation caused by concurrent updates while ensuring high concurrency performance. It also uses persistent memory (PMem) hardware features of the byte addressing and eight-byte atomic write to implement a lightweight logging mechanism that improves performance while ensuring data consistency. Experimental results show that the concurrent access performance of the proposed storage system is 1.70–3.73 times that of the state-of-the-art storage system Ceph, and the latency is only 3.4%–5.9% that of Ceph.

摘要

在分布式存储系统中, 常用的数据冗余方法包括副本和纠删码(erasure code, EC)。相较于副本, EC具有更好的存储效率, 但是在更新方面的开销更大。此外, 并发更新带来的一致性和可靠性问题给EC应用带来了新的挑战。许多研究工作都致力于优化EC技术, 包括算法优化、数据更新方法创新等, 但并发更新的一致性和可靠性问题尚未得到很好解决。本文介绍了一种将数据更新与EC编码解耦的存储系统, 命名为DDUC, 并提出了一种副本与校验块结合的放置策略。对于(N, M)的EC系统, 按照N组M+1的副本进行数据布局, 并将同一条带的冗余数据块都放置在校验节点上, 使得校验节点可以自主地执行本地EC编码。基于上述策略, 实现了一种两阶段数据更新方法, 在第一阶段按照副本模式进行数据更新, 在第二阶段由校验节点独立完成EC编码。这样在保证高并发性能的同时, 解决了并发更新导致的数据可靠性降低的问题。同时利用PMem硬件的字节寻址和8字节原子写特性实现了一种轻量级的日志机制, 在提升性能的同时保证了数据的一致性。实验结果表明, 和当前主流的存储系统Ceph相比, 本文所提出的存储系统并发访问性能提升至1.70–3.73倍, 时延仅为Ceph的3.4%–5.9%。

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Data availability

Due to the nature of this research, participants of this study did not agree for their data to be shared publicly, so supporting data are not available.

References

Aguilera MK, Janakiraman R, Xu LH, 2005a. On the erasure recoverability of MDS codes under concurrent updates. Proc IEEE Int Symp on Information Theory, p.1358–1362. https://doi.org/10.1109/ISIT.2005.1523564
Aguilera MK, Janakiraman R, Xu LH, 2005b. Using erasure codes efficiently for storage in a distributed system. Int Conf on Dependable Systems and Networks, p.336–345. https://doi.org/10.1109/DSN.2005.96
Chan JCW, Ding Q, Lee PPC, et al., 2014. Parity logging with reserved space: towards efficient updates and recovery in erasure-coded clustered storage. Proc 12^th USENIX Conf on File and Storage Technologies, p.163–176.
Ghemawat S, Gobioff H, Leung ST, 2003. The Google file system. Proc 19^th ACM Symp on Operating Systems Principles, p.29–43. https://doi.org/10.1145/945445.945450
Gong GW, Shen ZR, Wu SZ, et al., 2021. Optimal rack-coordinated updates in erasure-coded data centers. 40^th IEEE Conf on Computer Communications, p.1–10. https://doi.org/10.1109/INFOCOM42981.2021.9488813
Huang C, Simitci H, Xu YK, et al., 2012. Erasure coding in Windows Azure Storage. Proc USENIX Annual Technical Conf, p.2.
Huang JZ, Xia J, Qin X, et al., 2019. Optimization of small updates for erasure-coded in-memory stores. Comput J, 62(6):869–883. https://doi.org/10.1093/comjnl/bxz003
Article Google Scholar
Jiang TY, Zhang GY, Huang ZC, et al., 2021. Fusion-RAID: achieving consistent low latency for commodity SSD arrays. 19^th USENIX Conf on File and Storage Technologies, p.355–370.
Konwar KM, Prakash N, Lynch N, et al., 2017. A layered architecture for erasure-coded consistent distributed storage. Proc ACM Symp on Principles of Distributed Computing, p.63–72. https://doi.org/10.1145/3087801.3087832
Li HB, Zhang YM, Zhang ZM, et al., 2017. PARIX: speculative partial writes in erasure-coded systems. Proc USENIX Annual Technical Conf, p.581–587.
Liu YJ, Wei B, Wu JG, et al., 2021. Erasure-coded multiblock updates based on hybrid writes and common XORs first. 39^th Int Conf on Computer Design, p.472–479.
Meng YL, Zhang LL, Xu D, et al., 2019. A dynamic erasure code based on block code. Proc Int Conf on Embedded Wireless Systems and Networks, p.379–383.
Ousterhout J, Agrawal P, Erickson D, et al., 2010. The case for RAMClouds: scalable high-performance storage entirely in DRAM. ACM SIGOPS Oper Syst Rev, 43(4):92–105. https://doi.org/10.1145/1713254.1713276
Article Google Scholar
Peter K, Reinefeld A, 2012. Consistency and fault tolerance for erasure-coded distributed storage systems. Proc 5^th Int Workshop on Data-Intensive Distributed Computing, p.23–32. https://doi.org/10.1145/2286996.2287002
Pless V, 1998. Introduction to the Theory of Error-Correcting Codes (3^rd Ed.). John Wiley & Sons, Hoboken, USA. https://doi.org/10.1002/9781118032749
Book MATH Google Scholar
Pu WJ, Chen NJ, Zhong QW, 2020. SDCUP: software-defined-control based erasure-coded collaborative data update mechanism. IEEE Access, 8:180646–180660. https://doi.org/10.1109/ACCESS.2020.3028381
Article Google Scholar
Rizzo L, 1997. Effective erasure codes for reliable computer communication protocols. ACM SIGCOMM Comput Commun Rev, 27(2):24–36. https://doi.org/10.1145/263876.263881
Article Google Scholar
Shen ZR, Lee PPC, 2018. Cross-rack-aware updates in erasure-coded data centers. Proc 47^th Int Conf on Parallel Processing, Article 80. https://doi.org/10.1145/3225058.3225065
Wang F, Tang YJ, Xie YW, et al., 2019. XORInc: optimizing data repair and update for erasure-coded systems with XOR-based in-network computation. Proc 35^th Symp on Mass Storage Systems and Technologies, p.244–256. https://doi.org/10.1109/MSST.2019.00005
Wang YJ, Pei XQ, Ma XK, et al., 2018. TA-update: an adaptive update scheme with tree-structured transmission in erasure-coded storage systems. IEEE Trans Parall Distrib Syst, 29(8):1893–1906. https://doi.org/10.1109/TPDS.2017.2717981
Article Google Scholar
Weatherspoon H, Kubiatowicz JD, 2002. Erasure coding vs. replication: a quantitative comparison. 1^st Int Workshop on Peer-to-Peer Systems, p.328–337. https://doi.org/10.1007/3-540-45748-8_31
Xiong YL, Zhou J, Su L, et al., 2021. ECCH: erasure coded consistent hashing for distributed storage systems. IEEE Int Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking, p.177–184. https://doi.org/10.1109/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00036

Download references

Author information

Authors and Affiliations

State Key Laboratory of Mobile Network and Mobile Multimedia Technology, Shenzhen, 518000, China
Yaofeng Tu (屠要峰) & Yinjun Han (韩银俊)
ZTE Corporation, Nanjing, 210000, China
Yaofeng Tu (屠要峰), Rong Xiao (肖蓉), Yinjun Han (韩银俊), Zhenghua Chen (陈正华), Hao Jin (金浩), Xuecheng Qi (齐学成) & Xinyuan Sun (孙辛远)

Authors

Yaofeng Tu (屠要峰)
View author publications
You can also search for this author in PubMed Google Scholar
Rong Xiao (肖蓉)
View author publications
You can also search for this author in PubMed Google Scholar
Yinjun Han (韩银俊)
View author publications
You can also search for this author in PubMed Google Scholar
Zhenghua Chen (陈正华)
View author publications
You can also search for this author in PubMed Google Scholar
Hao Jin (金浩)
View author publications
You can also search for this author in PubMed Google Scholar
Xuecheng Qi (齐学成)
View author publications
You can also search for this author in PubMed Google Scholar
Xinyuan Sun (孙辛远)
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Yaofeng TU designed the research. Yinjun HAN and Zhenghua CHEN completed the detailed design. Rong XIAO drafted the paper. Hao JIN implemented the scheme. Xuecheng QI processed the data. Xinyuan SUN revised and finalized the paper.

Corresponding author

Correspondence to Yinjun Han (韩银俊).

Additional information

Compliance with ethics guidelines

Yaofeng TU, Rong XIAO, Yinjun HAN, Zhenghua CHEN, Hao JIN, Xuecheng QI, and Xinyuan SUN declare that they have no conflict of interest.

Project supported by the National Key Research and Development Program of China (No. 2021YFB3101100)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tu, Y., Xiao, R., Han, Y. et al. DDUC: an erasure-coded system with decoupled data updating and coding. Front Inform Technol Electron Eng 24, 716–730 (2023). https://doi.org/10.1631/FITEE.2200466

Download citation

Received: 15 October 2022
Accepted: 12 February 2023
Published: 02 June 2023
Issue Date: May 2023
DOI: https://doi.org/10.1631/FITEE.2200466

Key words

关键词

CLC number

TP333

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

DDUC: an erasure-coded system with decoupled data updating and coding

Abstract

摘要

Access this article

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Additional information

Compliance with ethics guidelines

Rights and permissions

About this article

Cite this article

Share this article

Key words

关键词

CLC number

Search

Navigation