An efficient and highly available framework of data recency enhancement for eventually consistent data stores

Tang, Yu; Sun, Hailong; Wang, Xu; Liu, Xudong

doi:10.1007/s11704-016-6041-1

An efficient and highly available framework of data recency enhancement for eventually consistent data stores

Research Article
Published: 07 April 2017

Volume 11, pages 88–104, (2017)
Cite this article

Frontiers of Computer Science Aims and scope Submit manuscript

Yu Tang¹,
Hailong Sun¹,
Xu Wang¹ &
…
Xudong Liu¹

68 Accesses
15 Citations
Explore all metrics

Abstract

Data items are usually replicated in modern distributed data stores to obtain high performance and availability. However, the availability-consistency and latencyconsistency trade-offs exist in data replication, thus system designers intend to choose weak consistency models, such as eventual consistency, which may result in stale reads. Since stale data items may lead to serious application semantic problems, we consider how to increase the probability of data recency which provides a uniform view on recent versions of data items for all clients. In this work, we propose HARP, a framework that can enhance data recency of eventually consistent distributed data stores in an efficient and highly available way. Through detecting possible stale reads under failures or not, HARP can perform reread operations to eliminate stale results only when needed based on our analysis on write/read processes. We also present solutions on how to deal with some practical anomalies in HARP, including delayed, reordered and dropped messages and clock drift, and show how to extend HARP to multiple datacenters. Finally we implement HARP based on Cassandra, and the experiments show that HARP can effectively eliminate stale reads, with a low overhead (less than 6.9%) compared with original eventually consistent Cassandra.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Self-tuning Eventually-Consistent Data Stores

Proof-of-Concept HARPA Measurement-Based Platform Modelling Framework

An adaptive and real-time based architecture for financial data integration

Article Open access 11 November 2019

References

De Candia G, Hastorun D, Jampani M, Kakulapati G, Lakshman A, Pilchin A, Sivasubramanian S, Vosshall P, Vogels W. Dynamo: Amazon’s highly available key-value store. In: Proceedings of ACM Symposium on Operating Systems Principles. 2007, 205–220
Google Scholar
Brewer E A. Towards robust distributed systems. In: Proceedings of the 19th ACM Symposium on Principles of Distributed Computing. 2000
Google Scholar
Abadi D. Consistency tradeoffs in modern distributed database system design: cap is only part of the story. IEEE Computer, 2012, 45(2): 37–42
Article Google Scholar
Lakshman A, Malik P. Cassandra: a decentralized structured storage system. ACM SIGOPS Operating Systems Review, 2010, 44(2): 35–40
Article Google Scholar
Vogels W. Eventually consistent. Communications of the ACM, 2009, 52(1): 14–19
Article Google Scholar
Saito Y, Shapiro M. Optimistic replication. ACM Computing Surveys, 2005, 37(1): 42–81
Article MATH Google Scholar
Bailis P, Venkataraman S, Franklin MJ, Hellerstein JM, Stoica I. Probabilistically bounded staleness for practical partial quorums. In: Proceedings of International Conference on Very Large Data Bases. 2012, 776–787
Google Scholar
Bailis P, Venkataraman S, Franklin M J, Hellerstein J M, Stoica I. Quantifying eventual consistency with PBS. VLDB Journal, 2014, 23(2): 279–302
Article Google Scholar
Dcmers A, Greene D, Hauser C, Irish W, Larson J, Shenkcr S, Sturgis H, Swinehart D, Terry D. Epidemic algorithms for replicated database maintenance. In: Proceedings of ACM Symposium on Principles of Distributed Computing. 1987, 1–12
Chapter Google Scholar
Cooper B F, Silberstein A, Tam E, Ramakrishnan R, Sears R. Benchmarking cloud serving systems with YCSB. In: Proceedings of ACM Symposium on Cloud Computing. 2010, 143–154
Chapter Google Scholar
Herlihy M P, Wing J M. Linearizability: a correctness condition for concurrent objects. ACM Transactions on Programming Languages & Systems, 1990, 12(3): 463–492
Article Google Scholar
Lamport L. On interprocess communication. Distributed Computing, 1986, 1(2): 86–101
Article MathSciNet MATH Google Scholar
Gilbert S, Lynch N. Brewer’s conjecture and the feasibility of consistent, available, partition-tolerant Web services. ACM SIGACT News, 2002, 33(2): 51–59
Article Google Scholar
Mahajan P, Alvisi L, Dahlin M. Consistency, availability, convergence. Technical Report. 2011
Google Scholar
Alpern B, Schneider F B. Recognizing safety and liveness. Distributed Computing, 1987, 2(3): 117–126
Article MATH Google Scholar
Lamport L. Time, clocks, and the ordering of events in a distributed system. Communications of the ACM, 1978, 21(14): 558–565
Article MATH Google Scholar
Wang X, Sun H L, Deng T, Huai J D. A quantitative analysis of quorum system availability in data centers. In: Proceedings of the 22nd IEEE International Symposium on Quality of Service. 2014, 99–104
Google Scholar
Ahamad M, Neiger G, Burns J E, Kohli P, Hutto P W. Causal memory: definitions, implementation, and programming. Distributed Computing, 1995, 9(1): 37–49
Article MathSciNet Google Scholar
Bailis P, Ghodsi A, Hellerstein J M, Stoica I. Bolt-on causal consistency. In: Proceedings of ACM SIGMOD International Conference on Management of Data. 2013, 761–772
Google Scholar
Lloyd W, Freedman M J, Kaminsky M, Andersen D G. Don’t settle for eventual: scalable causal consistency for wide-area storage with cops. In: Proceedings of ACMSymposium on Operating Systems Principles. 2011, 401–416
Google Scholar
Davidson S B, Garcia-Molina H, Skeen D. Consistency in a partitioned network. ACM Computing Surveys, 1985, 17(3): 341–370
Article Google Scholar
Dean J S. Designs, lessons and advice from building large distributed systems. In: Proceedings of the Workshop on Large-Scale Distributed Systems and Middleware. 2009
Google Scholar
Gill P, Jain N, Nagappan N. Understanding network failures in data centers: measurement, analysis, and implications. In: Proceedings of ACM International Conference on the Applications, Technologies, Architectures, and Protocols for Computer Communication. 2011, 350–361
Google Scholar
Bailis P, Ghodsi A. Eventual consistency today: limitations, extensions, and beyond. Queue, 2013, 11(3): 55–63
Google Scholar
Lloyd W, Freedman M J, Kaminsky M, Andersen D G. Stronger semantics for low-latency geo-replicated storage. In: Proceedings of the USENIX Conference on Networked Systems Design and Implementation. 2013, 313–328
Google Scholar
Du J, Iorgulescu C, Roy A, Zwaenepoel W. GentleRain: cheap and scalable causal consistency with physical clocks. ACM Symposium on Cloud Computing. 2014, 1–13
Google Scholar
Bailis P, Davidson A, Fekete A, Ghodsi A, Hellerstein J M, Stoica I. Highly available transactions: virtues and limitations. In: Proceedings of International Conference on Very Large Data Bases. 2013, 181–192
Google Scholar
Tang Y, Sun H L, Wang X, Liu X D. Harp: towards enhancing data recency for eventually consistent data stores. In: Proceedings of IEEE International Conference on Parallel and Distributed Systems. 2014, 685–692
Google Scholar

Download references

Acknowledgements

This work was supported partly by the National High-tech Research and Development Program (863 Program) of China (2015AA01A202), and partly by the National Natural Science Foundation of China (Grant Nos. 61370057 and 61421003).

Author information

Authors and Affiliations

School of Computer Science and Engineering, Beihang University, Beijing, 100191, China
Yu Tang, Hailong Sun, Xu Wang & Xudong Liu

Authors

Yu Tang
View author publications
Search author on:PubMed Google Scholar
Hailong Sun
View author publications
Search author on:PubMed Google Scholar
Xu Wang
View author publications
Search author on:PubMed Google Scholar
Xudong Liu
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Xu Wang.

Additional information

Yu Tang received the BS degree from Beihang University, China in 2011. Currently, he is working towards the PhD degree in the School of Computer Science and Engineering, Beihang University. His research interests include the areas of distributed systems and availability.

Hailong Sun received the BS degree in computer science from Beijing Jiaotong University, China in 2001. He received the PhD degree in computer software and theory from Beihang University, China in 2008. He is an associate professor in the School of Computer Science and Engineering, Beihang University. His research interests include services computing, cloud computing and distributed systems. He is a member of the IEEE and the ACM.

Xu Wang received the BS degree from Beihang University, China in 2008. He received the PhD degree in computer software and theory from Beihang University in 2015. His research interests include the areas of distributed systems, service computing, replication, and availability.

Xudong Liu is a professor and dean of the School of Computer Science and Engineering, Beihang University, China. Has have leaded several China 863 key projects and e-government projects. He has published more over 30 papers, more than 10 patents. His research interests include software middleware technology, software development methods and tools, large-scale information technology projects and application of research and teaching.

Electronic supplementary material

Supplementary material, approximately 288 KB.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tang, Y., Sun, H., Wang, X. et al. An efficient and highly available framework of data recency enhancement for eventually consistent data stores. Front. Comput. Sci. 11, 88–104 (2017). https://doi.org/10.1007/s11704-016-6041-1

Download citation

Received: 20 January 2016
Accepted: 05 May 2016
Published: 07 April 2017
Issue Date: February 2017
DOI: https://doi.org/10.1007/s11704-016-6041-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An efficient and highly available framework of data recency enhancement for eventually consistent data stores

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Self-tuning Eventually-Consistent Data Stores

Proof-of-Concept HARPA Measurement-Based Platform Modelling Framework

An adaptive and real-time based architecture for financial data integration

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material, approximately 288 KB.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now